2588 Commits

Author SHA1 Message Date
Jim Brennan
78bddd0d9f YARN-10697. Resources are displayed in bytes in UI for schedulers other than capacity. Contributed by Bilwa S T.
(cherry picked from commit 174f3a96b10a0ab0fd8aed1b0f904ca5f0c3f268)
2021-03-23 18:23:50 +00:00
Eric Badger
cd417f17ae YARN-10688. ClusterMetrics should support GPU capacity related metrics.. Contributed by Qi Zhu.
(cherry picked from commit 49f89f1d3de66f3bb4db5952e8873432ba62f71a)
2021-03-17 18:16:59 +00:00
Eric Payne
f5810ea83c YARN-10588. Percentage of queue and cluster is zero in WebUI . Contributed by Bilwa S T
(cherry picked from commit aa4c17b9d7af122163789a731ced05f740562e45)
2021-03-15 19:14:19 +00:00
Brahma Reddy Battula
f12293fba2 YARN-10671.Fix Typo in TestSchedulingRequestContainerAllocation. Contributed by D M Murali Krishna Reddy.
(cherry picked from commit b2a565629dba125be5b330e84c313ba26b50e80f)
2021-03-09 20:27:07 +05:30
Peter Bacsko
066f89af01 YARN-10672. All testcases in TestReservations are flaky. Contributed by Szilard Nemeth. 2021-03-08 11:42:59 +01:00
Neil
0396a721e3 YARN-10649. Fix RMNodeImpl.updateExistContainers leak (#2719). Contributed by Max Xie
(cherry picked from commit d615e2d3bd35780fbae379030a8231c301bda035)
2021-03-04 14:54:28 +05:30
Jonathan Hung
be6e99963d YARN-10651. CapacityScheduler crashed with NPE in AbstractYarnScheduler.updateNodeResource(). Contributed by Haibo Chen
(cherry picked from commit f348ab3f2f468751af329a1ffce4917cb000fcbf)
2021-02-25 15:09:33 -08:00
Jim Brennan
db457b056a [YARN-10613] Config to allow Intra- and Inter-queue preemption to enable/disable conservativeDRF. Contributed by Eric Payne
(cherry picked from commit c373da9f883e0f58c1fff0d2213ae973d1233e14)
2021-02-25 16:48:46 +00:00
Inigo Goiri
8c8ef2f444 YARN-9017. PlacementRule order is not maintained in CS. Contributed by Bilwa S T.
(cherry picked from commit 35010120fbbcad8618f99abf7130e53f98879a33)
2021-02-18 20:42:26 +05:30
Prabhu Joseph
72904c014d YARN-10361. Make custom DAO classes configurable into RMWebApp#JAXBContextResolver.
Contributed by Bilwa ST.

(cherry picked from commit c7e71a6c0beb2748988b339a851a129b5e57f8c4)
2021-02-18 14:25:16 +05:30
Prabhu Joseph
0c46ab51b5 YARN-8047. RMWebApp make external class pluggable.
Contributed by Bilwa S T.

(cherry picked from commit 3a4d05b850449c51a13f3a15fe0d756fdf50b4b2)
2021-02-18 13:59:50 +05:30
Masatake Iwasaki
4468378e4b YARN-10500. TestDelegationTokenRenewer fails intermittently. (#2619)
(cherry picked from commit f9a073c6c186848e09e2ee04118fd996ea8ace59)
2021-02-11 20:26:09 +00:00
bibinchundatt
1520b84b36 YARN-10519. Refactor QueueMetricsForCustomResources class to move to yarn-common package. Contributed by Minni Mittal
(cherry picked from commit 8bc2dfbf36bddd57de3496bece88ddb86529426b)
2021-01-22 08:30:12 +05:30
Neil
cd5ee0014f YARN-10541. capture the performance metrics of ZKRMStateStore (#2568)
(cherry picked from commit fa4cf91b5750327ed67355c35f1ed0f10bb0100d)
2021-01-08 10:38:08 -08:00
Szilard Nemeth
f6b9f82b3f YARN-10528. maxAMShare should only be accepted for leaf queues, not parent queues. Contributed by Siddharth Ahuja 2021-01-08 12:41:17 +01:00
srinivasst
98565b6c60 YARN-10538: Add RECOMMISSIONING nodes to the list of updated nodes returned to the AM (#2564)
Contributed by Srinivas S T

(cherry picked from commit 1b1791075a8a45ff8c7d25d52cb014d0ce858cc2)
2021-01-08 10:57:37 +05:30
Ayush Saxena
8378ab9f92 HADOOP-17288. Use shaded guava from thirdparty. Contributed by Ayush Saxena. #2505 2020-12-10 05:50:55 +05:30
Eric Payne
1fd6d81617 YARN-10278: CapacityScheduler test framework ProportionalCapacityPreemptionPolicyMockFramework. Contributed by Szilard Nemeth (snemeth)
(cherry picked from commit fa773a832652ac6f2a24f93fc2bfc2ae251bfe1d)
2020-12-01 22:51:20 +00:00
Peter Bacsko
0361837c9e YARN-10458. Hive On Tez queries fails upon submission to dynamically created pools. Contributed by Peter Bacsko. 2020-11-10 13:16:52 +01:00
Eric E Payne
2473e8b711 YARN-10475: Scale RM-NM heartbeat interval based on node utilization. Contributed by Jim Brennan (Jim_Brennan). 2020-11-02 17:16:28 +00:00
Jonathan Hung
f95c0824b0 YARN-10467. ContainerIdPBImpl objects can be leaked in RMNodeImpl.completedContainers. Contributed by Haibo Chen
(cherry picked from commit bab5bf9743f54f48cc2f31b4e5c8b6d4e5a5cfb8)
2020-10-28 10:33:41 -07:00
Eric Badger
8abf939152 YARN-10450. Add cpu and memory utilization per node and cluster-wide metrics.
Contributed by Jim Brennan.

(cherry picked from commit 8b8c672780c1505ab7f302c3116a0c5cdd122315)
2020-10-15 18:09:00 +00:00
Jim Brennan
c789e944b7 YARN-10451. RM (v1) UI NodesPage can NPE when yarn.io/gpu resource type is defined. Contributed by Eric Payne
(cherry picked from commit b361f29dda08fa9b4e406d07945d188a914bc437)
2020-10-06 18:19:53 +00:00
Eric Yang
9176e8fe5d YARN-9809. Added node manager health status to resource manager registration call.
Contributed by Eric Badger via eyang

(cherry picked from commit e8dc862d3856e9eaea124c625dade36f1dd53fe2)
2020-09-28 16:41:53 +00:00
Eric E Payne
87ff2f5597 YARN-10390: LeafQueue: retain user limits cache across assignContainers() calls. Contributed by Samir Khan (samkhan).
(cherry picked from commit 9afec2ed1721467aef7f2cd025d713273b12a6ca)
2020-09-11 16:19:09 +00:00
Brahma Reddy Battula
ea37a05d4b YARN-10397. SchedulerRequest should be forwarded to scheduler if custom scheduler supports placement constraints. Contributed by Bilwa S T.
(cherry picked from commit 43572fc7f88429a9804fa5889b82a0bbd5d3d78e)
2020-09-09 17:09:35 +05:30
bibinchundatt
acb219b9c9 YARN-10369. Make NMTokenSecretManagerInRM sending NMToken for nodeId DEBUG. Contributed by Jim Brennan.
(cherry picked from commit 5d8600e80ad7864b332b60d5a01585fdf00848ee)
2020-09-08 20:42:27 +00:00
Adam Antal
5710005a19 YARN-10332. RESOURCE_UPDATE event was repeatedly registered in DECOMMISSIONING state. Contributed by yehuanhuan
(cherry picked from commit 34fe74da0e9c68173e1de196c496b9cfca029618)
2020-09-07 11:46:30 +02:00
Sunil G
ea1c048a61 YARN-10396. Max applications calculation per queue disregards queue level settings in absolute mode. Contributed by Benjamin Teke.
(cherry picked from commit 82ec28f4421c162a505ba5e5b329e4be199878a7)
2020-08-19 11:56:40 +05:30
Jonathan Hung
706d14c41d YARN-10251. Show extended resources on legacy RM UI. Contributed by Eric Payne
(cherry picked from commit 64753addba9e25a13cc3a932ce9a5d40fd4e998f)
2020-08-07 17:43:16 -07:00
bibinchundatt
0c0ea4979e YARN-10388. RMNode updatedCapability flag not set while RecommissionNodeTransition. Contributed by Pranjal Protim Borah
(cherry picked from commit 81da221c757bef9ec35cd190f14b2f872324c661)
2020-08-07 09:01:53 +05:30
Eric Badger
723f1f61d6 YARN-4575. ApplicationResourceUsageReport should return ALL reserved resource.
Contributed by Bibin Chundatt and Eric Payne.

(cherry picked from commit 5edd8b925ef22b83350a21abed6ecc551adb92ee)
2020-08-05 18:56:56 +00:00
Jonathan Hung
d3c7f7cdbd YARN-10343. Legacy RM UI should include labeled metrics for allocated, total, and reserved resources. Contributed by Eric Payne
(cherry picked from commit 3eaf62726ffe90b3b096798fe501abd1ed0c5f15)
2020-07-28 13:43:44 -07:00
Ayush Saxena
e3b8d4eb05 HADOOP-17100. Replace Guava Supplier with Java8+ Supplier in Hadoop. Contributed by Ahmed Hussein. 2020-07-22 18:21:14 +05:30
Eric Badger
41bcef9486 YARN-10348. Allow RM to always cancel tokens after app completes. Contributed by
Jim Brennan

(cherry picked from commit 48f90115b5ecb37f814af281f09bb404361b2bba)
2020-07-13 23:12:18 +00:00
Eric E Payne
7044a007b3 YARN-10297. TestContinuousScheduling#testFairSchedulerContinuousSchedulingInitTime fails intermittently. Contributed by Jim Brennan (Jim_Brennan)
(cherry picked from commit 0427100b7543d412f4fafe631b7ace289662d28c)
2020-07-13 19:02:40 +00:00
Akira Ajisaka
20df70a895
HADOOP-17111. Replace Guava Optional with Java8+ Optional. Contributed by Ahmed Hussein.
(cherry picked from commit 639acb6d8921127cde3174a302f2e3d71b44f052)
2020-07-06 16:09:37 +09:00
Szilard Nemeth
439c51425e YARN-10330. Add missing test scenarios to TestUserGroupMappingPlacementRule and TestAppNameMappingPlacementRule. Contributed by Peter Bacsko 2020-07-01 17:42:45 +02:00
Szilard Nemeth
8b482744e9 YARN-10277. CapacityScheduler test TestUserGroupMappingPlacementRule should build proper hierarchy. Contributed by Szilard Nemeth 2020-06-30 11:32:59 +02:00
Eric E Payne
d7696453a0 YARN-9903: Support reservations continue looking for Node Labels. Contributed by Jim Brennan (Jim_Brennan).
(cherry picked from commit 74fc13cf91818a70f434401244f7560c4db3a676)
2020-06-29 18:59:52 +00:00
Szilard Nemeth
fa41e38450 YARN-10279. Avoid unnecessary QueueMappingEntity creations. Contributed by Marton Hudaky
(cherry picked from commit 6a8fd73b273629d0c7c071cf4d090f67d9b96fe4)
2020-06-25 17:28:48 +02:00
Szilard Nemeth
480919e42d YARN-10316. FS-CS converter: convert maxAppsDefault, maxRunningApps settings. Contributed by Peter Bacsko 2020-06-23 16:25:33 +02:00
Szilard Nemeth
8f1b70e367 YARN-9930. Support max running app logic for CapacityScheduler. Contributed by Peter Bacsko 2020-06-22 12:00:06 +02:00
Szilard Nemeth
ec913398a9 YARN-10281. Redundant QueuePath usage in UserGroupMappingPlacementRule and AppNameMappingPlacementRule. Contributed by Gergely Pollak 2020-06-17 14:36:08 +02:00
Szilard Nemeth
8be302a3b8 YARN-10274. Merge QueueMapping and QueueMappingEntity. Contributed by Gergely Pollak 2020-06-16 18:25:47 +02:00
Szilard Nemeth
52efe48d79 YARN-10292. FS-CS converter: add an option to enable asynchronous scheduling in CapacityScheduler. Contributed by Benjamin Teke 2020-06-16 18:01:39 +02:00
Vinayakumar B
534b15caf9
HADOOP-17046. Support downstreams' existing Hadoop-rpc implementations using non-shaded protobuf classes (#2026) 2020-06-12 23:20:10 +05:30
Eric E Payne
a7526ba9f7 YARN-10300: appMasterHost not set in RM ApplicationSummary when AM fails before first heartbeat. Contributed by Eric Badger (ebadger).
(cherry picked from commit 56247db3022705635580c4d2f8b0abde109f954f)
2020-06-09 18:51:46 +00:00
Szilard Nemeth
a266e32d82 YARN-10286. PendingContainers bugs in the scheduler outputs. Contributed by Andras Gyori 2020-06-05 09:50:43 +02:00
Szilard Nemeth
f65f64e8ae YARN-10254. CapacityScheduler incorrect User Group Mapping after leaf queue change. Contributed by Gergely Pollak 2020-06-02 18:32:06 +02:00