Benjamin Teke
b1fc00d4b2
YARN-11539. Fix leaf-templates in Flexible AQC. ( #5868 )
2023-07-27 09:24:35 +02:00
K0K0V0K
82c8070e93
YARN-9877 - Intermittent TIME_OUT of LogAggregationReport ( #5784 )
2023-07-20 06:30:55 +02:00
Benjamin Teke
193ff1c24e
YARN-11533. CapacityScheduler CapacityConfigType changed in legacy queue allocation mode ( #5852 )
...
Co-authored-by: Benjamin Teke <bteke@cloudera.com>
2023-07-20 06:26:29 +02:00
slfan1989
23ecc32d3a
YARN-11525. [Federation] Router CLI Supports Save the SubClusterPolicyConfiguration Of Queues. ( #5816 )
2023-07-19 11:51:47 -07:00
Susheel Gupta
c13d92996d
YARN-11464: TestFSQueueConverter#testAutoCreateV2FlagsInWeightMode has a missing dot before auto-queue-creation-v2.enabled for method call assertNoValueForQueues ( #5792 )
...
Co-authored-by: Susheel Gupta <susheel.gupta@cloudera.com>
2023-07-11 15:20:51 +02:00
Tamas Domok
a36d8adfd1
YARN-11000. Replace queue resource calculation logic in updateClusterResource ( #5783 )
...
Contributed by: Andras Gyori, Tamas Domok, Bence Kosztolnik and Benjamin Teke
---------
Co-authored-by: Benjamin Teke <benjamin.teke@gmail.com>
2023-07-06 10:18:14 +02:00
susheel-gupta
c82ea52e4c
YARN-11513: Applications submitted to ambiguous queue fail during recovery if "Specified" Placement Rule is used ( #5748 )
2023-06-23 13:09:23 +02:00
K0K0V0K
3f4de919bd
YARN-11511. Improve TestRMWebServices test config and data. ( #5745 )
...
Co-authored-by: Tamas Domok <tdomok@cloudera.com>
2023-06-21 14:15:11 +02:00
Tamas Domok
f4b70b353d
YARN-11429. Improve the updateTestDataAutomatically in TestRMWebServices. ( #5736 )
2023-06-14 14:39:07 +02:00
mudit-97
e69a077af8
YARN-11497 : Support removal of only selective node states in untracked removal flow ( #5681 )
...
Co-authored-by: mudit.sharma <mudit.sharma@flipkart.com>
Reviewed-by: Shilun Fan <slfan1989@apache.org>
Signed-off-by: Shilun Fan <slfan1989@apache.org>
2023-06-05 17:36:10 +08:00
Ayush Saxena
1d0c9ab433
Revert "HADOOP-18207. Introduce hadoop-logging module ( #5503 )"
...
This reverts commit 03a499821c
.
2023-06-05 09:34:40 +05:30
Viraj Jasani
03a499821c
HADOOP-18207. Introduce hadoop-logging module ( #5503 )
...
Reviewed-by: Duo Zhang <zhangduo@apache.org>
2023-06-02 18:07:34 -07:00
slfan1989
86c250a54a
YARN-7720. Race condition between second app attempt and UAM timeout when first attempt node is down. ( #5672 )
2023-05-29 10:37:08 -07:00
Xianming Lei
97afb33c73
YARN-11276. Add LRU cache for RMWebServices.getApps. ( #4793 )
...
Reviewed-by: Shilun Fan <slfan1989@apache.org>
Signed-off-by: Shilun Fan <slfan1989@apache.org>
2023-05-26 20:46:00 +08:00
Tamas Domok
aeb3f6f1a8
YARN-11490. Reverting YARN-11211 and eliminating the use of DefaultMetricsSystem during configuration validation ( #5644 )
2023-05-23 10:36:37 +02:00
Chun Chen
11af08d67a
YARN-11489. Fix memory leak of DelegationTokenRenewer futures in DelegationTokenRenewerPoolTracker. ( #5629 ). Contributed by Chun Chen.
...
Signed-off-by: He Xiaoqiao <hexiaoqiao@apache.org>
2023-05-14 21:38:04 +08:00
cxzl25
be50d221f5
YARN-11467. RM failover may fail when the nodes.exclude-path file does not exist ( #5565 )
2023-05-10 15:16:33 +08:00
slfan1989
d95b5c679d
YARN-11424. [Federation] Router Supports DeregisterSubCluster. ( #5363 )
2023-05-09 16:17:23 -07:00
Szilard Nemeth
c7699d3dcd
YARN-11079. Make an AbstractParentQueue to store common ParentQueue and ManagedParentQueue functionality. Contributed by Susheel Gupta
2023-05-04 22:16:18 -04:00
slfan1989
668c0a0930
YARN-11379. [Federation] Support mapAttributesToNodes, getGroupsForUser API's for Federation. ( #5596 )
2023-05-02 13:46:06 -07:00
Hexiaoqiao
70c0aa342e
YARN-11482. Fix bug of DRF comparision DominantResourceFairnessComparator2 in fair scheduler. ( #5607 ). Contributed by Xiaoqiao He.
...
Reviewed-by: Shilun Fan <slfan1989@apache.org>
2023-04-29 11:18:42 +08:00
Riya Khandelwal
60a7e8acaa
YARN-11459 Changed label called "max resource" on UIv1 and UIv2 ( #5527 )
2023-04-27 15:25:25 +02:00
slfan1989
a258f1f235
YARN-11326. [Federation] Add RM FederationStateStoreService Metrics. ( #4963 )
2023-04-18 09:13:08 -07:00
slfan1989
06f9bdffa6
YARN-10846. Add dispatcher metrics to NM. ( #4687 )
2023-04-12 09:53:20 -07:00
slfan1989
aa602381c5
YARN-11426. Improve YARN NodeLabel Memory Display. ( #5335 )
...
YARN-11426. Improve YARN NodeLabel Memory Display.
Co-authored-by: slfan1989 <louj1988@@>
Reviewed-by: Inigo Goiri <inigoiri@apache.org>
Reviewed-by: Chris Nauroth <cnauroth@apache.org>
Signed-off-by: Shilun Fan <slfan1989@apache.org>
2023-03-28 22:48:46 +08:00
Tamas Domok
69748aae32
YARN-11461. fix NPE in determineMissingParents (auto queue creation / CS). ( #5506 )
...
Change-Id: Iaaaf43a545588eaff8a0a20f6f3c27258a45f390
2023-03-24 09:38:53 +01:00
slfan1989
fa723ae839
YARN-11445. [Federation] Add getClusterInfo, getClusterUserInfo REST APIs for Router. ( #5472 )
2023-03-17 11:59:45 -07:00
Varun Saxena
2a0dc2ab2f
YARN-11383. Workflow priority mappings is case sensitive ( #5171 )
...
Contributed by Aparajita Choudhary
2023-03-05 21:25:16 +05:30
slfan1989
bcc51ce2c5
YARN-11375. [Federation] Support refreshAdminAcls、refreshServiceAcls API's for Federation. ( #5312 )
2023-02-28 14:44:00 -08:00
slfan1989
8798b94ee1
YARN-11221. [Federation] Add replaceLabelsOnNodes, replaceLabelsOnNode REST APIs for Router. ( #5302 )
2023-02-27 09:34:39 -08:00
slfan1989
27a54955f9
YARN-5604. [Federation] Add versioning for FederationStateStore. ( #5394 )
2023-02-24 10:51:19 -08:00
susheel-gupta
49b8ac19f2
YARN-11408. Add a check of autoQueueCreation is disabled for emitDefaultUserLimitFactor method ( #5278 )
...
* added a check of autoQueueCreationV2Disabled
Change-Id: If1e36c5969d270c1b81a4bbd2e883fa819c81f20
* added check of AutoCreateChildQueueDisabled
Change-Id: Ia011b4393ccd8d4d419a2e46b06a5237d050851c
* removed auto-create-child-queue-enabled check and emit
Change-Id: I7a154124519ecbd81379b46a238707c16db1e82a
2023-02-22 09:46:42 +01:00
slfan1989
4e6e2f318c
YARN-11394. Fix hadoop-yarn-server-resourcemanager module Java Doc Errors. ( #5288 )
...
Contributed by Shilun Fan
2023-02-21 14:39:32 +00:00
Viraj Jasani
90de1ff151
HADOOP-18206 Cleanup the commons-logging references and restrict its usage in future ( #5315 )
2023-02-14 03:24:06 +08:00
slfan1989
468135a4d9
YARN-11218. [Federation] Add getActivities, getBulkActivities REST APIs for Router. ( #5284 )
2023-01-26 11:14:05 -08:00
Szilard Nemeth
cf1b3711cb
YARN-10965. Centralize queue resource calculation based on CapacityVectors. Contributed by Andras Gyori
2023-01-26 19:45:54 +01:00
Szilard Nemeth
815cde9810
YARN-6971. Clean up different ways to create resources. Contributed by Riya Khandelwal
2023-01-25 17:28:29 +01:00
Szilard Nemeth
29f2230cb6
YARN-5607. Document TestContainerResourceUsage#waitForContainerCompletion. Contributed by Susheel Gupta
2023-01-25 15:13:24 +01:00
slfan1989
442a5fb285
YARN-11320. [Federation] Add getSchedulerInfo REST APIs for Router. ( #5217 )
2023-01-17 09:36:19 -08:00
slfan1989
168fa07801
YARN-11409. Fix Typo of ResourceManager#webapp moudle. ( #5285 )
2023-01-13 11:53:13 +08:00
Szilard Nemeth
7f6cc196f8
YARN-11190. CS Mapping rule bug: User matcher does not work correctly for usernames with dot ( #4471 )
2023-01-11 13:23:04 +01:00
susheel-gupta
c44c9f984b
YARN-11393. Fs2cs could be extended to set ULF to -1 upon conversion ( #5201 )
2023-01-02 15:35:16 +01:00
Chris Nauroth
6b67373d10
YARN-11388: Prevent resource leaks in TestClientRMService. ( #5187 )
...
Signed-off-by: Shilun Fan <slfan1989@apache.org>
2022-12-28 11:00:27 -08:00
curie71
9668a85d40
YARN-11392 Audit Log missing in ClientRMService ( #5250 ). Contributed by Beibei Zhao.
...
Signed-off-by: Chris Nauroth <cnauroth@apache.org>
2022-12-27 15:58:53 -08:00
Bence Kosztolnik
bf8ab83cd0
YARN-11395. RM UI, RMAttemptBlock can not render FINAL_SAVING. Contributed by Bence Kosztolnik
...
- In the YARN-1345 remove of FINAL_SAVING was missed from RMAttemptBlock
- Same issue was present after YARN-1345 in YARN-4411
- YARN-4411 logic was applied in this commit for FINAL_SAVING
2022-12-23 17:15:36 +01:00
susheel-gupta
e6056d128a
YARN-10879. Incorrect WARN text in ACL check for application tag based placement ( #5231 )
...
Change-Id: Id892e38fe4c834b1743a0df2f0a40146d3d5a878
2022-12-22 17:20:53 +01:00
slfan1989
63b9a6a2b6
YARN-11350. [Federation] Router Support DelegationToken With ZK. ( #5131 )
2022-12-14 09:09:38 -08:00
K0K0V0K
ee7d1787cd
YARN-11390. TestResourceTrackerService.testNodeRemovalNormally: Shutdown nodes should be 0 now expected: <1> but was: <0> ( #5190 )
...
Reviewed-by: Peter Szucs
Signed-off-by: Chris Nauroth <cnauroth@apache.org>
2022-12-08 09:52:19 -08:00
Szilard Nemeth
5440c75c4a
YARN-10946. AbstractCSQueue: Create separate class for constructing Queue API objects. Contributed by Peter Szucs
2022-12-01 15:11:58 +01:00
Szilard Nemeth
22c9f28f4d
YARN-10005. Code improvements in MutableCSConfigurationProvider. Contributed by Peter Szucs
2022-11-12 18:35:49 +01:00
slfan1989
5d6ab15860
YARN-11354. [Federation] Add Yarn Router's NodeLabel Web Page. ( #5073 )
2022-11-04 14:39:57 -07:00
slfan1989
b1cd88c598
YARN-11229. [Federation] Add checkUserAccessToQueue REST APIs for Router. ( #4929 )
2022-10-28 15:37:35 -07:00
Chris Nauroth
bfb84cd7f6
YARN-11360: Add number of decommissioning/shutdown nodes to YARN cluster metrics. ( #5060 )
2022-10-28 11:07:01 -07:00
slfan1989
9adf0ca089
YARN-11342. [Federation] Refactor getNewApplication, submitApplication Use FederationActionRetry. ( #5005 )
2022-10-20 09:22:24 -07:00
slfan1989
1962851356
YARN-11294. [Federation] Router Support DelegationToken store/update/remove Token With MemoryStateStore. ( #4915 )
2022-10-13 16:52:22 -07:00
slfan1989
647457e6ab
YARN-11327. [Federation] Refactoring Yarn Router's Node Web Page. ( #5009 )
2022-10-13 14:05:30 -07:00
PJ Fanning
bfce21ee08
YARN-11330. use secure XML parsers ( #4981 )
...
Move construction of XML parsers in YARN
modules to using the locked-down parser factory
of HADOOP-18469.
One exception: GpuDeviceInformationParser still supports DTD resolution;
all other features are disabled.
Contributed by P J Fanning
2022-10-13 18:19:19 +01:00
slfan1989
d78b0b39a6
YARN-11323. [Federation] Improve ResourceManager Handler FinishApps. ( #4954 )
2022-10-11 14:53:02 -07:00
slfan1989
9e16f1f883
YARN-11317. [Federation] Refactoring Yarn Router's About Web Page. ( #4946 )
2022-10-11 13:30:48 -07:00
Szilard Nemeth
0c515b0ef0
YARN-6766. Add helper method in FairSchedulerAppsBlock to print app info. Contributed by Riya Khandelwal
2022-10-10 15:30:33 +02:00
PJ Fanning
5eddec8c46
HADOOP-18468: Upgrade jettison to 1.5.1 to fix CVE-2022-40149 ( #4937 )
...
Contributed by PJ Fanning
2022-10-07 15:44:01 +01:00
slfan1989
342c4856b8
YARN-11293. [Federation] StoreNewMasterKey/removeStoredMasterKey With MemoryStateStore. ( #4852 )
2022-09-19 13:14:55 -07:00
9uapaw
5b85af87f0
YARN-11278. Fixed Ambiguous error message in mutation API. Contributed by Ashutosh Gupta.
2022-09-09 14:38:41 +02:00
slfan1989
37e213c3fc
YARN-11177. Support getNewReservation, submit / update/ Reservation API's for Federation. ( #4764 )
2022-09-01 16:35:20 -07:00
slfan1989
4031b0774e
YARN-11253. Add Configuration to delegationToken RemoverScanInterval. ( #4751 )
2022-08-27 10:02:59 -07:00
slfan1989
f8b9dd911c
YARN-11219. [Federation] Add getAppActivities, getAppStatistics REST APIs for Router. ( #4757 )
2022-08-26 16:01:17 -07:00
Ashutosh Gupta
c294a414b9
YARN-9425. Make initialDelay configurable for FederationStateStoreService#scheduledExecutorService ( #4731 ). Contributed by groot and Shen Yinjie.
...
Signed-off-by: Ayush Saxena <ayushsaxena@apache.org>
2022-08-22 03:40:00 +05:30
slfan1989
f75c58a1ca
YARN-11252. Yarn Federation Router Supports Update / Delete Reservation in MemoryStore. ( #4741 )
2022-08-18 21:13:43 -07:00
Ashutosh Gupta
f02ff1afe2
YARN-11248. Add unit test for FINISHED_CONTAINERS_PULLED_BY_AM event on DECOMMISSIONING ( #4721 )
...
Co-authored-by: Ashutosh Gupta <ashugpt@amazon.com>
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2022-08-16 19:07:05 +09:00
slfan1989
d383cc4525
YARN-11236. Implement FederationReservationHomeSubClusterStore With MemoryStore. ( #4711 )
2022-08-13 10:37:20 -07:00
slfan1989
133e8aabf0
YARN-11227. [Federation] Add getAppTimeout, getAppTimeouts, updateApplicationTimeout REST APIs for Router. ( #4715 )
2022-08-10 14:53:46 -07:00
slfan1989
c5eba323bc
YARN-6972. Adding RM ClusterId in AppInfo. ( #4673 )
2022-08-03 09:35:40 -07:00
slfan1989
1f0a71a92b
YARN-8973. [Router] Add missing methods in RMWebProtocol. ( #4664 )
2022-08-02 14:07:09 -07:00
slfan1989
57da4bb0a1
YARN-11220. [Federation] Add getLabelsToNodes, getClusterNodeLabels, getLabelsOnNode REST APIs for Router ( #4657 )
2022-08-02 12:09:55 -07:00
slfan1989
e994635a95
YARN-11212. [Federation] Add getNodeToLabels REST APIs for Router. ( #4614 )
2022-07-28 11:53:04 -07:00
9uapaw
bf570bd4ac
YARN-11063. Support auto queue creation template wildcards for arbitrary queue depths. Contributed by Bence Kosztolnik.
2022-07-28 17:32:20 +02:00
Szilard Nemeth
f4b635c4dc
YARN-11211. QueueMetrics leaks Configuration objects when validation API is called multiple times. Contributed by Andras Gyori
2022-07-21 14:20:34 +02:00
PJ Fanning
34e548cb62
HADOOP-18332: remove rs-api dependency as it conflicts with jsr311-api ( #4547 )
...
This downgrades jackson from the version switched to in
HADOOP-18033 (2.13.0), to Jackson 2.12.7.
This removes the dependency on javax.ws.rs-api,
so avoiding runtime problems with applications using
jersey-core v1 and/or jsr311-api.
The 2.12.7 release still contains the fix for CVE-2020-36518.
Contributed by PJ Fanning
2022-07-17 21:37:54 +05:30
Ashutosh Gupta
57cbde9abf
YARN-10287.Update scheduler-conf corrupts the CS configuration when removing queue which is referred in queue mapping ( #4515 )
...
* YARN-10287.Update scheduler-conf corrupts the CS configuration when removing queue which is referred in queue mapping
Co-authored-by: Ashutosh Gupta <ashugpt@amazon.com>
2022-07-02 21:58:56 +05:30
Tamas Domok
3cad632709
YARN-11202. Optimize ClientRMService.getApplications. Contributed by Tamas Domok.
...
Change-Id: I55ddb46fd0e4cdb644747d6d43083215f10861b5
2022-07-01 10:50:48 +02:00
slfan1989
073b8ea1d5
HADOOP-18284. Remove Unnecessary semicolon ';' ( #4422 ). Contributed by fanshilun.
2022-06-29 15:20:41 +05:30
Ashutosh Gupta
734b6f19ad
YARN-9874.Remove unnecessary LevelDb write call in LeveldbConfigurationStore#confirmMutation ( #4487 )
...
Co-authored-by: Ashutosh Gupta <ashugpt@amazon.com>
2022-06-23 21:59:27 +05:30
Szilard Nemeth
3a66348fda
YARN-11185. Pending app metrics are increased doubly when a queue reaches its max-parallel-apps limit. Contributed by Andras Gyori
2022-06-20 15:03:58 +02:00
zhengchenyu
80446dcd08
YARN-11172. Fix TestClientRMTokens#testDelegationToken introduced by HDFS-16563. ( #4408 )
...
Regression caused by HDFS-16563; the hdfs exception text was changed, but because it was
a YARN test doing the check, Yetus didn't notice.
Contributed by zhengchenyu
2022-06-17 19:49:36 +01:00
Benjamin Teke
020201cb65
Queue filter in CS UI v1 does not work as expected. Contributed by Chengbing Liu.
2022-06-17 19:28:32 +02:00
Ashutosh Gupta
7c66266b4e
YARN-11142.Remove unused Imports in Hadoop YARN project ( #4381 ). Contributed by Ashutosh Gupta.
...
Signed-off-by: Ayush Saxena <ayushsaxena@apache.org>
2022-05-31 23:17:11 +05:30
Ashutosh Gupta
e3e9369c1d
YARN-11128. Fix comments in TestProportionalCapacityPreemptionPolicy* ( #4271 )
...
Co-authored-by: Ashutosh Gupta <ashugpt@amazon.com>
Reviewed-by: Hemanth Boyina <hemanthboyina@apache.org>
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2022-05-26 15:51:55 +09:00
Owen O'Malley
f390edaec4
YARN-11162. Set the zk acl for nodes created by ZKConfigurationStore. ( #4350 )
2022-05-24 14:07:19 +09:00
9uapaw
0e6a6d1880
YARN-11152. QueueMetrics is leaking memory when creating a new queue during reinitialisation
2022-05-18 18:22:42 +02:00
9uapaw
54cd0174c0
YARN-11147. ResourceUsage and QueueCapacities classes provide node label iterators that are not thread safe
2022-05-18 18:16:34 +02:00
Jian Chen
d2c9eb653a
YARN-11073. Avoid unnecessary preemption for tiny queues under certain corner cases ( #4110 )
...
Co-authored-by: Jian Chen <jian.chen@airbnb.com>
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2022-05-14 01:11:42 +09:00
Szilard Nemeth
eced5bec68
YARN-11123. ResourceManager webapps test failures due to org.apache.hadoop.metrics2.MetricsException and subsequent java.net.BindException: Address already in use. Contributed by Szilard Nemeth
2022-05-13 13:16:12 +02:00
Szilard Nemeth
5e2f4339fa
YARN-11126. ZKConfigurationStore Java deserialisation vulnerability. Contributed by Tamas Domok
2022-05-12 13:42:06 +02:00
Szilard Nemeth
9af3eabdca
YARN-11114. RMWebServices returns only apps matching exactly the submitted queue name. Contributed by Szilard Nemeth
2022-05-11 18:01:31 +02:00
Szilard Nemeth
b62d6ce6fd
YARN-11141. Capacity Scheduler does not support ambiguous queue names when moving application across queues. Contributed by Andras Gyori
2022-05-11 14:29:05 +02:00
PJ Fanning
63187083cc
HADOOP-15983. Use jersey-json that is built to use jackson2 ( #3988 )
...
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2022-04-28 14:18:19 +09:00
章锡平
d5cba5c3d1
YARN-11107. Addendum. When NodeLabel is enabled for a YARN cluster, AM blacklist program does not work properly. ( #4175 )
2022-04-15 17:50:49 +09:00
章锡平
3b46aae977
YARN-11107. When NodeLabel is enabled for a YARN cluster, AM blacklist program does not work properly. Contributed by zhangxiping1
2022-04-13 15:57:18 +02:00
Junfan Zhang
e044a46f97
YARN-11088. Introduce the config to control the AM allocated to non-exclusive nodes. Contributed by Junfan Zhang
2022-03-31 12:39:56 +02:00
PJ Fanning
61e809b245
HADOOP-13386. Upgrade Avro to 1.9.2 ( #3990 )
...
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2022-03-26 20:31:16 +09:00