Commit Graph

2321 Commits

Author SHA1 Message Date
Sunil G
5fb14e0635 YARN-8953. [CSI] CSI driver adaptor module support in NodeManager. Contributed by Weiwei Yang. 2018-11-20 08:02:11 +05:30
Weiwei Yang
d027a24f03 YARN-8833. Avoid potential integer overflow when computing fair shares. Contributed by liyakun. 2018-11-18 23:18:26 +08:00
Weiwei Yang
c741109522 YARN-8987. Usability improvements node-attributes CLI. Contributed by Bibin A Chundatt. 2018-11-12 18:18:23 +08:00
Sunil G
4e7284443e YARN-8902. [CSI] Add volume manager that manages CSI volume lifecycle. Contributed by Weiwei Yang. 2018-11-12 11:57:02 +05:30
Haibo Chen
524a7523c4 YARN-8990. Fix fair scheduler race condition in app submit and queue cleanup. (Contributed by Wilfred Spiegelenburg) 2018-11-08 16:02:48 -08:00
Weiwei Yang
e1bbf7dcdf YARN-8988. Reduce the verbose log on RM heartbeat path when distributed node-attributes is enabled. Contributed by Tao Yang. 2018-11-08 17:47:18 +08:00
Weiwei Yang
c96cbe8659 YARN-8977. Remove unnecessary type casting when calling AbstractYarnScheduler#getSchedulerNode. Contributed by Wanqiang Ji. 2018-11-07 22:42:22 +08:00
Akira Ajisaka
951c98f890
YARN-8233. NPE in CapacityScheduler#tryCommit when handling allocate/reserve proposal whose allocatedOrReservedContainer is null. Contributed by Tao Yang. 2018-11-07 11:17:35 +09:00
Jason Lowe
ab6aa4c726 YARN-8865. RMStateStore contains large number of expired RMDelegationToken. Contributed by Wilfred Spiegelenburg 2018-11-06 08:40:59 -06:00
Weiwei Yang
5d6554c722 YARN-8970. Improve the debug message in CS#allocateContainerOnSingleNode. Contributed by Zhankun Tang. 2018-11-06 14:50:09 +08:00
Weiwei Yang
c7fcca0d7e YARN-8969. AbstractYarnScheduler#getNodeTracker should return generic type to avoid type casting. Contributed by Wanqiang Ji. 2018-11-06 13:14:57 +08:00
Giovanni Matteo Fumarola
babc946d40 YARN-8954. Reservations list field in ReservationListInfo is not accessible. Contributed by Oleksandr Shevchenko. 2018-11-02 11:10:08 -07:00
Jonathan Hung
2ab611d48b YARN-7225. Add queue and partition info to RM audit log. Contributed by Eric Payne 2018-11-01 14:22:00 -07:00
Weiwei Yang
1d90a0dd23 YARN-8944. TestContainerAllocation.testUserLimitAllocationMultipleContainers failure after YARN-8896. Contributed by Wilfred Spiegelenburg. 2018-10-29 11:53:10 +08:00
Jason Lowe
93fb3b4b9c YARN-8904. TestRMDelegationTokens can fail in testRMDTMasterKeyStateOnRollingMasterKey. Contributed by Wilfred Spiegelenburg 2018-10-23 12:49:15 -05:00
Haibo Chen
285d2c0753 YARN-8449. RM HA for AM web server HTTPS Support. (Contributed by Robert Kanter) 2018-10-18 21:24:36 -07:00
Weiwei Yang
13cc0f50ea YARN-8907. Fix incorrect logging message in TestCapacityScheduler. Contributed by Zhankun Tang. 2018-10-19 09:55:59 +08:00
Wangda Tan
780be14f07 YARN-8896. Limit the maximum number of container assignments per heartbeat. (Zhankun Tang via wangda)
Change-Id: I6e72f8362bd7f5c2a844cb9e3c4732492314e9f1
2018-10-18 10:58:21 -07:00
Sunil G
5085e5fa9e YARN-8759. Copy of resource-types.xml is not deleted if test fails, causes other test failures. Contributed by Antal Bálint Steinbach. 2018-10-17 16:05:08 +05:30
Haibo Chen
84e22a6af4 YARN-8842. Expose metrics for custom resource types in QueueMetrics. (Contributed by Szilard Nemeth) 2018-10-16 14:14:30 -07:00
Haibo Chen
c2288ac45b YARN-8448. AM HTTPS Support for AM communication with RMWeb proxy. (Contributed by Robert Kanter) 2018-10-16 13:36:26 -07:00
Weiwei Yang
edce866489 YARN-8858. CapacityScheduler should respect maximum node resource when per-queue maximum-allocation is being used. Contributed by Wangda Tan. 2018-10-10 09:32:27 +08:00
Haibo Chen
7c13872cbb YARN-8659. RMWebServices returns only RUNNING apps when filtered with queue. (Contributed by Szilard Nemeth) 2018-10-08 09:40:37 -07:00
Jason Lowe
9bb2801e8c YARN-4254. ApplicationAttempt stuck for ever due to UnknownHostException. Contributed by Bibin A Chundatt 2018-10-05 15:52:46 -05:00
Haibo Chen
c968365650 YARN-8644. Improve unit test for RMAppImpl.FinalTransition. (Contributed by Szilard Nemeth) 2018-10-05 09:32:30 -07:00
Haibo Chen
e60b797c88 YARN-8750. Refactor TestQueueMetrics. (Contributed by Szilard Nemeth) 2018-10-04 13:00:31 -07:00
Haibo Chen
b6d5d84e07 YARN-8732. Add unit tests of min/max allocation for custom resource types in FairScheduler. (Contributed by Szilard Nemeth) 2018-10-04 12:47:31 -07:00
Haibo Chen
d0ee6fbe28 YARN-8621. Add test coverage of custom Resource Types for the apps/<appId> REST API endpoint. (Contributed by Szilard Nemeth) 2018-10-01 14:46:42 -07:00
Weiwei Yang
fd6be5898a YARN-8468. Enable the use of queue based maximum container allocation limit and implement it in FairScheduler. Contributed by Antal Bálint Steinbach. 2018-09-29 17:47:12 +08:00
Eric E Payne
8598b498bc YARN-8774. Memory leak when CapacityScheduler allocates from reserved container with non-default label. Contributed by Tao Yang. 2018-09-28 15:32:07 +00:00
Jason Lowe
6b988d821e YARN-8804. resourceLimits may be wrongly calculated when leaf-queue is blocked in cluster with 3+ level queues. Contributed by Tao Yang 2018-09-26 14:43:00 -07:00
Rohith Sharma K S
e5287a4fe0 YARN-8824. App Nodelabel missed after RM restart for finished apps. Contributed by Bibin A Chundatt. 2018-09-26 12:30:26 +05:30
Akira Ajisaka
44edcdfd6a
YARN-8745. Misplaced the TestRMWebServicesFairScheduler.java file. Contributed by Y. SREENIVASULU REDDY. 2018-09-26 10:09:11 +09:00
Rohith Sharma K S
50bc7746d7 YARN-8815. RM fails to recover finished unmanaged AM. Contributed by Bibin A Chundatt. 2018-09-25 11:31:14 +05:30
Haibo Chen
29dad7d258 YARN-8616. systemClock should be used in RMAppImpl instead of System.currentTimeMills(), to be consistent. (Contributed by Szilard Nemeth) 2018-09-24 16:04:28 -07:00
Giovanni Matteo Fumarola
3090922805 YARN-8696. [AMRMProxy] FederationInterceptor upgrade: home sub-cluster heartbeat async. Contributed by Botong Huang. 2018-09-24 11:37:05 -07:00
Weiwei Yang
0712537e79 YARN-8771. CapacityScheduler fails to unreserve when cluster resource contains empty resource type. Contributed by Tao Yang. 2018-09-19 19:31:07 +08:00
Weiwei Yang
f1a893fdbc YARN-8720. CapacityScheduler does not enforce max resource allocation check at queue level. Contributed by Tarun Parimi. 2018-09-14 16:33:51 +08:00
Shane Kumpf
8e9afbfb66 YARN-8768. Javadoc error in node attributes. Contributed by Sunil Govindan. 2018-09-12 15:12:28 -06:00
Sunil G
5e64e62dee YARN-8740. Clear node attribute path after each test run. Contributed by Bibin A Chundatt. 2018-09-12 16:01:01 +05:30
bibinchundatt
c44088ac19 YARN-8739. Fix jenkins issues for Node Attributes branch. Contributed by Sunil Govindan. 2018-09-12 16:01:01 +05:30
Weiwei Yang
52194351e7 YARN-8721. Relax NE node-attribute check when attribute doesn't exist on a node. Contributed by Sunil Govindan. 2018-09-12 16:01:01 +05:30
Naganarasimha
67ae81f0e0 YARN-7863. Modify placement constraints to support node attributes. Contributed by Sunil Govindan. 2018-09-12 16:01:01 +05:30
Naganarasimha
eb08543c7a YARN-8103. Add CLI interface to query node attributes. Contributed by Bibin A Chundatt. 2018-09-12 16:01:01 +05:30
Sunil G
76183428b7 YARN-8351. Node attribute manager logs are flooding RM logs. Contributed by Weiwei Yang. 2018-09-12 16:01:00 +05:30
bibinchundatt
8cf6a9a2bd YARN-7892. Revisit NodeAttribute class structure. Contributed by Naganarasimha G R. 2018-09-12 16:01:00 +05:30
Naganarasimha
5dc7d6e0f3 YARN-8104. Add API to fetch node to attribute mapping. Contributed by Bibin A Chundatt. 2018-09-12 16:01:00 +05:30
Naganarasimha
0a01b1350d YARN-8100. Support API interface to query cluster attributes and attribute to nodes. Contributed by Bibin A Chundatt. 2018-09-12 16:01:00 +05:30
Sunil G
b9890d1f66 YARN-7875. Node Attribute store for storing and recovering attributes. Contributed by Bibin A Chundatt. 2018-09-12 16:01:00 +05:30
bibinchundatt
a6590c1f1f YARN-8117. Fix TestRMWebServicesNodes test failure. Contributed by Bibin A Chundatt. 2018-09-12 16:01:00 +05:30
bibinchundatt
901e85238d YARN-8033. CLI Integration with NodeAttributesManagerImpl. Contributed by Naganarasimha G R. 2018-09-12 16:01:00 +05:30
Sunil G
89b3ebd11e YARN-8092. Expose Node Attributes info via RM nodes REST API. Contributed by Weiwei Yang. 2018-09-12 16:01:00 +05:30
Sunil G
6f4bc49c6d YARN-7988. Refactor FSNodeLabelStore code for Node Attributes store support. Contributed by Bibin A Chundatt. 2018-09-12 16:01:00 +05:30
Naganarasimha
3b3b6efe21 YARN-7871. Node attributes reporting from NM to RM. Contributed by Weiwei Yang. 2018-09-12 16:01:00 +05:30
Naganarasimha
86d024ef2a YARN-7965. NodeAttributeManager add/get API is not working properly. Contributed by Weiwei Yang. 2018-09-12 16:01:00 +05:30
Sunil G
2f7712be09 YARN-6858. Attribute Manager to store and provide node attributes in RM. Contributed by Naganarasimha G R. 2018-09-12 16:01:00 +05:30
Naganarasimha
1f42ce907a YARN-6855. [YARN-3409] CLI Proto Modifications to support Node Attributes. Contributed by Naganarasimha G R. 2018-09-12 16:00:59 +05:30
Eric E Payne
987d8191ad YARN-8709: CS preemption monitor always fails since one under-served queue was deleted. Contributed by Tao Yang. 2018-09-10 19:55:20 +00:00
Giovanni Matteo Fumarola
9af96d4ed4 HADOOP-15707. Add IsActiveServlet to be used for Load Balancers. Contributed by Lukas Majercak. 2018-09-05 10:50:25 -07:00
Weiwei Yang
3fa4639421 YARN-8723. Fix a typo in CS init error message when resource calculator is not correctly set. Contributed by Abhishek Modi. 2018-08-29 11:13:44 +08:00
Sunil G
1ac01444a2 YARN-8015. Support all types of placement constraint support for Capacity Scheduler. Contributed by Weiwei Yang. 2018-08-23 10:05:43 +05:30
Weiwei Yang
9c3fc3ef28 YARN-7494. Add muti-node lookup mechanism and pluggable nodes sorting policies to optimize placement decision. Contributed by Sunil Govindan. 2018-08-21 22:42:28 +08:00
Weiwei Yang
54d0bf8935 YARN-8683. Support to display pending scheduling requests in RM app attempt page. Contributed by Tao Yang. 2018-08-21 19:00:31 +08:00
Giovanni Matteo Fumarola
8736fc39ac YARN-8673. [AMRMProxy] More robust responseId resync after an YarnRM master slave switch. Contributed by Botong Huang. 2018-08-20 12:22:36 -07:00
Rohith Sharma K S
4aacbfff60 YARN-8679. [ATSv2] If HBase cluster is down for long time, high chances that NM ContainerManager dispatcher get blocked. Contributed by Wangda Tan. 2018-08-18 10:26:55 +05:30
Wei-Chiu Chuang
bdd0e01ed9 HADOOP-14212. Addendum patch: Expose SecurityEnabled boolean field in JMX for other services besides NameNode. Contributed by Adam Antal. 2018-08-14 18:25:38 -07:00
Wei-Chiu Chuang
363bd16e31 HADOOP-14212. Expose SecurityEnabled boolean field in JMX for other services besides NameNode. Contributed by Adam Antal. 2018-08-14 17:19:00 -07:00
Weiwei Yang
0a71bf1452 YARN-8575. Avoid committing allocation proposal to unavailable nodes in async scheduling. Contributed by Tao Yang. 2018-08-10 14:37:45 +08:00
Weiwei Yang
08d5060605 YARN-8521. NPE in AllocationTagsManager when a container is removed more than once. Contributed by Weiwei Yang. 2018-08-10 08:32:02 +08:00
Robert Kanter
b2517dd66b YARN-4946. RM should not consider an application as COMPLETED when log aggregation is not in a terminal state (snemeth via rkanter) 2018-08-09 14:58:04 -07:00
Wangda Tan
344c335a92 YARN-8588. Logging improvements for better debuggability. (Suma Shivaprasad via wangda)
Change-Id: I66aa4b0ec031ae5ce0fae558e2f8cbcbbfebc442
2018-08-09 11:04:02 -07:00
Weiwei Yang
d352f167eb YARN-8559. Expose mutable-conf scheduler's configuration in RM /scheduler-conf endpoint. Contributed by Weiwei Yang. 2018-08-09 23:46:53 +08:00
Haibo Chen
889df6f194 YARN-7948. Enable fair scheduler to refresh maximum allocation for multiple resource types. (Szilard Nemeth via Haibo Chen) 2018-08-02 10:10:12 -07:00
Sunil G
12a095a496 YARN-7159. Normalize unit of resource objects in RM to avoid unit conversion in critical path. Contributed by Manikandan R. 2018-08-02 22:29:21 +05:30
Sunil G
735b492556 YARN-8593. Add RM web service endpoint to get user information. Contributed by Akhil PB. 2018-08-02 08:34:09 +05:30
Sunil G
a48a0cc7fd YARN-8606. Opportunistic scheduling does not work post RM failover. Contributed by Bibin A Chundatt. 2018-08-01 12:17:18 +05:30
Sunil G
6310c0d17d YARN-8397. Potential thread leak in ActivitiesManager. Contributed by Rohith Sharma K S. 2018-08-01 08:33:30 +05:30
Eric Yang
c7ebcd76bf YARN-8579. Recover NMToken of previous attempted component data.
Contributed by Gour Saha
2018-07-31 18:01:02 -04:00
Haibo Chen
8aa93a575e YARN-8605. TestDominantResourceFairnessPolicy.testModWhileSorting is flaky. (Wilfred Spiegelenburg via Haibo Chen) 2018-07-31 11:32:40 -07:00
Jonathan Hung
3e06a5dcea YARN-7974. Allow updating application tracking url after registration. Contributed by Jonathan Hung 2018-07-30 17:44:18 -07:00
Robert Kanter
2cccf4061c YARN-8517. getContainer and getContainers ResourceManager REST API methods are not documented (snemeth via rkanter) 2018-07-27 14:35:03 -07:00
Eric Yang
f93ecf5c1e YARN-8330. Improved publishing ALLOCATED events to ATS.
Contributed by Suma Shivaprasad
2018-07-25 18:49:30 -04:00
Eric E Payne
9485c9aee6 YARN-4606. CapacityScheduler: applications could get starved because computation of #activeUsers considers pending apps. Contributed by Manikandan R 2018-07-25 16:22:04 +00:00
Weiwei Yang
5be9f4a5d0 YARN-8546. Resource leak caused by a reserved container being released more than once under async scheduling. Contributed by Tao Yang. 2018-07-25 17:35:27 +08:00
Haibo Chen
ea2c6c8c9a YARN-7133. Clean up lock-try order in fair scheduler. (Szilard Nemeth via Haibo Chen) 2018-07-24 12:46:59 -07:00
bibinchundatt
e673dd1d4d YARN-8541. RM startup failure on recovery after user deletion. Contributed by Bibin A Chundatt. 2018-07-24 23:26:59 +05:30
Sunil G
35ce6eb1f5 YARN-7748. TestContainerResizing.testIncreaseContainerUnreservedWhenApplicationCompleted fails due to multiple container fail events. Contributed by Weiwei Yang. 2018-07-24 22:20:17 +05:30
bibinchundatt
ff7c2eda34 YARN-8548. AllocationRespose proto setNMToken initBuilder not done. Contributed by Bilwa S T. 2018-07-24 16:17:20 +05:30
bibinchundatt
8461278833 YARN-8544. [DS] AM registration fails when hadoop authorization is enabled. Contributed by Bibin A Chundatt. 2018-07-24 13:09:17 +05:30
Haibo Chen
8a6bb8409c YARN-6964. Fair scheduler misuses Resources operations. (Daniel Templeton and Szilard Nemeth via Haibo Chen) 2018-07-20 10:46:28 -07:00
Weiwei Yang
cbf2026483 YARN-8528. Final states in ContainerAllocation might be modified externally causing unexpected allocation results. Contributed by Xintong Song. 2018-07-20 22:34:06 +08:00
Haibo Chen
2564884757 YARN-8436. FSParentQueue: Comparison method violates its general contract. (Wilfred Spiegelenburg via Haibo Chen) 2018-07-19 13:22:31 -07:00
Eric Yang
5836e0a46b YARN-8501. Reduce complexity of RMWebServices getApps method.
Contributed by Szilard Nemeth
2018-07-19 12:30:38 -04:00
Wangda Tan
238ffff999 YARN-8524. Single parameter Resource / LightWeightResource constructor looks confusing. (Szilard Nemeth via wangda)
Change-Id: I4ae97548b5b8d76a6bcebb2d3d70bf8e0be3c125
2018-07-16 10:58:00 -07:00
Wangda Tan
a2e49f41a8 YARN-8361. Change App Name Placement Rule to use App Name instead of App Id for configuration. (Zian Chen via wangda)
Change-Id: I17e5021f8f611a9c5e3bd4b38f25e08585afc6b1
2018-07-16 10:57:37 -07:00
Wangda Tan
752dcce5f4 YARN-8511. When AM releases a container, RM removes allocation tags before it is released by NM. (Weiwei Yang via wangda)
Change-Id: I6f9f409f2ef685b405cbff547dea9623bf3322d9
2018-07-16 10:54:41 -07:00
Eric E Payne
937ef39b3f YARN-8421: when moving app, activeUsers is increased, even though app does not have outstanding request. Contributed by Kyungwan Nam 2018-07-16 16:24:21 +00:00
Akira Ajisaka
88625f5cd9
HADOOP-15531. Use commons-text instead of commons-lang in some classes to fix deprecation warnings. Contributed by Takanobu Asanuma. 2018-07-13 11:42:12 -04:00
Giovanni Matteo Fumarola
82ac3aa6d0 YARN-8502. Use path strings consistently for webservice endpoints in RMWebServices. Contributed by Szilard Nemeth. 2018-07-10 10:36:17 -07:00
Robert Kanter
a129e3e74e YARN-7451. Add missing tests to verify the presence of custom resources of RM apps and scheduler webservice endpoints (snemeth via rkanter) 2018-07-06 11:04:00 -07:00
Haibo Chen
9edc74f64a YARN-7556. Fair scheduler configuration should allow resource types in the minResources and maxResources properties. (Daniel Templeton and Szilard Nemeth via Haibo Chen) 2018-07-06 11:03:48 -07:00
Anu Engineer
39ad98903a Revert "Merge branch 'trunk' of https://git-wip-us.apache.org/repos/asf/hadoop into trunk"
This reverts commit c163d1797a, reversing
changes made to 0d9804dcef.
2018-07-05 12:22:18 -07:00
Robert Kanter
99febe7fd5 YARN-7451. Add missing tests to verify the presence of custom resources of RM apps and scheduler webservice endpoints (snemeth via rkanter) 2018-07-05 10:54:19 -07:00
Haibo Chen
1726247024 YARN-7556. Fair scheduler configuration should allow resource types in the minResources and maxResources properties. (Daniel Templeton and Szilard Nemeth via Haibo Chen) 2018-07-05 10:42:39 -07:00
Sunil G
51654a3962 YARN-8459. Improve Capacity Scheduler logs to debug invalid states. Contributed by Wangda Tan. 2018-07-03 09:59:20 -07:00
Junping Du
d9ba6f3656 yarn.resourcemanager.fail-fast is used inconsistently. Contributed by Yuanbo Liu. 2018-07-03 14:46:44 +08:00
Sunil G
291194302c YARN-8379. Improve balancing resources in already satisfied queues by using Capacity Scheduler preemption. Contributed by Zian Chen. 2018-06-28 10:23:31 -07:00
Eric Yang
384764cdea YARN-8409. Fixed NPE in ActiveStandbyElectorBasedElectorService.
Contributed by Chandni Singh
2018-06-28 12:39:49 -04:00
Wangda Tan
bedc4fe079 YARN-8464. Async scheduling thread could be interrupted when there are no NodeManagers in cluster. (Sunil G via wangda)
Change-Id: I4f5f856373378685713e77752ba6cf0988a66065
2018-06-26 19:27:17 -07:00
Eric Yang
b69ba0f330 YARN-8108. Added option to disable loading existing filters to prevent
security filter from initialize twice.
            Contributed by Sunil Govindan
2018-06-26 17:34:57 -04:00
Weiwei Yang
440140cea6 YARN-8443. Total #VCores in cluster metrics is wrong when CapacityScheduler reserved some containers. Contributed by Tao Yang. 2018-06-25 09:15:31 +08:00
Inigo Goiri
99948565cb YARN-8412. Move ResourceRequest.clone logic everywhere into a proper API. Contributed by Botong Huang. 2018-06-21 18:24:10 -07:00
Miklos Szegedi
46f9058164 YARN-8441. Typo in CSQueueUtils local variable names: queueGuranteedResource. Contributed by Szilard Nemeth. 2018-06-20 11:58:18 -07:00
Miklos Szegedi
9a9e969570 YARN-8391. Investigate AllocationFileLoaderService.reloadListener locking issue. Contributed by Szilard Nemeth. 2018-06-20 10:59:33 -07:00
Rohith Sharma K S
9119b3cf8f YARN-8155. Improve ATSv2 client logging in RM and NM publisher. Contributed by Abhishek Modi. 2018-06-14 12:38:10 +05:30
Sunil G
6307962b93 YARN-8404. Timeline event publish need to be async to avoid Dispatcher thread leak in case ATS is down. Contributed by Rohith Sharma K S 2018-06-13 16:09:16 +05:30
Haibo Chen
10d0e4be6e YARN-8325. Miscellaneous QueueManager code clean up. (Szilard Nemeth via Haibo Chen) 2018-06-12 10:11:30 -07:00
Akira Ajisaka
652bcbb3e4
YARN-8363. Upgrade commons-lang version to 3.7 in hadoop-yarn-project. Contributed by Takanobu Asanuma. 2018-06-13 00:35:37 +09:00
Rohith Sharma K S
2df73dace0 YARN-8405. RM zk-state-store.parent-path ACLs has been changed since HADOOP-14773. Contributed by Íñigo Goiri. 2018-06-12 17:13:09 +05:30
Haibo Chen
180b3c960b YARN-8321. AllocationFileLoaderService.getAllocationFile() should be declared as VisibleForTest. (Szilard Nemeth via Haibo Chen) 2018-06-11 11:19:33 -07:00
Haibo Chen
676dcffff5 YARN-8322. Change log level when there is an IOException when the allocation file is loaded. (Szilard Nemeth via Haibo Chen) 2018-06-11 11:16:21 -07:00
Haibo Chen
c190ac2be8 YARN-8323. FairScheduler.allocConf should be declared as volatile. (Szilard Nemeth via Haibo Chen) 2018-06-11 11:12:44 -07:00
Robert Kanter
0cd145a443 YARN-4677. RMNodeResourceUpdateEvent update from scheduler can lead to race condition (wilfreds and gphillips via rkanter) 2018-06-04 15:32:03 -07:00
Haibo Chen
ba12f87dcb YARN-8390. Fix API incompatible changes in FairScheduler's AllocationFileLoaderService. (Gergo Repas via Haibo Chen) 2018-06-04 09:15:23 -07:00
Wangda Tan
ff583d3fa3 YARN-8349. Remove YARN registry entries when a service is killed by the RM. (Billie Rinaldi via wangda)
Change-Id: Ia58db3637789a8921482f564aa9bdf99c45cc36c
2018-06-01 14:49:18 -07:00
Wangda Tan
931f78718f YARN-7962. Race Condition When Stopping DelegationTokenRenewer causes RM crash during failover. (BELUGA BEHR via wangda)
Change-Id: I617e2645f60a57080058ad5f06af860fb3f682c8
2018-06-01 14:49:18 -07:00
Yufei Gu
6b21a599fb YARN-7340. Fix the missing time stamp in exception message in Class
NoOverCommitPolicy. Contributed by Dinesh Chitlangia.
2018-05-31 22:30:12 -07:00
Weiwei Yang
6468071f13 YARN-8367. Fix NPE in SingleConstraintAppPlacementAllocator when placement constraint in SchedulingRequest is null. Contributed by Weiwei Yang. 2018-05-31 20:46:39 +08:00
Wangda Tan
17aa40f669 YARN-8369. Javadoc build failed due to 'bad use of >'. (Takanobu Asanuma via wangda)
Change-Id: I79a42154e8f86ab1c3cc939b3745024b8eebe5f4
2018-05-29 09:27:36 -07:00
Sunil G
7c343669ba YARN-4781. Support intra-queue preemption for fairness ordering policy. Contributed by Eric Payne. 2018-05-28 16:32:53 +05:30
Wangda Tan
f24c842d52 YARN-8213. Add Capacity Scheduler performance metrics. (Weiwei Yang via wangda)
Change-Id: Ieea6f3eeb83c90cd74233fea896f0fcd0f325d5f
2018-05-25 21:53:20 -07:00
Eric E Payne
8d5509c681 YARN-8292: Fix the dominant resource preemption cannot happen when some of the resource vector becomes negative. Contributed by Wangda Tan. 2018-05-25 16:06:09 +00:00
Haibo Chen
86bc6425d4 YARN-8191. Fair scheduler: queue deletion without RM restart. (Gergo Repas via Haibo Chen) 2018-05-24 17:12:34 -07:00
Rohith Sharma K S
c05b5d424b YARN-8319. More YARN pages need to honor yarn.resourcemanager.display.per-user-apps. Contributed by Sunil G. 2018-05-24 14:19:46 +05:30
Wangda Tan
bd15d2396e YARN-8290. SystemMetricsPublisher.appACLsUpdated should be invoked after application information is published to ATS to avoid "User is not set in the application report" Exception. (Eric Yang via wangda)
Change-Id: I0ac6ddd19740d1aa7dd07111cd11af71ddc2fcaf
2018-05-22 13:33:33 -07:00
Eric E Payne
0b4c44bdee YARN-8179: Preemption does not happen due to natural_termination_factor when DRF is used. Contributed by Kyungwan Nam. 2018-05-21 20:14:58 +00:00
Haibo Chen
f48fec83d0 YARN-8248. Job hangs when a job requests a resource that its queue does not have. (Szilard Nemeth via Haibo Chen) 2018-05-21 08:10:41 -07:00
Arun Suresh
3159bffce2 YARN-7900. [AMRMProxy] AMRMClientRelayer for stateful FederationInterceptor. (Botong Huang via asuresh) 2018-05-17 20:00:52 -07:00
Haibo Chen
1f10a36021 YARN-8268. Fair scheduler: reservable queue is configured both as parent and leaf queue. (Gergo Repas via Haibo Chen) 2018-05-11 11:34:00 -07:00
Weiwei Yang
9db9cd95bd YARN-7003. DRAINING state of queues is not recovered after RM restart. Contributed by Tao Yang. 2018-05-11 10:47:04 +08:00
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
d76fbbc9b8 YARN-8249. Fixed few REST APIs in RMWebServices to have static-user check. Contributed by Sunil Govindan. 2018-05-10 19:05:35 -07:00
Robert Kanter
c8b53c4364 YARN-8202. DefaultAMSProcessor should properly check units of requested custom resource types against minimum/maximum allocation (snemeth via rkanter) 2018-05-10 09:31:59 -07:00
bibinchundatt
cc0310a526 YARN-8201. Skip stacktrace of few exception from ClientRMService. Contributed by Bilwa S T. 2018-05-10 09:15:46 +05:30
Rohith Sharma K S
eb7b256cc2 YARN-8239. [UI2] Clicking on Node Manager UI under AM container info / App Attempt page goes to old RM UI. Contributed by Sunil G. 2018-05-09 12:00:43 +05:30
Wangda Tan
6b96a73bb0 YARN-8232. RMContainer lost queue name when RM HA happens. (Hu Ziqian via wangda)
Change-Id: Ia21e1da6871570c993bbedde76ce32929e95970f
2018-05-08 11:34:45 -07:00
Weiwei Yang
67f239c42f YARN-8025. UsersManangers#getComputedResourceLimitForActiveUsers throws NPE due to preComputedActiveUserLimit is empty. Contributed by Tao Yang. 2018-05-07 11:33:14 +08:00
Inigo Goiri
6a69239d86 YARN-8163. Add support for Node Labels in opportunistic scheduling. Contributed by Abhishek Modi. 2018-05-04 14:59:59 -07:00
Rohith Sharma K S
ee2ce923a9 YARN-8217. RmAuthenticationFilterInitializer and TimelineAuthenticationFilterInitializer should use Configuration.getPropsWithPrefix instead of iterator. Contributed by Suma Shivaprasad. 2018-05-03 14:43:40 +05:30
Inigo Goiri
e6a80e476d YARN-8151. Yarn RM Epoch should wrap around. Contributed by Young Chen. 2018-05-02 17:23:17 -07:00
Weiwei Yang
251f528814 YARN-8222. Fix potential NPE when gets RMApp from RM context. Contributed by Tao Yang. 2018-05-02 17:54:46 +08:00
Weiwei Yang
2d319e3793 YARN-8212. Pending backlog for async allocation threads should be configurable. Contributed by Tao Yang. 2018-05-01 09:47:10 +08:00
Konstantinos Karanasos
9b09555451 YARN-8195. Fix constraint cardinality check in the presence of multiple target allocation tags. Contributed by Weiwei Yang. 2018-04-30 11:54:30 -07:00
Wangda Tan
92c5331423 YARN-8005. Add unit tests for queue priority with dominant resource calculator. (Zian Chen via wangda)
Change-Id: I17a645f20869a1e5d86fa7a325c93fec908b91dc
2018-04-27 13:08:43 -07:00
Rohith Sharma K S
ef3ecc308d YARN-8221. RMWebServices also need to honor yarn.resourcemanager.display.per-user-apps. Contributed by Sunil G. 2018-04-27 22:58:10 +05:30
Sunil G
71220d218d YARN-8004. Add unit tests for inter queue preemption for dominant resource calculator. Contributed by Zian Chen. 2018-04-27 10:42:30 +05:30
Sunil G
1634de0fc1 YARN-8205. Application State is not updated to ATS if AM launching is delayed. Contributed by Rohith Sharma K S. 2018-04-27 10:26:57 +05:30
Wangda Tan
2a0fa50f9d YARN-8193. YARN RM hangs abruptly (stops allocating resources) when running successive applications. (Zian Chen via wangda)
Change-Id: Ia83dd2499ee9000b9e09ae5a932f21a13c0ddee6
2018-04-26 13:54:40 -07:00
Sunil G
7ac0abdc13 YARN-8188. RM Nodes UI data table index for sorting column need to be corrected post Application tags display. Contributed by Weiwei Yang. 2018-04-26 13:54:40 -07:00
Wangda Tan
ebedd69e25 YARN-8183. Fix ConcurrentModificationException inside RMAppAttemptMetrics#convertAtomicLongMaptoLongMap. (Suma Shivaprasad via wangda)
Change-Id: I347871d672001653a3afe2e99adefd74e0d798cd
2018-04-26 13:54:40 -07:00
Owen O'Malley
2adda92de1
HDFS-7240 git commit fixup
Signed-off-by: Owen O'Malley <omalley@apache.org>
2018-04-26 13:52:17 -07:00
Anu Engineer
4e61bc431e HDFS-13258. Ozone: restructure Hdsl/Ozone code to separated maven subprojects.
Contributed by Elek Marton, Mukul Kumar Singh, Xiaoyu Yao, Ajay Kumar, Anu Engineer, Lokesh Jain, Nanda Kumar.
2018-04-26 05:36:04 -07:00
Robert Kanter
7ab08a9c37 HADOOP-15390. Yarn RM logs flooded by DelegationTokenRenewer trying to renew KMS tokens (xiaochen via rkanter) 2018-04-23 15:44:15 -07:00
Konstantinos Karanasos
28e2244390 YARN-8111. Simplify PlacementConstraints API by removing allocationTagToIntraApp. Contributed by Weiwei Yang. 2018-04-20 12:24:48 -07:00
Jason Lowe
766544c0b0 YARN-7786. NullPointerException while launching ApplicationMaster. Contributed by lujie 2018-04-20 13:22:20 -05:00
Sunil G
7d06806dfd YARN-6827. [ATS1/1.5] NPE exception while publishing recovering applications into ATS during RM restart. Contributed by Rohith Sharma K S. 2018-04-20 00:05:53 +05:30
Shashikant Banerjee
e4c39f3247
HADOOP-15396. Some java source files are executable
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2018-04-19 08:35:38 +09:00
Rohith Sharma K S
241d985467 YARN-8145. yarn rmadmin -getGroups doesn't return updated groups for user. Contributed by Sunil G. 2018-04-18 13:39:41 +05:30
Yufei
d426b980ac YARN-7966. Remove method AllocationConfiguration#getQueueAcl and related unit tests. Contributed by Sen Zhao. 2018-04-17 22:53:32 -07:00
Haibo Chen
bb92bfb4ef YARN-7088. Add application launch time to Resource Manager REST API. (Kanwaljeet Sachdev via Haibo Chen) 2018-04-17 11:15:20 -07:00
Haibo Chen
86dfb70875 Revert "YARN-7088. Add application launch time to Resource Manager REST API. (Kanwaljeet Sachdev via Haibo Chen)"
This reverts commit 0c14b6f0d8.
2018-04-17 06:04:38 -07:00
Haibo Chen
0c14b6f0d8 YARN-7088. Add application launch time to Resource Manager REST API. (Kanwaljeet Sachdev via Haibo Chen) 2018-04-17 05:59:03 -07:00
Inigo Goiri
f1461b2487 YARN-8164. Fix a potential NPE in AbstractSchedulerPlanFollower. Contributed by lujie. 2018-04-16 17:32:19 -07:00
Inigo Goiri
dd5e18c4ae YARN-8165. Incorrect queue name logging in AbstractContainerAllocator. Contributed by Weiwei Yan. 2018-04-16 16:52:17 -07:00
Wangda Tan
896b473f1b YARN-8138. Add unit test to validate queue priority preemption works under node partition. (Zian Chen via wangda)
Change-Id: Ibebfab98a714c12c2dc643b6d7b9754a7f813632
(cherry picked from commit 6ee62e6b1c9b4bc3447ce870446068e626b1a492)
2018-04-14 11:08:36 -07:00
Eric E Payne
18844599ae YARN-8147. TestClientRMService#testGetApplications sporadically fails. Contributed by Jason Lowe 2018-04-12 17:53:57 +00:00
Eric E Payne
6bb128dfb8 YARN-8120. JVM can crash with SIGSEGV when exiting due to custom leveldb logger. Contributed by Jason Lowe. 2018-04-12 16:04:23 +00:00
Weiwei Yang
7eb783e263 YARN-8127. Resource leak when async scheduling is enabled. Contributed by Tao Yang. 2018-04-11 17:15:25 +08:00
Sunil G
821b0de4c5 YARN-7574. Add support for Node Labels on Auto Created Leaf Queue Template. Contributed by Suma Shivaprasad. 2018-04-09 21:17:22 +05:30
Wangda Tan
d4e63ccca0 YARN-8048. Support auto-spawning of admin configured services during bootstrap of RM (Rohith Sharma K S via wangda)
Change-Id: I2d8d61ccad55e1118009294d7e17822df3cd0fd5
2018-04-06 21:24:58 -07:00
Vrushali C
345e7624d5 YARN-8073 TimelineClientImpl doesn't honor yarn.timeline-service.versions configuration. Contributed by Rohith Sharma K S 2018-04-04 15:08:03 -07:00
Konstantinos Karanasos
7853ec8d2f YARN-8013. Support application tags when defining application namespaces for placement constraints. Contributed by Weiwei Yang. 2018-04-04 10:51:58 -07:00
Haibo Chen
93d47a0ed5 YARN-8051. TestRMEmbeddedElector#testCallbackSynchronization is flaky. (Robert Kanter via Haibo Chen) 2018-04-03 07:59:20 -07:00
Wangda Tan
c78cb18c61 YARN-8091. Revisit checkUserAccessToQueue RM REST API. (wangda)
Change-Id: I5fab3fe229c34e967487b7327c7b3c8ddf7cb795
2018-04-02 15:22:05 -07:00
Weiwei Yang
1606dad133 YARN-7497. Add file system based scheduler configuration store. Contributed by Jiandan Yang. 2018-03-30 21:41:33 +08:00
Eric Yang
8fb00c3fce YARN-6257. Fix CapacityScheduler REST API JSON output.
Contributed By Tao Yang
2018-03-29 17:36:34 -04:00
Sunil G
7a59d60e0c YARN-8085. ResourceProfilesManager should be set in RMActiveServiceContext. Contributed by Tao Yang. 2018-03-29 21:41:16 +05:30
Wangda Tan
47f711eebc YARN-6629. NPE occurred when container allocation proposal is applied but its resource requests are removed before. (Tao Yang via wangda)
Change-Id: I805880f90b3f6798ec96ed8e8e75755f390a9ad5
2018-03-28 11:07:45 -07:00
Wangda Tan
24bc5e0e4d YARN-8072. RM log is getting flooded with MemoryPlacementConstraintManager info logs. (Zian Chen via wangda)
Change-Id: I2567f4669ad868e13c4301fe44c4bde85bb92c37
2018-03-26 11:20:16 -07:00
Wangda Tan
9eef19b2ad YARN-8068. Application Priority field causes NPE in app timeline publish when Hadoop 2.7 based clients to 2.8+ (Sunil G via wangda)
Change-Id: I7910bd1064a1b4dbbe2084080c060822ea6f3b48
2018-03-26 11:19:15 -07:00
Wangda Tan
5d381570f8 YARN-8062. yarn rmadmin -getGroups returns group from which the user has been removed. (Sunil G via wangda)
Change-Id: I80ed63846502bf7751b890b6c6c6a7c0679e2b4a
2018-03-26 11:16:06 -07:00
Wangda Tan
a90471b3e6 YARN-8016. Refine PlacementRule interface and add a app-name queue mapping rule as an example. (Zian Chen via wangda)
Change-Id: I35caf1480e0f76f5f3a53528af09312e39414bbb
2018-03-23 16:43:40 -07:00
Wangda Tan
a08921ca6c YARN-8002. Support NOT_SELF and ALL namespace types for allocation tag. (Weiwei Yang via wangda)
Change-Id: I63b4e4192a95bf7ded98c54e46a2871c72869700
2018-03-19 11:04:27 -07:00
Sunil G
6bd130ea45 YARN-8028. Support authorizeUserAccessToQueue in RMWebServices. Contributed by Wangda Tan. 2018-03-18 11:00:30 +05:30
Weiwei Yang
154cfb2b62 YARN-7636. Re-reservation count may overflow when cluster resource exhausted for a long time. contributed by Tao Yang. 2018-03-16 18:57:31 +08:00
Wangda Tan
4bf622043f YARN-7952. RM should be able to recover log aggregation status after restart/fail-over. (Xuan Gong via wangda)
Change-Id: I725c9afe64831eda0aa6b0bebdbc79d2dd165707
2018-03-15 13:26:45 -07:00
Wangda Tan
b167d60763 YARN-7657. Queue Mapping could provide options to provide 'user' specific auto-created queues under a specified group parent queue. (Suma Shivaprasad via wangda)
Change-Id: I32d566e8727840e43c0d66e39a77edef017e3a83
2018-03-13 17:55:51 -07:00
Yufei Gu
ff31d8aefa YARN-8024. LOG in class MaxRunningAppsEnforcer is initialized with a faulty class. Contributed by Sen Zhao. 2018-03-12 16:35:26 -07:00
Weiwei Yang
b451889e8e YARN-8011. TestOpportunisticContainerAllocatorAMService#testContainerPromoteAndDemoteBeforeContainerStart fails intermittently. Contributed by Tao Yang. 2018-03-08 18:13:36 +08:00
Wangda Tan
e718ac597f Revert "YARN-7891. LogAggregationIndexedFileController should support read from HAR file. (Xuan Gong via wangda)"
This reverts commit 4d53ef7eef.
2018-03-07 15:46:47 -08:00
Wangda Tan
4d53ef7eef YARN-7891. LogAggregationIndexedFileController should support read from HAR file. (Xuan Gong via wangda)
Change-Id: Ie16e34039d57df50128c73b37516ad0bc7c9590e
2018-03-07 11:30:06 -08:00