Wangda Tan
|
344c335a92
|
YARN-8588. Logging improvements for better debuggability. (Suma Shivaprasad via wangda)
Change-Id: I66aa4b0ec031ae5ce0fae558e2f8cbcbbfebc442
|
2018-08-09 11:04:02 -07:00 |
|
Weiwei Yang
|
d352f167eb
|
YARN-8559. Expose mutable-conf scheduler's configuration in RM /scheduler-conf endpoint. Contributed by Weiwei Yang.
|
2018-08-09 23:46:53 +08:00 |
|
Jason Lowe
|
cd04e954d2
|
YARN-8331. Race condition in NM container launched after done. Contributed by Pradeep Ambati
|
2018-08-09 10:17:34 -05:00 |
|
Giovanni Matteo Fumarola
|
d838179d8d
|
YARN-8626. Create HomePolicyManager that sends all the requests to the home subcluster. Contributed by Inigo Goiri.
|
2018-08-07 15:33:16 -07:00 |
|
Wangda Tan
|
d4258fcad7
|
YARN-8629. Container cleanup fails while trying to delete Cgroups. (Suma Shivaprasad via wangda)
Change-Id: I392ef4f8baa84d5d7b1f2e438c560b5426b6d4f2
|
2018-08-07 12:36:55 -07:00 |
|
Haibo Chen
|
889df6f194
|
YARN-7948. Enable fair scheduler to refresh maximum allocation for multiple resource types. (Szilard Nemeth via Haibo Chen)
|
2018-08-02 10:10:12 -07:00 |
|
Sunil G
|
12a095a496
|
YARN-7159. Normalize unit of resource objects in RM to avoid unit conversion in critical path. Contributed by Manikandan R.
|
2018-08-02 22:29:21 +05:30 |
|
Jason Lowe
|
7526815e32
|
YARN-8263. DockerClient still touches hadoop.tmp.dir. Contributed by Craig Condit
|
2018-08-02 10:43:48 -05:00 |
|
Sunil G
|
735b492556
|
YARN-8593. Add RM web service endpoint to get user information. Contributed by Akhil PB.
|
2018-08-02 08:34:09 +05:30 |
|
Billie Rinaldi
|
67c65da261
|
YARN-8403. Change the log level for fail to download resource from INFO to ERROR. Contributed by Eric Yang
|
2018-08-01 08:51:40 -07:00 |
|
Sunil G
|
a48a0cc7fd
|
YARN-8606. Opportunistic scheduling does not work post RM failover. Contributed by Bibin A Chundatt.
|
2018-08-01 12:17:18 +05:30 |
|
Sunil G
|
6310c0d17d
|
YARN-8397. Potential thread leak in ActivitiesManager. Contributed by Rohith Sharma K S.
|
2018-08-01 08:33:30 +05:30 |
|
Eric Yang
|
c7ebcd76bf
|
YARN-8579. Recover NMToken of previous attempted component data.
Contributed by Gour Saha
|
2018-07-31 18:01:02 -04:00 |
|
Wangda Tan
|
4b540bbfcf
|
YARN-8418. App local logs could leaked if log aggregation fails to initialize for the app. (Bibin A Chundatt via wangda)
Change-Id: I29a23ca4b219b48c92e7975cd44cddb8b0e04104
|
2018-07-31 12:08:00 -07:00 |
|
Haibo Chen
|
8aa93a575e
|
YARN-8605. TestDominantResourceFairnessPolicy.testModWhileSorting is flaky. (Wilfred Spiegelenburg via Haibo Chen)
|
2018-07-31 11:32:40 -07:00 |
|
Jonathan Hung
|
3e06a5dcea
|
YARN-7974. Allow updating application tracking url after registration. Contributed by Jonathan Hung
|
2018-07-30 17:44:18 -07:00 |
|
bibinchundatt
|
2b39ad2698
|
YARN-8584. Several typos in Log Aggregation related classes. Contributed by Szilard Nemeth.
|
2018-07-30 23:25:19 +05:30 |
|
Sunil G
|
63e08ec071
|
YARN-8591. [ATSv2] NPE while checking for entity acl in non-secure cluster. Contributed by Rohith Sharma K S.
|
2018-07-30 14:48:04 +05:30 |
|
bibinchundatt
|
3d586841ab
|
YARN-8558. NM recovery level db not cleaned up properly on container finish. Contributed by Bibin A Chundatt.
|
2018-07-28 20:52:39 +05:30 |
|
Eric Yang
|
ed9d60e888
|
YARN-8508. Release GPU resource for killed container.
Contributed by Chandni Singh
|
2018-07-27 19:33:58 -04:00 |
|
Inigo Goiri
|
79091cf76f
|
YARN-8596. Allow SQLFederationStateStore to submit the same app in the same subcluster. Contributed by Giovanni Matteo Fumarola.
|
2018-07-27 15:23:57 -07:00 |
|
Robert Kanter
|
2cccf4061c
|
YARN-8517. getContainer and getContainers ResourceManager REST API methods are not documented (snemeth via rkanter)
|
2018-07-27 14:35:03 -07:00 |
|
Eric Yang
|
f93ecf5c1e
|
YARN-8330. Improved publishing ALLOCATED events to ATS.
Contributed by Suma Shivaprasad
|
2018-07-25 18:49:30 -04:00 |
|
Eric E Payne
|
9485c9aee6
|
YARN-4606. CapacityScheduler: applications could get starved because computation of #activeUsers considers pending apps. Contributed by Manikandan R
|
2018-07-25 16:22:04 +00:00 |
|
Weiwei Yang
|
5be9f4a5d0
|
YARN-8546. Resource leak caused by a reserved container being released more than once under async scheduling. Contributed by Tao Yang.
|
2018-07-25 17:35:27 +08:00 |
|
Haibo Chen
|
ea2c6c8c9a
|
YARN-7133. Clean up lock-try order in fair scheduler. (Szilard Nemeth via Haibo Chen)
|
2018-07-24 12:46:59 -07:00 |
|
bibinchundatt
|
e673dd1d4d
|
YARN-8541. RM startup failure on recovery after user deletion. Contributed by Bibin A Chundatt.
|
2018-07-24 23:26:59 +05:30 |
|
Sunil G
|
35ce6eb1f5
|
YARN-7748. TestContainerResizing.testIncreaseContainerUnreservedWhenApplicationCompleted fails due to multiple container fail events. Contributed by Weiwei Yang.
|
2018-07-24 22:20:17 +05:30 |
|
bibinchundatt
|
ff7c2eda34
|
YARN-8548. AllocationRespose proto setNMToken initBuilder not done. Contributed by Bilwa S T.
|
2018-07-24 16:17:20 +05:30 |
|
bibinchundatt
|
8461278833
|
YARN-8544. [DS] AM registration fails when hadoop authorization is enabled. Contributed by Bibin A Chundatt.
|
2018-07-24 13:09:17 +05:30 |
|
Eric Yang
|
8688a0c7f8
|
YARN-8380. Support bind propagation options for mounts in docker runtime.
Contributed by Billie Rinaldi
|
2018-07-23 20:12:04 -04:00 |
|
Haibo Chen
|
9d3c39e9dd
|
YARN-6966. NodeManager metrics may return wrong negative values when NM restart. (Szilard Nemeth via Haibo Chen)
|
2018-07-23 11:07:24 -07:00 |
|
Haibo Chen
|
8a6bb8409c
|
YARN-6964. Fair scheduler misuses Resources operations. (Daniel Templeton and Szilard Nemeth via Haibo Chen)
|
2018-07-20 10:46:28 -07:00 |
|
Weiwei Yang
|
cbf2026483
|
YARN-8528. Final states in ContainerAllocation might be modified externally causing unexpected allocation results. Contributed by Xintong Song.
|
2018-07-20 22:34:06 +08:00 |
|
Haibo Chen
|
e6873dfde0
|
YARN-7300. DiskValidator is not used in LocalDirAllocator. (Szilard Nemeth via Haibo Chen)
|
2018-07-19 16:27:11 -07:00 |
|
Haibo Chen
|
2564884757
|
YARN-8436. FSParentQueue: Comparison method violates its general contract. (Wilfred Spiegelenburg via Haibo Chen)
|
2018-07-19 13:22:31 -07:00 |
|
Eric Yang
|
5836e0a46b
|
YARN-8501. Reduce complexity of RMWebServices getApps method.
Contributed by Szilard Nemeth
|
2018-07-19 12:30:38 -04:00 |
|
Eric Yang
|
efb4e274e5
|
YARN-8538. Fixed memory leaks in container-executor and test cases.
Contributed by Billie Rinaldi
|
2018-07-16 17:38:49 -04:00 |
|
Wangda Tan
|
238ffff999
|
YARN-8524. Single parameter Resource / LightWeightResource constructor looks confusing. (Szilard Nemeth via wangda)
Change-Id: I4ae97548b5b8d76a6bcebb2d3d70bf8e0be3c125
|
2018-07-16 10:58:00 -07:00 |
|
Wangda Tan
|
a2e49f41a8
|
YARN-8361. Change App Name Placement Rule to use App Name instead of App Id for configuration. (Zian Chen via wangda)
Change-Id: I17e5021f8f611a9c5e3bd4b38f25e08585afc6b1
|
2018-07-16 10:57:37 -07:00 |
|
Wangda Tan
|
752dcce5f4
|
YARN-8511. When AM releases a container, RM removes allocation tags before it is released by NM. (Weiwei Yang via wangda)
Change-Id: I6f9f409f2ef685b405cbff547dea9623bf3322d9
|
2018-07-16 10:54:41 -07:00 |
|
Eric E Payne
|
937ef39b3f
|
YARN-8421: when moving app, activeUsers is increased, even though app does not have outstanding request. Contributed by Kyungwan Nam
|
2018-07-16 16:24:21 +00:00 |
|
Akira Ajisaka
|
88625f5cd9
|
HADOOP-15531. Use commons-text instead of commons-lang in some classes to fix deprecation warnings. Contributed by Takanobu Asanuma.
|
2018-07-13 11:42:12 -04:00 |
|
Jason Lowe
|
17118f446c
|
YARN-8515. container-executor can crash with SIGPIPE after nodemanager restart. Contributed by Jim Brennan
|
2018-07-13 10:05:25 -05:00 |
|
Robert Kanter
|
1bc106a738
|
YARN-8518. test-container-executor test_is_empty() is broken (Jim_Brennan via rkanter)
|
2018-07-12 16:38:46 -07:00 |
|
Sunil G
|
7f1d3d0e9d
|
YARN-8512. ATSv2 entities are not published to HBase from second attempt onwards. Contributed by Rohith Sharma K S.
|
2018-07-11 12:26:32 +05:30 |
|
Giovanni Matteo Fumarola
|
82ac3aa6d0
|
YARN-8502. Use path strings consistently for webservice endpoints in RMWebServices. Contributed by Szilard Nemeth.
|
2018-07-10 10:36:17 -07:00 |
|
Sunil G
|
705e2c1f7c
|
YARN-8473. Containers being launched as app tears down can leave containers in NEW state. Contributed by Jason Lowe.
|
2018-07-10 20:11:47 +05:30 |
|
Giovanni Matteo Fumarola
|
ea9b608237
|
YARN-7899. [AMRMProxy] Stateful FederationInterceptor for pending requests. Contributed by Botong Huang.
|
2018-07-09 12:27:36 -07:00 |
|
Robert Kanter
|
0838fe8337
|
Only mount non-empty directories for cgroups (miklos.szegedi@cloudera.com via rkanter)
|
2018-07-09 10:37:20 -07:00 |
|
Rohith Sharma K S
|
ba68320449
|
YARN-8302. ATS v2 should handle HBase connection issue properly. Contributed by Billie Rinaldi.
|
2018-07-06 15:19:01 -07:00 |
|
Sunil G
|
e4bf38cf50
|
YARN-8492. ATSv2 HBase tests are failing with ClassNotFoundException. Contributed by Rohith Sharma K S.
|
2018-07-06 12:05:32 -07:00 |
|
Robert Kanter
|
a129e3e74e
|
YARN-7451. Add missing tests to verify the presence of custom resources of RM apps and scheduler webservice endpoints (snemeth via rkanter)
|
2018-07-06 11:04:00 -07:00 |
|
Haibo Chen
|
9edc74f64a
|
YARN-7556. Fair scheduler configuration should allow resource types in the minResources and maxResources properties. (Daniel Templeton and Szilard Nemeth via Haibo Chen)
|
2018-07-06 11:03:48 -07:00 |
|
Anu Engineer
|
39ad98903a
|
Revert "Merge branch 'trunk' of https://git-wip-us.apache.org/repos/asf/hadoop into trunk"
This reverts commit c163d1797a , reversing
changes made to 0d9804dcef .
|
2018-07-05 12:22:18 -07:00 |
|
Giovanni Matteo Fumarola
|
c163d1797a
|
Merge branch 'trunk' of https://git-wip-us.apache.org/repos/asf/hadoop into trunk
|
2018-07-05 10:55:05 -07:00 |
|
Giovanni Matteo Fumarola
|
0d9804dcef
|
YARN-8435. Fix NPE when the same client simultaneously contact for the first time Yarn Router. Contributed by Rang Jiaheng.
|
2018-07-05 10:54:31 -07:00 |
|
Robert Kanter
|
99febe7fd5
|
YARN-7451. Add missing tests to verify the presence of custom resources of RM apps and scheduler webservice endpoints (snemeth via rkanter)
|
2018-07-05 10:54:19 -07:00 |
|
Haibo Chen
|
1726247024
|
YARN-7556. Fair scheduler configuration should allow resource types in the minResources and maxResources properties. (Daniel Templeton and Szilard Nemeth via Haibo Chen)
|
2018-07-05 10:42:39 -07:00 |
|
Sunil G
|
51654a3962
|
YARN-8459. Improve Capacity Scheduler logs to debug invalid states. Contributed by Wangda Tan.
|
2018-07-03 09:59:20 -07:00 |
|
Junping Du
|
d9ba6f3656
|
yarn.resourcemanager.fail-fast is used inconsistently. Contributed by Yuanbo Liu.
|
2018-07-03 14:46:44 +08:00 |
|
Sunil G
|
fa9ef15ecd
|
YARN-8415. TimelineWebServices.getEntity should throw ForbiddenException instead of 404 when ACL checks fail. Contributed by Suma Shivaprasad.
|
2018-07-02 15:34:47 -07:00 |
|
Shane Kumpf
|
53e267fa72
|
YARN-8485. Priviledged container app launch is failing intermittently. Contributed by Eric Yang
|
2018-07-02 16:18:32 -06:00 |
|
Eric Yang
|
5cc2541a16
|
YARN-8465. Fixed docker container status for node manager restart.
Contributed by Shane Kumpf
|
2018-07-02 13:37:51 -04:00 |
|
Giovanni Matteo Fumarola
|
cdb084426b
|
YARN-8481. AMRMProxyPolicies should accept heartbeat response from new/unknown subclusters. Contributed by Botong Huang.
|
2018-06-29 11:47:30 -07:00 |
|
Jason Lowe
|
100470140d
|
YARN-8451. Multiple NM heartbeat thread created when a slow NM resync with RM. Contributed by Botong Huang
|
2018-06-29 13:06:28 -05:00 |
|
Sunil G
|
469b29c081
|
YARN-8455. Add basic ACL check for all ATS v2 REST APIs. Contributed by Rohith Sharma K S.
|
2018-06-29 10:02:53 -07:00 |
|
Sunil G
|
291194302c
|
YARN-8379. Improve balancing resources in already satisfied queues by using Capacity Scheduler preemption. Contributed by Zian Chen.
|
2018-06-28 10:23:31 -07:00 |
|
Eric Yang
|
384764cdea
|
YARN-8409. Fixed NPE in ActiveStandbyElectorBasedElectorService.
Contributed by Chandni Singh
|
2018-06-28 12:39:49 -04:00 |
|
Wangda Tan
|
bedc4fe079
|
YARN-8464. Async scheduling thread could be interrupted when there are no NodeManagers in cluster. (Sunil G via wangda)
Change-Id: I4f5f856373378685713e77752ba6cf0988a66065
|
2018-06-26 19:27:17 -07:00 |
|
Wangda Tan
|
ada8f63d0b
|
YARN-8423. GPU does not get released even though the application gets killed. (Sunil G via wangda)
Change-Id: I570db7d60f8c6c21762dd618a9207d1107c486a0
|
2018-06-26 19:25:57 -07:00 |
|
Miklos Szegedi
|
62d83ca536
|
YARN-8461. Support strict memory control on individual container with elastic control memory mechanism. Contributed by Haibo Chen.
|
2018-06-26 15:21:35 -07:00 |
|
Eric Yang
|
b69ba0f330
|
YARN-8108. Added option to disable loading existing filters to prevent
security filter from initialize twice.
Contributed by Sunil Govindan
|
2018-06-26 17:34:57 -04:00 |
|
Miklos Szegedi
|
35ec9401e8
|
YARN-8438. TestContainer.testKillOnNew flaky on trunk. Contributed by Szilard Nemeth.
|
2018-06-25 15:47:54 -07:00 |
|
Weiwei Yang
|
440140cea6
|
YARN-8443. Total #VCores in cluster metrics is wrong when CapacityScheduler reserved some containers. Contributed by Tao Yang.
|
2018-06-25 09:15:31 +08:00 |
|
Eric Yang
|
8a32bc39eb
|
YARN-8326. Removed exit code file check for launched container.
Contributed by Shane Kumpf
|
2018-06-22 19:12:48 -04:00 |
|
Yufei Gu
|
1cdce86d33
|
YARN-8184. Too many metrics if containerLocalizer/ResourceLocalizationService uses ReadWriteDiskValidator. Contributed by Yufei Gu
|
2018-06-22 14:03:55 -07:00 |
|
Inigo Goiri
|
99948565cb
|
YARN-8412. Move ResourceRequest.clone logic everywhere into a proper API. Contributed by Botong Huang.
|
2018-06-21 18:24:10 -07:00 |
|
Miklos Szegedi
|
46f9058164
|
YARN-8441. Typo in CSQueueUtils local variable names: queueGuranteedResource. Contributed by Szilard Nemeth.
|
2018-06-20 11:58:18 -07:00 |
|
Miklos Szegedi
|
9a9e969570
|
YARN-8391. Investigate AllocationFileLoaderService.reloadListener locking issue. Contributed by Szilard Nemeth.
|
2018-06-20 10:59:33 -07:00 |
|
Haibo Chen
|
4939ffedb1
|
YARN-8437. Build oom-listener fails on older versions. (Miklos Szegedi via Haibo Chen)
|
2018-06-20 10:42:13 -07:00 |
|
Dedunu Dhananjaya
|
f386e78a4b
|
YARN-7668. Remove unused variables from ContainerLocalizer
This closes #364
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
|
2018-06-18 16:45:50 -07:00 |
|
Inigo Goiri
|
b272b71095
|
HADOOP-15537. Clean up ContainerLaunch and ContainerExecutor pre-HADOOP-15528. Contributed by Giovanni Matteo Fumarola.
|
2018-06-15 09:40:45 -07:00 |
|
Rohith Sharma K S
|
9119b3cf8f
|
YARN-8155. Improve ATSv2 client logging in RM and NM publisher. Contributed by Abhishek Modi.
|
2018-06-14 12:38:10 +05:30 |
|
Eric Yang
|
22994889dc
|
YARN-8259. Improve privileged docker container liveliness checks.
Contributed by Shane Kumpf
|
2018-06-13 19:24:31 -04:00 |
|
Sunil G
|
6307962b93
|
YARN-8404. Timeline event publish need to be async to avoid Dispatcher thread leak in case ATS is down. Contributed by Rohith Sharma K S
|
2018-06-13 16:09:16 +05:30 |
|
Weiwei Yang
|
29024a6203
|
YARN-8394. Improve data locality documentation for Capacity Scheduler. Contributed by Weiwei Yang.
|
2018-06-13 09:28:05 +08:00 |
|
Inigo Goiri
|
6e756e8a62
|
HADOOP-15529. ContainerLaunch#testInvalidEnvVariableSubstitutionType is not supported in Windows. Contributed by Giovanni Matteo Fumarola.
|
2018-06-12 10:24:34 -07:00 |
|
Haibo Chen
|
10d0e4be6e
|
YARN-8325. Miscellaneous QueueManager code clean up. (Szilard Nemeth via Haibo Chen)
|
2018-06-12 10:11:30 -07:00 |
|
Haibo Chen
|
24a89825f0
|
YARN-6931. Make the aggregation interval in AppLevelTimelineCollector configurable. (Abhishek Modi via Haibo Chen)
|
2018-06-12 10:03:34 -07:00 |
|
Akira Ajisaka
|
652bcbb3e4
|
YARN-8363. Upgrade commons-lang version to 3.7 in hadoop-yarn-project. Contributed by Takanobu Asanuma.
|
2018-06-13 00:35:37 +09:00 |
|
Rohith Sharma K S
|
2df73dace0
|
YARN-8405. RM zk-state-store.parent-path ACLs has been changed since HADOOP-14773. Contributed by Íñigo Goiri.
|
2018-06-12 17:13:09 +05:30 |
|
Inigo Goiri
|
2b2f672022
|
YARN-8370. Some Node Manager tests fail on Windows due to improper path/file separator. Contributed by Anbang Hu.
|
2018-06-11 19:25:56 -07:00 |
|
Haibo Chen
|
180b3c960b
|
YARN-8321. AllocationFileLoaderService.getAllocationFile() should be declared as VisibleForTest. (Szilard Nemeth via Haibo Chen)
|
2018-06-11 11:19:33 -07:00 |
|
Haibo Chen
|
676dcffff5
|
YARN-8322. Change log level when there is an IOException when the allocation file is loaded. (Szilard Nemeth via Haibo Chen)
|
2018-06-11 11:16:21 -07:00 |
|
Haibo Chen
|
c190ac2be8
|
YARN-8323. FairScheduler.allocConf should be declared as volatile. (Szilard Nemeth via Haibo Chen)
|
2018-06-11 11:12:44 -07:00 |
|
Inigo Goiri
|
3b88fe25ba
|
YARN-8359. Exclude containermanager.linux test classes on Windows. Contributed by Jason Lowe.
|
2018-06-07 17:13:03 -07:00 |
|
Robert Kanter
|
351cf87c92
|
Disable mounting cgroups by default (miklos.szegedi@cloudera.com via rkanter)
|
2018-06-07 17:09:34 -07:00 |
|
Miklos Szegedi
|
d5eca1a6a0
|
YARN-6677. Preempt opportunistic containers when root container cgroup goes over memory limit. Contributed by Haibo Chen.
|
2018-06-07 16:38:23 -07:00 |
|
Rohith Sharma K S
|
58bc34f1e3
|
YARN-8399. NodeManager is giving 403 GSS exception post upgrade to 3.1 in secure mode. Contributed by Sunil Govindan.
|
2018-06-07 10:25:56 +05:30 |
|