Sunil G
|
51654a3962
|
YARN-8459. Improve Capacity Scheduler logs to debug invalid states. Contributed by Wangda Tan.
|
2018-07-03 09:59:20 -07:00 |
|
Junping Du
|
d9ba6f3656
|
yarn.resourcemanager.fail-fast is used inconsistently. Contributed by Yuanbo Liu.
|
2018-07-03 14:46:44 +08:00 |
|
Sunil G
|
fa9ef15ecd
|
YARN-8415. TimelineWebServices.getEntity should throw ForbiddenException instead of 404 when ACL checks fail. Contributed by Suma Shivaprasad.
|
2018-07-02 15:34:47 -07:00 |
|
Shane Kumpf
|
53e267fa72
|
YARN-8485. Priviledged container app launch is failing intermittently. Contributed by Eric Yang
|
2018-07-02 16:18:32 -06:00 |
|
Eric Yang
|
5cc2541a16
|
YARN-8465. Fixed docker container status for node manager restart.
Contributed by Shane Kumpf
|
2018-07-02 13:37:51 -04:00 |
|
Giovanni Matteo Fumarola
|
cdb084426b
|
YARN-8481. AMRMProxyPolicies should accept heartbeat response from new/unknown subclusters. Contributed by Botong Huang.
|
2018-06-29 11:47:30 -07:00 |
|
Jason Lowe
|
100470140d
|
YARN-8451. Multiple NM heartbeat thread created when a slow NM resync with RM. Contributed by Botong Huang
|
2018-06-29 13:06:28 -05:00 |
|
Sunil G
|
469b29c081
|
YARN-8455. Add basic ACL check for all ATS v2 REST APIs. Contributed by Rohith Sharma K S.
|
2018-06-29 10:02:53 -07:00 |
|
Sunil G
|
291194302c
|
YARN-8379. Improve balancing resources in already satisfied queues by using Capacity Scheduler preemption. Contributed by Zian Chen.
|
2018-06-28 10:23:31 -07:00 |
|
Eric Yang
|
384764cdea
|
YARN-8409. Fixed NPE in ActiveStandbyElectorBasedElectorService.
Contributed by Chandni Singh
|
2018-06-28 12:39:49 -04:00 |
|
Wangda Tan
|
bedc4fe079
|
YARN-8464. Async scheduling thread could be interrupted when there are no NodeManagers in cluster. (Sunil G via wangda)
Change-Id: I4f5f856373378685713e77752ba6cf0988a66065
|
2018-06-26 19:27:17 -07:00 |
|
Wangda Tan
|
ada8f63d0b
|
YARN-8423. GPU does not get released even though the application gets killed. (Sunil G via wangda)
Change-Id: I570db7d60f8c6c21762dd618a9207d1107c486a0
|
2018-06-26 19:25:57 -07:00 |
|
Miklos Szegedi
|
62d83ca536
|
YARN-8461. Support strict memory control on individual container with elastic control memory mechanism. Contributed by Haibo Chen.
|
2018-06-26 15:21:35 -07:00 |
|
Eric Yang
|
b69ba0f330
|
YARN-8108. Added option to disable loading existing filters to prevent
security filter from initialize twice.
Contributed by Sunil Govindan
|
2018-06-26 17:34:57 -04:00 |
|
Miklos Szegedi
|
35ec9401e8
|
YARN-8438. TestContainer.testKillOnNew flaky on trunk. Contributed by Szilard Nemeth.
|
2018-06-25 15:47:54 -07:00 |
|
Weiwei Yang
|
440140cea6
|
YARN-8443. Total #VCores in cluster metrics is wrong when CapacityScheduler reserved some containers. Contributed by Tao Yang.
|
2018-06-25 09:15:31 +08:00 |
|
Eric Yang
|
8a32bc39eb
|
YARN-8326. Removed exit code file check for launched container.
Contributed by Shane Kumpf
|
2018-06-22 19:12:48 -04:00 |
|
Yufei Gu
|
1cdce86d33
|
YARN-8184. Too many metrics if containerLocalizer/ResourceLocalizationService uses ReadWriteDiskValidator. Contributed by Yufei Gu
|
2018-06-22 14:03:55 -07:00 |
|
Inigo Goiri
|
99948565cb
|
YARN-8412. Move ResourceRequest.clone logic everywhere into a proper API. Contributed by Botong Huang.
|
2018-06-21 18:24:10 -07:00 |
|
Miklos Szegedi
|
46f9058164
|
YARN-8441. Typo in CSQueueUtils local variable names: queueGuranteedResource. Contributed by Szilard Nemeth.
|
2018-06-20 11:58:18 -07:00 |
|
Miklos Szegedi
|
9a9e969570
|
YARN-8391. Investigate AllocationFileLoaderService.reloadListener locking issue. Contributed by Szilard Nemeth.
|
2018-06-20 10:59:33 -07:00 |
|
Haibo Chen
|
4939ffedb1
|
YARN-8437. Build oom-listener fails on older versions. (Miklos Szegedi via Haibo Chen)
|
2018-06-20 10:42:13 -07:00 |
|
Dedunu Dhananjaya
|
f386e78a4b
|
YARN-7668. Remove unused variables from ContainerLocalizer
This closes #364
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
|
2018-06-18 16:45:50 -07:00 |
|
Inigo Goiri
|
b272b71095
|
HADOOP-15537. Clean up ContainerLaunch and ContainerExecutor pre-HADOOP-15528. Contributed by Giovanni Matteo Fumarola.
|
2018-06-15 09:40:45 -07:00 |
|
Rohith Sharma K S
|
9119b3cf8f
|
YARN-8155. Improve ATSv2 client logging in RM and NM publisher. Contributed by Abhishek Modi.
|
2018-06-14 12:38:10 +05:30 |
|
Eric Yang
|
22994889dc
|
YARN-8259. Improve privileged docker container liveliness checks.
Contributed by Shane Kumpf
|
2018-06-13 19:24:31 -04:00 |
|
Sunil G
|
6307962b93
|
YARN-8404. Timeline event publish need to be async to avoid Dispatcher thread leak in case ATS is down. Contributed by Rohith Sharma K S
|
2018-06-13 16:09:16 +05:30 |
|
Weiwei Yang
|
29024a6203
|
YARN-8394. Improve data locality documentation for Capacity Scheduler. Contributed by Weiwei Yang.
|
2018-06-13 09:28:05 +08:00 |
|
Inigo Goiri
|
6e756e8a62
|
HADOOP-15529. ContainerLaunch#testInvalidEnvVariableSubstitutionType is not supported in Windows. Contributed by Giovanni Matteo Fumarola.
|
2018-06-12 10:24:34 -07:00 |
|
Haibo Chen
|
10d0e4be6e
|
YARN-8325. Miscellaneous QueueManager code clean up. (Szilard Nemeth via Haibo Chen)
|
2018-06-12 10:11:30 -07:00 |
|
Haibo Chen
|
24a89825f0
|
YARN-6931. Make the aggregation interval in AppLevelTimelineCollector configurable. (Abhishek Modi via Haibo Chen)
|
2018-06-12 10:03:34 -07:00 |
|
Akira Ajisaka
|
652bcbb3e4
|
YARN-8363. Upgrade commons-lang version to 3.7 in hadoop-yarn-project. Contributed by Takanobu Asanuma.
|
2018-06-13 00:35:37 +09:00 |
|
Rohith Sharma K S
|
2df73dace0
|
YARN-8405. RM zk-state-store.parent-path ACLs has been changed since HADOOP-14773. Contributed by Íñigo Goiri.
|
2018-06-12 17:13:09 +05:30 |
|
Inigo Goiri
|
2b2f672022
|
YARN-8370. Some Node Manager tests fail on Windows due to improper path/file separator. Contributed by Anbang Hu.
|
2018-06-11 19:25:56 -07:00 |
|
Haibo Chen
|
180b3c960b
|
YARN-8321. AllocationFileLoaderService.getAllocationFile() should be declared as VisibleForTest. (Szilard Nemeth via Haibo Chen)
|
2018-06-11 11:19:33 -07:00 |
|
Haibo Chen
|
676dcffff5
|
YARN-8322. Change log level when there is an IOException when the allocation file is loaded. (Szilard Nemeth via Haibo Chen)
|
2018-06-11 11:16:21 -07:00 |
|
Haibo Chen
|
c190ac2be8
|
YARN-8323. FairScheduler.allocConf should be declared as volatile. (Szilard Nemeth via Haibo Chen)
|
2018-06-11 11:12:44 -07:00 |
|
Inigo Goiri
|
3b88fe25ba
|
YARN-8359. Exclude containermanager.linux test classes on Windows. Contributed by Jason Lowe.
|
2018-06-07 17:13:03 -07:00 |
|
Robert Kanter
|
351cf87c92
|
Disable mounting cgroups by default (miklos.szegedi@cloudera.com via rkanter)
|
2018-06-07 17:09:34 -07:00 |
|
Miklos Szegedi
|
d5eca1a6a0
|
YARN-6677. Preempt opportunistic containers when root container cgroup goes over memory limit. Contributed by Haibo Chen.
|
2018-06-07 16:38:23 -07:00 |
|
Rohith Sharma K S
|
58bc34f1e3
|
YARN-8399. NodeManager is giving 403 GSS exception post upgrade to 3.1 in secure mode. Contributed by Sunil Govindan.
|
2018-06-07 10:25:56 +05:30 |
|
Robert Kanter
|
0cd145a443
|
YARN-4677. RMNodeResourceUpdateEvent update from scheduler can lead to race condition (wilfreds and gphillips via rkanter)
|
2018-06-04 15:32:03 -07:00 |
|
Haibo Chen
|
04cf699dd5
|
YARN-8388. TestCGroupElasticMemoryController.testNormalExit() hangs on Linux. (Miklos Szegedi via Haibo Chen)
|
2018-06-04 14:23:08 -07:00 |
|
Miklos Szegedi
|
e2c172dc9f
|
YARN-8382. cgroup file leak in NM. Contributed by Hu Ziqian.
|
2018-06-04 10:41:10 -07:00 |
|
Haibo Chen
|
ba12f87dcb
|
YARN-8390. Fix API incompatible changes in FairScheduler's AllocationFileLoaderService. (Gergo Repas via Haibo Chen)
|
2018-06-04 09:15:23 -07:00 |
|
Billie Rinaldi
|
31998643a5
|
YARN-8342. Enable untrusted docker image to run with launch command. Contributed by Eric Yang
|
2018-06-02 14:46:32 -07:00 |
|
Rohith Sharma K S
|
8261f9e571
|
More YARN pages need to honor yarn.resourcemanager.display.per-user-apps(addendum). Contributed by Sunil G.
|
2018-06-02 09:29:06 +05:30 |
|
Haibo Chen
|
4880d890ee
|
YARN-8375. TestCGroupElasticMemoryController fails surefire build. (Miklos Szegedi via Haibo Chen)
|
2018-06-01 16:29:13 -07:00 |
|
Wangda Tan
|
ff583d3fa3
|
YARN-8349. Remove YARN registry entries when a service is killed by the RM. (Billie Rinaldi via wangda)
Change-Id: Ia58db3637789a8921482f564aa9bdf99c45cc36c
|
2018-06-01 14:49:18 -07:00 |
|
Wangda Tan
|
931f78718f
|
YARN-7962. Race Condition When Stopping DelegationTokenRenewer causes RM crash during failover. (BELUGA BEHR via wangda)
Change-Id: I617e2645f60a57080058ad5f06af860fb3f682c8
|
2018-06-01 14:49:18 -07:00 |
|
Wangda Tan
|
3a6bd77550
|
YARN-8384. stdout.txt, stderr.txt logs of a launched docker container is coming with primary group of submit user instead of hadoop. (Eric Yang via wangda)
Change-Id: Idbb2cd250f4f62dc32993e0d3ca3ec5684616baa
|
2018-06-01 14:49:17 -07:00 |
|
Yufei Gu
|
6b21a599fb
|
YARN-7340. Fix the missing time stamp in exception message in Class
NoOverCommitPolicy. Contributed by Dinesh Chitlangia.
|
2018-05-31 22:30:12 -07:00 |
|
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
|
6b74f5d7fc
|
YARN-8197. Fixed AM IP Filter and Webapp proxy to redirect app tracking-URLs correctly when UI is secure. Contributed by Sunil Govindan.
|
2018-05-31 16:48:33 -07:00 |
|
Weiwei Yang
|
6468071f13
|
YARN-8367. Fix NPE in SingleConstraintAppPlacementAllocator when placement constraint in SchedulingRequest is null. Contributed by Weiwei Yang.
|
2018-05-31 20:46:39 +08:00 |
|
Eric E Payne
|
e44c0849d7
|
YARN-8377: Javadoc build failed in hadoop-yarn-server-nodemanager. Contributed by Takanobu Asanuma
|
2018-05-30 16:50:19 +00:00 |
|
Eric Yang
|
135941e00d
|
YARN-8362. Bugfix logic in container retries in node manager.
Contributed by Chandni Singh
|
2018-05-29 16:56:58 -04:00 |
|
Jason Lowe
|
4827e9a908
|
YARN-8329. Docker client configuration can still be set incorrectly. Contributed by Shane Kumpf
|
2018-05-29 14:43:17 -05:00 |
|
Wangda Tan
|
17aa40f669
|
YARN-8369. Javadoc build failed due to 'bad use of >'. (Takanobu Asanuma via wangda)
Change-Id: I79a42154e8f86ab1c3cc939b3745024b8eebe5f4
|
2018-05-29 09:27:36 -07:00 |
|
Jason Lowe
|
31ab960f4f
|
YARN-8338. TimelineService V1.5 doesn't come up after HADOOP-15406. Contributed by Vinod Kumar Vavilapalli
|
2018-05-29 11:00:30 -05:00 |
|
Sunil G
|
7c343669ba
|
YARN-4781. Support intra-queue preemption for fairness ordering policy. Contributed by Eric Payne.
|
2018-05-28 16:32:53 +05:30 |
|
Wangda Tan
|
f24c842d52
|
YARN-8213. Add Capacity Scheduler performance metrics. (Weiwei Yang via wangda)
Change-Id: Ieea6f3eeb83c90cd74233fea896f0fcd0f325d5f
|
2018-05-25 21:53:20 -07:00 |
|
Eric E Payne
|
8d5509c681
|
YARN-8292: Fix the dominant resource preemption cannot happen when some of the resource vector becomes negative. Contributed by Wangda Tan.
|
2018-05-25 16:06:09 +00:00 |
|
Haibo Chen
|
86bc6425d4
|
YARN-8191. Fair scheduler: queue deletion without RM restart. (Gergo Repas via Haibo Chen)
|
2018-05-24 17:12:34 -07:00 |
|
Shane Kumpf
|
1388de18ad
|
YARN-6919. Add default volume mount list. Contributed by Eric Badger
|
2018-05-24 09:30:39 -06:00 |
|
Rohith Sharma K S
|
c05b5d424b
|
YARN-8319. More YARN pages need to honor yarn.resourcemanager.display.per-user-apps. Contributed by Sunil G.
|
2018-05-24 14:19:46 +05:30 |
|
Haibo Chen
|
d996479954
|
YARN-4599. Set OOM control for memory cgroups. (Miklos Szegedi via Haibo Chen)
|
2018-05-23 16:35:37 -07:00 |
|
Inigo Goiri
|
d72615611c
|
YARN-8348. Incorrect and missing AfterClass in HBase-tests to fix NPE failures. Contributed by Giovanni Matteo Fumarola.
|
2018-05-23 14:43:59 -07:00 |
|
Inigo Goiri
|
e99e5bf104
|
YARN-8344. Missing nm.stop() in TestNodeManagerResync to fix testKillContainersOnResync. Contributed by Giovanni Matteo Fumarola.
|
2018-05-23 14:15:26 -07:00 |
|
Shane Kumpf
|
9837ca9cc7
|
YARN-8285. Remove unused environment variables from the Docker runtime. Contributed by Eric Badger
|
2018-05-23 06:43:44 -06:00 |
|
Robert Kanter
|
b22f56c471
|
YARN-8273. Log aggregation does not warn if HDFS quota in target directory is exceeded (grepas via rkanter)
|
2018-05-22 14:24:38 -07:00 |
|
Wangda Tan
|
bd15d2396e
|
YARN-8290. SystemMetricsPublisher.appACLsUpdated should be invoked after application information is published to ATS to avoid "User is not set in the application report" Exception. (Eric Yang via wangda)
Change-Id: I0ac6ddd19740d1aa7dd07111cd11af71ddc2fcaf
|
2018-05-22 13:33:33 -07:00 |
|
Eric Yang
|
6176d2b35c
|
YARN-7960. Added security flag no-new-privileges for YARN Docker integration.
Contributed by Eric Badger
|
2018-05-22 13:44:58 -04:00 |
|
Jason Lowe
|
5f11288e41
|
YARN-8206. Sending a kill does not immediately kill docker containers. Contributed by Eric Badger
|
2018-05-22 09:27:08 -05:00 |
|
Eric E Payne
|
0b4c44bdee
|
YARN-8179: Preemption does not happen due to natural_termination_factor when DRF is used. Contributed by Kyungwan Nam.
|
2018-05-21 20:14:58 +00:00 |
|
Haibo Chen
|
f48fec83d0
|
YARN-8248. Job hangs when a job requests a resource that its queue does not have. (Szilard Nemeth via Haibo Chen)
|
2018-05-21 08:10:41 -07:00 |
|
Arun Suresh
|
3159bffce2
|
YARN-7900. [AMRMProxy] AMRMClientRelayer for stateful FederationInterceptor. (Botong Huang via asuresh)
|
2018-05-17 20:00:52 -07:00 |
|
Eric Yang
|
d45a0b7d73
|
YARN-8141. Removed YARN_CONTAINER_RUNTIME_DOCKER_LOCAL_RESOURCE_MOUNTS flag.
Contributed by Chandni Singh
|
2018-05-17 17:29:34 -04:00 |
|
Jason Lowe
|
be53969047
|
YARN-8071. Add ability to specify nodemanager environment variables individually. Contributed by Jim Brennan
|
2018-05-16 16:17:28 -05:00 |
|
Haibo Chen
|
e3b7d7ac16
|
YARN-7933. [atsv2 read acls] Add TimelineWriter#writeDomain. (Rohith Sharma K S via Haibo Chen)
|
2018-05-16 13:00:01 -07:00 |
|
Jason Lowe
|
d47c09dcb1
|
YARN-8284. get_docker_command refactoring. Contributed by Eric Badger
|
2018-05-16 09:23:49 -05:00 |
|
Weiwei Yang
|
2bb647bb91
|
YARN-8278. DistributedScheduling is not working in HA. Contributed by Bibin A Chundatt.
|
2018-05-15 17:28:19 +08:00 |
|
Haibo Chen
|
2d00a0c71b
|
YARN-8130 Race condition when container events are published for KILLED applications. (Rohith Sharma K S via Haibo Chen)
|
2018-05-14 11:08:42 -07:00 |
|
Eric Yang
|
0ff94563b9
|
YARN-8265. Improve DNS handling on docker IP changes.
Contributed by Billie Rinaldi
|
2018-05-11 22:37:43 -07:00 |
|
Jason Lowe
|
6c8e51ca7e
|
YARN-7654. Support ENTRY_POINT for docker container. Contributed by Eric Yang
|
2018-05-11 18:56:05 -05:00 |
|
Jason Lowe
|
dc912994a1
|
YARN-8244. TestContainerSchedulerQueuing.testStartMultipleContainers failed. Contributed by Jim Brennan
|
2018-05-11 14:07:32 -05:00 |
|
Haibo Chen
|
1f10a36021
|
YARN-8268. Fair scheduler: reservable queue is configured both as parent and leaf queue. (Gergo Repas via Haibo Chen)
|
2018-05-11 11:34:00 -07:00 |
|
Eric Yang
|
8f7912e0fe
|
YARN-8274. Fixed a bug on docker start command.
Contributed by Jason Lowe
|
2018-05-11 14:23:16 -04:00 |
|
Weiwei Yang
|
9db9cd95bd
|
YARN-7003. DRAINING state of queues is not recovered after RM restart. Contributed by Tao Yang.
|
2018-05-11 10:47:04 +08:00 |
|
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
|
d76fbbc9b8
|
YARN-8249. Fixed few REST APIs in RMWebServices to have static-user check. Contributed by Sunil Govindan.
|
2018-05-10 19:05:35 -07:00 |
|
Haibo Chen
|
6341c3a437
|
YARN-7715. Support NM promotion/demotion of running containers. (Miklos Szegedi via Haibo Chen)
|
2018-05-10 11:01:01 -07:00 |
|
Robert Kanter
|
c8b53c4364
|
YARN-8202. DefaultAMSProcessor should properly check units of requested custom resource types against minimum/maximum allocation (snemeth via rkanter)
|
2018-05-10 09:31:59 -07:00 |
|
Vrushali C
|
3c95ca4f21
|
YARN-8247 Incorrect HTTP status code returned by ATSv2 for non-whitelisted users. Contributed by Rohith Sharma K S
|
2018-05-09 22:17:48 -07:00 |
|
bibinchundatt
|
cc0310a526
|
YARN-8201. Skip stacktrace of few exception from ClientRMService. Contributed by Bilwa S T.
|
2018-05-10 09:15:46 +05:30 |
|
Eric Yang
|
af4fc2e628
|
YARN-8261. Fixed a bug in creation of localized container directory.
Contributed by Jason Lowe
|
2018-05-09 13:32:51 -04:00 |
|
Rohith Sharma K S
|
eb7b256cc2
|
YARN-8239. [UI2] Clicking on Node Manager UI under AM container info / App Attempt page goes to old RM UI. Contributed by Sunil G.
|
2018-05-09 12:00:43 +05:30 |
|
Jason Lowe
|
a2ea756420
|
YARN-8207. Docker container launch use popen have risk of shell expansion. Contributed by Eric Yang.
|
2018-05-08 15:30:36 -05:00 |
|
Wangda Tan
|
6b96a73bb0
|
YARN-8232. RMContainer lost queue name when RM HA happens. (Hu Ziqian via wangda)
Change-Id: Ia21e1da6871570c993bbedde76ce32929e95970f
|
2018-05-08 11:34:45 -07:00 |
|
Rohith Sharma K S
|
7450583721
|
YARN-8253. HTTPS Ats v2 api call fails with 'bad HTTP parsed'. Contributed by Charan Hebri.
|
2018-05-08 12:28:54 +05:30 |
|
Weiwei Yang
|
67f239c42f
|
YARN-8025. UsersManangers#getComputedResourceLimitForActiveUsers throws NPE due to preComputedActiveUserLimit is empty. Contributed by Tao Yang.
|
2018-05-07 11:33:14 +08:00 |
|
Inigo Goiri
|
6a69239d86
|
YARN-8163. Add support for Node Labels in opportunistic scheduling. Contributed by Abhishek Modi.
|
2018-05-04 14:59:59 -07:00 |
|