Commit Graph

3790 Commits

Author SHA1 Message Date
Inigo Goiri
c504eee0c2 YARN-9339. Apps pending metric incorrect after moving app to a new queue. Contributed by Abhishek Modi. 2019-04-23 12:40:44 -07:00
Zhankun Tang
8a95ea61e1 YARN-9475. [YARN-9473] Create basic VE plugin. Contributed by Peter Bacsko. 2019-04-23 17:33:58 +08:00
Weiwei Yang
1c8046d67e YARN-9325. TestQueueManagementDynamicEditPolicy fails intermittent. Contributed by Prabhu Joseph. 2019-04-23 14:21:13 +08:00
Inigo Goiri
96e3027e46 YARN-2889. Limit the number of opportunistic container allocated per AM heartbeat. Contributed by Abhishek Modi. 2019-04-22 09:49:03 -07:00
Inigo Goiri
aeadb9432f YARN-9448. Fix Opportunistic Scheduling for node local allocations. Contributed by Abhishek Modi. 2019-04-19 09:41:06 -07:00
Eric Yang
ef97a20831 YARN-8622. Fixed container-executor compilation on MacOSX.
Contributed by Siyao Meng
2019-04-18 18:59:21 -04:00
Eric Yang
df76cdc895 YARN-6695. Fixed NPE in publishing appFinished events to ATSv2.
Contributed by Prabhu Joseph
2019-04-18 12:29:37 -04:00
Prabhu Joseph
aa4c744aef
YARN-9470. Fix order of actual and expected expression in assert statements
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2019-04-18 15:40:37 +09:00
Siyao Meng
6e4399ea61 YARN-9487. NodeManager native build shouldn't link against librt on macOS. Contributed by Siyao Meng.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
2019-04-17 22:56:57 -07:00
Eric Yang
9cf7401794 YARN-9349. Improved log level practices for InvalidStateTransitionException.
Contributed by Anuhan Torgonshar

(cherry picked from commit fe2370e039e1ee980d74769ae85d67434e0993cf)
2019-04-16 19:53:45 -04:00
Szilard Nemeth
b8086aed86 YARN-9123. Clean up and split testcases in TestNMWebServices for GPU support. Contributed by Szilard Nemeth.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
2019-04-16 11:06:25 -07:00
Eric Badger
5583e1b6fc YARN-7848 Force removal of docker containers that do not get removed on first try. Contributed by Eric Yang 2019-04-15 20:47:09 +00:00
Eric Badger
254efc9358 YARN-9379. Can't specify docker runtime through environment. Contributed by caozhiqiang 2019-04-15 18:24:37 +00:00
Weiwei Yang
7fa73fac26 YARN-9439. Support asynchronized scheduling mode and multi-node lookup mechanism for app activities. Contributed by Tao Yang. 2019-04-16 00:12:43 +08:00
Inigo Goiri
7a68e7abd5 YARN-9474. Remove hard coded sleep from Opportunistic Scheduler tests. Contributed by Abhishek Modi. 2019-04-14 20:11:20 -07:00
Gabor Bota
1943db5571
HADOOP-16237. Fix new findbugs issues after updating guava to 27.0-jre.
Author:    Gabor Bota <gabor.bota@cloudera.com>
2019-04-12 18:28:38 -07:00
Giovanni Matteo Fumarola
ed3747c1cc YARN-9435. Add Opportunistic Scheduler metrics in ResourceManager. Contributed by Abhishek Modi. 2019-04-11 11:49:19 -07:00
Weiwei Yang
8c1bba375b YARN-9463. Add queueName info when failing with queue capacity sanity check. Contributed by Aihua Xu. 2019-04-10 22:51:28 +08:00
Igor Rudenko
32722d2661
YARN-9433. Remove unused constants in YARN resource manager
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2019-04-10 18:37:27 +09:00
Giovanni Matteo Fumarola
cfec455c45 YARN-999. In case of long running tasks, reduce node resource should balloon out resource quickly by calling preemption API and suspending running task. Contributed by Inigo Goiri. 2019-04-09 10:59:43 -07:00
Weiwei Yang
fc05b0e70e YARN-9313. Support asynchronized scheduling mode and multi-node lookup mechanism for scheduler activities. Contributed by Tao Yang. 2019-04-08 13:40:53 +08:00
Weiwei Yang
ec143cbf67 YARN-9413. Queue resource leak after app fail for CapacityScheduler. Contributed by Tao Yang. 2019-04-06 19:59:36 +08:00
Vrushali C
22362c876d YARN-9335 [atsv2] Restrict the number of elements held in timeline collector when backend is unreachable for async calls. Contributed by Abhishesk Modi. 2019-04-05 12:06:51 -07:00
Vrushali C
27039a29ae YARN-9382 Publish container killed, paused and resumed events to ATSv2. Contributed by Abhishesk Modi. 2019-04-05 12:02:43 -07:00
Eric Yang
8d150067e2 YARN-9396. Fixed duplicated RM Container created event to ATS.
Contributed by Prabhu Joseph
2019-04-04 13:01:56 -04:00
Vrushali C
eb03f7c419 YARN-9303 Username splits won't help timelineservice.app_flow table. Contributed by Prabhu Joseph. 2019-04-03 22:53:05 -07:00
Sunil G
002dcc4ebf YARN-4901. QueueMetrics needs to be cleared before MockRM is initialized. Contributed by Peter Bacsko. 2019-04-03 18:57:28 +05:30
Yufei Gu
2f752830ba YARN-9214. Add AbstractYarnScheduler#getValidQueues method to remove duplication. Contributed by Wanqiang Ji. 2019-04-01 20:05:15 -07:00
Giovanni Matteo Fumarola
ab2bda57bd YARN-9428. Add metrics for paused containers in NodeManager. Contributed by Abhishek Modi. 2019-04-01 14:21:17 -07:00
Giovanni Matteo Fumarola
da7f8c244d YARN-9431. Fix flaky junit test fair.TestAppRunnability after YARN-8967. Contributed by Wilfred Spiegelenburg. 2019-04-01 11:21:31 -07:00
Giovanni Matteo Fumarola
332cab5518 YARN-9418. ATSV2 /apps//entities/YARN_CONTAINER rest api does not show metrics. Contributed by Prabhu Joseph. 2019-04-01 11:06:51 -07:00
Devaraj K
56f1e131ec YARN-9270. Minor cleanup in TestFpgaDiscoverer. Contributed by Peter Bacsko. 2019-03-29 10:58:56 -07:00
Devaraj K
a4cd75e09c YARN-9269. Minor cleanup in FpgaResourceAllocator. Contributed by Peter Bacsko. 2019-03-27 10:08:07 -07:00
yufei
5257f50abb YARN-8967. Change FairScheduler to use PlacementRule interface. Contributed by Wilfred Spiegelenburg. 2019-03-25 22:47:24 -07:00
Devaraj K
eeda6891e4 YARN-9268. General improvements in FpgaDevice. Contributed by Peter Bacsko. 2019-03-25 13:22:53 -07:00
Eric Yang
3c45762a0b YARN-9391. Fixed node manager environment leaks into Docker containers.
Contributed by Jim Brennan
2019-03-25 15:53:24 -04:00
Giovanni Matteo Fumarola
509b20b292 YARN-9404. TestApplicationLifetimeMonitor#testApplicationLifetimeMonitor fails intermittent. Contributed by Prabhu Joseph. 2019-03-22 11:45:39 -07:00
Zoltan Siegl
ce5eb9cb2e YARN-9358. Add javadoc to new methods introduced in FSQueueMetrics with YARN-9322
(Contributed by Zoltan Siegl via Daniel Templeton)

Change-Id: I92d52c0ca630e71afb26b2b7587cbdbe79254a05
2019-03-22 12:28:34 +01:00
Giovanni Matteo Fumarola
548997d6c9 YARN-9402. Opportunistic containers should not be scheduled on Decommissioning nodes. Contributed by Abhishek Modi. 2019-03-21 12:04:05 -07:00
Devaraj K
a99eb80659 YARN-9267. General improvements in FpgaResourceHandlerImpl. Contributed by Peter Bacsko. 2019-03-21 11:15:56 -07:00
Eric Yang
506502bb83 YARN-9370. Added logging for recovering assigned GPU devices.
Contributed by Yesha Vora
2019-03-20 19:12:19 -04:00
Eric Yang
f2b862cac6 YARN-9398. Fixed javadoc errors for FPGA related java files.
Contributed by Peter Bacsko
2019-03-20 15:45:37 -04:00
Rohith Sharma K S
b3b0e332e6 YARN-9299. TestTimelineReaderWhitelistAuthorizationFilter ignores Http Errors. Contributed by Prabhu Joseph. 2019-03-20 21:24:31 +05:30
Rohith Sharma K S
0d24684eee YARN-9357. Modify HBase Liveness monitor log to debug. Contributed by Prabhu Joseph. 2019-03-20 21:22:54 +05:30
Rohith Sharma K S
c1a4eeb7c8 YARN-9389. FlowActivity and FlowRun table prefix is wrong. Contributed by Prabhu Joseph. 2019-03-20 21:18:19 +05:30
Giovanni Matteo Fumarola
5d8bd0e5cb YARN-9392. Handle missing scheduler events in Opportunistic Scheduler. Contributed by Abhishek Modi. 2019-03-19 11:00:21 -07:00
Eric Yang
09eabda314 YARN-9364. Remove commons-logging dependency from YARN.
Contributed by Prabhu Joseph
2019-03-18 19:58:42 -04:00
Eric Yang
5f6e225166 YARN-9363. Replaced debug logging with SLF4J parameterized log message.
Contributed by Prabhu Joseph
2019-03-18 13:57:18 -04:00
Shweta Yakkali
0e7e9013d4 YARN-9340. [Clean-up] Remove NULL check before instanceof in ResourceRequestSetKey
(Contributed by Shweta Yakkali via Daniel Templeton)

Change-Id: I932e29b36f086f7b7c76a250e33b473617ddbda1
2019-03-18 15:08:37 +01:00
Eric Yang
2064ca015d YARN-9349. Changed logging to use slf4j api.
Contributed by Prabhu Joseph
2019-03-15 19:20:59 -04:00
Eric Yang
03f3c8aed2 YARN-4404. Corrected typo in javadoc.
Contributed by Yesha Vora
2019-03-15 18:04:04 -04:00
Eric Badger
688b177fc6 YARN-8376. Separate white list for docker.trusted.registries and docker.privileged-container.registries. Contributed by Eric Yang 2019-03-14 19:39:00 +00:00
Vrushali C
f235a942d5 YARN-9016 DocumentStore as a backend for ATSv2. Contributed by Sushil Ks. 2019-03-13 16:45:23 -07:00
Vrushali C
17a3e14d25 YARN-9338 Timeline related testcases are failing. Contributed by Abhishek Modi. 2019-03-12 21:33:17 -07:00
Sunil G
8e1539eca8 YARN-9266. General improvements in IntelFpgaOpenclPlugin. Contributed by Peter Bacsko. 2019-03-13 02:45:17 +05:30
Sunil G
de15a66d78 YARN-9265. FPGA plugin fails to recognize Intel Processing Accelerator Card. Contributed by Peter Bacsko. 2019-03-08 17:39:22 +05:30
Eric Yang
39b4a37e02 YARN-9341. Fixed enentrant lock usage in YARN project.
Contributed by Prabhu Joseph
2019-03-07 16:47:45 -05:00
Vrushali C
491313ab84 YARN-8218 Add application launch time to ATSV1. Contributed by Abhishek Modi 2019-03-06 21:47:29 -08:00
Sunil G
46045c5cb3 YARN-9138. Improve test coverage for nvidia-smi binary execution of GpuDiscoverer. Contributed by Szilard Nemeth. 2019-03-06 16:01:08 +05:30
Eric Yang
7b42e0e32a YARN-7266. Fixed deadlock in Timeline Server thread initialization.
Contributed by Prabhu Joseph
2019-03-05 12:17:01 -05:00
Yufei Gu
0aefe2846f YARN-9298. Implement FS placement rules using PlacementRule interface. Contributed by Wilfred Spiegelenburg. 2019-03-04 23:49:07 -08:00
Prabhu Joseph
e40e2d6ad5
YARN-7243. Moving logging APIs over to slf4j in hadoop-yarn-server-resourcemanager.
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2019-03-05 14:10:08 +09:00
bibinchundatt
15098df744 Revert "YARN-8132. Final Status of applications shown as UNDEFINED in ATS app queries. Contributed by Prabhu Joseph."
This reverts commit a63c358b78.
2019-03-04 16:57:31 +05:30
Suma Shivaprasad
cab8529ecb YARN-7904. Privileged, trusted containers should be supported only in ENTRYPOINT mode. Contributed by Eric Yang. 2019-03-01 11:06:09 -08:00
Sunil G
dcaca19871 YARN-9139. Simplify initializer code of GpuDiscoverer. Contributed by Szilard Nemeth. 2019-03-01 19:24:35 +05:30
Szilard Nemeth
538bb4880d YARN-9323. FSLeafQueue#computeMaxAMResource does not override zero values for custom resources
(Contributed by Szilard Nemeth via Daniel Templeton)

Change-Id: Id844ccf09488f367c0c7de0a3b2d4aca1bba31cc
2019-02-27 19:59:48 -08:00
Szilard Nemeth
7b928f19a4 YARN-9322. Store metrics for custom resource types into FSQueueMetrics and query them in FairSchedulerQueueInfo
(Contributed by Szilard Nemeth via Daniel Templeton)

Change-Id: I14c12f1265999d62102f2ec5506d90015efeefe8
2019-02-27 19:43:50 -08:00
Weiwei Yang
1779fc57a1 YARN-9324. TestSchedulingRequestContainerAllocation(Async) fails with junit-4.11. Contributed by Prabhu Joseph. 2019-02-28 09:56:29 +08:00
Vrushali C
ea3cdc60b3 YARN-3841 [atsv2 Storage implementation] Adding retry semantics to HDFS backing storage. Contributed by Abhishek Modi. 2019-02-27 14:55:35 -08:00
Vrushali C
0ec962ac8f YARN-5336 Limit the flow name size & consider cleanup for hex chars. Contributed by Sushil Ks 2019-02-27 14:43:39 -08:00
Eric Yang
fbc7bb315f YARN-9245. Added query docker image command ability to node manager.
Contributed by Chandni Singh
2019-02-27 14:57:24 -05:00
Weiwei Yang
8c30114b00 YARN-9248. RMContainerImpl:Invalid event: ACQUIRED at KILLED. Contributed by lujie. 2019-02-27 17:29:02 +08:00
Rohith Sharma K S
6c96f5e4b6 YARN-8378. ApplicationHistoryManagerImpl#getApplications doesn't honor filters. Contributed by Lantao Jin. 2019-02-27 10:32:58 +05:30
Rohith Sharma K S
8eae260af5 YARN-9311. Fix TestRMRestart hangs. Contributed by Prabhu Joseph. 2019-02-27 10:28:16 +05:30
Weiwei Yang
c6ea28c480 YARN-9331. [YARN-8851] Fix a bug that lacking cgroup initialization when bootstrap DeviceResourceHandlerImpl. Contributed by Zhankun Tang. 2019-02-26 10:05:31 +08:00
Giovanni Matteo Fumarola
95372657fc YARN-9287. Consecutive StringBuilder append should be reuse. Contributed by Ayush Saxena. 2019-02-25 11:45:37 -08:00
Weiwei Yang
3e1739d589 YARN-9329. updatePriority is blocked when using FairScheduler. Contributed by Jiandan Yang. 2019-02-26 00:08:13 +08:00
Sunil G
5e91ebd91a YARN-9121. Replace GpuDiscoverer.getInstance() to a readable object for easy access control. Contributed by Szilard Nemeth. 2019-02-25 11:30:46 +05:30
Weiwei Yang
9cd5c5447f YARN-9316. TestPlacementConstraintsUtil#testInterAppConstraintsByAppID fails intermittently. Contributed by Prabhu Joseph. 2019-02-24 22:42:27 +08:00
Weiwei Yang
50094d7fef YARN-9300. Lazy preemption should trigger an update on queue preemption metrics for CapacityScheduler. Contributed by Tao Yang. 2019-02-24 22:17:29 +08:00
Sunil G
dddcfa4d9f YARN-8821. [YARN-8851] GPU hierarchy/topology scheduling support based on pluggable device framework. Contributed by Zhankun Tang. 2019-02-24 14:37:06 +05:30
bibinchundatt
ed13cf8406 YARN-9317. Avoid repeated YarnConfiguration#timelineServiceV2Enabled check. Contributed by Prabhu Joseph 2019-02-22 21:01:34 +05:30
Sunil G
95fbbfed75 YARN-9118. Handle exceptions with parsing user defined GPU devices in GpuDiscoverer. Contributed by Szilard Nemeth. 2019-02-22 20:22:17 +05:30
Weiwei Yang
9c88695bcd YARN-9238. Avoid allocating opportunistic containers to previous/removed/non-exist application attempt. Contributed by lujie. 2019-02-22 20:58:33 +08:00
Weiwei Yang
632d5e8a98 YARN-9156. Improve debug message in device plugin method compatibility check of ResourcePluginManager. Contributed by Zhankun Tang. 2019-02-22 11:27:30 +08:00
Wangda Tan
b17a2602d1 YARN-9319. Fix compilation issue of handling typedef an existing name by gcc compiler. (Zhankun Tang via wangda)
Change-Id: Ie8293cc5b710ff355fb6a1ea7201be53ed0385bf
2019-02-21 11:17:32 -08:00
Weiwei Yang
eedcc8e26e YARN-9315. TestCapacitySchedulerMetrics fails intermittently. Contributed by Prabhu Joseph. 2019-02-21 17:21:21 +08:00
bibinchundatt
a63c358b78 YARN-8132. Final Status of applications shown as UNDEFINED in ATS app queries. Contributed by Prabhu Joseph. 2019-02-21 13:29:01 +05:30
bibinchundatt
b8de78c570 YARN-9286. [Timeline Server] Sorting based on FinalStatus shows pop-up message. Contributed by Bilwa S T. 2019-02-20 01:19:20 +05:30
Sunil G
db4d1a1e2f YARN-9060. [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example. Contributed by Zhankun Tang. 2019-02-18 15:58:04 +05:30
Akira Ajisaka
0f2b65c3da
HADOOP-16116. Fix Spelling Mistakes - DECOMISSIONED. Contributed by BELUGA BEHR. 2019-02-18 15:21:46 +09:00
Sunil G
ba56bc24f5 YARN-9213. RM Web UI v1 does not show custom resource allocations for containers page. Contributed by Szilard Nemeth. 2019-02-17 20:08:20 +05:30
Eric Yang
7c1b561e33 YARN-8927. Added support for top level Dockerhub images to trusted registry using library keyword.
Contributed by Zhankun Tang
2019-02-15 11:18:07 -05:00
Adam Antal
9385ec45d7
YARN-9283. Javadoc of LinuxContainerExecutor#addSchedPriorityCommand has a wrong property name as reference
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2019-02-15 18:47:10 +09:00
Eric Yang
3dc2523266 YARN-9184. Add a system flag to allow update to latest docker images.
Contributed by Zhaohui Xin
2019-02-12 16:16:35 -05:00
Giovanni Matteo Fumarola
7536488bbd YARN-996. REST API support for node resource configuration. Contributed by Inigo Goiri. 2019-02-11 15:27:43 -08:00
Weiwei Yang
0a1637c750 YARN-8555. Parameterize TestSchedulingRequestContainerAllocation(Async) to cover both PC handler options. Contributed by Prabhu Joseph. 2019-02-11 15:53:50 +08:00
Masatake Iwasaki
e0ab1bdece YARN-9282. Typo in javadoc of class LinuxContainerExecutor: hadoop.security.authetication should be 'authentication'. Contributed by Charan Hebri. 2019-02-09 00:26:50 +09:00
Eric E Payne
d1ca9432dd YARN-7171: RM UI should sort memory / cores numerically. Contributed by Ahmed Hussein 2019-02-07 16:38:11 +00:00
Vinayakumar B
3c96a03dea YARN-8498. Yarn NodeManager OOM Listener Fails Compilation on Ubuntu 18.04. Contributed by Ayush Saxena. 2019-02-07 13:02:35 +05:30