Szilard Nemeth
|
742e30b473
|
YARN-9096: Some GpuResourcePlugin and ResourcePluginManager methods are synchronized unnecessarily. Contributed by Gergely Pollak
|
2019-08-09 09:59:19 +02:00 |
|
Szilard Nemeth
|
72d7e570a7
|
YARN-9094: Remove unused interface method: NodeResourceUpdaterPlugin#handleUpdatedResourceFromRM. Contributed by Gergely Pollak
|
2019-08-09 09:49:18 +02:00 |
|
Haibo Chen
|
f51702d539
|
YARN-9559. Create AbstractContainersLauncher for pluggable ContainersLauncher logic. (Contributed by Jonathan Hung)
|
2019-08-06 13:52:30 -07:00 |
|
HUAN-PING SU
|
7c2042a44d
|
YARN-9678. Addendum: TestGpuResourceHandler / TestFpgaResourceHandler should be renamed. Contributed by kevin su.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
|
2019-08-06 10:21:55 -07:00 |
|
HUAN-PING SU
|
b8bf09ba3d
|
YARN-9678. TestGpuResourceHandler / TestFpgaResourceHandler should be renamed. Contributed by kevin su.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
|
2019-08-06 09:05:53 -07:00 |
|
Eric Yang
|
d6697da5e8
|
YARN-9667. Use setbuf with line buffer to reduce fflush complexity in container-executor.
Contributed by Peter Bacsko
|
2019-08-05 13:59:12 -04:00 |
|
Szilard Nemeth
|
54ac80176e
|
Logging fileSize of log files under NM Local Dir. Contributed by Prabhu Joseph
|
2019-08-02 13:38:06 +02:00 |
|
Vidura Mudalige
|
1930a7bf60
|
YARN-9093. Remove commented code block from the beginning of Tes… (#444)
|
2019-08-02 13:16:19 +02:00 |
|
Adam Antal
|
95fc38f2e9
|
YARN-9375. Use Configured in GpuDiscoverer and FpgaDiscoverer (#1131)
Contributed by Adam Antal
|
2019-08-02 11:24:09 +02:00 |
|
Szilard Nemeth
|
18ee1092b4
|
YARN-9127. Create more tests to verify GpuDeviceInformationParser. Contributed by Peter Bacsko
|
2019-07-15 11:59:11 +02:00 |
|
Szilard Nemeth
|
61b0c2bb7c
|
YARN-9337. GPU auto-discovery script runs even when the resource is given by hand. Contributed by Adam Antal
|
2019-07-12 17:28:14 +02:00 |
|
Szilard Nemeth
|
8b3c6791b1
|
YARN-9135. NM State store ResourceMappings serialization are tested with Strings instead of real Device objects. Contributed by Peter Bacsko
|
2019-07-12 17:20:42 +02:00 |
|
Szilard Nemeth
|
c416284bb7
|
YARN-9235. If linux container executor is not set for a GPU cluster GpuResourceHandlerImpl is not initialized and NPE is thrown. Contributed by Antal Balint Steinbach, Adam Antal
|
2019-07-12 16:51:58 +02:00 |
|
Haibo Chen
|
9b54dd7186
|
YARN-9668. UGI conf doesn't read user overridden configurations on RM and NM startup. (Contributed by Jonathan Hung)
|
2019-07-11 13:57:08 -07:00 |
|
Szilard Nemeth
|
a2a8be18cb
|
YARN-9629. Support configurable MIN_LOG_ROLLING_INTERVAL. Contributed by Adam Antal.
|
2019-07-03 13:45:00 +02:00 |
|
Weiwei Yang
|
570eee30e5
|
YARN-9655. AllocateResponse in FederationInterceptor lost applicationPriority. Contributed by hunshenshi.
|
2019-07-02 09:55:25 +08:00 |
|
Eric Yang
|
29465bf169
|
YARN-9560. Restructure DockerLinuxContainerRuntime to extend OCIContainerRuntime.
Contributed by Eric Badger, Jim Brennan, Craig Condit
|
2019-06-28 17:18:53 -04:00 |
|
Weiwei Yang
|
f09c31a97e
|
Revert "YARN-9655. AllocateResponse in FederationInterceptor lost applicationPriority. (hunshenshi via wwei) closes apache/hadoop#1023"
This reverts commit 5e7caf1287 .
|
2019-06-29 00:29:17 +08:00 |
|
Weiwei Yang
|
5e7caf1287
|
YARN-9655. AllocateResponse in FederationInterceptor lost applicationPriority. (hunshenshi via wwei) closes apache/hadoop#1023
|
2019-06-29 00:08:40 +08:00 |
|
Giovanni Matteo Fumarola
|
1ac967a6b7
|
YARN-6055. ContainersMonitorImpl need be adjusted when NM resource changed. Contributed by Inigo Goiri.
|
2019-06-26 14:01:31 -07:00 |
|
Zhankun Tang
|
062eb605ac
|
YARN-9477. Implement VE discovery using libudev. Contributed by Peter Bacsko.
|
2019-06-26 23:53:14 +08:00 |
|
Zhankun Tang
|
67414a1a80
|
YARN-9584. Should put initializeProcessTrees method call before get pid. Contributed by Wanqiang Ji.
|
2019-06-18 12:23:52 +08:00 |
|
Giovanni Matteo Fumarola
|
bcfd228336
|
YARN-9599. TestContainerSchedulerQueuing#testQueueShedding fails intermittently. Contributed by Abhishek Modi.
|
2019-06-13 11:08:35 -07:00 |
|
bibinchundatt
|
2263ead365
|
YARN-9557. Application fails in diskchecker when ReadWriteDiskValidator is configured. Contributed by Bilwa S T.
|
2019-06-11 23:20:28 +05:30 |
|
bibinchundatt
|
6d80b9bc3f
|
YARN-9594. Fix missing break statement in ContainerScheduler#handle. Contributed by lujie.
|
2019-06-11 22:49:21 +05:30 |
|
bibinchundatt
|
f7df55f4a8
|
YARN-9602. Use logger format in Container Executor. Contributed by Abhishek Modi.
|
2019-06-11 22:29:00 +05:30 |
|
Zhankun Tang
|
606061aa14
|
YARN-9595. FPGA plugin: NullPointerException in FpgaNodeResourceUpdateHandler.updateConfiguredResource(). Contributed by Peter Bacsko.
|
2019-06-04 09:56:59 +08:00 |
|
Weiwei Yang
|
4530f4500d
|
YARN-9507. Fix NPE in NodeManager#serviceStop on startup failure. Contributed by Bilwa S T.
|
2019-06-03 14:09:37 +08:00 |
|
Giovanni Matteo Fumarola
|
2210897609
|
YARN-9592. Use Logger format in ContainersMonitorImpl. Contributed by Inigo Goiri.
|
2019-05-31 17:35:49 -07:00 |
|
Sunil G
|
7861a5eb1a
|
YARN-9033. ResourceHandlerChain#bootstrap is invoked twice during NM start if LinuxContainerExecutor enabled. Contributed by Zhankun Tang.
|
2019-05-31 10:22:26 +05:30 |
|
Sunil G
|
30c6dd92e1
|
YARN-9452. Fix TestDistributedShell and TestTimelineAuthFilterForV2 failures. Contributed by Prabhu Joseph.
|
2019-05-30 22:32:41 +05:30 |
|
Akira Ajisaka
|
afd844059c
|
HADOOP-16331. Fix ASF License check in pom.xml
Signed-off-by: Takanobu Asanuma <tasanuma@apache.org>
|
2019-05-29 17:25:13 +09:00 |
|
Akira Ajisaka
|
9f933e6446
|
HADOOP-16323. https everywhere in Maven settings.
|
2019-05-27 15:24:59 +09:00 |
|
Eric Yang
|
460ba7fb14
|
YARN-9558. Fixed LogAggregation test cases.
Contributed by Prabhu Joseph
|
2019-05-23 18:38:47 -04:00 |
|
Haibo Chen
|
597fa47ad1
|
YARN-9529. Log correct cpu controller path on error while initializing CGroups. (Contributed by Jonathan Hung)
|
2019-05-06 11:56:22 -07:00 |
|
Eric Yang
|
accb811e57
|
YARN-6929. Improved partition algorithm for yarn remote-app-log-dir.
Contributed by Prabhu Joseph
|
2019-04-30 17:04:59 -04:00 |
|
Zhankun Tang
|
7fbaa7d66f
|
YARN-9476. [YARN-9473] Create unit tests for VE plugin. Contributed by Peter Bacsko.
|
2019-04-30 11:06:44 +08:00 |
|
Eric Badger
|
79d3d35398
|
YARN-9486. Docker container exited with failure does not get clean up correctly. Contributed by Eric Yang
|
2019-04-26 01:21:28 +00:00 |
|
Sean Mackrory
|
a703dae25e
|
HADOOP-16222. Fix new deprecations after guava 27.0 update in trunk. Contributed by Gabor Bota.
|
2019-04-24 10:39:00 -06:00 |
|
Zhankun Tang
|
8a95ea61e1
|
YARN-9475. [YARN-9473] Create basic VE plugin. Contributed by Peter Bacsko.
|
2019-04-23 17:33:58 +08:00 |
|
Inigo Goiri
|
96e3027e46
|
YARN-2889. Limit the number of opportunistic container allocated per AM heartbeat. Contributed by Abhishek Modi.
|
2019-04-22 09:49:03 -07:00 |
|
Eric Yang
|
ef97a20831
|
YARN-8622. Fixed container-executor compilation on MacOSX.
Contributed by Siyao Meng
|
2019-04-18 18:59:21 -04:00 |
|
Prabhu Joseph
|
aa4c744aef
|
YARN-9470. Fix order of actual and expected expression in assert statements
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
|
2019-04-18 15:40:37 +09:00 |
|
Siyao Meng
|
6e4399ea61
|
YARN-9487. NodeManager native build shouldn't link against librt on macOS. Contributed by Siyao Meng.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
|
2019-04-17 22:56:57 -07:00 |
|
Eric Yang
|
9cf7401794
|
YARN-9349. Improved log level practices for InvalidStateTransitionException.
Contributed by Anuhan Torgonshar
(cherry picked from commit fe2370e039e1ee980d74769ae85d67434e0993cf)
|
2019-04-16 19:53:45 -04:00 |
|
Szilard Nemeth
|
b8086aed86
|
YARN-9123. Clean up and split testcases in TestNMWebServices for GPU support. Contributed by Szilard Nemeth.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
|
2019-04-16 11:06:25 -07:00 |
|
Eric Badger
|
5583e1b6fc
|
YARN-7848 Force removal of docker containers that do not get removed on first try. Contributed by Eric Yang
|
2019-04-15 20:47:09 +00:00 |
|
Eric Badger
|
254efc9358
|
YARN-9379. Can't specify docker runtime through environment. Contributed by caozhiqiang
|
2019-04-15 18:24:37 +00:00 |
|
Vrushali C
|
27039a29ae
|
YARN-9382 Publish container killed, paused and resumed events to ATSv2. Contributed by Abhishesk Modi.
|
2019-04-05 12:02:43 -07:00 |
|
Giovanni Matteo Fumarola
|
ab2bda57bd
|
YARN-9428. Add metrics for paused containers in NodeManager. Contributed by Abhishek Modi.
|
2019-04-01 14:21:17 -07:00 |
|
Giovanni Matteo Fumarola
|
332cab5518
|
YARN-9418. ATSV2 /apps//entities/YARN_CONTAINER rest api does not show metrics. Contributed by Prabhu Joseph.
|
2019-04-01 11:06:51 -07:00 |
|
Devaraj K
|
56f1e131ec
|
YARN-9270. Minor cleanup in TestFpgaDiscoverer. Contributed by Peter Bacsko.
|
2019-03-29 10:58:56 -07:00 |
|
Devaraj K
|
a4cd75e09c
|
YARN-9269. Minor cleanup in FpgaResourceAllocator. Contributed by Peter Bacsko.
|
2019-03-27 10:08:07 -07:00 |
|
Devaraj K
|
eeda6891e4
|
YARN-9268. General improvements in FpgaDevice. Contributed by Peter Bacsko.
|
2019-03-25 13:22:53 -07:00 |
|
Eric Yang
|
3c45762a0b
|
YARN-9391. Fixed node manager environment leaks into Docker containers.
Contributed by Jim Brennan
|
2019-03-25 15:53:24 -04:00 |
|
Devaraj K
|
a99eb80659
|
YARN-9267. General improvements in FpgaResourceHandlerImpl. Contributed by Peter Bacsko.
|
2019-03-21 11:15:56 -07:00 |
|
Eric Yang
|
506502bb83
|
YARN-9370. Added logging for recovering assigned GPU devices.
Contributed by Yesha Vora
|
2019-03-20 19:12:19 -04:00 |
|
Eric Yang
|
f2b862cac6
|
YARN-9398. Fixed javadoc errors for FPGA related java files.
Contributed by Peter Bacsko
|
2019-03-20 15:45:37 -04:00 |
|
Eric Yang
|
09eabda314
|
YARN-9364. Remove commons-logging dependency from YARN.
Contributed by Prabhu Joseph
|
2019-03-18 19:58:42 -04:00 |
|
Eric Yang
|
5f6e225166
|
YARN-9363. Replaced debug logging with SLF4J parameterized log message.
Contributed by Prabhu Joseph
|
2019-03-18 13:57:18 -04:00 |
|
Eric Yang
|
2064ca015d
|
YARN-9349. Changed logging to use slf4j api.
Contributed by Prabhu Joseph
|
2019-03-15 19:20:59 -04:00 |
|
Eric Badger
|
688b177fc6
|
YARN-8376. Separate white list for docker.trusted.registries and docker.privileged-container.registries. Contributed by Eric Yang
|
2019-03-14 19:39:00 +00:00 |
|
Sunil G
|
8e1539eca8
|
YARN-9266. General improvements in IntelFpgaOpenclPlugin. Contributed by Peter Bacsko.
|
2019-03-13 02:45:17 +05:30 |
|
Sunil G
|
de15a66d78
|
YARN-9265. FPGA plugin fails to recognize Intel Processing Accelerator Card. Contributed by Peter Bacsko.
|
2019-03-08 17:39:22 +05:30 |
|
Eric Yang
|
39b4a37e02
|
YARN-9341. Fixed enentrant lock usage in YARN project.
Contributed by Prabhu Joseph
|
2019-03-07 16:47:45 -05:00 |
|
Sunil G
|
46045c5cb3
|
YARN-9138. Improve test coverage for nvidia-smi binary execution of GpuDiscoverer. Contributed by Szilard Nemeth.
|
2019-03-06 16:01:08 +05:30 |
|
Suma Shivaprasad
|
cab8529ecb
|
YARN-7904. Privileged, trusted containers should be supported only in ENTRYPOINT mode. Contributed by Eric Yang.
|
2019-03-01 11:06:09 -08:00 |
|
Sunil G
|
dcaca19871
|
YARN-9139. Simplify initializer code of GpuDiscoverer. Contributed by Szilard Nemeth.
|
2019-03-01 19:24:35 +05:30 |
|
Eric Yang
|
fbc7bb315f
|
YARN-9245. Added query docker image command ability to node manager.
Contributed by Chandni Singh
|
2019-02-27 14:57:24 -05:00 |
|
Weiwei Yang
|
c6ea28c480
|
YARN-9331. [YARN-8851] Fix a bug that lacking cgroup initialization when bootstrap DeviceResourceHandlerImpl. Contributed by Zhankun Tang.
|
2019-02-26 10:05:31 +08:00 |
|
Giovanni Matteo Fumarola
|
95372657fc
|
YARN-9287. Consecutive StringBuilder append should be reuse. Contributed by Ayush Saxena.
|
2019-02-25 11:45:37 -08:00 |
|
Sunil G
|
5e91ebd91a
|
YARN-9121. Replace GpuDiscoverer.getInstance() to a readable object for easy access control. Contributed by Szilard Nemeth.
|
2019-02-25 11:30:46 +05:30 |
|
Sunil G
|
dddcfa4d9f
|
YARN-8821. [YARN-8851] GPU hierarchy/topology scheduling support based on pluggable device framework. Contributed by Zhankun Tang.
|
2019-02-24 14:37:06 +05:30 |
|
bibinchundatt
|
ed13cf8406
|
YARN-9317. Avoid repeated YarnConfiguration#timelineServiceV2Enabled check. Contributed by Prabhu Joseph
|
2019-02-22 21:01:34 +05:30 |
|
Sunil G
|
95fbbfed75
|
YARN-9118. Handle exceptions with parsing user defined GPU devices in GpuDiscoverer. Contributed by Szilard Nemeth.
|
2019-02-22 20:22:17 +05:30 |
|
Weiwei Yang
|
632d5e8a98
|
YARN-9156. Improve debug message in device plugin method compatibility check of ResourcePluginManager. Contributed by Zhankun Tang.
|
2019-02-22 11:27:30 +08:00 |
|
Wangda Tan
|
b17a2602d1
|
YARN-9319. Fix compilation issue of handling typedef an existing name by gcc compiler. (Zhankun Tang via wangda)
Change-Id: Ie8293cc5b710ff355fb6a1ea7201be53ed0385bf
|
2019-02-21 11:17:32 -08:00 |
|
Sunil G
|
db4d1a1e2f
|
YARN-9060. [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example. Contributed by Zhankun Tang.
|
2019-02-18 15:58:04 +05:30 |
|
Eric Yang
|
7c1b561e33
|
YARN-8927. Added support for top level Dockerhub images to trusted registry using library keyword.
Contributed by Zhankun Tang
|
2019-02-15 11:18:07 -05:00 |
|
Adam Antal
|
9385ec45d7
|
YARN-9283. Javadoc of LinuxContainerExecutor#addSchedPriorityCommand has a wrong property name as reference
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
|
2019-02-15 18:47:10 +09:00 |
|
Eric Yang
|
3dc2523266
|
YARN-9184. Add a system flag to allow update to latest docker images.
Contributed by Zhaohui Xin
|
2019-02-12 16:16:35 -05:00 |
|
Masatake Iwasaki
|
e0ab1bdece
|
YARN-9282. Typo in javadoc of class LinuxContainerExecutor: hadoop.security.authetication should be 'authentication'. Contributed by Charan Hebri.
|
2019-02-09 00:26:50 +09:00 |
|
Vinayakumar B
|
3c96a03dea
|
YARN-8498. Yarn NodeManager OOM Listener Fails Compilation on Ubuntu 18.04. Contributed by Ayush Saxena.
|
2019-02-07 13:02:35 +05:30 |
|
Rohith Sharma K S
|
e3ec18b0c4
|
YARN-6735. Have a way to turn off container metrics from NMs. Contributed by Abhishek Modi.
|
2019-02-05 13:48:04 +05:30 |
|
Weiwei Yang
|
f20b043a02
|
YARN-9263. TestConfigurationNodeAttributesProvider fails after Mockito updated. Contributed by Weiwei Yang.
|
2019-02-02 23:04:34 +08:00 |
|
Sunil G
|
71c49fa60f
|
YARN-9099. GpuResourceAllocator#getReleasingGpus calculates number of GPUs in a wrong way. Contributed by Szilard Nemeth.
|
2019-01-31 09:25:29 +05:30 |
|
Akira Ajisaka
|
1129288cf5
|
HADOOP-14178. Move Mockito up to version 2.23.4. Contributed by Akira Ajisaka and Masatake Iwasaki.
|
2019-01-29 18:29:56 -08:00 |
|
Weiwei Yang
|
4f63ffe444
|
YARN-9237. NM should ignore sending finished apps to RM during RM fail-over. Contributed by Jiandan Yang.
|
2019-01-29 10:34:26 +08:00 |
|
Eric Yang
|
2e636dd3c4
|
YARN-9074. Consolidate docker removal logic in ContainerCleanup.
Contributed by Zhaohui Xin
|
2019-01-28 18:05:53 -05:00 |
|
Eric Yang
|
1ab69a9543
|
YARN-9221. Added flag to disable dynamic auxiliary service feature.
Contributed by Billie Rinaldi
|
2019-01-25 19:05:36 -05:00 |
|
Eric Yang
|
a33ef4fd31
|
YARN-8867. Added resource localization status to YARN service status call.
Contributed by Chandni Singh
|
2019-01-24 18:43:21 -05:00 |
|
Eric Yang
|
2fa9389c2e
|
YARN-9146. Added REST API to configure auxiliary service.
Contributed by Billie Rinaldi
|
2019-01-22 18:24:43 -05:00 |
|
Jason Lowe
|
6a923464af
|
YARN-6523. Optimize system credentials sent in node heartbeat responses. Contributed by Manikandan R
|
2019-01-08 16:54:05 -06:00 |
|
Weiwei Yang
|
0921b706f7
|
YARN-9037. [CSI] Ignore volume resource in resource calculators based on tags. Contributed by Sunil Govindan.
|
2019-01-08 14:30:53 +08:00 |
|
Giovanni Matteo Fumarola
|
489411579c
|
YARN-9169. Add metrics for queued opportunistic and guaranteed containers. Contributed by Abhishek Modi.
|
2019-01-07 15:16:55 -08:00 |
|
Wangda Tan
|
0a01d49917
|
YARN-8822. Nvidia-docker v2 support for YARN GPU feature. (Charo Zhang via wangda)
Change-Id: Id8af27134d3286a7a10d85eda9be25df9689d0e7
|
2019-01-07 12:07:26 -08:00 |
|
Giovanni Matteo Fumarola
|
6e35f7130f
|
YARN-9166. Fix logging for preemption of Opportunistic containers for Guaranteed containers. Contributed by Abhishek Modi.
|
2019-01-04 11:16:41 -08:00 |
|
Sunil G
|
f4906ac019
|
YARN-9038. [CSI] Add ability to publish/unpublish volumes on node managers. Contributed by Weiwei Yang.
|
2019-01-04 12:10:00 +05:30 |
|
Eric Yang
|
dfceffa70d
|
YARN-9147. Rmove auxiliary services when manifest file is removed.
Contributed by Billie Rinaldi
|
2019-01-03 12:57:21 -05:00 |
|
Botong Huang
|
657aa433e2
|
YARN 9108. Fix FederationIntercepter merge home and secondary allocate response typo. Contributed by Abhishek Modi.
|
2018-12-22 12:41:49 -08:00 |
|