Szilard Nemeth
|
e8fa192f07
|
YARN-9217. Nodemanager will fail to start if GPU is misconfigured on the node or GPU drivers missing. Contributed by Peter Bacsko
|
2019-08-21 16:44:22 +02:00 |
|
bibinchundatt
|
e684b17e6f
|
YARN-5857. TestLogAggregationService.testFixedSizeThreadPool fails intermittently on trunk. Contributed by Bilwa S T.
|
2019-08-21 17:14:42 +05:30 |
|
Szilard Nemeth
|
2216ec54e5
|
YARN-9100. Add tests for GpuResourceAllocator and do minor code cleanup. Contributed by Peter Bacsko
|
2019-08-16 09:13:20 +02:00 |
|
Adam Antal
|
22c4f38c4b
|
YARN-9679. Regular code cleanup in TestResourcePluginManager (#1122)
|
2019-08-15 17:32:05 +02:00 |
|
Szilard Nemeth
|
3e0410449f
|
YARN-9133. Make tests more easy to comprehend in TestGpuResourceHandler. Contributed by Peter Bacsko
|
2019-08-14 17:13:54 +02:00 |
|
Szilard Nemeth
|
e5e609384f
|
YARN-9140. Code cleanup in ResourcePluginManager.initialize and in TestResourcePluginManager. Contributed by Peter Bacsko
|
2019-08-14 16:58:22 +02:00 |
|
Eric Yang
|
6ff0453ede
|
YARN-9527. Prevent rogue Localizer Runner from downloading same file repeatly.
Contributed by Jim Brennan
|
2019-08-09 14:12:17 -04:00 |
|
HUAN-PING SU
|
7c2042a44d
|
YARN-9678. Addendum: TestGpuResourceHandler / TestFpgaResourceHandler should be renamed. Contributed by kevin su.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
|
2019-08-06 10:21:55 -07:00 |
|
HUAN-PING SU
|
b8bf09ba3d
|
YARN-9678. TestGpuResourceHandler / TestFpgaResourceHandler should be renamed. Contributed by kevin su.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
|
2019-08-06 09:05:53 -07:00 |
|
Vidura Mudalige
|
1930a7bf60
|
YARN-9093. Remove commented code block from the beginning of Tes… (#444)
|
2019-08-02 13:16:19 +02:00 |
|
Szilard Nemeth
|
18ee1092b4
|
YARN-9127. Create more tests to verify GpuDeviceInformationParser. Contributed by Peter Bacsko
|
2019-07-15 11:59:11 +02:00 |
|
Szilard Nemeth
|
61b0c2bb7c
|
YARN-9337. GPU auto-discovery script runs even when the resource is given by hand. Contributed by Adam Antal
|
2019-07-12 17:28:14 +02:00 |
|
Szilard Nemeth
|
8b3c6791b1
|
YARN-9135. NM State store ResourceMappings serialization are tested with Strings instead of real Device objects. Contributed by Peter Bacsko
|
2019-07-12 17:20:42 +02:00 |
|
Szilard Nemeth
|
c416284bb7
|
YARN-9235. If linux container executor is not set for a GPU cluster GpuResourceHandlerImpl is not initialized and NPE is thrown. Contributed by Antal Balint Steinbach, Adam Antal
|
2019-07-12 16:51:58 +02:00 |
|
Haibo Chen
|
9b54dd7186
|
YARN-9668. UGI conf doesn't read user overridden configurations on RM and NM startup. (Contributed by Jonathan Hung)
|
2019-07-11 13:57:08 -07:00 |
|
Szilard Nemeth
|
a2a8be18cb
|
YARN-9629. Support configurable MIN_LOG_ROLLING_INTERVAL. Contributed by Adam Antal.
|
2019-07-03 13:45:00 +02:00 |
|
Weiwei Yang
|
570eee30e5
|
YARN-9655. AllocateResponse in FederationInterceptor lost applicationPriority. Contributed by hunshenshi.
|
2019-07-02 09:55:25 +08:00 |
|
Eric Yang
|
29465bf169
|
YARN-9560. Restructure DockerLinuxContainerRuntime to extend OCIContainerRuntime.
Contributed by Eric Badger, Jim Brennan, Craig Condit
|
2019-06-28 17:18:53 -04:00 |
|
Giovanni Matteo Fumarola
|
1ac967a6b7
|
YARN-6055. ContainersMonitorImpl need be adjusted when NM resource changed. Contributed by Inigo Goiri.
|
2019-06-26 14:01:31 -07:00 |
|
Zhankun Tang
|
062eb605ac
|
YARN-9477. Implement VE discovery using libudev. Contributed by Peter Bacsko.
|
2019-06-26 23:53:14 +08:00 |
|
Giovanni Matteo Fumarola
|
bcfd228336
|
YARN-9599. TestContainerSchedulerQueuing#testQueueShedding fails intermittently. Contributed by Abhishek Modi.
|
2019-06-13 11:08:35 -07:00 |
|
bibinchundatt
|
2263ead365
|
YARN-9557. Application fails in diskchecker when ReadWriteDiskValidator is configured. Contributed by Bilwa S T.
|
2019-06-11 23:20:28 +05:30 |
|
Zhankun Tang
|
606061aa14
|
YARN-9595. FPGA plugin: NullPointerException in FpgaNodeResourceUpdateHandler.updateConfiguredResource(). Contributed by Peter Bacsko.
|
2019-06-04 09:56:59 +08:00 |
|
Eric Yang
|
460ba7fb14
|
YARN-9558. Fixed LogAggregation test cases.
Contributed by Prabhu Joseph
|
2019-05-23 18:38:47 -04:00 |
|
Eric Yang
|
accb811e57
|
YARN-6929. Improved partition algorithm for yarn remote-app-log-dir.
Contributed by Prabhu Joseph
|
2019-04-30 17:04:59 -04:00 |
|
Zhankun Tang
|
7fbaa7d66f
|
YARN-9476. [YARN-9473] Create unit tests for VE plugin. Contributed by Peter Bacsko.
|
2019-04-30 11:06:44 +08:00 |
|
Eric Badger
|
79d3d35398
|
YARN-9486. Docker container exited with failure does not get clean up correctly. Contributed by Eric Yang
|
2019-04-26 01:21:28 +00:00 |
|
Sean Mackrory
|
a703dae25e
|
HADOOP-16222. Fix new deprecations after guava 27.0 update in trunk. Contributed by Gabor Bota.
|
2019-04-24 10:39:00 -06:00 |
|
Prabhu Joseph
|
aa4c744aef
|
YARN-9470. Fix order of actual and expected expression in assert statements
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
|
2019-04-18 15:40:37 +09:00 |
|
Szilard Nemeth
|
b8086aed86
|
YARN-9123. Clean up and split testcases in TestNMWebServices for GPU support. Contributed by Szilard Nemeth.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
|
2019-04-16 11:06:25 -07:00 |
|
Eric Badger
|
254efc9358
|
YARN-9379. Can't specify docker runtime through environment. Contributed by caozhiqiang
|
2019-04-15 18:24:37 +00:00 |
|
Vrushali C
|
27039a29ae
|
YARN-9382 Publish container killed, paused and resumed events to ATSv2. Contributed by Abhishesk Modi.
|
2019-04-05 12:02:43 -07:00 |
|
Giovanni Matteo Fumarola
|
ab2bda57bd
|
YARN-9428. Add metrics for paused containers in NodeManager. Contributed by Abhishek Modi.
|
2019-04-01 14:21:17 -07:00 |
|
Giovanni Matteo Fumarola
|
332cab5518
|
YARN-9418. ATSV2 /apps//entities/YARN_CONTAINER rest api does not show metrics. Contributed by Prabhu Joseph.
|
2019-04-01 11:06:51 -07:00 |
|
Devaraj K
|
56f1e131ec
|
YARN-9270. Minor cleanup in TestFpgaDiscoverer. Contributed by Peter Bacsko.
|
2019-03-29 10:58:56 -07:00 |
|
Devaraj K
|
eeda6891e4
|
YARN-9268. General improvements in FpgaDevice. Contributed by Peter Bacsko.
|
2019-03-25 13:22:53 -07:00 |
|
Eric Yang
|
3c45762a0b
|
YARN-9391. Fixed node manager environment leaks into Docker containers.
Contributed by Jim Brennan
|
2019-03-25 15:53:24 -04:00 |
|
Devaraj K
|
a99eb80659
|
YARN-9267. General improvements in FpgaResourceHandlerImpl. Contributed by Peter Bacsko.
|
2019-03-21 11:15:56 -07:00 |
|
Eric Yang
|
5f6e225166
|
YARN-9363. Replaced debug logging with SLF4J parameterized log message.
Contributed by Prabhu Joseph
|
2019-03-18 13:57:18 -04:00 |
|
Sunil G
|
8e1539eca8
|
YARN-9266. General improvements in IntelFpgaOpenclPlugin. Contributed by Peter Bacsko.
|
2019-03-13 02:45:17 +05:30 |
|
Sunil G
|
de15a66d78
|
YARN-9265. FPGA plugin fails to recognize Intel Processing Accelerator Card. Contributed by Peter Bacsko.
|
2019-03-08 17:39:22 +05:30 |
|
Eric Yang
|
39b4a37e02
|
YARN-9341. Fixed enentrant lock usage in YARN project.
Contributed by Prabhu Joseph
|
2019-03-07 16:47:45 -05:00 |
|
Sunil G
|
46045c5cb3
|
YARN-9138. Improve test coverage for nvidia-smi binary execution of GpuDiscoverer. Contributed by Szilard Nemeth.
|
2019-03-06 16:01:08 +05:30 |
|
Sunil G
|
dcaca19871
|
YARN-9139. Simplify initializer code of GpuDiscoverer. Contributed by Szilard Nemeth.
|
2019-03-01 19:24:35 +05:30 |
|
Eric Yang
|
fbc7bb315f
|
YARN-9245. Added query docker image command ability to node manager.
Contributed by Chandni Singh
|
2019-02-27 14:57:24 -05:00 |
|
Weiwei Yang
|
c6ea28c480
|
YARN-9331. [YARN-8851] Fix a bug that lacking cgroup initialization when bootstrap DeviceResourceHandlerImpl. Contributed by Zhankun Tang.
|
2019-02-26 10:05:31 +08:00 |
|
Sunil G
|
5e91ebd91a
|
YARN-9121. Replace GpuDiscoverer.getInstance() to a readable object for easy access control. Contributed by Szilard Nemeth.
|
2019-02-25 11:30:46 +05:30 |
|
Sunil G
|
dddcfa4d9f
|
YARN-8821. [YARN-8851] GPU hierarchy/topology scheduling support based on pluggable device framework. Contributed by Zhankun Tang.
|
2019-02-24 14:37:06 +05:30 |
|
Sunil G
|
95fbbfed75
|
YARN-9118. Handle exceptions with parsing user defined GPU devices in GpuDiscoverer. Contributed by Szilard Nemeth.
|
2019-02-22 20:22:17 +05:30 |
|
Sunil G
|
db4d1a1e2f
|
YARN-9060. [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example. Contributed by Zhankun Tang.
|
2019-02-18 15:58:04 +05:30 |
|
Eric Yang
|
3dc2523266
|
YARN-9184. Add a system flag to allow update to latest docker images.
Contributed by Zhaohui Xin
|
2019-02-12 16:16:35 -05:00 |
|
Rohith Sharma K S
|
e3ec18b0c4
|
YARN-6735. Have a way to turn off container metrics from NMs. Contributed by Abhishek Modi.
|
2019-02-05 13:48:04 +05:30 |
|
Weiwei Yang
|
f20b043a02
|
YARN-9263. TestConfigurationNodeAttributesProvider fails after Mockito updated. Contributed by Weiwei Yang.
|
2019-02-02 23:04:34 +08:00 |
|
Akira Ajisaka
|
1129288cf5
|
HADOOP-14178. Move Mockito up to version 2.23.4. Contributed by Akira Ajisaka and Masatake Iwasaki.
|
2019-01-29 18:29:56 -08:00 |
|
Eric Yang
|
2e636dd3c4
|
YARN-9074. Consolidate docker removal logic in ContainerCleanup.
Contributed by Zhaohui Xin
|
2019-01-28 18:05:53 -05:00 |
|
Eric Yang
|
1ab69a9543
|
YARN-9221. Added flag to disable dynamic auxiliary service feature.
Contributed by Billie Rinaldi
|
2019-01-25 19:05:36 -05:00 |
|
Eric Yang
|
a33ef4fd31
|
YARN-8867. Added resource localization status to YARN service status call.
Contributed by Chandni Singh
|
2019-01-24 18:43:21 -05:00 |
|
Eric Yang
|
2fa9389c2e
|
YARN-9146. Added REST API to configure auxiliary service.
Contributed by Billie Rinaldi
|
2019-01-22 18:24:43 -05:00 |
|
Jason Lowe
|
6a923464af
|
YARN-6523. Optimize system credentials sent in node heartbeat responses. Contributed by Manikandan R
|
2019-01-08 16:54:05 -06:00 |
|
Giovanni Matteo Fumarola
|
489411579c
|
YARN-9169. Add metrics for queued opportunistic and guaranteed containers. Contributed by Abhishek Modi.
|
2019-01-07 15:16:55 -08:00 |
|
Wangda Tan
|
0a01d49917
|
YARN-8822. Nvidia-docker v2 support for YARN GPU feature. (Charo Zhang via wangda)
Change-Id: Id8af27134d3286a7a10d85eda9be25df9689d0e7
|
2019-01-07 12:07:26 -08:00 |
|
Sunil G
|
f4906ac019
|
YARN-9038. [CSI] Add ability to publish/unpublish volumes on node managers. Contributed by Weiwei Yang.
|
2019-01-04 12:10:00 +05:30 |
|
Eric Yang
|
dfceffa70d
|
YARN-9147. Rmove auxiliary services when manifest file is removed.
Contributed by Billie Rinaldi
|
2019-01-03 12:57:21 -05:00 |
|
Botong Huang
|
657aa433e2
|
YARN 9108. Fix FederationIntercepter merge home and secondary allocate response typo. Contributed by Abhishek Modi.
|
2018-12-22 12:41:49 -08:00 |
|
Eric Yang
|
f82922dcfa
|
YARN-5168. Added exposed port information for Docker container.
Contributed by Xun Liu
|
2018-12-21 19:44:07 -05:00 |
|
Eric Yang
|
ea724181d6
|
YARN-9132. Added file permission check for auxiliary services manifest file.
Contributed by Billie Rinaldi
|
2018-12-21 14:56:39 -05:00 |
|
Weiwei Yang
|
f659485ee8
|
YARN-8925. Updating distributed node attributes only when necessary. Contributed by Tao Yang.
|
2018-12-21 10:56:42 +08:00 |
|
Eric Yang
|
a80d321074
|
YARN-9152. Improved AuxServices REST API output.
Contributed by Billie Rinaldi
|
2018-12-20 19:21:55 -05:00 |
|
Eric Yang
|
e815fd9c49
|
YARN-9126. Fix container clean up for reinitialization.
Contributed by Chandni Singh
|
2018-12-19 14:55:56 -05:00 |
|
Eric Yang
|
c7a5a4435e
|
YARN-9075. Add ability to register/remove auxiliary service without restart node manager.
Contributed by Billie Rinaldi
|
2018-12-18 17:05:51 -05:00 |
|
Billie Rinaldi
|
c5c73182e5
|
YARN-9072. Send exit command to terminate docker exec on connection close. Contributed by Eric Yang
|
2018-12-18 10:06:33 -08:00 |
|
Eric Yang
|
b2d7204ed0
|
YARN-9125. Fixed Carriage Return detection in Docker container launch command.
Contributed by Billie Rinaldi
|
2018-12-14 17:52:26 -05:00 |
|
Wangda Tan
|
37eb919c59
|
YARN-8885. [DevicePlugin] Support NM APIs to query device resource allocation. (Zhankun Tang via wangda)
Change-Id: I2a9870709b512af1ac6c09c9701d0b3c0791ff32
|
2018-12-12 11:45:47 -08:00 |
|
Wangda Tan
|
61bdcb7b2b
|
YARN-9015. [DevicePlugin] Add an interface for device plugin to provide customized scheduler. (Zhankun Tang via wangda)
Change-Id: Ib2e4ae47a6f29bb3082c1f8520cf5a52ca720979
|
2018-12-12 11:44:22 -08:00 |
|
Haibo Chen
|
881230da21
|
YARN-9051. Integrate multiple CustomResourceTypesConfigurationProvider implementations into one. (Contributed by Szilard Nemeth)
|
2018-12-11 11:41:32 -08:00 |
|
Billie Rinaldi
|
154449fbd8
|
YARN-8914. Add xtermjs to YARN UI2. Contributed by Eric Yang and Akhil PB
|
2018-12-07 10:56:17 -08:00 |
|
Eric Yang
|
1b790f4dd1
|
YARN-9071. Improved status update for reinitialized containers.
Contributed by Chandni Singh
|
2018-12-05 17:00:56 -05:00 |
|
Wangda Tan
|
bad12031f6
|
YARN-9010. Fix the incorrect trailing slash deletion in constructor method of CGroupsHandlerImpl. (Zhankun Tang via wangda)
Change-Id: Iaecc66d57781cc10f19ead4647e47fc9556676da
|
2018-11-29 14:56:07 -08:00 |
|
Wangda Tan
|
fe7dab8ef5
|
YARN-8989. [YARN-8851] Move DockerCommandPlugin volume related APIs' invocation from DockerLinuxContainerRuntime#prepareContainer to #launchContainer. (Zhankun Tang via wangda)
Change-Id: Ia6d532c687168448416dfdf46f0ac34bff20e6ca
|
2018-11-28 15:03:06 -08:00 |
|
Wangda Tan
|
8ebeda98a9
|
YARN-8974. Improve the assertion message in TestGPUResourceHandler. (Zhankun Tang via wangda)
Change-Id: I4eb58e9d251d5f54e7feffc4fbb813b4f5ae4b1b
|
2018-11-28 14:36:30 -08:00 |
|
Wangda Tan
|
579ef4be06
|
YARN-8882. [YARN-8851] Add a shared device mapping manager (scheduler) for device plugins. (Zhankun Tang via wangda)
Change-Id: I9435136642c3d556971a357bf687f69df90bb45e
|
2018-11-28 14:09:52 -08:00 |
|
Jason Lowe
|
3ce99e32f7
|
YARN-8812. Containers fail during creating a symlink which started with hyphen for a resource file. Contributed by Oleksandr Shevchenko
|
2018-11-28 08:46:11 -06:00 |
|
Eric Yang
|
33e0df4b35
|
YARN-8986. Added port publish for Docker container running with bridge network.
Contributed by Charo Zhang
|
2018-11-26 19:45:05 -05:00 |
|
Billie Rinaldi
|
49824ed260
|
YARN-8838. Check that container user is same as websocket user for interactive shell. Contributed by Eric Yang
|
2018-11-20 11:12:24 -08:00 |
|
Wangda Tan
|
6357803645
|
YARN-8881. [YARN-8851] Add basic pluggable device plugin framework. (Zhankun Tang via wangda)
Change-Id: If9a2f68cd4713b4ec932cdeda68106f17437c3d3
|
2018-11-19 08:54:31 -08:00 |
|
Eric Yang
|
21ec4bdaef
|
YARN-8672. Improve token filename management for localization.
Contributed by Chandni Singh
|
2018-11-14 15:22:01 -05:00 |
|
Billie Rinaldi
|
1f9c4f32e8
|
YARN-8776. Implement Container Exec feature in LinuxContainerExecutor. Contributed by Eric Yang
|
2018-11-12 10:42:30 -08:00 |
|
Botong Huang
|
b5ec85d966
|
YARN-8933. [AMRMProxy] Fix potential empty fields in allocation response, move SubClusterTimeout to FederationInterceptor. Contributed by Botong Huang.
|
2018-11-11 11:12:53 -08:00 |
|
Weiwei Yang
|
f8c72d7b3a
|
YARN-8880. Add configurations for pluggable plugin framework. Contributed by Zhankun Tang.
|
2018-11-08 12:23:00 +08:00 |
|
Giovanni Matteo Fumarola
|
989715ec50
|
YARN-8893. [AMRMProxy] Fix thread leak in AMRMClientRelayer and UAM client. Contributed by Botong Huang.
|
2018-11-02 15:30:08 -07:00 |
|
Billie Rinaldi
|
d07e873b7d
|
YARN-8569. Create an interface to provide cluster information to application. Contributed by Eric Yang
|
2018-10-26 17:57:05 -07:00 |
|
Robert Kanter
|
f76e3c3db7
|
YARN-8930. CGroup-based strict container memory enforcement does not work with CGroupElasticMemoryController (haibochen via rkanter)
|
2018-10-25 11:09:47 -07:00 |
|
Robert Kanter
|
69b328943e
|
YARN-8929. DefaultOOMHandler should only pick running containers to kill upon oom events (haibochen via rkanter)
|
2018-10-24 13:15:50 -07:00 |
|
Haibo Chen
|
766b78ee07
|
YARN-8911. ContainerScheduler incorrectly uses percentage number as the cpu resource utlization.
|
2018-10-24 07:58:26 -07:00 |
|
Eric Yang
|
47ad98b2e1
|
YARN-8910. Fixed misleading log statement when container max retries is infinite.
Contributed by Chandni Singh
|
2018-10-19 13:49:04 -04:00 |
|
Wangda Tan
|
5e02b4915b
|
YARN-8916. Define a constant docker string in ContainerRuntimeConstants.java for better maintainability. (Zhankun Tang via wangda)
Change-Id: I1349e740037f81afdbe30edbe741f20e88fd0a90
|
2018-10-19 09:49:26 -07:00 |
|
Wangda Tan
|
a457a8951a
|
YARN-8456. Fix a configuration handling bug when user leave FPGA discover executable path configuration default but set OpenCL SDK path environment variable. (Zhankun Tang via wangda)
Change-Id: Iff150ea98ba0c60d448474fd940eb121afce6965
|
2018-10-18 10:57:11 -07:00 |
|
Haibo Chen
|
32fe351bb6
|
YARN-8864. NM incorrectly logs container user as the user who sent a start/stop container request in its audit log. (Contributed by Wilfred Spiegelenburg)
|
2018-10-18 08:28:07 -07:00 |
|
Haibo Chen
|
c2288ac45b
|
YARN-8448. AM HTTPS Support for AM communication with RMWeb proxy. (Contributed by Robert Kanter)
|
2018-10-16 13:36:26 -07:00 |
|
Jason Lowe
|
5ce70e1211
|
YARN-7644. NM gets backed up deleting docker containers. Contributed by Chandni Singh
|
2018-10-10 09:52:19 -05:00 |
|