Commit Graph

1172 Commits

Author SHA1 Message Date
Billie Rinaldi
ca90b2480e YARN-9032. Support sh shell for interactive container shell at command line. Contributed by Eric Yang 2018-12-14 14:08:22 -08:00
Billie Rinaldi
72e7c6a489 YARN-8962. Add ability to use interactive shell with normal yarn container. Contributed by Eric Yang 2018-12-12 18:18:57 -08:00
Wangda Tan
37eb919c59 YARN-8885. [DevicePlugin] Support NM APIs to query device resource allocation. (Zhankun Tang via wangda)
Change-Id: I2a9870709b512af1ac6c09c9701d0b3c0791ff32
2018-12-12 11:45:47 -08:00
Wangda Tan
61bdcb7b2b YARN-9015. [DevicePlugin] Add an interface for device plugin to provide customized scheduler. (Zhankun Tang via wangda)
Change-Id: Ib2e4ae47a6f29bb3082c1f8520cf5a52ca720979
2018-12-12 11:44:22 -08:00
Haibo Chen
881230da21 YARN-9051. Integrate multiple CustomResourceTypesConfigurationProvider implementations into one. (Contributed by Szilard Nemeth) 2018-12-11 11:41:32 -08:00
Haibo Chen
ac578c0e82 YARN-9087. Improve logging for initialization of Resource plugins. (Contributed by Szilard Nemeth) 2018-12-10 11:12:54 -08:00
Billie Rinaldi
154449fbd8 YARN-8914. Add xtermjs to YARN UI2. Contributed by Eric Yang and Akhil PB 2018-12-07 10:56:17 -08:00
Eric Yang
1b790f4dd1 YARN-9071. Improved status update for reinitialized containers.
Contributed by Chandni Singh
2018-12-05 17:00:56 -05:00
Wangda Tan
bad12031f6 YARN-9010. Fix the incorrect trailing slash deletion in constructor method of CGroupsHandlerImpl. (Zhankun Tang via wangda)
Change-Id: Iaecc66d57781cc10f19ead4647e47fc9556676da
2018-11-29 14:56:07 -08:00
Wangda Tan
fe7dab8ef5 YARN-8989. [YARN-8851] Move DockerCommandPlugin volume related APIs' invocation from DockerLinuxContainerRuntime#prepareContainer to #launchContainer. (Zhankun Tang via wangda)
Change-Id: Ia6d532c687168448416dfdf46f0ac34bff20e6ca
2018-11-28 15:03:06 -08:00
Wangda Tan
8ebeda98a9 YARN-8974. Improve the assertion message in TestGPUResourceHandler. (Zhankun Tang via wangda)
Change-Id: I4eb58e9d251d5f54e7feffc4fbb813b4f5ae4b1b
2018-11-28 14:36:30 -08:00
Wangda Tan
9ed87567ad YARN-9061. Improve the GPU/FPGA module log message of container-executor. (Zhankun Tang via wangda)
Change-Id: Iece9b47438357077a53984a820d4d6423f480518
2018-11-28 14:31:31 -08:00
Wangda Tan
579ef4be06 YARN-8882. [YARN-8851] Add a shared device mapping manager (scheduler) for device plugins. (Zhankun Tang via wangda)
Change-Id: I9435136642c3d556971a357bf687f69df90bb45e
2018-11-28 14:09:52 -08:00
Jason Lowe
3ce99e32f7 YARN-8812. Containers fail during creating a symlink which started with hyphen for a resource file. Contributed by Oleksandr Shevchenko 2018-11-28 08:46:11 -06:00
Eric Yang
33e0df4b35 YARN-8986. Added port publish for Docker container running with bridge network.
Contributed by Charo Zhang
2018-11-26 19:45:05 -05:00
Rohith Sharma K S
e223a790a7 YARN-9042. Fix javadoc error in deviceplugin package. Contributed by Zhankun Tang. 2018-11-22 20:04:30 +05:30
Billie Rinaldi
49824ed260 YARN-8838. Check that container user is same as websocket user for interactive shell. Contributed by Eric Yang 2018-11-20 11:12:24 -08:00
Wangda Tan
6357803645 YARN-8881. [YARN-8851] Add basic pluggable device plugin framework. (Zhankun Tang via wangda)
Change-Id: If9a2f68cd4713b4ec932cdeda68106f17437c3d3
2018-11-19 08:54:31 -08:00
Eric Yang
21ec4bdaef YARN-8672. Improve token filename management for localization.
Contributed by Chandni Singh
2018-11-14 15:22:01 -05:00
Billie Rinaldi
1f9c4f32e8 YARN-8776. Implement Container Exec feature in LinuxContainerExecutor. Contributed by Eric Yang 2018-11-12 10:42:30 -08:00
Botong Huang
b5ec85d966 YARN-8933. [AMRMProxy] Fix potential empty fields in allocation response, move SubClusterTimeout to FederationInterceptor. Contributed by Botong Huang. 2018-11-11 11:12:53 -08:00
Weiwei Yang
f8c72d7b3a YARN-8880. Add configurations for pluggable plugin framework. Contributed by Zhankun Tang. 2018-11-08 12:23:00 +08:00
Giovanni Matteo Fumarola
989715ec50 YARN-8893. [AMRMProxy] Fix thread leak in AMRMClientRelayer and UAM client. Contributed by Botong Huang. 2018-11-02 15:30:08 -07:00
Billie Rinaldi
d07e873b7d YARN-8569. Create an interface to provide cluster information to application. Contributed by Eric Yang 2018-10-26 17:57:05 -07:00
Robert Kanter
f76e3c3db7 YARN-8930. CGroup-based strict container memory enforcement does not work with CGroupElasticMemoryController (haibochen via rkanter) 2018-10-25 11:09:47 -07:00
Eric Yang
c16c49b8c3 YARN-8587. Added retries for fetching docker exit code.
Contributed by Charo Zhang
2018-10-24 17:29:46 -04:00
Robert Kanter
69b328943e YARN-8929. DefaultOOMHandler should only pick running containers to kill upon oom events (haibochen via rkanter) 2018-10-24 13:15:50 -07:00
Haibo Chen
766b78ee07 YARN-8911. ContainerScheduler incorrectly uses percentage number as the cpu resource utlization. 2018-10-24 07:58:26 -07:00
Eric Yang
66e7a2c787 YARN-8922. Fixed test-container-executor test setup and clean up.
Contributed by Robert Kanter
2018-10-22 15:28:58 -04:00
Eric Yang
e2a9fa8448 HADOOP-15821. Move YARN Registry to Hadoop Registry.
Contributed by Íñigo Goiri
2018-10-19 19:46:48 -04:00
Eric Yang
47ad98b2e1 YARN-8910. Fixed misleading log statement when container max retries is infinite.
Contributed by Chandni Singh
2018-10-19 13:49:04 -04:00
Wangda Tan
5e02b4915b YARN-8916. Define a constant docker string in ContainerRuntimeConstants.java for better maintainability. (Zhankun Tang via wangda)
Change-Id: I1349e740037f81afdbe30edbe741f20e88fd0a90
2018-10-19 09:49:26 -07:00
Wangda Tan
a457a8951a YARN-8456. Fix a configuration handling bug when user leave FPGA discover executable path configuration default but set OpenCL SDK path environment variable. (Zhankun Tang via wangda)
Change-Id: Iff150ea98ba0c60d448474fd940eb121afce6965
2018-10-18 10:57:11 -07:00
Haibo Chen
32fe351bb6 YARN-8864. NM incorrectly logs container user as the user who sent a start/stop container request in its audit log. (Contributed by Wilfred Spiegelenburg) 2018-10-18 08:28:07 -07:00
Haibo Chen
c2288ac45b YARN-8448. AM HTTPS Support for AM communication with RMWeb proxy. (Contributed by Robert Kanter) 2018-10-16 13:36:26 -07:00
Haibo Chen
f880ff418c YARN-8775. TestDiskFailures.testLocalDirsFailures sometimes can fail on concurrent File modifications. (Contributed by Antal Bálint Steinbach) 2018-10-15 09:37:20 -07:00
Billie Rinaldi
96d28b4750 YARN-8777. Container Executor C binary change to execute interactive docker command. Contributed by Eric Yang 2018-10-11 09:25:21 -07:00
Jason Lowe
e787d65a08 YARN-8861. executorLock is misleading in ContainerLaunch. Contributed by Chandni Singh 2018-10-11 10:54:57 -05:00
Jason Lowe
5ce70e1211 YARN-7644. NM gets backed up deleting docker containers. Contributed by Chandni Singh 2018-10-10 09:52:19 -05:00
Inigo Goiri
c3d22d3b45 YARN-7652. Handle AM register requests asynchronously in FederationInterceptor. Contributed by Botong Huang. 2018-10-09 10:29:40 -07:00
Eric Yang
347ea38581 YARN-8763. Added node manager websocket API for accessing containers.
Contributed by Zian Chen
2018-10-08 13:56:47 -04:00
Wangda Tan
2e9913caf2 YARN-8844. TestNMProxy unit test is failing. (Eric Yang via wangda)
Change-Id: I241fa8701b6f1dbcad87fd2e9a429e32e7aa40f5
(cherry picked from commit b3ac8869338faeabedbc27e95e3166c830a9a761)
2018-10-04 10:53:39 -07:00
Sunil G
58fa96b697 Changed version in trunk to 3.3.0-SNAPSHOT. 2018-10-02 22:41:41 +05:30
Shane Kumpf
5edb9d3b97 YARN-8785. Improve the error message when a bind mount is not whitelisted. Contributed by Simon Prewo 2018-10-02 07:16:29 -06:00
Giovanni Matteo Fumarola
59d5af21b7 YARN-8760. [AMRMProxy] Fix concurrent re-register due to YarnRM failover in AMRMClientRelayer. Contributed by Botong Huang. 2018-10-01 13:12:38 -07:00
Eric Yang
b237a0dd44 YARN-6456. Added config to set default container runtimes.
Contributed by Craig Condit
2018-09-27 15:31:18 -04:00
Eric Yang
913f87dada YARN-8665. Added Yarn service cancel upgrade option.
Contributed by Chandni Singh
2018-09-26 14:51:35 -04:00
Giovanni Matteo Fumarola
3090922805 YARN-8696. [AMRMProxy] FederationInterceptor upgrade: home sub-cluster heartbeat async. Contributed by Botong Huang. 2018-09-24 11:37:05 -07:00
Eric Yang
aa4bd493c3 YARN-8801. Fixed header comments for docker utility functions.
Contributed by Zian Chen
2018-09-20 13:08:59 -04:00
Jason Lowe
6b5838ed32 YARN-8784. DockerLinuxContainerRuntime prevents access to distributed cache entries on a full disk. Contributed by Eric Badger 2018-09-19 16:44:51 -05:00
Eric Yang
efdea85ad1 YARN-8791. Trim docker inspect output for line feed for STOPSIGNAL parsing.
Contributed by Chandni Singh
2018-09-19 13:16:11 -04:00
Jason Lowe
2df0a8dcb3 YARN-8648. Container cgroups are leaked when using docker. Contributed by Jim Brennan 2018-09-18 15:36:45 -05:00
Shane Kumpf
144a55f0e3 YARN-8045. Reduce log output from container status calls. Contributed by Craig Condit 2018-09-14 10:41:55 -06:00
Shane Kumpf
78902f0250 YARN-8748. Javadoc warnings within the nodemanager package. Contributed by Craig Condit 2018-09-14 10:28:36 -06:00
Eric Yang
99237607bf YARN-8706. Allow additional flag in docker inspect call.
Contributed by Chandni Singh
2018-09-14 11:46:59 -04:00
Jason Lowe
250b50018e YARN-8680. YARN NM: Implement Iterable Abstraction for LocalResourceTracker state. Contributed by Pradeep Ambati 2018-09-13 13:28:54 -05:00
Weiwei Yang
39c1ea1ed4 YARN-8729. Node status updater thread could be lost after it is restarted. Contributed by Tao Yang. 2018-09-13 22:21:35 +08:00
Giovanni Matteo Fumarola
02b9bfdf9e YARN-8658. [AMRMProxy] Metrics for AMRMClientRelayer inside FederationInterceptor. Contributed by Young Chen. 2018-09-12 11:46:35 -07:00
bibinchundatt
c44088ac19 YARN-8739. Fix jenkins issues for Node Attributes branch. Contributed by Sunil Govindan. 2018-09-12 16:01:01 +05:30
bibinchundatt
8cf6a9a2bd YARN-7892. Revisit NodeAttribute class structure. Contributed by Naganarasimha G R. 2018-09-12 16:01:00 +05:30
Sunil G
440ff7f563 YARN-8094. Support configuration based Node Attribute provider. Contributed by Weiwei Yang. 2018-09-12 16:01:00 +05:30
Naganarasimha
3b3b6efe21 YARN-7871. Node attributes reporting from NM to RM. Contributed by Weiwei Yang. 2018-09-12 16:01:00 +05:30
Sunil G
ffcabd24c3 YARN-7856. Validate Node Attributes from NM. Contributed by Weiwei Yang. 2018-09-12 16:01:00 +05:30
Sunil G
2f7712be09 YARN-6858. Attribute Manager to store and provide node attributes in RM. Contributed by Naganarasimha G R. 2018-09-12 16:01:00 +05:30
Naganarasimha
d312b5cf9f YARN-7757. Refactor NodeLabelsProvider to be more generic and reusable for node attributes providers. Contributed by Weiwei Yang. 2018-09-12 16:01:00 +05:30
Eric Yang
bf8a1750e9 YARN-8706. Updated docker container stop logic to avoid double kill.
Contributed by Chandni Singh
2018-09-07 20:18:09 -04:00
Eric Yang
7d62334387 YARN-8751. Reduce conditions that mark node manager as unhealthy.
Contributed by Craig Condit
2018-09-07 19:46:15 -04:00
Shane Kumpf
dffb7bfe6c YARN-8638. Allow linux container runtimes to be pluggable. Contributed by Craig Condit 2018-09-05 06:47:54 -06:00
Shane Kumpf
73625168c0 YARN-8642. Add support for tmpfs mounts with the Docker runtime. Contributed by Craig Condit 2018-08-29 07:08:37 -06:00
Giovanni Matteo Fumarola
602d13844a HADOOP-15699. Fix some of testContainerManager failures in Windows. Contributed by Botong Huang. 2018-08-27 12:25:46 -07:00
Billie Rinaldi
05b2bbeb35 YARN-8675. Remove default hostname for docker containers when net=host. Contributed by Suma Shivaprasad 2018-08-27 11:34:33 -07:00
Giovanni Matteo Fumarola
f152582562 YARN-8705. Refactor the UAM heartbeat thread in preparation for YARN-8696. Contributed by Botong Huang. 2018-08-27 10:32:22 -07:00
Jason Lowe
585ebd873a YARN-8649. NPE in localizer hearbeat processing if a container is killed while localizing. Contributed by lujie 2018-08-23 09:29:46 -05:00
Giovanni Matteo Fumarola
8736fc39ac YARN-8673. [AMRMProxy] More robust responseId resync after an YarnRM master slave switch. Contributed by Botong Huang. 2018-08-20 12:22:36 -07:00
Jason Lowe
65e7469712 YARN-8242. YARN NM: OOM error while reading back the state store on recovery. Contributed by Pradeep Ambati and Kanwaljeet Sachdev 2018-08-20 10:14:40 -05:00
Eric Yang
d42806160e YARN-8667. Cleanup symlinks when container restarted by NM.
Contributed by Chandni Singh
2018-08-16 18:41:58 -04:00
Jason Lowe
cb21eaa026 YARN-8656. container-executor should not write cgroup tasks files for docker containers. Contributed by Jim Brennan 2018-08-16 10:06:17 -05:00
Wei-Chiu Chuang
bdd0e01ed9 HADOOP-14212. Addendum patch: Expose SecurityEnabled boolean field in JMX for other services besides NameNode. Contributed by Adam Antal. 2018-08-14 18:25:38 -07:00
Wei-Chiu Chuang
363bd16e31 HADOOP-14212. Expose SecurityEnabled boolean field in JMX for other services besides NameNode. Contributed by Adam Antal. 2018-08-14 17:19:00 -07:00
Eric Yang
4cba0741d5 YARN-8160. Support upgrade of service that use docker containers.
Contributed by Chandni Singh
2018-08-14 11:36:26 -04:00
Jason Lowe
d1d129aa9d YARN-8640. Restore previous state in container-executor after failure. Contributed by Jim Brennan 2018-08-14 10:21:03 -05:00
Jason Lowe
cd04e954d2 YARN-8331. Race condition in NM container launched after done. Contributed by Pradeep Ambati 2018-08-09 10:17:34 -05:00
Wangda Tan
d4258fcad7 YARN-8629. Container cleanup fails while trying to delete Cgroups. (Suma Shivaprasad via wangda)
Change-Id: I392ef4f8baa84d5d7b1f2e438c560b5426b6d4f2
2018-08-07 12:36:55 -07:00
Jason Lowe
7526815e32 YARN-8263. DockerClient still touches hadoop.tmp.dir. Contributed by Craig Condit 2018-08-02 10:43:48 -05:00
Billie Rinaldi
67c65da261 YARN-8403. Change the log level for fail to download resource from INFO to ERROR. Contributed by Eric Yang 2018-08-01 08:51:40 -07:00
Wangda Tan
4b540bbfcf YARN-8418. App local logs could leaked if log aggregation fails to initialize for the app. (Bibin A Chundatt via wangda)
Change-Id: I29a23ca4b219b48c92e7975cd44cddb8b0e04104
2018-07-31 12:08:00 -07:00
bibinchundatt
2b39ad2698 YARN-8584. Several typos in Log Aggregation related classes. Contributed by Szilard Nemeth. 2018-07-30 23:25:19 +05:30
bibinchundatt
3d586841ab YARN-8558. NM recovery level db not cleaned up properly on container finish. Contributed by Bibin A Chundatt. 2018-07-28 20:52:39 +05:30
Eric Yang
ed9d60e888 YARN-8508. Release GPU resource for killed container.
Contributed by Chandni Singh
2018-07-27 19:33:58 -04:00
bibinchundatt
8461278833 YARN-8544. [DS] AM registration fails when hadoop authorization is enabled. Contributed by Bibin A Chundatt. 2018-07-24 13:09:17 +05:30
Eric Yang
8688a0c7f8 YARN-8380. Support bind propagation options for mounts in docker runtime.
Contributed by Billie Rinaldi
2018-07-23 20:12:04 -04:00
Haibo Chen
9d3c39e9dd YARN-6966. NodeManager metrics may return wrong negative values when NM restart. (Szilard Nemeth via Haibo Chen) 2018-07-23 11:07:24 -07:00
Haibo Chen
e6873dfde0 YARN-7300. DiskValidator is not used in LocalDirAllocator. (Szilard Nemeth via Haibo Chen) 2018-07-19 16:27:11 -07:00
Eric Yang
efb4e274e5 YARN-8538. Fixed memory leaks in container-executor and test cases.
Contributed by Billie Rinaldi
2018-07-16 17:38:49 -04:00
Jason Lowe
17118f446c YARN-8515. container-executor can crash with SIGPIPE after nodemanager restart. Contributed by Jim Brennan 2018-07-13 10:05:25 -05:00
Robert Kanter
1bc106a738 YARN-8518. test-container-executor test_is_empty() is broken (Jim_Brennan via rkanter) 2018-07-12 16:38:46 -07:00
Sunil G
7f1d3d0e9d YARN-8512. ATSv2 entities are not published to HBase from second attempt onwards. Contributed by Rohith Sharma K S. 2018-07-11 12:26:32 +05:30
Sunil G
705e2c1f7c YARN-8473. Containers being launched as app tears down can leave containers in NEW state. Contributed by Jason Lowe. 2018-07-10 20:11:47 +05:30
Giovanni Matteo Fumarola
ea9b608237 YARN-7899. [AMRMProxy] Stateful FederationInterceptor for pending requests. Contributed by Botong Huang. 2018-07-09 12:27:36 -07:00
Robert Kanter
0838fe8337 Only mount non-empty directories for cgroups (miklos.szegedi@cloudera.com via rkanter) 2018-07-09 10:37:20 -07:00