Jason Lowe
|
854d25b0c3
|
YARN-3727. For better error recovery, check if the directory exists before using it for localization. Contributed by Zhihai Xu
|
2015-09-30 14:59:44 +00:00 |
|
Rohith Sharma K S
|
8ed0d4b744
|
YARN-4152. NodeManager crash with NPE when LogAggregationService#stopContainer called for absent container. (Bibin A Chundatt via rohithsharmaks)
|
2015-09-24 11:24:14 +05:30 |
|
Jian He
|
c57eac5dfe
|
YARN-3868. Recovery support for container resizing. Contributed by Meng Ding
|
2015-09-23 13:29:38 -07:00 |
|
Jian He
|
c3dc1af072
|
YARN-1644. RM-NM protocol changes and NodeStatusUpdater implementation to support container resizing. Contributed by Meng Ding
|
2015-09-23 13:29:37 -07:00 |
|
Jian He
|
c59ae4eeb1
|
YARN-1643. Make ContainersMonitor support changing monitoring size of an allocated container. Contributed by Meng Ding and Wangda Tan
|
2015-09-23 13:29:37 -07:00 |
|
Jian He
|
5f5a968d65
|
YARN-3867. ContainerImpl changes to support container resizing. Contributed by Meng Ding
|
2015-09-23 13:29:37 -07:00 |
|
Jian He
|
ffd820c27a
|
YARN-1645. ContainerManager implementation to support container resizing. Contributed by Meng Ding & Wangda Tan
|
2015-09-23 13:29:37 -07:00 |
|
Jason Lowe
|
c890c51a91
|
YARN-4095. Avoid sharing AllocatorPerContext object in LocalDirAllocator between ShuffleHandler and LocalDirsHandlerService. Contributed by Zhihai Xu
|
2015-09-23 15:42:01 +00:00 |
|
Xuan
|
34ef1a092b
|
YARN-4149. yarn logs -am should provide an option to fetch all the log
files. Contributed by Varun Vasudev
|
2015-09-15 14:36:30 -07:00 |
|
Wangda Tan
|
77666105b4
|
YARN-4106. NodeLabels for NM in distributed mode is not updated even after clusterNodelabel addition in RM. (Bibin A Chundatt via wangda)
|
2015-09-10 09:30:09 -07:00 |
|
Zhihai Xu
|
16b9037dc1
|
YARN-4096. App local logs are leaked if log aggregation fails to initialize for the app. Contributed by Jason Lowe.
|
2015-09-08 12:29:54 -07:00 |
|
Jian He
|
6f72f1e600
|
YARN-2884. Added a proxy service in NM to proxy the the communication between AM and RM. Contributed by Kishore Chaliparambil
|
2015-09-08 09:35:46 +08:00 |
|
Varun Vasudev
|
1dbd8e34a7
|
YARN-3591. Resource localization on a bad disk causes subsequent containers failure. Contributed by Lavkesh Lahngir.
|
2015-09-07 11:32:12 +05:30 |
|
Rohith Sharma K S
|
095ab9ab5f
|
YARN-4073. Removed unused ApplicationACLsManager in ContainerManagerImpl constructor. (Naganarasimha G R via rohithsharmaks)
|
2015-09-02 14:13:33 +05:30 |
|
Xuan
|
37e1c3d82a
|
YARN-221. NM should provide a way for AM to tell it not to aggregate
logs. Contributed by Ming Ma
|
2015-08-22 16:25:24 -07:00 |
|
Wangda Tan
|
fc07464d1a
|
YARN-2923. Support configuration based NodeLabelsProvider Service in Distributed Node Label Configuration Setup. (Naganarasimha G R)
|
2015-08-20 11:51:03 -07:00 |
|
Karthik Kambatla
|
def12933b3
|
YARN-3534. Collect memory/cpu usage on the node. (Inigo Goiri via kasha)
|
2015-08-16 06:24:16 -07:00 |
|
Jian He
|
38aed1a94e
|
YARN-4005. Completed container whose app is finished is possibly not removed from NMStateStore. Contributed by Jun Gong
|
2015-08-13 14:46:08 -07:00 |
|
Jason Lowe
|
469cfcd695
|
YARN-3965. Add startup timestamp to nodemanager UI. Contributed by Hong Zhiguo
|
2015-08-03 15:53:32 +00:00 |
|
Varun Vasudev
|
3e6fce91a4
|
YARN-3853. Add docker container runtime support to LinuxContainterExecutor. Contributed by Sidharta Seethana.
|
2015-07-27 11:57:40 -07:00 |
|
Jason Lowe
|
ff9c13e0a7
|
YARN-3925. ContainerLogsUtils#getContainerLogFile fails to read container log files from full disks. Contributed by zhihai xu
|
2015-07-24 22:14:39 +00:00 |
|
Wangda Tan
|
06e5dd2c84
|
YARN-3956. Fix TestNodeManagerHardwareUtils fails on Mac (Varun Vasudev via wangda)
|
2015-07-22 12:01:41 -07:00 |
|
Varun Vasudev
|
63d0365088
|
YARN-2194. Addendum patch to fix failing unit test in TestPrivilegedOperationExecutor. Contributed by Sidharta Seethana.
|
2015-07-09 11:51:59 +05:30 |
|
Varun Vasudev
|
c40bdb56a7
|
YARN-2194. Fix bug causing CGroups functionality to fail on RHEL7. Contributed by Wei Yan.
|
2015-07-07 16:59:29 +05:30 |
|
Jason Lowe
|
b5cdf78e8e
|
YARN-3793. Several NPEs when deleting local files on NM recovery. Contributed by Varun Saxena
|
2015-07-01 21:13:32 +00:00 |
|
Jian He
|
62e583c7dc
|
YARN-3695. ServerProxy (NMProxy, etc.) shouldn't retry forever for non network exception. Contributed by Raju Bairishetti
|
2015-06-29 13:37:32 -07:00 |
|
Jason Lowe
|
40b256949a
|
YARN-3850. NM fails to read files from full disks which can lead to container logs being lost and other issues. Contributed by Varun Saxena
|
2015-06-26 15:47:07 +00:00 |
|
Karthik Kambatla
|
5ebf2817e5
|
YARN-3842. NMProxy should retry on NMNotYetReadyException. (Robert Kanter via kasha)
|
2015-06-22 17:47:47 -07:00 |
|
Xuan
|
6c7a9d502a
|
YARN-3834. Scrub debug logging of tokens during resource localization. Contributed by Chris Nauroth
|
2015-06-21 17:13:44 -07:00 |
|
Devaraj K
|
126321eded
|
YARN-3747. TestLocalDirsHandlerService should delete the created test
directory logDir2. Contributed by David Moore.
|
2015-06-08 15:32:13 +05:30 |
|
Junping Du
|
d7e7f6aa03
|
YARN-41. The RM should handle the graceful shutdown of the NM. Contributed by Devaraj K.
|
2015-06-04 04:59:27 -07:00 |
|
cnauroth
|
4102e5882e
|
YARN-3626. On Windows localized resources are not moved to the front of the classpath when they should be. Contributed by Craig Welch.
|
2015-05-27 14:31:49 -07:00 |
|
Vinod Kumar Vavilapalli
|
500a1d9c76
|
YARN-160. Enhanced NodeManager to automatically obtain cpu/memory values from underlying OS when configured to do so. Contributed by Varun Vasudev.
|
2015-05-26 11:38:35 -07:00 |
|
Vinod Kumar Vavilapalli
|
53fafcf061
|
YARN-3684. Changed ContainerExecutor's primary lifecycle methods to use a more extensible mechanism of context objects. Contributed by Sidharta Seethana.
|
2015-05-21 15:50:23 -07:00 |
|
Wangda Tan
|
b37da52a1c
|
YARN-3565. NodeHeartbeatRequest/RegisterNodeManagerRequest should use NodeLabel object instead of String. (Naganarasimha G R via wangda)
|
2015-05-19 16:34:17 -07:00 |
|
Ravi Prakash
|
c97f32e7b9
|
YARN-3302. TestDockerContainerExecutor should run automatically if it can detect docker in the usual place (Ravindra Kumar Naik via raviprak)
|
2015-05-19 10:28:11 -07:00 |
|
Xuan
|
0f95921447
|
YARN-3626. On Windows localized resources are not moved to the front of the classpath when they should be. Contributed by Craig Welch
|
2015-05-13 13:10:53 -07:00 |
|
Karthik Kambatla
|
70fb37cd79
|
YARN-1287. Consolidate MockClocks. (Sebastian Wong and Anubhav Dhoot via kasha)
|
2015-05-09 14:34:54 -07:00 |
|
Xuan
|
6471d18bc7
|
YARN-1912. ResourceLocalizer started without any jvm memory control.
Contributed by Masatake Iwasaki
|
2015-05-08 20:01:21 -07:00 |
|
Xuan
|
333f9a896d
|
YARN-3602. TestResourceLocalizationService.testPublicResourceInitializesLocalDir fails Intermittently due to IOException from cleanup. Contributed by zhihai xu
|
2015-05-08 17:52:54 -07:00 |
|
Xuan
|
088156de43
|
YARN-2331. Distinguish shutdown during supervision vs. shutdown for
rolling upgrade. Contributed by Jason Lowe
|
2015-05-08 15:10:43 -07:00 |
|
Robert (Bobby) Evans
|
bcf2890502
|
YARN-644: Basic null check is not performed on passed in arguments before using them in ContainerManagerImpl.startContainer
|
2015-05-08 11:11:01 -05:00 |
|
Akira Ajisaka
|
b167fe7605
|
YARN-1832. Fix wrong MockLocalizerStatus#equals implementation. Contributed by Hong Zhiguo.
|
2015-05-08 11:14:45 +09:00 |
|
Akira Ajisaka
|
918af8efff
|
YARN-3577. Misspelling of threshold in log4j.properties for tests. Contributed by Brahma Reddy Battula.
|
2015-05-07 13:33:03 +09:00 |
|
Robert Kanter
|
b72507810a
|
YARN-3491. PublicLocalizer#addResource is too slow. (zxu via rkanter)
|
2015-05-06 14:19:06 -07:00 |
|
Robert Kanter
|
ac7d152901
|
YARN-3363. add localization and container launch time to ContainerMetrics at NM to show these timing information for each active container. (zxu via rkanter)
|
2015-05-01 16:39:21 -07:00 |
|
Vinod Kumar Vavilapalli
|
1b3b9e5c31
|
YARN-2619. Added NodeManager support for disk io isolation through cgroups. Contributed by Varun Vasudev and Wei Yan.
|
2015-04-30 21:41:07 -07:00 |
|
Karthik Kambatla
|
47279c3228
|
YARN-3464. Race condition in LocalizerRunner kills localizer before localizing all resources. (Zhihai Xu via kasha)
|
2015-04-26 09:13:46 -07:00 |
|
Vinod Kumar Vavilapalli
|
a100be685c
|
YARN-3366. Enhanced NodeManager to support classifying/shaping outgoing network bandwidth traffic originating from YARN containers Contributed by Sidharta Seethana.
|
2015-04-22 17:26:13 -07:00 |
|
Jian He
|
674c7ef649
|
YARN-3503. Expose disk utilization percentage and bad local and log dir counts in NM metrics. Contributed by Varun Vasudev
|
2015-04-21 20:57:02 -07:00 |
|
Jian He
|
1b89a3e173
|
YARN-3354. Add node label expression in ContainerTokenIdentifier to support RM recovery. Contributed by Wangda Tan
|
2015-04-15 13:57:06 -07:00 |
|
Junping Du
|
838b06ac87
|
YARN-3443. Create a 'ResourceHandler' subsystem to ease addition of support for new resource types on the NM. Contributed by Sidharta Seethana.
|
2015-04-13 18:35:56 -07:00 |
|
Vinod Kumar Vavilapalli
|
b21c72777a
|
YARN-3365. Enhanced NodeManager to support using the 'tc' tool via container-executor for outbound network traffic control. Contributed by Sidharta Seethana.
|
2015-04-02 16:53:59 -07:00 |
|
Wangda Tan
|
2a945d24f7
|
YARN-2495. Allow admin specify labels from each NM (Distributed configuration for node label). (Naganarasimha G R via wangda)
|
2015-03-30 12:05:21 -07:00 |
|
Ravi Prakash
|
e0ccea33c9
|
YARN-3288. Document and fix indentation in the DockerContainerExecutor code
|
2015-03-28 08:00:41 -07:00 |
|
Tsuyoshi Ozawa
|
82eda771e0
|
YARN-3384. TestLogAggregationService.verifyContainerLogs fails after YARN-2777. Contributed by Naganarasimha G R.
|
2015-03-24 00:25:30 +09:00 |
|
Junping Du
|
d81109e588
|
YARN-3269. Yarn.nodemanager.remote-app-log-dir could not be configured to fully qualified path. Contributed by Xuan Gong
|
2015-03-20 13:41:22 -07:00 |
|
Ravi Prakash
|
56085203c4
|
YARN-3339. TestDockerContainerExecutor should pull a single image and not the entire centos repository. (Ravindra Kumar Naik via raviprak)
|
2015-03-16 16:17:58 -07:00 |
|
Vinod Kumar Vavilapalli
|
863079bb87
|
YARN-3154. Added additional APIs in LogAggregationContext to avoid aggregating running logs of application when rolling is enabled. Contributed by Xuan Gong.
|
2015-03-12 13:32:29 -07:00 |
|
Jian He
|
21101c01f2
|
YARN-2190. Added CPU and memory limit options to the default container executor for Windows containers. Contributed by Chuan Liu
|
2015-03-06 14:18:11 -08:00 |
|
Allen Wittenauer
|
d4ac6822e1
|
YARN-2980. Move health check script related functionality to hadoop-common (Varun Saxena via aw)
|
2015-02-24 11:25:26 -08:00 |
|
Junping Du
|
c33ae271c2
|
YARN-2799. Cleanup TestLogAggregationService based on the change in YARN-90. Contributed by Zhihai Xu
|
2015-02-20 09:43:39 -08:00 |
|
cnauroth
|
6804d68901
|
YARN-2899. Run TestDockerContainerExecutorWithMocks on Linux only. Contributed by Ming Ma.
|
2015-02-13 21:58:54 -08:00 |
|
Junping Du
|
04f5ef18f7
|
YARN-2079. Recover NonAggregatingLogHandler state upon nodemanager restart. (Contributed by Jason Lowe)
|
2015-02-12 11:46:47 -08:00 |
|
Jason Lowe
|
b379972ab3
|
YARN-3074. Nodemanager dies when localizer runner tries to write to a full disk. Contributed by Varun Saxena
|
2015-02-11 16:33:43 +00:00 |
|
Jason Lowe
|
3f5431a22f
|
YARN-2809. Implement workaround for linux kernel panic when removing cgroup. Contributed by Nathan Roberts
|
2015-02-10 17:27:21 +00:00 |
|
Arun C. Murthy
|
02f154a001
|
YARN-1537. Fix race condition in TestLocalResourcesTrackerImpl.testLocalResourceCache. Contributed by Xuan Gong.
|
2015-02-05 23:58:52 -08:00 |
|
Xuan
|
b73e776abc
|
YARN-3056. Add verification for containerLaunchDuration in
TestNodeManagerMetrics. Contributed by zhihai Xu
|
2015-02-03 15:12:54 -08:00 |
|
Robert Kanter
|
f7a77819a1
|
YARN-3022. Expose Container resource information from NodeManager for monitoring (adhoot via ranter)
|
2015-02-03 10:39:41 -08:00 |
|
Jian He
|
4e15fc0841
|
YARN-3011. Possible IllegalArgumentException in ResourceLocalizationService might lead NM to crash. Contributed by Varun Saxena
|
2015-01-27 13:31:22 -08:00 |
|
Jason Lowe
|
902c6ea7e4
|
YARN-3088. LinuxContainerExecutor.deleteAsUser can throw NPE if native executor returns an error. Contributed by Eric Payne
|
2015-01-26 15:40:21 +00:00 |
|
Xuan
|
0d6bd62102
|
YARN-3024. LocalizerRunner should give DIE action when all resources are
localized. Contributed by Chengbing Liu
|
2015-01-25 19:37:57 -08:00 |
|
Tsuyoshi Ozawa
|
3aab354e66
|
YARN-3082. Non thread safe access to systemCredentials in NodeHeartbeatResponse processing. Contributed by Anubhav Dhoot.
|
2015-01-23 16:04:18 +09:00 |
|
Karthik Kambatla
|
84198564ba
|
YARN-2984. Metrics for container's actual memory usage. (kasha)
|
2015-01-17 05:44:04 +05:30 |
|
Junping Du
|
5d1cca34fa
|
YARN-3064. TestRMRestart/TestContainerResourceUsage/TestNodeManagerResync failure with allocation timeout. (Contributed by Jian He)
|
2015-01-16 00:08:36 -08:00 |
|
Jian He
|
cc2a745f7e
|
YARN-2997. Fixed NodeStatusUpdater to not send alreay-sent completed container statuses on heartbeat. Contributed by Chengbing Liu
|
2015-01-08 11:12:54 -08:00 |
|
Karthik Kambatla
|
954fb8581e
|
YARN-2675. containersKilled metrics is not updated when the container is killed during localization. (Zhihai Xu via kasha)
|
2014-12-19 16:02:20 -08:00 |
|
cnauroth
|
e996a1bfd4
|
HADOOP-11321. copyToLocal cannot save a file to an SMB share unless the user has Full Control permissions. Contributed by Chris Nauroth.
|
2014-12-16 15:29:22 -08:00 |
|
Steve Loughran
|
3681de2039
|
YARN-2912 Jersey Tests failing with port in use. (varun saxena via stevel)
|
2014-12-12 17:11:07 +00:00 |
|
Karthik Kambatla
|
db73cc9124
|
YARN-2931. PublicLocalizer may fail until directory is initialized by LocalizeRunner. (Anubhav Dhoot via kasha)
|
2014-12-08 22:26:18 -08:00 |
|
Junping Du
|
e65b7c5ff6
|
YARN-1156. Enhance NodeManager AllocatedGB and AvailableGB metrics for aggregation of decimal values. (Contributed by Tsuyoshi OZAWA)
|
2014-12-03 04:11:18 -08:00 |
|
Karthik Kambatla
|
233b61e495
|
YARN-2679. Add metric for container launch duration. (Zhihai Xu via kasha)
|
2014-11-21 14:22:21 -08:00 |
|
Jason Lowe
|
49c38898b0
|
YARN-2816. NM fail to start with NPE during container recovery. Contributed by Zhihai Xu
|
2014-11-14 21:25:59 +00:00 |
|
Karthik Kambatla
|
a04143039e
|
YARN-2236. [YARN-1492] Shared Cache uploader service on the Node Manager. (Chris Trezzo and Sanjin Lee via kasha)
|
2014-11-12 09:31:05 -08:00 |
|
Ravi Prakash
|
53f64ee516
|
YARN-1964. Create Docker analog of the LinuxContainerExecutor in YARN
|
2014-11-11 21:28:11 -08:00 |
|
Xuan
|
5c9a51f140
|
YARN-2841. RMProxy should retry EOFException. Contributed by Jian He
|
2014-11-10 18:25:01 -08:00 |
|
Arun C. Murthy
|
43cd07b408
|
YARN-2830. Add backwords compatible ContainerId.newInstance constructor. Contributed by Jonathan Eagles.
|
2014-11-09 14:57:37 -08:00 |
|
Jason Lowe
|
c3d475070a
|
YARN-2825. Container leak on NM. Contributed by Jian He
|
2014-11-07 23:16:37 +00:00 |
|
Vinod Kumar Vavilapalli
|
c5a46d4c8c
|
YARN-1922. Fixed NodeManager to kill process-trees correctly in the presence of races between the launch and the stop-container call and when root processes crash. Contributed by Billie Rinaldi.
|
2014-11-03 16:38:55 -08:00 |
|
Vinod Kumar Vavilapalli
|
58e9f24e0f
|
YARN-2788. Fixed backwards compatiblity issues with log-aggregation feature that were caused when adding log-upload-time via YARN-2703. Contributed by Xuan Gong.
|
2014-11-03 13:16:29 -08:00 |
|
Vinod Kumar Vavilapalli
|
5c0381c96a
|
YARN-2790. Fixed a NodeManager bug that was causing log-aggregation to fail beyond HFDS delegation-token expiry even when RM is a proxy-user (YARN-2704). Contributed by Jian He.
|
2014-11-01 16:32:35 -07:00 |
|
Zhijie Shen
|
1cd088fd9d
|
YARN-2711. Fixed TestDefaultContainerExecutor#testContainerLaunchError failure on Windows. Contributed by Varun Vasudev.
|
2014-10-31 17:43:30 -07:00 |
|
Jason Lowe
|
73e626ad91
|
YARN-2755. NM fails to clean up usercache_DEL_<timestamp> dirs after YARN-661. Contributed by Siqi Li
|
2014-10-30 15:10:27 +00:00 |
|
Zhijie Shen
|
8984e9b177
|
YARN-2741. Made NM web UI serve logs on the drive other than C: on Windows. Contributed by Craig Welch.
|
2014-10-28 14:11:19 -07:00 |
|
Vinod Kumar Vavilapalli
|
a16d022ca4
|
YARN-2704. Changed ResourceManager to optionally obtain tokens itself for the sake of localization and log-aggregation for long-running services. Contributed by Jian He.
|
2014-10-27 15:49:47 -07:00 |
|
Zhijie Shen
|
f81dc3f995
|
YARN-2703. Added logUploadedTime into LogValue for better display. Contributed by Xuan Gong.
|
2014-10-24 14:10:46 -07:00 |
|
Jian He
|
3b12fd6cfb
|
YARN-2198. Remove the need to run NodeManager as privileged account for Windows Secure Container Executor. Contributed by Remus Rusanu
|
2014-10-22 15:57:46 -07:00 |
|
Jason Lowe
|
6f2028bd15
|
YARN-90. NodeManager should identify failed disks becoming good again. Contributed by Varun Vasudev
|
2014-10-21 17:31:13 +00:00 |
|
Jian He
|
4fa1fb3193
|
Missing file for YARN-2701
|
2014-10-20 19:57:45 -07:00 |
|
Jian He
|
2839365f23
|
YARN-2701. Potential race condition in startLocalizer when using LinuxContainerExecutor. Contributed by Xuan Gong
|
2014-10-20 18:45:47 -07:00 |
|