Jian He
|
83a18add10
|
YARN-1449. AM-NM protocol changes to support container resizing. Contributed by Meng Ding & Wangda Tan)
|
2015-09-23 13:29:36 -07:00 |
|
Jason Lowe
|
c890c51a91
|
YARN-4095. Avoid sharing AllocatorPerContext object in LocalDirAllocator between ShuffleHandler and LocalDirsHandlerService. Contributed by Zhihai Xu
|
2015-09-23 15:42:01 +00:00 |
|
Xuan
|
34ef1a092b
|
YARN-4149. yarn logs -am should provide an option to fetch all the log
files. Contributed by Varun Vasudev
|
2015-09-15 14:36:30 -07:00 |
|
Jason Lowe
|
8c1cdb17a0
|
YARN-4158. Remove duplicate close for LogWriter in AppLogAggregatorImpl#uploadLogsForContainers. Contributed by Zhihai Xu
|
2015-09-15 20:21:33 +00:00 |
|
Varun Vasudev
|
486d5cb803
|
YARN-4136. LinuxContainerExecutor loses info when forwarding ResourceHandlerException. Contributed by Bibin A Chundatt.
|
2015-09-11 14:37:48 +05:30 |
|
Wangda Tan
|
77666105b4
|
YARN-4106. NodeLabels for NM in distributed mode is not updated even after clusterNodelabel addition in RM. (Bibin A Chundatt via wangda)
|
2015-09-10 09:30:09 -07:00 |
|
Zhihai Xu
|
16b9037dc1
|
YARN-4096. App local logs are leaked if log aggregation fails to initialize for the app. Contributed by Jason Lowe.
|
2015-09-08 12:29:54 -07:00 |
|
Jian He
|
6f72f1e600
|
YARN-2884. Added a proxy service in NM to proxy the the communication between AM and RM. Contributed by Kishore Chaliparambil
|
2015-09-08 09:35:46 +08:00 |
|
Varun Vasudev
|
1dbd8e34a7
|
YARN-3591. Resource localization on a bad disk causes subsequent containers failure. Contributed by Lavkesh Lahngir.
|
2015-09-07 11:32:12 +05:30 |
|
Rohith Sharma K S
|
095ab9ab5f
|
YARN-4073. Removed unused ApplicationACLsManager in ContainerManagerImpl constructor. (Naganarasimha G R via rohithsharmaks)
|
2015-09-02 14:13:33 +05:30 |
|
Xuan
|
b71c6006f5
|
YARN-221. Addendum patch to compilation issue which is caused by missing
AllContainerLogAggregationPolicy. Contributed by Xuan Gong
|
2015-08-23 16:46:30 -07:00 |
|
Xuan
|
37e1c3d82a
|
YARN-221. NM should provide a way for AM to tell it not to aggregate
logs. Contributed by Ming Ma
|
2015-08-22 16:25:24 -07:00 |
|
Wangda Tan
|
fc07464d1a
|
YARN-2923. Support configuration based NodeLabelsProvider Service in Distributed Node Label Configuration Setup. (Naganarasimha G R)
|
2015-08-20 11:51:03 -07:00 |
|
Zhihai Xu
|
14215c8ef8
|
YARN-4057. If ContainersMonitor is not enabled, only print related log info one time. Contributed by Jun Gong.
|
2015-08-18 11:36:02 -07:00 |
|
Karthik Kambatla
|
13604bd5f1
|
YARN-4055. Report node resource utilization in heartbeat. (Inigo Goiri via kasha)
|
2015-08-16 15:08:53 -07:00 |
|
Karthik Kambatla
|
def12933b3
|
YARN-3534. Collect memory/cpu usage on the node. (Inigo Goiri via kasha)
|
2015-08-16 06:24:16 -07:00 |
|
Jian He
|
38aed1a94e
|
YARN-4005. Completed container whose app is finished is possibly not removed from NMStateStore. Contributed by Jun Gong
|
2015-08-13 14:46:08 -07:00 |
|
Junping Du
|
cfee02b3bd
|
YARN-4019. Add JvmPauseMonitor to ResourceManager and NodeManager. Contributed by Robert Kanter.
|
2015-08-06 06:49:45 -07:00 |
|
Xuan
|
c3364ca8e7
|
YARN-4004. container-executor should print output of docker logs if the
docker container exits with non-0 exit status. Contributed by Varun
Vasudev
|
2015-08-03 18:10:11 -07:00 |
|
Jason Lowe
|
469cfcd695
|
YARN-3965. Add startup timestamp to nodemanager UI. Contributed by Hong Zhiguo
|
2015-08-03 15:53:32 +00:00 |
|
Xuan
|
f170934215
|
YARN-3982. container-executor parsing of container-executor.cfg broken
in trunk and branch-2. Contributed by Varun Vasudev
|
2015-07-27 23:45:58 -07:00 |
|
Varun Vasudev
|
3e6fce91a4
|
YARN-3853. Add docker container runtime support to LinuxContainterExecutor. Contributed by Sidharta Seethana.
|
2015-07-27 11:57:40 -07:00 |
|
Varun Vasudev
|
f36835ff9b
|
YARN-3852. Add docker container support to container-executor. Contributed by Abin Shahab.
|
2015-07-27 10:14:51 -07:00 |
|
Jason Lowe
|
ff9c13e0a7
|
YARN-3925. ContainerLogsUtils#getContainerLogFile fails to read container log files from full disks. Contributed by zhihai xu
|
2015-07-24 22:14:39 +00:00 |
|
Colin Patrick Mccabe
|
419c51d233
|
YARN-3844. Make hadoop-yarn-project Native code -Wall-clean (Alan Burlison via Colin P. McCabe)
|
2015-07-17 11:38:59 -07:00 |
|
Akira Ajisaka
|
19295b36d9
|
YARN-3381. Fix typo InvalidStateTransitonException. Contributed by Brahma Reddy Battula.
|
2015-07-13 17:52:13 +09:00 |
|
Zhijie Shen
|
1ea36299a4
|
YARN-3116. RM notifies NM whether a container is an AM container or normal task container. Contributed by Giovanni Matteo Fumarola.
|
2015-07-10 18:58:10 -07:00 |
|
Karthik Kambatla
|
527c40e4d6
|
YARN-1012. Report NM aggregated container resource utilization in heartbeat. (Inigo Goiri via kasha)
|
2015-07-09 09:35:14 -07:00 |
|
Varun Vasudev
|
c40bdb56a7
|
YARN-2194. Fix bug causing CGroups functionality to fail on RHEL7. Contributed by Wei Yan.
|
2015-07-07 16:59:29 +05:30 |
|
Jason Lowe
|
b5cdf78e8e
|
YARN-3793. Several NPEs when deleting local files on NM recovery. Contributed by Varun Saxena
|
2015-07-01 21:13:32 +00:00 |
|
Jason Lowe
|
40b256949a
|
YARN-3850. NM fails to read files from full disks which can lead to container logs being lost and other issues. Contributed by Varun Saxena
|
2015-06-26 15:47:07 +00:00 |
|
Jason Lowe
|
8d58512d6e
|
YARN-3832. Resource Localization fails on a cluster due to existing cache directories. Contributed by Brahma Reddy Battula
|
2015-06-24 16:37:39 +00:00 |
|
Xuan
|
6c7a9d502a
|
YARN-3834. Scrub debug logging of tokens during resource localization. Contributed by Chris Nauroth
|
2015-06-21 17:13:44 -07:00 |
|
Junping Du
|
d7e7f6aa03
|
YARN-41. The RM should handle the graceful shutdown of the NM. Contributed by Devaraj K.
|
2015-06-04 04:59:27 -07:00 |
|
Jason Lowe
|
e13b671aa5
|
YARN-3585. NodeManager cannot exit on SHUTDOWN event triggered and NM recovery is enabled. Contributed by Rohith Sharmaks
|
2015-06-03 19:44:07 +00:00 |
|
Robert Kanter
|
6aec13cb33
|
YARN-3713. Remove duplicate function call storeContainerDiagnostics in ContainerDiagnosticsUpdateTransition (zxu via rkanter)
|
2015-05-29 15:34:37 -07:00 |
|
cnauroth
|
4102e5882e
|
YARN-3626. On Windows localized resources are not moved to the front of the classpath when they should be. Contributed by Craig Welch.
|
2015-05-27 14:31:49 -07:00 |
|
Vinod Kumar Vavilapalli
|
500a1d9c76
|
YARN-160. Enhanced NodeManager to automatically obtain cpu/memory values from underlying OS when configured to do so. Contributed by Varun Vasudev.
|
2015-05-26 11:38:35 -07:00 |
|
Junping Du
|
132d909d4a
|
YARN-3594. WintuilsProcessStubExecutor.startStreamReader leaks streams. Contributed by Lars Francke.
|
2015-05-22 04:23:25 -07:00 |
|
Vinod Kumar Vavilapalli
|
53fafcf061
|
YARN-3684. Changed ContainerExecutor's primary lifecycle methods to use a more extensible mechanism of context objects. Contributed by Sidharta Seethana.
|
2015-05-21 15:50:23 -07:00 |
|
Jian He
|
6329bd00fa
|
YARN-3654. ContainerLogsPage web UI should not have meta-refresh. Contributed by Xuan Gong
|
2015-05-20 17:20:21 -07:00 |
|
Wangda Tan
|
b37da52a1c
|
YARN-3565. NodeHeartbeatRequest/RegisterNodeManagerRequest should use NodeLabel object instead of String. (Naganarasimha G R via wangda)
|
2015-05-19 16:34:17 -07:00 |
|
Colin Patrick Mccabe
|
470c87dbc6
|
HADOOP-11970. Replace uses of ThreadLocal<Random> with JDK7 ThreadLocalRandom (Sean Busbey via Colin P. McCabe)
|
2015-05-19 10:50:15 -07:00 |
|
Junping Du
|
03a293aed6
|
YARN-3505 addendum: fix an issue in previous patch.
|
2015-05-15 06:39:39 -07:00 |
|
Ravi Prakash
|
53fe4eff09
|
YARN-1519. Check in container-executor if sysconf is implemented before using it (Radim Kolar and Eric Payne via raviprak)
|
2015-05-14 15:55:37 -07:00 |
|
Junping Du
|
15ccd967ee
|
YARN-3505. Node's Log Aggregation Report with SUCCEED should not cached in RMApps. Contributed by Xuan Gong.
|
2015-05-14 10:58:12 -07:00 |
|
Jason Lowe
|
711d77cc54
|
YARN-3641. NodeManager: stopRecoveryStore() shouldn't be skipped when exceptions happen in stopping NM's sub-services. Contributed by Junping Du
|
2015-05-13 21:06:47 +00:00 |
|
Xuan
|
0f95921447
|
YARN-3626. On Windows localized resources are not moved to the front of the classpath when they should be. Contributed by Craig Welch
|
2015-05-13 13:10:53 -07:00 |
|
Devaraj K
|
5c2f05cd9b
|
YARN-3629. NodeID is always printed as "null" in node manager
initialization log. Contributed by nijel.
|
2015-05-12 22:20:25 +05:30 |
|
Devaraj K
|
8badd82ce2
|
YARN-3513. Remove unused variables in ContainersMonitorImpl and add debug
log for overall resource usage by all containers. Contributed by
Naganarasimha G R.
|
2015-05-12 16:54:38 +05:30 |
|
Xuan
|
6471d18bc7
|
YARN-1912. ResourceLocalizer started without any jvm memory control.
Contributed by Masatake Iwasaki
|
2015-05-08 20:01:21 -07:00 |
|
Jason Lowe
|
25e2b02122
|
YARN-3476. Nodemanager can fail to delete local logs if log aggregation fails. Contributed by Rohith
|
2015-05-08 22:45:52 +00:00 |
|
Xuan
|
088156de43
|
YARN-2331. Distinguish shutdown during supervision vs. shutdown for
rolling upgrade. Contributed by Jason Lowe
|
2015-05-08 15:10:43 -07:00 |
|
Robert (Bobby) Evans
|
bcf2890502
|
YARN-644: Basic null check is not performed on passed in arguments before using them in ContainerManagerImpl.startContainer
|
2015-05-08 11:11:01 -05:00 |
|
Robert Kanter
|
b72507810a
|
YARN-3491. PublicLocalizer#addResource is too slow. (zxu via rkanter)
|
2015-05-06 14:19:06 -07:00 |
|
Junping Du
|
3810242062
|
YARN-3396. Handle URISyntaxException in ResourceLocalizationService. (Contributed by Brahma Reddy Battula)
|
2015-05-05 10:18:23 -07:00 |
|
Wangda Tan
|
71f4de220c
|
YARN-3375. NodeHealthScriptRunner.shouldRun() check is performing 3 times for starting NodeHealthScriptRunner (Devaraj K via wangda)
|
2015-05-04 15:49:19 -07:00 |
|
Jason Lowe
|
8f65c793f2
|
YARN-3097. Logging of resource recovery on NM restart has redundancies. Contributed by Eric Payne
|
2015-05-04 15:31:15 +00:00 |
|
Robert Kanter
|
ac7d152901
|
YARN-3363. add localization and container launch time to ContainerMetrics at NM to show these timing information for each active container. (zxu via rkanter)
|
2015-05-01 16:39:21 -07:00 |
|
Vinod Kumar Vavilapalli
|
1b3b9e5c31
|
YARN-2619. Added NodeManager support for disk io isolation through cgroups. Contributed by Varun Vasudev and Wei Yan.
|
2015-04-30 21:41:07 -07:00 |
|
Karthik Kambatla
|
47279c3228
|
YARN-3464. Race condition in LocalizerRunner kills localizer before localizing all resources. (Zhihai Xu via kasha)
|
2015-04-26 09:13:46 -07:00 |
|
Jason Lowe
|
5e093f0d40
|
YARN-3537. NPE when NodeManager.serviceInit fails and stopRecoveryStore invoked. Contributed by Brahma Reddy Battula
|
2015-04-24 22:02:53 +00:00 |
|
Xuan
|
0b3f8957a8
|
YARN-3516. killing ContainerLocalizer action doesn't take effect when
private localizer receives FETCH_FAILURE status. Contributed by zhihai
xu
|
2015-04-23 16:40:40 -07:00 |
|
Vinod Kumar Vavilapalli
|
a100be685c
|
YARN-3366. Enhanced NodeManager to support classifying/shaping outgoing network bandwidth traffic originating from YARN containers Contributed by Sidharta Seethana.
|
2015-04-22 17:26:13 -07:00 |
|
Jian He
|
674c7ef649
|
YARN-3503. Expose disk utilization percentage and bad local and log dir counts in NM metrics. Contributed by Varun Vasudev
|
2015-04-21 20:57:02 -07:00 |
|
Junping Du
|
1db355a875
|
YARN-1402. Update related Web UI and CLI with exposing client API to check log aggregation status. Contributed by Xuan Gong.
|
2015-04-17 13:18:59 -07:00 |
|
Jian He
|
1b89a3e173
|
YARN-3354. Add node label expression in ContainerTokenIdentifier to support RM recovery. Contributed by Wangda Tan
|
2015-04-15 13:57:06 -07:00 |
|
Junping Du
|
838b06ac87
|
YARN-3443. Create a 'ResourceHandler' subsystem to ease addition of support for new resource types on the NM. Contributed by Sidharta Seethana.
|
2015-04-13 18:35:56 -07:00 |
|
Junping Du
|
92431c9617
|
YARN-1376. NM need to notify the log aggregation status to RM through Node heartbeat. Contributed by Xuan Gong.
|
2015-04-10 08:56:18 -07:00 |
|
Karthik Kambatla
|
6495940eae
|
YARN-3465. Use LinkedHashMap to preserve order of resource requests. (Zhihai Xu via kasha)
|
2015-04-09 00:07:49 -07:00 |
|
Tsuyoshi Ozawa
|
dd852f5b8c
|
YARN-3457. NPE when NodeManager.serviceInit fails and stopRecoveryStore called. Contributed by Bibin A Chundatt.
|
2015-04-08 15:56:18 +09:00 |
|
Wangda Tan
|
bad070fe15
|
YARN-2901. Add errors and warning metrics page to RM, NM web UI. (Varun Vasudev via wangda)
|
2015-04-02 17:23:20 -07:00 |
|
Vinod Kumar Vavilapalli
|
b21c72777a
|
YARN-3365. Enhanced NodeManager to support using the 'tc' tool via container-executor for outbound network traffic control. Contributed by Sidharta Seethana.
|
2015-04-02 16:53:59 -07:00 |
|
Tsuyoshi Ozawa
|
c69ba81497
|
YARN-3424. Change logs for ContainerMonitorImpl's resourse monitoring from info to debug. Contributed by Anubhav Dhoot.
|
2015-04-01 17:44:25 +09:00 |
|
Karthik Kambatla
|
2daa478a64
|
YARN-3428. Debug log resources to be localized for a container. (kasha)
|
2015-03-31 17:34:47 -07:00 |
|
Wangda Tan
|
2a945d24f7
|
YARN-2495. Allow admin specify labels from each NM (Distributed configuration for node label). (Naganarasimha G R via wangda)
|
2015-03-30 12:05:21 -07:00 |
|
Vinod Kumar Vavilapalli
|
c358368f51
|
YARN-3304. Cleaning up ResourceCalculatorProcessTree APIs for public use and removing inconsistencies in the default values. Contributed by Junping Du and Karthik Kambatla.
|
2015-03-30 10:09:40 -07:00 |
|
Ravi Prakash
|
e0ccea33c9
|
YARN-3288. Document and fix indentation in the DockerContainerExecutor code
|
2015-03-28 08:00:41 -07:00 |
|
Junping Du
|
d81109e588
|
YARN-3269. Yarn.nodemanager.remote-app-log-dir could not be configured to fully qualified path. Contributed by Xuan Gong
|
2015-03-20 13:41:22 -07:00 |
|
Karthik Kambatla
|
20b49224eb
|
YARN-3351. AppMaster tracking URL is broken in HA. (Anubhav Dhoot via kasha)
|
2015-03-18 16:30:33 -07:00 |
|
Tsuyoshi Ozawa
|
3da9a97cfb
|
YARN-1453. [JDK8] Fix Javadoc errors caused by incorrect or illegal tags in doc comments. Contributed by Akira AJISAKA, Andrew Purtell, and Allen Wittenauer.
|
2015-03-16 23:19:05 +09:00 |
|
Vinod Kumar Vavilapalli
|
863079bb87
|
YARN-3154. Added additional APIs in LogAggregationContext to avoid aggregating running logs of application when rolling is enabled. Contributed by Xuan Gong.
|
2015-03-12 13:32:29 -07:00 |
|
Jian He
|
21101c01f2
|
YARN-2190. Added CPU and memory limit options to the default container executor for Windows containers. Contributed by Chuan Liu
|
2015-03-06 14:18:11 -08:00 |
|
Karthik Kambatla
|
53947f37c7
|
YARN-3122. Metrics for container's actual CPU usage. (Anubhav Dhoot via kasha)
|
2015-03-04 17:33:30 -08:00 |
|
Konstantin V Shvachko
|
8ca0d957c4
|
YARN-3255. RM, NM, JobHistoryServer, and WebAppProxyServer's main() should support generic options. Contributed by Konstantin Shvachko.
|
2015-02-26 17:12:19 -08:00 |
|
Allen Wittenauer
|
d4ac6822e1
|
YARN-2980. Move health check script related functionality to hadoop-common (Varun Saxena via aw)
|
2015-02-24 11:25:26 -08:00 |
|
Xuan
|
f56c65bb3e
|
YARN-3237. AppLogAggregatorImpl fails to log error cause. Contributed by
Rushabh S Shah
|
2015-02-20 14:02:40 -08:00 |
|
Tsuyoshi Ozawa
|
447bd7b5a6
|
YARN-3203. Correct a log message in AuxServices. Contributed by Brahma Reddy Battula.
|
2015-02-16 23:55:58 +09:00 |
|
Junping Du
|
ab0b958a52
|
YARN-2749. Fix some testcases from TestLogAggregationService fails in trunk. (Contributed by Xuan Gong)
|
2015-02-15 06:46:32 -08:00 |
|
Jason Lowe
|
1a0f508b63
|
YARN-2847. Linux native container executor segfaults if default banned user detected. Contributed by Olaf Flebbe
|
2015-02-13 20:20:55 +00:00 |
|
Akira Ajisaka
|
6a49e58cb8
|
YARN-3191. Log object should be initialized with its own class. Contributed by Rohith.
|
2015-02-12 17:58:54 -08:00 |
|
Junping Du
|
04f5ef18f7
|
YARN-2079. Recover NonAggregatingLogHandler state upon nodemanager restart. (Contributed by Jason Lowe)
|
2015-02-12 11:46:47 -08:00 |
|
Jason Lowe
|
b379972ab3
|
YARN-3074. Nodemanager dies when localizer runner tries to write to a full disk. Contributed by Varun Saxena
|
2015-02-11 16:33:43 +00:00 |
|
Jason Lowe
|
3f5431a22f
|
YARN-2809. Implement workaround for linux kernel panic when removing cgroup. Contributed by Nathan Roberts
|
2015-02-10 17:27:21 +00:00 |
|
Jason Lowe
|
4eb5f7fa32
|
YARN-3090. DeletionService can silently ignore deletion task failures. Contributed by Varun Saxena
|
2015-02-10 16:54:21 +00:00 |
|
Jason Lowe
|
4c484320b4
|
YARN-3089. LinuxContainerExecutor does not handle file arguments to deleteAsUser. Contributed by Eric Payne
|
2015-02-06 20:39:01 +00:00 |
|
Robert Kanter
|
f7a77819a1
|
YARN-3022. Expose Container resource information from NodeManager for monitoring (adhoot via ranter)
|
2015-02-03 10:39:41 -08:00 |
|
Akira Ajisaka
|
342efa110a
|
HADOOP-9907. Webapp http://hostname:port/metrics link is not working. (aajisaka)
|
2015-01-30 02:49:10 +09:00 |
|
Allen Wittenauer
|
9dd0b7a2ab
|
YARN-2428. LCE default banned user list should have yarn (Varun Saxena via aw)
|
2015-01-29 09:30:26 -08:00 |
|
Jian He
|
4e15fc0841
|
YARN-3011. Possible IllegalArgumentException in ResourceLocalizationService might lead NM to crash. Contributed by Varun Saxena
|
2015-01-27 13:31:22 -08:00 |
|