Junping Du
|
838b06ac87
|
YARN-3443. Create a 'ResourceHandler' subsystem to ease addition of support for new resource types on the NM. Contributed by Sidharta Seethana.
|
2015-04-13 18:35:56 -07:00 |
|
Junping Du
|
92431c9617
|
YARN-1376. NM need to notify the log aggregation status to RM through Node heartbeat. Contributed by Xuan Gong.
|
2015-04-10 08:56:18 -07:00 |
|
Karthik Kambatla
|
6495940eae
|
YARN-3465. Use LinkedHashMap to preserve order of resource requests. (Zhihai Xu via kasha)
|
2015-04-09 00:07:49 -07:00 |
|
Tsuyoshi Ozawa
|
dd852f5b8c
|
YARN-3457. NPE when NodeManager.serviceInit fails and stopRecoveryStore called. Contributed by Bibin A Chundatt.
|
2015-04-08 15:56:18 +09:00 |
|
Wangda Tan
|
bad070fe15
|
YARN-2901. Add errors and warning metrics page to RM, NM web UI. (Varun Vasudev via wangda)
|
2015-04-02 17:23:20 -07:00 |
|
Vinod Kumar Vavilapalli
|
b21c72777a
|
YARN-3365. Enhanced NodeManager to support using the 'tc' tool via container-executor for outbound network traffic control. Contributed by Sidharta Seethana.
|
2015-04-02 16:53:59 -07:00 |
|
Tsuyoshi Ozawa
|
c69ba81497
|
YARN-3424. Change logs for ContainerMonitorImpl's resourse monitoring from info to debug. Contributed by Anubhav Dhoot.
|
2015-04-01 17:44:25 +09:00 |
|
Karthik Kambatla
|
2daa478a64
|
YARN-3428. Debug log resources to be localized for a container. (kasha)
|
2015-03-31 17:34:47 -07:00 |
|
Wangda Tan
|
2a945d24f7
|
YARN-2495. Allow admin specify labels from each NM (Distributed configuration for node label). (Naganarasimha G R via wangda)
|
2015-03-30 12:05:21 -07:00 |
|
Vinod Kumar Vavilapalli
|
c358368f51
|
YARN-3304. Cleaning up ResourceCalculatorProcessTree APIs for public use and removing inconsistencies in the default values. Contributed by Junping Du and Karthik Kambatla.
|
2015-03-30 10:09:40 -07:00 |
|
Ravi Prakash
|
e0ccea33c9
|
YARN-3288. Document and fix indentation in the DockerContainerExecutor code
|
2015-03-28 08:00:41 -07:00 |
|
Junping Du
|
d81109e588
|
YARN-3269. Yarn.nodemanager.remote-app-log-dir could not be configured to fully qualified path. Contributed by Xuan Gong
|
2015-03-20 13:41:22 -07:00 |
|
Karthik Kambatla
|
20b49224eb
|
YARN-3351. AppMaster tracking URL is broken in HA. (Anubhav Dhoot via kasha)
|
2015-03-18 16:30:33 -07:00 |
|
Tsuyoshi Ozawa
|
3da9a97cfb
|
YARN-1453. [JDK8] Fix Javadoc errors caused by incorrect or illegal tags in doc comments. Contributed by Akira AJISAKA, Andrew Purtell, and Allen Wittenauer.
|
2015-03-16 23:19:05 +09:00 |
|
Vinod Kumar Vavilapalli
|
863079bb87
|
YARN-3154. Added additional APIs in LogAggregationContext to avoid aggregating running logs of application when rolling is enabled. Contributed by Xuan Gong.
|
2015-03-12 13:32:29 -07:00 |
|
Jian He
|
21101c01f2
|
YARN-2190. Added CPU and memory limit options to the default container executor for Windows containers. Contributed by Chuan Liu
|
2015-03-06 14:18:11 -08:00 |
|
Karthik Kambatla
|
53947f37c7
|
YARN-3122. Metrics for container's actual CPU usage. (Anubhav Dhoot via kasha)
|
2015-03-04 17:33:30 -08:00 |
|
Konstantin V Shvachko
|
8ca0d957c4
|
YARN-3255. RM, NM, JobHistoryServer, and WebAppProxyServer's main() should support generic options. Contributed by Konstantin Shvachko.
|
2015-02-26 17:12:19 -08:00 |
|
Allen Wittenauer
|
d4ac6822e1
|
YARN-2980. Move health check script related functionality to hadoop-common (Varun Saxena via aw)
|
2015-02-24 11:25:26 -08:00 |
|
Xuan
|
f56c65bb3e
|
YARN-3237. AppLogAggregatorImpl fails to log error cause. Contributed by
Rushabh S Shah
|
2015-02-20 14:02:40 -08:00 |
|
Tsuyoshi Ozawa
|
447bd7b5a6
|
YARN-3203. Correct a log message in AuxServices. Contributed by Brahma Reddy Battula.
|
2015-02-16 23:55:58 +09:00 |
|
Junping Du
|
ab0b958a52
|
YARN-2749. Fix some testcases from TestLogAggregationService fails in trunk. (Contributed by Xuan Gong)
|
2015-02-15 06:46:32 -08:00 |
|
Jason Lowe
|
1a0f508b63
|
YARN-2847. Linux native container executor segfaults if default banned user detected. Contributed by Olaf Flebbe
|
2015-02-13 20:20:55 +00:00 |
|
Akira Ajisaka
|
6a49e58cb8
|
YARN-3191. Log object should be initialized with its own class. Contributed by Rohith.
|
2015-02-12 17:58:54 -08:00 |
|
Junping Du
|
04f5ef18f7
|
YARN-2079. Recover NonAggregatingLogHandler state upon nodemanager restart. (Contributed by Jason Lowe)
|
2015-02-12 11:46:47 -08:00 |
|
Jason Lowe
|
b379972ab3
|
YARN-3074. Nodemanager dies when localizer runner tries to write to a full disk. Contributed by Varun Saxena
|
2015-02-11 16:33:43 +00:00 |
|
Jason Lowe
|
3f5431a22f
|
YARN-2809. Implement workaround for linux kernel panic when removing cgroup. Contributed by Nathan Roberts
|
2015-02-10 17:27:21 +00:00 |
|
Jason Lowe
|
4eb5f7fa32
|
YARN-3090. DeletionService can silently ignore deletion task failures. Contributed by Varun Saxena
|
2015-02-10 16:54:21 +00:00 |
|
Jason Lowe
|
4c484320b4
|
YARN-3089. LinuxContainerExecutor does not handle file arguments to deleteAsUser. Contributed by Eric Payne
|
2015-02-06 20:39:01 +00:00 |
|
Robert Kanter
|
f7a77819a1
|
YARN-3022. Expose Container resource information from NodeManager for monitoring (adhoot via ranter)
|
2015-02-03 10:39:41 -08:00 |
|
Akira Ajisaka
|
342efa110a
|
HADOOP-9907. Webapp http://hostname:port/metrics link is not working. (aajisaka)
|
2015-01-30 02:49:10 +09:00 |
|
Allen Wittenauer
|
9dd0b7a2ab
|
YARN-2428. LCE default banned user list should have yarn (Varun Saxena via aw)
|
2015-01-29 09:30:26 -08:00 |
|
Jian He
|
4e15fc0841
|
YARN-3011. Possible IllegalArgumentException in ResourceLocalizationService might lead NM to crash. Contributed by Varun Saxena
|
2015-01-27 13:31:22 -08:00 |
|
Jason Lowe
|
902c6ea7e4
|
YARN-3088. LinuxContainerExecutor.deleteAsUser can throw NPE if native executor returns an error. Contributed by Eric Payne
|
2015-01-26 15:40:21 +00:00 |
|
Xuan
|
0d6bd62102
|
YARN-3024. LocalizerRunner should give DIE action when all resources are
localized. Contributed by Chengbing Liu
|
2015-01-25 19:37:57 -08:00 |
|
Karthik Kambatla
|
84198564ba
|
YARN-2984. Metrics for container's actual memory usage. (kasha)
|
2015-01-17 05:44:04 +05:30 |
|
Jian He
|
cc2a745f7e
|
YARN-2997. Fixed NodeStatusUpdater to not send alreay-sent completed container statuses on heartbeat. Contributed by Chengbing Liu
|
2015-01-08 11:12:54 -08:00 |
|
Zhijie Shen
|
41a548a916
|
YARN-2937. Fixed new findbugs warnings in hadoop-yarn-nodemanager. Contributed by Varun Saxena.
|
2014-12-23 20:32:36 -08:00 |
|
Jian He
|
808cba3821
|
YARN-2952. Fixed incorrect version check in StateStore. Contributed by Rohith Sharmaks
|
2014-12-19 16:56:30 -08:00 |
|
Karthik Kambatla
|
954fb8581e
|
YARN-2675. containersKilled metrics is not updated when the container is killed during localization. (Zhihai Xu via kasha)
|
2014-12-19 16:02:20 -08:00 |
|
cnauroth
|
e996a1bfd4
|
HADOOP-11321. copyToLocal cannot save a file to an SMB share unless the user has Full Control permissions. Contributed by Chris Nauroth.
|
2014-12-16 15:29:22 -08:00 |
|
Karthik Kambatla
|
db73cc9124
|
YARN-2931. PublicLocalizer may fail until directory is initialized by LocalizeRunner. (Anubhav Dhoot via kasha)
|
2014-12-08 22:26:18 -08:00 |
|
Harsh J
|
a31e016491
|
YARN-2891. Failed Container Executor does not provide a clear error message. Contributed by Dustin Cote. (harsh)
|
2014-12-04 03:17:15 +05:30 |
|
Jason Lowe
|
03ab24aa01
|
MAPREDUCE-5932. Provide an option to use a dedicated reduce-side shuffle log. Contributed by Gera Shegalov
|
2014-12-03 17:02:14 +00:00 |
|
Junping Du
|
e65b7c5ff6
|
YARN-1156. Enhance NodeManager AllocatedGB and AvailableGB metrics for aggregation of decimal values. (Contributed by Tsuyoshi OZAWA)
|
2014-12-03 04:11:18 -08:00 |
|
Karthik Kambatla
|
233b61e495
|
YARN-2679. Add metric for container launch duration. (Zhihai Xu via kasha)
|
2014-11-21 14:22:21 -08:00 |
|
Jason Lowe
|
49c38898b0
|
YARN-2816. NM fail to start with NPE during container recovery. Contributed by Zhihai Xu
|
2014-11-14 21:25:59 +00:00 |
|
Jason Lowe
|
33ea5ae92b
|
YARN-2846. Incorrect persist exit code for running containers in reacquireContainer() that interrupted by NodeManager restart. Contributed by Junping Du
|
2014-11-13 16:11:04 +00:00 |
|
Zhijie Shen
|
be7bf956e9
|
YARN-2794. Fixed log messages about distributing system-credentials. Contributed by Jian He.
|
2014-11-12 11:07:57 -08:00 |
|
Karthik Kambatla
|
a04143039e
|
YARN-2236. [YARN-1492] Shared Cache uploader service on the Node Manager. (Chris Trezzo and Sanjin Lee via kasha)
|
2014-11-12 09:31:05 -08:00 |
|
Ravi Prakash
|
53f64ee516
|
YARN-1964. Create Docker analog of the LinuxContainerExecutor in YARN
|
2014-11-11 21:28:11 -08:00 |
|
Karthik Kambatla
|
061bc293c8
|
YARN-2735. diskUtilizationPercentageCutoff and diskUtilizationSpaceCutoff are initialized twice in DirectoryCollection. (Zhihai Xu via kasha)
|
2014-11-11 10:31:39 -08:00 |
|
Jason Lowe
|
c3d475070a
|
YARN-2825. Container leak on NM. Contributed by Jian He
|
2014-11-07 23:16:37 +00:00 |
|
cnauroth
|
06b797947c
|
YARN-2803. MR distributed cache not working correctly on Windows after NodeManager privileged account changes. Contributed by Craig Welch.
|
2014-11-07 12:29:39 -08:00 |
|
Vinod Kumar Vavilapalli
|
c5a46d4c8c
|
YARN-1922. Fixed NodeManager to kill process-trees correctly in the presence of races between the launch and the stop-container call and when root processes crash. Contributed by Billie Rinaldi.
|
2014-11-03 16:38:55 -08:00 |
|
Jason Lowe
|
6157ace547
|
YARN-2730. DefaultContainerExecutor runs only one localizer at a time. Contributed by Siqi Li
|
2014-11-03 20:37:47 +00:00 |
|
Vinod Kumar Vavilapalli
|
5c0381c96a
|
YARN-2790. Fixed a NodeManager bug that was causing log-aggregation to fail beyond HFDS delegation-token expiry even when RM is a proxy-user (YARN-2704). Contributed by Jian He.
|
2014-11-01 16:32:35 -07:00 |
|
Xuan
|
86ff28dea0
|
YARN-2701. Addendum patch. Potential race condition in startLocalizer when using LinuxContainerExecutor. Contributed by Xuan Gong
|
2014-10-31 14:36:25 -07:00 |
|
Jason Lowe
|
73e626ad91
|
YARN-2755. NM fails to clean up usercache_DEL_<timestamp> dirs after YARN-661. Contributed by Siqi Li
|
2014-10-30 15:10:27 +00:00 |
|
Zhijie Shen
|
8984e9b177
|
YARN-2741. Made NM web UI serve logs on the drive other than C: on Windows. Contributed by Craig Welch.
|
2014-10-28 14:11:19 -07:00 |
|
Vinod Kumar Vavilapalli
|
a16d022ca4
|
YARN-2704. Changed ResourceManager to optionally obtain tokens itself for the sake of localization and log-aggregation for long-running services. Contributed by Jian He.
|
2014-10-27 15:49:47 -07:00 |
|
Jian He
|
3b12fd6cfb
|
YARN-2198. Remove the need to run NodeManager as privileged account for Windows Secure Container Executor. Contributed by Remus Rusanu
|
2014-10-22 15:57:46 -07:00 |
|
cnauroth
|
6637e3cf95
|
YARN-2720. Windows: Wildcard classpath variables not expanded against resources contained in archives. Contributed by Craig Welch.
|
2014-10-21 12:33:21 -07:00 |
|
Jason Lowe
|
6f2028bd15
|
YARN-90. NodeManager should identify failed disks becoming good again. Contributed by Varun Vasudev
|
2014-10-21 17:31:13 +00:00 |
|
Jian He
|
2839365f23
|
YARN-2701. Potential race condition in startLocalizer when using LinuxContainerExecutor. Contributed by Xuan Gong
|
2014-10-20 18:45:47 -07:00 |
|
Jian He
|
0fd0ebae64
|
YARN-2682. Updated WindowsSecureContainerExecutor to not use DefaultContainerExecutor#getFirstApplicationDir and use getWorkingDir() instead. Contributed by Zhihai Xu
|
2014-10-16 18:14:34 -07:00 |
|
Jian He
|
0af1a2b5bc
|
YARN-2312. Deprecated old ContainerId#getId API and updated MapReduce to use ContainerId#getContainerId instead. Contributed by Tsuyoshi OZAWA
|
2014-10-15 15:22:07 -07:00 |
|
Karthik Kambatla
|
cc93e7e683
|
YARN-2566. DefaultContainerExecutor should pick a working directory randomly. (Zhihai Xu via kasha)
|
2014-10-13 16:32:01 -07:00 |
|
Jason Lowe
|
a56ea01002
|
YARN-2377. Localization exception stack traces are not passed as diagnostic info. Contributed by Gera Shegalov
|
2014-10-13 18:31:16 +00:00 |
|
Zhijie Shen
|
4aed2d8e91
|
YARN-2651. Spun off LogRollingInterval from LogAggregationContext. Contributed by Xuan Gong.
|
2014-10-13 10:54:09 -07:00 |
|
Zhijie Shen
|
cb81bac002
|
YARN-2583. Modified AggregatedLogDeletionService to be able to delete rolling aggregated logs. Contributed by Xuan Gong.
|
2014-10-10 00:11:30 -07:00 |
|
Vinod Kumar Vavilapalli
|
34cdcaad71
|
YARN-2468. Enhanced NodeManager to support log handling APIs (YARN-2569) for use by long running services. Contributed by Xuan Gong.
|
2014-10-03 12:15:40 -07:00 |
|
Jason Lowe
|
29f520052e
|
YARN-2624. Resource Localization fails on a cluster due to existing cache directories. Contributed by Anubhav Dhoot
|
2014-10-02 17:39:34 +00:00 |
|
Jian He
|
3ef1cf187f
|
YARN-2617. Fixed NM to not send duplicate container status whose app is not running. Contributed by Jun Gong
|
2014-10-02 10:04:09 -07:00 |
|
Zhijie Shen
|
52bbe0f11b
|
YARN-2630. Prevented previous AM container status from being acquired by the current restarted AM. Contributed by Jian He.
|
2014-10-01 15:38:11 -07:00 |
|
Vinod Kumar Vavilapalli
|
ba7f31c2ee
|
YARN-1972. Added a secure container-executor for Windows. Contributed by Remus Rusanu.
|
2014-10-01 10:14:41 -07:00 |
|
Jian He
|
5391919b09
|
YARN-668. Changed NMTokenIdentifier/AMRMTokenIdentifier/ContainerTokenIdentifier to use protobuf object as the payload. Contributed by Junping Du.
|
2014-09-26 17:48:41 -07:00 |
|
Zhijie Shen
|
c86674a3a4
|
YARN-2581. Passed LogAggregationContext to NM via ContainerTokenIdentifier. Contributed by Xuan Gong.
|
2014-09-24 17:50:26 -07:00 |
|
Allen Wittenauer
|
034df0e2eb
|
YARN-2161. Fix build on macosx: YARN parts (Binglin Chang via aw)
|
2014-09-24 08:47:55 -07:00 |
|
junping_du
|
a9a55db065
|
YARN-2584. TestContainerManagerSecurity fails on trunk. (Contributed by Jian He)
|
2014-09-22 22:45:06 -07:00 |
|
Jian He
|
0a641496c7
|
YARN-1372. Ensure all completed containers are reported to the AMs across RM restart. Contributed by Anubhav Dhoot
|
2014-09-22 10:30:53 -07:00 |
|
Vinod Kumar Vavilapalli
|
9f6891d9ef
|
YARN-2531. Added a configuration for admins to be able to override app-configs and enforce/not-enforce strict control of per-container cpu usage. Contributed by Varun Vasudev.
|
2014-09-16 10:14:46 -07:00 |
|
Vinod Kumar Vavilapalli
|
4be95175cd
|
YARN-2440. Enabled Nodemanagers to limit the aggregate cpu usage across all containers to a preconfigured limit. Contributed by Varun Vasudev.
|
2014-09-10 19:22:52 -07:00 |
|
Jason Lowe
|
3fa5f728c4
|
YARN-2431. NM restart: cgroup is not removed for reacquired containers. Contributed by Jason Lowe
|
2014-09-04 21:11:27 +00:00 |
|
Hitesh Shah
|
3de66011c2
|
YARN-2450. Fix typos in log messages. Contributed by Ray Chiang.
|
2014-08-29 11:16:36 -07:00 |
|
Allen Wittenauer
|
7e75226e68
|
YARN-2424. LCE should support non-cgroups, non-secure mode (Chris Douglas via aw)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1619421 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-21 14:57:11 +00:00 |
|
Junping Du
|
c2febdcbaa
|
YARN-1337. Recover containers upon nodemanager restart. (Contributed by Jason Lowe)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617448 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-12 10:56:13 +00:00 |
|
Junping Du
|
b8f151231b
|
YARN-1354. Recover applications upon nodemanager restart. (Contributed by Jason Lowe)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1615550 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-04 13:25:37 +00:00 |
|
Jian He
|
a41c314373
|
YARN-2343. Improve NMToken expire exception message. Contributed by Li Lu
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1615270 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-01 23:44:48 +00:00 |
|
Xuan Gong
|
e52f67e389
|
YARN-1994. Expose YARN/MR endpoints on multiple interfaces. Contributed by Craig Welch, Milan Potocnik,and Arpit Agarwal
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1614981 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-31 20:06:02 +00:00 |
|
Zhijie Shen
|
1d6e178144
|
YARN-2347. Consolidated RMStateVersion and NMDBSchemaVersion into Version in yarn-server-common. Contributed by Junping Du.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1614838 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-31 09:27:43 +00:00 |
|
Aaron Myers
|
5d4677b57b
|
YARN-1796. container-executor shouldn't require o-r permissions. Contributed by Aaron T. Myers.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1613548 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-26 01:51:35 +00:00 |
|
Devarajulu K
|
2050e0dad6
|
YARN-1342. Recover container tokens upon nodemanager restart. Contributed by Jason Lowe.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612995 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-24 05:02:00 +00:00 |
|
Junping Du
|
537c361f5b
|
YARN-2013. The diagnostics is always the ExitCodeException stack when the container crashes. (Contributed by Tsuyoshi OZAWA)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612449 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-22 03:01:58 +00:00 |
|
Jason Darrell Lowe
|
1ad2d7b405
|
YARN-2321. NodeManager web UI can incorrectly report Pmem enforcement. Contributed by Leitao Guo
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612411 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-21 21:55:06 +00:00 |
|
Jason Darrell Lowe
|
8a87085820
|
YARN-2045. Data persisted in NM should be versioned. Contributed by Junping Du
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612285 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-21 14:43:59 +00:00 |
|
Junping Du
|
403ec8ea80
|
YARN-1341. Recover NMTokens upon nodemanager restart. (Contributed by Jason Lowe)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1611512 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-17 23:33:22 +00:00 |
|
Jian He
|
6d7dbd4fed
|
YARN-1367. Changed NM to not kill containers on NM resync if RM work-preserving restart is enabled. Contributed by Anubhav Dhoot
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1608334 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-07 04:37:59 +00:00 |
|
Steve Loughran
|
d1f54f4f4b
|
YARN-2065 AM cannot create new containers after restart
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1607441 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-02 18:35:10 +00:00 |
|
Vinod Kumar Vavilapalli
|
e285b98f0f
|
YARN-2152. Added missing information into ContainerTokenIdentifier so that NodeManagers can report the same to RM when RM restarts. Contributed Jian He.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605205 13f79535-47bb-0310-9956-ffa450edef68
|
2014-06-24 21:43:22 +00:00 |
|