Jason Lowe
|
6f2028bd15
|
YARN-90. NodeManager should identify failed disks becoming good again. Contributed by Varun Vasudev
|
2014-10-21 17:31:13 +00:00 |
|
Jian He
|
4fa1fb3193
|
Missing file for YARN-2701
|
2014-10-20 19:57:45 -07:00 |
|
Jian He
|
2839365f23
|
YARN-2701. Potential race condition in startLocalizer when using LinuxContainerExecutor. Contributed by Xuan Gong
|
2014-10-20 18:45:47 -07:00 |
|
Jian He
|
0af1a2b5bc
|
YARN-2312. Deprecated old ContainerId#getId API and updated MapReduce to use ContainerId#getContainerId instead. Contributed by Tsuyoshi OZAWA
|
2014-10-15 15:22:07 -07:00 |
|
Karthik Kambatla
|
cc93e7e683
|
YARN-2566. DefaultContainerExecutor should pick a working directory randomly. (Zhihai Xu via kasha)
|
2014-10-13 16:32:01 -07:00 |
|
Zhijie Shen
|
4aed2d8e91
|
YARN-2651. Spun off LogRollingInterval from LogAggregationContext. Contributed by Xuan Gong.
|
2014-10-13 10:54:09 -07:00 |
|
Zhijie Shen
|
cb81bac002
|
YARN-2583. Modified AggregatedLogDeletionService to be able to delete rolling aggregated logs. Contributed by Xuan Gong.
|
2014-10-10 00:11:30 -07:00 |
|
cnauroth
|
d3afd730ac
|
YARN-2662. TestCgroupsLCEResourcesHandler leaks file descriptors. Contributed by Chris Nauroth.
|
2014-10-09 22:44:44 -07:00 |
|
Vinod Kumar Vavilapalli
|
34cdcaad71
|
YARN-2468. Enhanced NodeManager to support log handling APIs (YARN-2569) for use by long running services. Contributed by Xuan Gong.
|
2014-10-03 12:15:40 -07:00 |
|
Jason Lowe
|
29f520052e
|
YARN-2624. Resource Localization fails on a cluster due to existing cache directories. Contributed by Anubhav Dhoot
|
2014-10-02 17:39:34 +00:00 |
|
Jian He
|
3ef1cf187f
|
YARN-2617. Fixed NM to not send duplicate container status whose app is not running. Contributed by Jun Gong
|
2014-10-02 10:04:09 -07:00 |
|
junping_du
|
c7cee9b455
|
YARN-1979. TestDirectoryCollection fails when the umask is unusual. (Contributed by Vinod Kumar Vavilapalli and Tsuyoshi OZAWA)
|
2014-10-02 08:02:59 -07:00 |
|
junping_du
|
0708827a93
|
YARN-2613. Support retry in NMClient for rolling-upgrades. (Contributed by Jian He)
|
2014-10-01 16:50:30 -07:00 |
|
Zhijie Shen
|
52bbe0f11b
|
YARN-2630. Prevented previous AM container status from being acquired by the current restarted AM. Contributed by Jian He.
|
2014-10-01 15:38:11 -07:00 |
|
Vinod Kumar Vavilapalli
|
ba7f31c2ee
|
YARN-1972. Added a secure container-executor for Windows. Contributed by Remus Rusanu.
|
2014-10-01 10:14:41 -07:00 |
|
Jian He
|
5391919b09
|
YARN-668. Changed NMTokenIdentifier/AMRMTokenIdentifier/ContainerTokenIdentifier to use protobuf object as the payload. Contributed by Junping Du.
|
2014-09-26 17:48:41 -07:00 |
|
Zhijie Shen
|
c86674a3a4
|
YARN-2581. Passed LogAggregationContext to NM via ContainerTokenIdentifier. Contributed by Xuan Gong.
|
2014-09-24 17:50:26 -07:00 |
|
Jian He
|
0a641496c7
|
YARN-1372. Ensure all completed containers are reported to the AMs across RM restart. Contributed by Anubhav Dhoot
|
2014-09-22 10:30:53 -07:00 |
|
Vinod Kumar Vavilapalli
|
9f6891d9ef
|
YARN-2531. Added a configuration for admins to be able to override app-configs and enforce/not-enforce strict control of per-container cpu usage. Contributed by Varun Vasudev.
|
2014-09-16 10:14:46 -07:00 |
|
Vinod Kumar Vavilapalli
|
4be95175cd
|
YARN-2440. Enabled Nodemanagers to limit the aggregate cpu usage across all containers to a preconfigured limit. Contributed by Varun Vasudev.
|
2014-09-10 19:22:52 -07:00 |
|
Jason Lowe
|
3fa5f728c4
|
YARN-2431. NM restart: cgroup is not removed for reacquired containers. Contributed by Jason Lowe
|
2014-09-04 21:11:27 +00:00 |
|
Jason Lowe
|
b03653f9a5
|
YARN-2462. TestNodeManagerResync#testBlockNewContainerRequestsOnStartAndResync should have a test timeout. Contributed by Eric Payne
|
2014-08-29 20:15:40 +00:00 |
|
Allen Wittenauer
|
7e75226e68
|
YARN-2424. LCE should support non-cgroups, non-secure mode (Chris Douglas via aw)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1619421 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-21 14:57:11 +00:00 |
|
Junping Du
|
c2febdcbaa
|
YARN-1337. Recover containers upon nodemanager restart. (Contributed by Jason Lowe)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617448 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-12 10:56:13 +00:00 |
|
Junping Du
|
b8f151231b
|
YARN-1354. Recover applications upon nodemanager restart. (Contributed by Jason Lowe)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1615550 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-04 13:25:37 +00:00 |
|
Zhijie Shen
|
1d6e178144
|
YARN-2347. Consolidated RMStateVersion and NMDBSchemaVersion into Version in yarn-server-common. Contributed by Junping Du.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1614838 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-31 09:27:43 +00:00 |
|
Devarajulu K
|
2050e0dad6
|
YARN-1342. Recover container tokens upon nodemanager restart. Contributed by Jason Lowe.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612995 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-24 05:02:00 +00:00 |
|
Junping Du
|
537c361f5b
|
YARN-2013. The diagnostics is always the ExitCodeException stack when the container crashes. (Contributed by Tsuyoshi OZAWA)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612449 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-22 03:01:58 +00:00 |
|
Jason Darrell Lowe
|
8a87085820
|
YARN-2045. Data persisted in NM should be versioned. Contributed by Junping Du
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612285 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-21 14:43:59 +00:00 |
|
Junping Du
|
403ec8ea80
|
YARN-1341. Recover NMTokens upon nodemanager restart. (Contributed by Jason Lowe)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1611512 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-17 23:33:22 +00:00 |
|
Jian He
|
6d7dbd4fed
|
YARN-1367. Changed NM to not kill containers on NM resync if RM work-preserving restart is enabled. Contributed by Anubhav Dhoot
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1608334 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-07 04:37:59 +00:00 |
|
Karthik Kambatla
|
58e398f31d
|
YARN-2204. Explicitly enable vmem check in TestContainersMonitor#testContainerKillOnMemoryOverflow. (Anubhav Dhoot via kasha)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1607231 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-02 02:05:37 +00:00 |
|
Vinod Kumar Vavilapalli
|
e285b98f0f
|
YARN-2152. Added missing information into ContainerTokenIdentifier so that NodeManagers can report the same to RM when RM restarts. Contributed Jian He.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605205 13f79535-47bb-0310-9956-ffa450edef68
|
2014-06-24 21:43:22 +00:00 |
|
Thomas Graves
|
1f9a0fd927
|
YARN-2072. RM/NM UIs and webservices are missing vcore information. (Nathan Roberts via tgraves)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605162 13f79535-47bb-0310-9956-ffa450edef68
|
2014-06-24 19:34:34 +00:00 |
|
Junping Du
|
072360d128
|
YARN-1339. Recover DeletionService state upon nodemanager restart. (Contributed by Jason Lowe)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1603036 13f79535-47bb-0310-9956-ffa450edef68
|
2014-06-17 01:02:16 +00:00 |
|
Bikas Saha
|
ecfd43a2f1
|
YARN-2091. Add more values to ContainerExitStatus and pass it from NM to RM and then to app masters (Tsuyoshi OZAWA via bikas)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1601762 13f79535-47bb-0310-9956-ffa450edef68
|
2014-06-10 20:08:33 +00:00 |
|
Vinod Kumar Vavilapalli
|
23c325ad47
|
YARN-2115. Replaced RegisterNodeManagerRequest's ContainerStatus with a new NMContainerStatus which has more information that is needed for work-preserving RM-restart. Contributed by Jian He.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1598790 13f79535-47bb-0310-9956-ffa450edef68
|
2014-05-31 00:20:50 +00:00 |
|
Junping Du
|
66598697a6
|
YARN-1338. Recover localized resource cache state upon nodemanager restart (Contributed by Jason Lowe)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1598640 13f79535-47bb-0310-9956-ffa450edef68
|
2014-05-30 15:37:27 +00:00 |
|
Junping Du
|
b29434a5c8
|
YARN-1362. Distinguish between nodemanager shutdown for decommission vs shutdown for restart. (Contributed by Jason Lowe)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1594421 13f79535-47bb-0310-9956-ffa450edef68
|
2014-05-14 00:20:53 +00:00 |
|
Junping Du
|
72ba2b35d8
|
YARN-766. TestNodeManagerShutdown in branch-2 should use Shell to form the output path and a format issue in trunk. (Contributed by Siddharth Seth)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1593660 13f79535-47bb-0310-9956-ffa450edef68
|
2014-05-10 03:45:01 +00:00 |
|
Ivan Mitic
|
4810e2b849
|
YARN-1865. ShellScriptBuilder does not check for some error conditions. Contributed by Remus Rusanu.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1588693 13f79535-47bb-0310-9956-ffa450edef68
|
2014-04-19 18:55:07 +00:00 |
|
Junping Du
|
0d087f7941
|
YARN-1750. TestNodeStatusUpdater#testNMRegistration is incorrect in test case. (Wangda Tan via junping_du)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1588343 13f79535-47bb-0310-9956-ffa450edef68
|
2014-04-17 19:06:48 +00:00 |
|
Vinod Kumar Vavilapalli
|
c6b70f4760
|
YARN-1933. Fixed test issues with TestAMRestart and TestNodeHealthService. Contributed by Jian He.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1587104 13f79535-47bb-0310-9956-ffa450edef68
|
2014-04-13 21:51:38 +00:00 |
|
Jian He
|
ed78328d50
|
YARN-1903. Set exit code and diagnostics when container is killed at NEW/LOCALIZING state. Contributed by Zhijie Shen
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1586522 13f79535-47bb-0310-9956-ffa450edef68
|
2014-04-11 01:26:36 +00:00 |
|
Karthik Kambatla
|
245012a9d9
|
YARN-1757. NM Recovery. Auxiliary service support. (Jason Lowe via kasha)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1585783 13f79535-47bb-0310-9956-ffa450edef68
|
2014-04-08 17:15:58 +00:00 |
|
Vinod Kumar Vavilapalli
|
06579878df
|
YARN-1775. Enhanced ProcfsBasedProcessTree to optionally add the ability to use smaps for obtaining used memory information. Contributed by Rajesh Balamohan.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1580087 13f79535-47bb-0310-9956-ffa450edef68
|
2014-03-22 00:01:59 +00:00 |
|
Jian He
|
6a89e57b8d
|
YARN-1206. Fixed AM container log to show on NM web page after application finishes if log-aggregation is disabled. Contributed by Rohith Sharmaks
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1578614 13f79535-47bb-0310-9956-ffa450edef68
|
2014-03-17 21:49:06 +00:00 |
|
Jonathan Turner Eagles
|
7944eab64a
|
YARN-1136. Replace junit.framework.Assert with org.junit.Assert (Chen He via jeagles)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1578539 13f79535-47bb-0310-9956-ffa450edef68
|
2014-03-17 20:03:35 +00:00 |
|
Vinod Kumar Vavilapalli
|
96e0ca2d27
|
YARN-1824. Improved NodeManager and clients to be able to handle cross platform application submissions. Contributed by Jian He.
MAPREDUCE-4052. Improved MapReduce clients to use NodeManagers' ability to handle cross platform application submissions. Contributed by Jian He.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1578135 13f79535-47bb-0310-9956-ffa450edef68
|
2014-03-16 18:32:05 +00:00 |
|
Vinod Kumar Vavilapalli
|
8aab8533a1
|
YARN-1800. Fixed NodeManager to gracefully handle RejectedExecutionException in the public-localizer thread-pool. Contributed by Varun Vasudev.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1576545 13f79535-47bb-0310-9956-ffa450edef68
|
2014-03-11 23:33:56 +00:00 |
|