Arun Suresh
dc26601d8f
YARN-5180. Allow ResourceRequest to specify an enforceExecutionType flag. (asuresh)
2016-06-02 09:01:02 -07:00
Xuan
0bc05e40fa
YARN-5088. Improve "yarn log" command-line to read the last K bytes for the log files. Contributed by Xuan Gong
2016-06-01 13:44:21 -07:00
Varun Vasudev
86fb58b7dc
YARN-4007. Add support for different network setups when launching the docker container. Contributed by Sidharta Seethana.
2016-05-29 21:32:51 +05:30
Varun Vasudev
42f90ab885
YARN-4844. Add getMemorySize/getVirtualCoresSize to o.a.h.y.api.records.Resource. Contributed by Wangda Tan.
2016-05-29 21:24:16 +05:30
Arun Suresh
4fc09a897b
YARN-5117. QueuingContainerManager does not start GUARANTEED Container even if Resources are available. (Konstantinos Karanasos via asuresh)
2016-05-27 17:49:15 -07:00
Carlo Curino
013532a95e
YARN-4957. Add getNewReservation in ApplicationClientProtocol (Sean Po via curino)
2016-05-25 16:55:49 -07:00
Robert Kanter
e07519b8db
YARN-4766. NM should not aggregate logs older than the retention policy (haibochen via rkanter)
2016-05-25 10:25:44 -07:00
Varun Vasudev
c380a22031
YARN-857. Localization failures should be available in container diagnostics. Contributed by Vinod Kumar Vavilapalli.
2016-05-25 19:01:12 +05:30
Varun Vasudev
0287c49107
YARN-5016. Add support for a minimum retry interval for container retries. Contributed by Jun Gong.
2016-05-20 12:28:14 +05:30
Akira Ajisaka
d4274c64bc
YARN-5107. TestContainerMetrics fails. (aajisaka)
2016-05-19 19:53:59 +09:00
Arun Suresh
1597630681
YARN-5110. Fix OpportunisticContainerAllocator to insert complete HostAddress in issued ContainerTokenIds. (Konstantinos Karanasos via asuresh)
2016-05-18 18:46:00 -07:00
Jason Lowe
81effb7dcd
YARN-4325. Nodemanager log handlers fail to send finished/failed events in some cases. Contributed by Junping Du
2016-05-16 15:40:23 +00:00
Arun Suresh
f45bc5a83e
YARN-4738. Notify the RM about the status of OPPORTUNISTIC containers (Konstantinos Karanasos via asuresh)
2016-05-15 17:54:34 -07:00
Arun Suresh
f0ac18d001
YARN-2888. Corrective mechanisms for rebalancing NM container queues. (asuresh)
2016-05-13 13:38:36 -07:00
Sangjin Lee
0bbe01f8d5
YARN-4577. Enable aux services to have their own custom classpath/jar file (Xuan Gong via sale)
2016-05-12 10:11:23 -07:00
Arun Suresh
d464f4d1c4
YARN-5049. Extend NMStateStore to save queued container information. (Konstantinos Karanasos via asuresh)
2016-05-11 19:10:17 -07:00
Varun Vasudev
72b047715c
YARN-4595. Add support for configurable read-only mounts when launching Docker containers. Contributed by Billie Rinaldi.
2016-05-05 13:01:54 +05:30
Wangda Tan
7bd418e48c
YARN-4984. LogAggregationService shouldn't swallow exception in handling createAppDir() which cause thread leak. (Junping Du via wangda)
2016-05-04 11:38:55 -07:00
Junping Du
e61d431275
YARN-4920. ATS/NM should support a link to dowload/get the logs in text format. Contributed by Xuan Gong.
2016-05-04 10:35:49 -07:00
Robert Kanter
9d4d30243b
Remove parent's env vars from child processes
2016-04-29 09:25:51 -07:00
Varun Vasudev
0f25a1bb52
YARN-3998. Add support in the NodeManager to re-launch containers. Contributed by Jun Gong.
2016-04-29 16:09:07 +05:30
Jian He
4a8508501b
YARN-5009. NMLeveldbStateStoreService database can grow substantially leading to longer recovery times. Contributed by Jason Lowe
2016-04-28 21:54:11 -07:00
Karthik Kambatla
1a3f1482e2
YARN-4795. ContainerMetrics drops records. (Daniel Templeton via kasha)
2016-04-26 06:15:36 -07:00
Arun Suresh
c282a08f38
YARN-2885. Create AMRMProxy request interceptor and ContainerAllocator to distribute OPPORTUNISTIC containers to appropriate Nodes (asuresh)
...
(cherry picked from commit 2bf025278a318b0452fdc9ece4427b4c42124e39)
2016-04-24 22:38:33 -07:00
Jing Zhao
63e5412f1a
HDFS-9427. HDFS should not default to ephemeral ports. Contributed by Xiaobing Zhou.
2016-04-22 15:14:40 -07:00
Karthik Kambatla
c8172f5f14
YARN-2883. Queuing of container requests in the NM. (Konstantinos Karanasos and Arun Suresh via kasha)
2016-04-20 09:55:50 -07:00
Jason Lowe
3150ae8108
YARN-4924. NM recovery race can lead to container not cleaned up. Contributed by sandflee
2016-04-14 19:17:14 +00:00
Naganarasimha
437e9d6475
YARN-4810. NM applicationpage cause internal error 500. Contributed by Bibin A Chundatt.
2016-04-12 17:59:46 +05:30
Vinod Kumar Vavilapalli
44bbc50d91
YARN-4168. Fixed a failing test TestLogAggregationService.testLocalFileDeletionOnDiskFull. Contributed by Takashi Ohnishi.
2016-04-11 12:11:14 -07:00
Karthik Kambatla
e82f961a39
YARN-4756. Unnecessary wait in Node Status Updater during reboot. (Eric Badger via kasha)
2016-04-07 17:05:29 -07:00
Varun Vasudev
b41e65e5bc
YARN-4906. Capture container start/finish time in container metrics. Contributed by Jian He.
2016-04-06 13:41:33 +05:30
Junping Du
0005816743
YARN-4916. TestNMProxy.tesNMProxyRPCRetry fails. Contributed by Tibor Kiss.
2016-04-05 09:01:08 -07:00
naganarasimha
5092c94195
YARN-4746. yarn web services should convert parse failures of appId, appAttemptId and containerId to 400. Contributed by Bibin A Chundatt
2016-04-04 16:25:03 +05:30
Jian He
0dd9bcab97
YARN-4811. Generate histograms in ContainerMetrics for actual container resource usage
2016-03-31 14:28:13 -07:00
Allen Wittenauer
0a74610d1c
HADOOP-11393. Revert HADOOP_PREFIX, go back to HADOOP_HOME (aw)
2016-03-31 07:51:05 -07:00
Jason Lowe
948b758070
YARN-4773. Log aggregation performs extraneous filesystem operations when rolling log aggregation is disabled. Contributed by Jun Gong
2016-03-28 23:00:56 +00:00
Robert Kanter
22ca176dfe
TestResourceLocalizationService.testPublicResourceInitializesLocalDir fails Intermittently due to IllegalArgumentException from cleanup (templedf via rkanter)
2016-03-15 10:05:10 -07:00
Vinod Kumar Vavilapalli
b2661765a5
YARN-4762. Fixed CgroupHandler's creation and usage to avoid NodeManagers crashing when LinuxContainerExecutor is enabled. (Sidharta Seethana via vinodkv)
2016-03-07 11:08:17 -08:00
Jason Lowe
059caf9989
YARN-4744. Too many signal to container failure in case of LCE. Contributed by Sidharta Seethana
2016-03-07 15:40:01 +00:00
Haohui Mai
0fa54d45b1
HADOOP-12813. Migrate TestRPC and related codes to rebase on ProtobufRpcEngine. Contributed by Kai Zheng.
2016-02-29 11:41:00 -08:00
Ming Ma
7f3139e54d
YARN-4720. Skip unnecessary NN operations in log aggregation. (Jun Gong via mingma)
2016-02-26 08:40:05 -08:00
Robert Kanter
d7fdec1e6b
YARN-4579. Allow DefaultContainerExecutor container log directory permissions to be configurable (rchiang via rkanter)
2016-02-25 16:36:38 -08:00
Robert Kanter
954dd57043
YARN-4697. NM aggregation thread pool is not bound by limits (haibochen via rkanter)
2016-02-24 15:00:24 -08:00
Jason Lowe
d284e187b8
YARN-2046. Out of band heartbeats are sent only on container kill and possibly too early. Contributed by Ming Ma
2016-02-23 20:49:09 +00:00
Varun Vasudev
140cb5d745
YARN-4709. NMWebServices produces incorrect JSON for containers. Contributed by Varun Saxena.
2016-02-23 12:29:25 +05:30
Varun Vasudev
fa00d3e205
YARN-4655. Log uncaught exceptions/errors in various thread pools in YARN. Contributed by Sidharta Seethana.
2016-02-11 12:06:42 +05:30
Wangda Tan
9875325d5c
YARN-4340. Add list API to reservation system. (Sean Po via wangda)
2016-02-02 10:17:33 +08:00
Rohith Sharma K S
ac68666803
YARN-4543. Fix random test failure in TestNodeStatusUpdater.testStopReentrant. (Akihiro Suda via rohithsharmaks)
2016-01-29 12:29:54 +05:30
Jason Lowe
61382ff8fa
YARN-4643. Container recovery is broken with delegating container runtime. Contributed by Sidharta Seethana
2016-01-28 18:59:35 +00:00
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
2085e60a96
YARN-3542. Refactored existing CPU cgroups support to use the newer and integrated ResourceHandler mechanism, and also deprecated the old LCEResourceHandler inteface hierarchy. Contributed by Varun Vasudev.
2016-01-25 16:19:36 -08:00
Xuan
618bfd6ac2
YARN-4496. Improve HA ResourceManager Failover detection on the client.
...
Contributed by Jian He
2016-01-22 18:20:38 -08:00
Varun Vasudev
b41a7e89d1
YARN-4578. Directories that are mounted in docker containers need to be more restrictive/container-specific. Contributed by Sidharta Seethana.
2016-01-22 14:43:14 +05:30
Wangda Tan
89d1fd5dac
HADOOP-12356. Fix computing CPU usage statistics on Windows. (Inigo Goiri via wangda)
2016-01-19 21:27:38 +08:00
Varun Vasudev
3ddb92bd30
YARN-4553. Add cgroups support for docker containers. Contributed by Sidharta Seethana.
2016-01-14 14:29:29 +05:30
Jason Lowe
13de8359a1
YARN-4414. Nodemanager connection errors are retried at multiple levels. Contributed by Chang Li
2016-01-12 15:56:15 +00:00
Steve Loughran
07d1cb612c
YARN-4550. Some tests in TestContainerLanch fails on non-english locale environment. (Takashi Ohnishi via stevel)
2016-01-07 14:30:20 +00:00
rohithsharmaks
791c1639ae
YARN-4393. Fix intermittent test failure for TestResourceLocalizationService#testFailedDirsResourceRelease (Varun Saxana via rohithsharmaks)
2016-01-07 09:38:47 +05:30
Gera Shegalov
2c17b81569
YARN-2934. Improve handling of container's stderr. (Naganarasimha G R via gera)
2015-12-24 23:48:05 -08:00
Uma Mahesh
0f82b5d878
YARN-4480. Clean up some inappropriate imports. (Kai Zheng via umamahesh)
2015-12-19 23:10:13 -08:00
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
4e7d32c0db
YARN-1856. Added cgroups based memory monitoring for containers as another alternative to custom memory-monitoring. Contributed by Varun Vasudev.
2015-12-17 12:13:03 -08:00
Jian He
915cd6c3f4
YARN-4402. TestNodeManagerShutdown And TestNodeManagerResync fails with bind exception. Contributed by Brahma Reddy Battula
2015-12-14 14:59:01 -08:00
Wangda Tan
dfcbbddb09
YARN-4309. Add container launch related debug information to container logs when a container fails. (Varun Vasudev via wangda)
2015-12-14 11:13:22 -08:00
Junping Du
62e9348bc1
YARN-4408. Fix issue that NodeManager still reports negative running containers. Contributed by Robert Kanter.
2015-12-03 06:36:37 -08:00
Tsuyoshi Ozawa
0656d2dc83
YARN-4380. TestResourceLocalizationService.testDownloadingResourcesOnContainerKill fails intermittently. Contributed by Varun Saxena.
2015-11-26 01:10:02 +09:00
Jason Lowe
4ac6799d4a
YARN-4132. Separate configs for nodemanager to resourcemanager connection timeout and retries. Contributed by Chang Li
2015-11-24 22:35:37 +00:00
Junping Du
855d52927b
YARN-4354. Public resource localization fails with NPE. Contributed by Jason Lowe.
2015-11-15 04:43:57 -08:00
Jason Lowe
e2267de207
YARN-2902. Killing a container that is localizing can orphan resources in the DOWNLOADING state. Contributed by Varun Saxena
2015-10-29 16:34:25 +00:00
Wangda Tan
6f606214e7
YARN-4169. Fix racing condition of TestNodeStatusUpdaterForLabels. (Naganarasimha G R via wangda)
2015-10-26 16:36:34 -07:00
Rohith Sharma K S
5acdde4744
YARN-2729. Support script based NodeLabelsProvider Interface in Distributed Node Label Configuration Setup. (Naganarasimha G R via rohithsharmaks)
2015-10-26 15:42:42 +05:30
Tsuyoshi Ozawa
ce60b4fc8b
YARN-3528. Tests with 12345 as hard-coded port break jenkins. Contributed by Brahma Reddy Battula.
2015-10-26 16:45:11 +09:00
Varun Vasudev
e39ae0e676
YARN-4262. Allow whitelisted users to run privileged docker containers. Contributed by Sidharta Seethana.
2015-10-19 20:19:22 +05:30
Steve Loughran
e2d59e2c7b
YARN-4155. TestLogAggregationService.testLogAggregationServiceWithInterval failing. (Bibin A Chundatt via stevel)
2015-10-18 12:03:57 +01:00
Varun Vasudev
63020c54c1
YARN-4258. Add support for controlling capabilities for docker containers. Contributed by Sidharta Seethana.
2015-10-15 13:41:20 +05:30
Steve Loughran
dfa7848463
HADOOP-12478. Shell.getWinUtilsPath() has been renamed Shell.getWinutilsPath(). (stevel)
2015-10-14 20:25:47 +01:00
Steve Loughran
c59af2fdf8
HADOOP-10775. Shell operations to fail with meaningful errors on windows if winutils.exe not found. (stevel)
2015-10-13 21:49:28 +01:00
Jason Lowe
a0bca2b5ad
YARN-261. Ability to fail AM attempts. Contributed by Andrey Klochkov and Rohith Sharma K S
2015-10-09 14:17:38 +00:00
Jason Lowe
8d226225d0
YARN-3943. Use separate threshold configurations for disk-full detection and disk-not-full detection. Contributed by Zhihai Xu
2015-10-08 22:25:34 +00:00
Wangda Tan
30ac69c6bd
YARN-4176. Resync NM nodelabels with RM periodically for distributed nodelabels. (Bibin A Chundatt via wangda)
2015-10-05 15:47:21 -07:00
Xuan
8f08532bde
YARN-1897. CLI and core support for signal container functionality. Contributed by Ming Ma
2015-10-02 18:50:47 -07:00
Jason Lowe
fdf02d1f26
YARN-3619. ContainerMetrics unregisters during getMetrics and leads to ConcurrentModificationException. Contributed by Zhihai Xu
2015-10-02 20:20:31 +00:00
Jason Lowe
854d25b0c3
YARN-3727. For better error recovery, check if the directory exists before using it for localization. Contributed by Zhihai Xu
2015-09-30 14:59:44 +00:00
Rohith Sharma K S
8ed0d4b744
YARN-4152. NodeManager crash with NPE when LogAggregationService#stopContainer called for absent container. (Bibin A Chundatt via rohithsharmaks)
2015-09-24 11:24:14 +05:30
Jian He
c57eac5dfe
YARN-3868. Recovery support for container resizing. Contributed by Meng Ding
2015-09-23 13:29:38 -07:00
Jian He
c3dc1af072
YARN-1644. RM-NM protocol changes and NodeStatusUpdater implementation to support container resizing. Contributed by Meng Ding
2015-09-23 13:29:37 -07:00
Jian He
c59ae4eeb1
YARN-1643. Make ContainersMonitor support changing monitoring size of an allocated container. Contributed by Meng Ding and Wangda Tan
2015-09-23 13:29:37 -07:00
Jian He
5f5a968d65
YARN-3867. ContainerImpl changes to support container resizing. Contributed by Meng Ding
2015-09-23 13:29:37 -07:00
Jian He
ffd820c27a
YARN-1645. ContainerManager implementation to support container resizing. Contributed by Meng Ding & Wangda Tan
2015-09-23 13:29:37 -07:00
Jason Lowe
c890c51a91
YARN-4095. Avoid sharing AllocatorPerContext object in LocalDirAllocator between ShuffleHandler and LocalDirsHandlerService. Contributed by Zhihai Xu
2015-09-23 15:42:01 +00:00
Xuan
34ef1a092b
YARN-4149. yarn logs -am should provide an option to fetch all the log
...
files. Contributed by Varun Vasudev
2015-09-15 14:36:30 -07:00
Wangda Tan
77666105b4
YARN-4106. NodeLabels for NM in distributed mode is not updated even after clusterNodelabel addition in RM. (Bibin A Chundatt via wangda)
2015-09-10 09:30:09 -07:00
Zhihai Xu
16b9037dc1
YARN-4096. App local logs are leaked if log aggregation fails to initialize for the app. Contributed by Jason Lowe.
2015-09-08 12:29:54 -07:00
Jian He
6f72f1e600
YARN-2884. Added a proxy service in NM to proxy the the communication between AM and RM. Contributed by Kishore Chaliparambil
2015-09-08 09:35:46 +08:00
Varun Vasudev
1dbd8e34a7
YARN-3591. Resource localization on a bad disk causes subsequent containers failure. Contributed by Lavkesh Lahngir.
2015-09-07 11:32:12 +05:30
Rohith Sharma K S
095ab9ab5f
YARN-4073. Removed unused ApplicationACLsManager in ContainerManagerImpl constructor. (Naganarasimha G R via rohithsharmaks)
2015-09-02 14:13:33 +05:30
Xuan
37e1c3d82a
YARN-221. NM should provide a way for AM to tell it not to aggregate
...
logs. Contributed by Ming Ma
2015-08-22 16:25:24 -07:00
Wangda Tan
fc07464d1a
YARN-2923. Support configuration based NodeLabelsProvider Service in Distributed Node Label Configuration Setup. (Naganarasimha G R)
2015-08-20 11:51:03 -07:00
Karthik Kambatla
def12933b3
YARN-3534. Collect memory/cpu usage on the node. (Inigo Goiri via kasha)
2015-08-16 06:24:16 -07:00
Jian He
38aed1a94e
YARN-4005. Completed container whose app is finished is possibly not removed from NMStateStore. Contributed by Jun Gong
2015-08-13 14:46:08 -07:00
Jason Lowe
469cfcd695
YARN-3965. Add startup timestamp to nodemanager UI. Contributed by Hong Zhiguo
2015-08-03 15:53:32 +00:00
Varun Vasudev
3e6fce91a4
YARN-3853. Add docker container runtime support to LinuxContainterExecutor. Contributed by Sidharta Seethana.
2015-07-27 11:57:40 -07:00
Jason Lowe
ff9c13e0a7
YARN-3925. ContainerLogsUtils#getContainerLogFile fails to read container log files from full disks. Contributed by zhihai xu
2015-07-24 22:14:39 +00:00
Wangda Tan
06e5dd2c84
YARN-3956. Fix TestNodeManagerHardwareUtils fails on Mac (Varun Vasudev via wangda)
2015-07-22 12:01:41 -07:00
Varun Vasudev
63d0365088
YARN-2194. Addendum patch to fix failing unit test in TestPrivilegedOperationExecutor. Contributed by Sidharta Seethana.
2015-07-09 11:51:59 +05:30
Varun Vasudev
c40bdb56a7
YARN-2194. Fix bug causing CGroups functionality to fail on RHEL7. Contributed by Wei Yan.
2015-07-07 16:59:29 +05:30
Jason Lowe
b5cdf78e8e
YARN-3793. Several NPEs when deleting local files on NM recovery. Contributed by Varun Saxena
2015-07-01 21:13:32 +00:00
Jian He
62e583c7dc
YARN-3695. ServerProxy (NMProxy, etc.) shouldn't retry forever for non network exception. Contributed by Raju Bairishetti
2015-06-29 13:37:32 -07:00
Jason Lowe
40b256949a
YARN-3850. NM fails to read files from full disks which can lead to container logs being lost and other issues. Contributed by Varun Saxena
2015-06-26 15:47:07 +00:00
Karthik Kambatla
5ebf2817e5
YARN-3842. NMProxy should retry on NMNotYetReadyException. (Robert Kanter via kasha)
2015-06-22 17:47:47 -07:00
Xuan
6c7a9d502a
YARN-3834. Scrub debug logging of tokens during resource localization. Contributed by Chris Nauroth
2015-06-21 17:13:44 -07:00
Devaraj K
126321eded
YARN-3747. TestLocalDirsHandlerService should delete the created test
...
directory logDir2. Contributed by David Moore.
2015-06-08 15:32:13 +05:30
Junping Du
d7e7f6aa03
YARN-41. The RM should handle the graceful shutdown of the NM. Contributed by Devaraj K.
2015-06-04 04:59:27 -07:00
cnauroth
4102e5882e
YARN-3626. On Windows localized resources are not moved to the front of the classpath when they should be. Contributed by Craig Welch.
2015-05-27 14:31:49 -07:00
Vinod Kumar Vavilapalli
500a1d9c76
YARN-160. Enhanced NodeManager to automatically obtain cpu/memory values from underlying OS when configured to do so. Contributed by Varun Vasudev.
2015-05-26 11:38:35 -07:00
Vinod Kumar Vavilapalli
53fafcf061
YARN-3684. Changed ContainerExecutor's primary lifecycle methods to use a more extensible mechanism of context objects. Contributed by Sidharta Seethana.
2015-05-21 15:50:23 -07:00
Wangda Tan
b37da52a1c
YARN-3565. NodeHeartbeatRequest/RegisterNodeManagerRequest should use NodeLabel object instead of String. (Naganarasimha G R via wangda)
2015-05-19 16:34:17 -07:00
Ravi Prakash
c97f32e7b9
YARN-3302. TestDockerContainerExecutor should run automatically if it can detect docker in the usual place (Ravindra Kumar Naik via raviprak)
2015-05-19 10:28:11 -07:00
Xuan
0f95921447
YARN-3626. On Windows localized resources are not moved to the front of the classpath when they should be. Contributed by Craig Welch
2015-05-13 13:10:53 -07:00
Karthik Kambatla
70fb37cd79
YARN-1287. Consolidate MockClocks. (Sebastian Wong and Anubhav Dhoot via kasha)
2015-05-09 14:34:54 -07:00
Xuan
6471d18bc7
YARN-1912. ResourceLocalizer started without any jvm memory control.
...
Contributed by Masatake Iwasaki
2015-05-08 20:01:21 -07:00
Xuan
333f9a896d
YARN-3602. TestResourceLocalizationService.testPublicResourceInitializesLocalDir fails Intermittently due to IOException from cleanup. Contributed by zhihai xu
2015-05-08 17:52:54 -07:00
Xuan
088156de43
YARN-2331. Distinguish shutdown during supervision vs. shutdown for
...
rolling upgrade. Contributed by Jason Lowe
2015-05-08 15:10:43 -07:00
Robert (Bobby) Evans
bcf2890502
YARN-644: Basic null check is not performed on passed in arguments before using them in ContainerManagerImpl.startContainer
2015-05-08 11:11:01 -05:00
Akira Ajisaka
b167fe7605
YARN-1832. Fix wrong MockLocalizerStatus#equals implementation. Contributed by Hong Zhiguo.
2015-05-08 11:14:45 +09:00
Akira Ajisaka
918af8efff
YARN-3577. Misspelling of threshold in log4j.properties for tests. Contributed by Brahma Reddy Battula.
2015-05-07 13:33:03 +09:00
Robert Kanter
b72507810a
YARN-3491. PublicLocalizer#addResource is too slow. (zxu via rkanter)
2015-05-06 14:19:06 -07:00
Robert Kanter
ac7d152901
YARN-3363. add localization and container launch time to ContainerMetrics at NM to show these timing information for each active container. (zxu via rkanter)
2015-05-01 16:39:21 -07:00
Vinod Kumar Vavilapalli
1b3b9e5c31
YARN-2619. Added NodeManager support for disk io isolation through cgroups. Contributed by Varun Vasudev and Wei Yan.
2015-04-30 21:41:07 -07:00
Karthik Kambatla
47279c3228
YARN-3464. Race condition in LocalizerRunner kills localizer before localizing all resources. (Zhihai Xu via kasha)
2015-04-26 09:13:46 -07:00
Vinod Kumar Vavilapalli
a100be685c
YARN-3366. Enhanced NodeManager to support classifying/shaping outgoing network bandwidth traffic originating from YARN containers Contributed by Sidharta Seethana.
2015-04-22 17:26:13 -07:00
Jian He
674c7ef649
YARN-3503. Expose disk utilization percentage and bad local and log dir counts in NM metrics. Contributed by Varun Vasudev
2015-04-21 20:57:02 -07:00
Jian He
1b89a3e173
YARN-3354. Add node label expression in ContainerTokenIdentifier to support RM recovery. Contributed by Wangda Tan
2015-04-15 13:57:06 -07:00
Junping Du
838b06ac87
YARN-3443. Create a 'ResourceHandler' subsystem to ease addition of support for new resource types on the NM. Contributed by Sidharta Seethana.
2015-04-13 18:35:56 -07:00
Vinod Kumar Vavilapalli
b21c72777a
YARN-3365. Enhanced NodeManager to support using the 'tc' tool via container-executor for outbound network traffic control. Contributed by Sidharta Seethana.
2015-04-02 16:53:59 -07:00
Wangda Tan
2a945d24f7
YARN-2495. Allow admin specify labels from each NM (Distributed configuration for node label). (Naganarasimha G R via wangda)
2015-03-30 12:05:21 -07:00
Ravi Prakash
e0ccea33c9
YARN-3288. Document and fix indentation in the DockerContainerExecutor code
2015-03-28 08:00:41 -07:00
Tsuyoshi Ozawa
82eda771e0
YARN-3384. TestLogAggregationService.verifyContainerLogs fails after YARN-2777. Contributed by Naganarasimha G R.
2015-03-24 00:25:30 +09:00
Junping Du
d81109e588
YARN-3269. Yarn.nodemanager.remote-app-log-dir could not be configured to fully qualified path. Contributed by Xuan Gong
2015-03-20 13:41:22 -07:00
Ravi Prakash
56085203c4
YARN-3339. TestDockerContainerExecutor should pull a single image and not the entire centos repository. (Ravindra Kumar Naik via raviprak)
2015-03-16 16:17:58 -07:00
Vinod Kumar Vavilapalli
863079bb87
YARN-3154. Added additional APIs in LogAggregationContext to avoid aggregating running logs of application when rolling is enabled. Contributed by Xuan Gong.
2015-03-12 13:32:29 -07:00
Jian He
21101c01f2
YARN-2190. Added CPU and memory limit options to the default container executor for Windows containers. Contributed by Chuan Liu
2015-03-06 14:18:11 -08:00
Allen Wittenauer
d4ac6822e1
YARN-2980. Move health check script related functionality to hadoop-common (Varun Saxena via aw)
2015-02-24 11:25:26 -08:00
Junping Du
c33ae271c2
YARN-2799. Cleanup TestLogAggregationService based on the change in YARN-90. Contributed by Zhihai Xu
2015-02-20 09:43:39 -08:00
cnauroth
6804d68901
YARN-2899. Run TestDockerContainerExecutorWithMocks on Linux only. Contributed by Ming Ma.
2015-02-13 21:58:54 -08:00
Junping Du
04f5ef18f7
YARN-2079. Recover NonAggregatingLogHandler state upon nodemanager restart. (Contributed by Jason Lowe)
2015-02-12 11:46:47 -08:00
Jason Lowe
b379972ab3
YARN-3074. Nodemanager dies when localizer runner tries to write to a full disk. Contributed by Varun Saxena
2015-02-11 16:33:43 +00:00
Jason Lowe
3f5431a22f
YARN-2809. Implement workaround for linux kernel panic when removing cgroup. Contributed by Nathan Roberts
2015-02-10 17:27:21 +00:00
Arun C. Murthy
02f154a001
YARN-1537. Fix race condition in TestLocalResourcesTrackerImpl.testLocalResourceCache. Contributed by Xuan Gong.
2015-02-05 23:58:52 -08:00
Xuan
b73e776abc
YARN-3056. Add verification for containerLaunchDuration in
...
TestNodeManagerMetrics. Contributed by zhihai Xu
2015-02-03 15:12:54 -08:00
Robert Kanter
f7a77819a1
YARN-3022. Expose Container resource information from NodeManager for monitoring (adhoot via ranter)
2015-02-03 10:39:41 -08:00
Jian He
4e15fc0841
YARN-3011. Possible IllegalArgumentException in ResourceLocalizationService might lead NM to crash. Contributed by Varun Saxena
2015-01-27 13:31:22 -08:00
Jason Lowe
902c6ea7e4
YARN-3088. LinuxContainerExecutor.deleteAsUser can throw NPE if native executor returns an error. Contributed by Eric Payne
2015-01-26 15:40:21 +00:00
Xuan
0d6bd62102
YARN-3024. LocalizerRunner should give DIE action when all resources are
...
localized. Contributed by Chengbing Liu
2015-01-25 19:37:57 -08:00
Tsuyoshi Ozawa
3aab354e66
YARN-3082. Non thread safe access to systemCredentials in NodeHeartbeatResponse processing. Contributed by Anubhav Dhoot.
2015-01-23 16:04:18 +09:00
Karthik Kambatla
84198564ba
YARN-2984. Metrics for container's actual memory usage. (kasha)
2015-01-17 05:44:04 +05:30
Junping Du
5d1cca34fa
YARN-3064. TestRMRestart/TestContainerResourceUsage/TestNodeManagerResync failure with allocation timeout. (Contributed by Jian He)
2015-01-16 00:08:36 -08:00
Jian He
cc2a745f7e
YARN-2997. Fixed NodeStatusUpdater to not send alreay-sent completed container statuses on heartbeat. Contributed by Chengbing Liu
2015-01-08 11:12:54 -08:00
Karthik Kambatla
954fb8581e
YARN-2675. containersKilled metrics is not updated when the container is killed during localization. (Zhihai Xu via kasha)
2014-12-19 16:02:20 -08:00
cnauroth
e996a1bfd4
HADOOP-11321. copyToLocal cannot save a file to an SMB share unless the user has Full Control permissions. Contributed by Chris Nauroth.
2014-12-16 15:29:22 -08:00
Steve Loughran
3681de2039
YARN-2912 Jersey Tests failing with port in use. (varun saxena via stevel)
2014-12-12 17:11:07 +00:00
Karthik Kambatla
db73cc9124
YARN-2931. PublicLocalizer may fail until directory is initialized by LocalizeRunner. (Anubhav Dhoot via kasha)
2014-12-08 22:26:18 -08:00
Junping Du
e65b7c5ff6
YARN-1156. Enhance NodeManager AllocatedGB and AvailableGB metrics for aggregation of decimal values. (Contributed by Tsuyoshi OZAWA)
2014-12-03 04:11:18 -08:00
Karthik Kambatla
233b61e495
YARN-2679. Add metric for container launch duration. (Zhihai Xu via kasha)
2014-11-21 14:22:21 -08:00
Jason Lowe
49c38898b0
YARN-2816. NM fail to start with NPE during container recovery. Contributed by Zhihai Xu
2014-11-14 21:25:59 +00:00
Karthik Kambatla
a04143039e
YARN-2236. [YARN-1492] Shared Cache uploader service on the Node Manager. (Chris Trezzo and Sanjin Lee via kasha)
2014-11-12 09:31:05 -08:00
Ravi Prakash
53f64ee516
YARN-1964. Create Docker analog of the LinuxContainerExecutor in YARN
2014-11-11 21:28:11 -08:00
Xuan
5c9a51f140
YARN-2841. RMProxy should retry EOFException. Contributed by Jian He
2014-11-10 18:25:01 -08:00
Arun C. Murthy
43cd07b408
YARN-2830. Add backwords compatible ContainerId.newInstance constructor. Contributed by Jonathan Eagles.
2014-11-09 14:57:37 -08:00
Jason Lowe
c3d475070a
YARN-2825. Container leak on NM. Contributed by Jian He
2014-11-07 23:16:37 +00:00
Vinod Kumar Vavilapalli
c5a46d4c8c
YARN-1922. Fixed NodeManager to kill process-trees correctly in the presence of races between the launch and the stop-container call and when root processes crash. Contributed by Billie Rinaldi.
2014-11-03 16:38:55 -08:00
Vinod Kumar Vavilapalli
58e9f24e0f
YARN-2788. Fixed backwards compatiblity issues with log-aggregation feature that were caused when adding log-upload-time via YARN-2703. Contributed by Xuan Gong.
2014-11-03 13:16:29 -08:00
Vinod Kumar Vavilapalli
5c0381c96a
YARN-2790. Fixed a NodeManager bug that was causing log-aggregation to fail beyond HFDS delegation-token expiry even when RM is a proxy-user (YARN-2704). Contributed by Jian He.
2014-11-01 16:32:35 -07:00
Zhijie Shen
1cd088fd9d
YARN-2711. Fixed TestDefaultContainerExecutor#testContainerLaunchError failure on Windows. Contributed by Varun Vasudev.
2014-10-31 17:43:30 -07:00
Jason Lowe
73e626ad91
YARN-2755. NM fails to clean up usercache_DEL_<timestamp> dirs after YARN-661. Contributed by Siqi Li
2014-10-30 15:10:27 +00:00
Zhijie Shen
8984e9b177
YARN-2741. Made NM web UI serve logs on the drive other than C: on Windows. Contributed by Craig Welch.
2014-10-28 14:11:19 -07:00
Vinod Kumar Vavilapalli
a16d022ca4
YARN-2704. Changed ResourceManager to optionally obtain tokens itself for the sake of localization and log-aggregation for long-running services. Contributed by Jian He.
2014-10-27 15:49:47 -07:00
Zhijie Shen
f81dc3f995
YARN-2703. Added logUploadedTime into LogValue for better display. Contributed by Xuan Gong.
2014-10-24 14:10:46 -07:00
Jian He
3b12fd6cfb
YARN-2198. Remove the need to run NodeManager as privileged account for Windows Secure Container Executor. Contributed by Remus Rusanu
2014-10-22 15:57:46 -07:00
Jason Lowe
6f2028bd15
YARN-90. NodeManager should identify failed disks becoming good again. Contributed by Varun Vasudev
2014-10-21 17:31:13 +00:00
Jian He
4fa1fb3193
Missing file for YARN-2701
2014-10-20 19:57:45 -07:00
Jian He
2839365f23
YARN-2701. Potential race condition in startLocalizer when using LinuxContainerExecutor. Contributed by Xuan Gong
2014-10-20 18:45:47 -07:00
Jian He
0af1a2b5bc
YARN-2312. Deprecated old ContainerId#getId API and updated MapReduce to use ContainerId#getContainerId instead. Contributed by Tsuyoshi OZAWA
2014-10-15 15:22:07 -07:00
Karthik Kambatla
cc93e7e683
YARN-2566. DefaultContainerExecutor should pick a working directory randomly. (Zhihai Xu via kasha)
2014-10-13 16:32:01 -07:00
Zhijie Shen
4aed2d8e91
YARN-2651. Spun off LogRollingInterval from LogAggregationContext. Contributed by Xuan Gong.
2014-10-13 10:54:09 -07:00
Zhijie Shen
cb81bac002
YARN-2583. Modified AggregatedLogDeletionService to be able to delete rolling aggregated logs. Contributed by Xuan Gong.
2014-10-10 00:11:30 -07:00
cnauroth
d3afd730ac
YARN-2662. TestCgroupsLCEResourcesHandler leaks file descriptors. Contributed by Chris Nauroth.
2014-10-09 22:44:44 -07:00
Vinod Kumar Vavilapalli
34cdcaad71
YARN-2468. Enhanced NodeManager to support log handling APIs (YARN-2569) for use by long running services. Contributed by Xuan Gong.
2014-10-03 12:15:40 -07:00
Jason Lowe
29f520052e
YARN-2624. Resource Localization fails on a cluster due to existing cache directories. Contributed by Anubhav Dhoot
2014-10-02 17:39:34 +00:00
Jian He
3ef1cf187f
YARN-2617. Fixed NM to not send duplicate container status whose app is not running. Contributed by Jun Gong
2014-10-02 10:04:09 -07:00
junping_du
c7cee9b455
YARN-1979. TestDirectoryCollection fails when the umask is unusual. (Contributed by Vinod Kumar Vavilapalli and Tsuyoshi OZAWA)
2014-10-02 08:02:59 -07:00
junping_du
0708827a93
YARN-2613. Support retry in NMClient for rolling-upgrades. (Contributed by Jian He)
2014-10-01 16:50:30 -07:00
Zhijie Shen
52bbe0f11b
YARN-2630. Prevented previous AM container status from being acquired by the current restarted AM. Contributed by Jian He.
2014-10-01 15:38:11 -07:00
Vinod Kumar Vavilapalli
ba7f31c2ee
YARN-1972. Added a secure container-executor for Windows. Contributed by Remus Rusanu.
2014-10-01 10:14:41 -07:00
Jian He
5391919b09
YARN-668. Changed NMTokenIdentifier/AMRMTokenIdentifier/ContainerTokenIdentifier to use protobuf object as the payload. Contributed by Junping Du.
2014-09-26 17:48:41 -07:00
Zhijie Shen
c86674a3a4
YARN-2581. Passed LogAggregationContext to NM via ContainerTokenIdentifier. Contributed by Xuan Gong.
2014-09-24 17:50:26 -07:00
Jian He
0a641496c7
YARN-1372. Ensure all completed containers are reported to the AMs across RM restart. Contributed by Anubhav Dhoot
2014-09-22 10:30:53 -07:00
Vinod Kumar Vavilapalli
9f6891d9ef
YARN-2531. Added a configuration for admins to be able to override app-configs and enforce/not-enforce strict control of per-container cpu usage. Contributed by Varun Vasudev.
2014-09-16 10:14:46 -07:00
Vinod Kumar Vavilapalli
4be95175cd
YARN-2440. Enabled Nodemanagers to limit the aggregate cpu usage across all containers to a preconfigured limit. Contributed by Varun Vasudev.
2014-09-10 19:22:52 -07:00
Jason Lowe
3fa5f728c4
YARN-2431. NM restart: cgroup is not removed for reacquired containers. Contributed by Jason Lowe
2014-09-04 21:11:27 +00:00
Jason Lowe
b03653f9a5
YARN-2462. TestNodeManagerResync#testBlockNewContainerRequestsOnStartAndResync should have a test timeout. Contributed by Eric Payne
2014-08-29 20:15:40 +00:00
Allen Wittenauer
7e75226e68
YARN-2424. LCE should support non-cgroups, non-secure mode (Chris Douglas via aw)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1619421 13f79535-47bb-0310-9956-ffa450edef68
2014-08-21 14:57:11 +00:00
Junping Du
c2febdcbaa
YARN-1337. Recover containers upon nodemanager restart. (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617448 13f79535-47bb-0310-9956-ffa450edef68
2014-08-12 10:56:13 +00:00
Junping Du
b8f151231b
YARN-1354. Recover applications upon nodemanager restart. (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1615550 13f79535-47bb-0310-9956-ffa450edef68
2014-08-04 13:25:37 +00:00
Zhijie Shen
1d6e178144
YARN-2347. Consolidated RMStateVersion and NMDBSchemaVersion into Version in yarn-server-common. Contributed by Junping Du.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1614838 13f79535-47bb-0310-9956-ffa450edef68
2014-07-31 09:27:43 +00:00
Devarajulu K
2050e0dad6
YARN-1342. Recover container tokens upon nodemanager restart. Contributed by Jason Lowe.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612995 13f79535-47bb-0310-9956-ffa450edef68
2014-07-24 05:02:00 +00:00
Junping Du
537c361f5b
YARN-2013. The diagnostics is always the ExitCodeException stack when the container crashes. (Contributed by Tsuyoshi OZAWA)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612449 13f79535-47bb-0310-9956-ffa450edef68
2014-07-22 03:01:58 +00:00
Jason Darrell Lowe
8a87085820
YARN-2045. Data persisted in NM should be versioned. Contributed by Junping Du
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612285 13f79535-47bb-0310-9956-ffa450edef68
2014-07-21 14:43:59 +00:00
Junping Du
403ec8ea80
YARN-1341. Recover NMTokens upon nodemanager restart. (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1611512 13f79535-47bb-0310-9956-ffa450edef68
2014-07-17 23:33:22 +00:00
Jian He
6d7dbd4fed
YARN-1367. Changed NM to not kill containers on NM resync if RM work-preserving restart is enabled. Contributed by Anubhav Dhoot
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1608334 13f79535-47bb-0310-9956-ffa450edef68
2014-07-07 04:37:59 +00:00
Karthik Kambatla
58e398f31d
YARN-2204. Explicitly enable vmem check in TestContainersMonitor#testContainerKillOnMemoryOverflow. (Anubhav Dhoot via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1607231 13f79535-47bb-0310-9956-ffa450edef68
2014-07-02 02:05:37 +00:00
Vinod Kumar Vavilapalli
e285b98f0f
YARN-2152. Added missing information into ContainerTokenIdentifier so that NodeManagers can report the same to RM when RM restarts. Contributed Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605205 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 21:43:22 +00:00
Thomas Graves
1f9a0fd927
YARN-2072. RM/NM UIs and webservices are missing vcore information. (Nathan Roberts via tgraves)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605162 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 19:34:34 +00:00
Junping Du
072360d128
YARN-1339. Recover DeletionService state upon nodemanager restart. (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1603036 13f79535-47bb-0310-9956-ffa450edef68
2014-06-17 01:02:16 +00:00
Bikas Saha
ecfd43a2f1
YARN-2091. Add more values to ContainerExitStatus and pass it from NM to RM and then to app masters (Tsuyoshi OZAWA via bikas)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1601762 13f79535-47bb-0310-9956-ffa450edef68
2014-06-10 20:08:33 +00:00
Vinod Kumar Vavilapalli
23c325ad47
YARN-2115. Replaced RegisterNodeManagerRequest's ContainerStatus with a new NMContainerStatus which has more information that is needed for work-preserving RM-restart. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1598790 13f79535-47bb-0310-9956-ffa450edef68
2014-05-31 00:20:50 +00:00
Junping Du
66598697a6
YARN-1338. Recover localized resource cache state upon nodemanager restart (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1598640 13f79535-47bb-0310-9956-ffa450edef68
2014-05-30 15:37:27 +00:00
Junping Du
b29434a5c8
YARN-1362. Distinguish between nodemanager shutdown for decommission vs shutdown for restart. (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1594421 13f79535-47bb-0310-9956-ffa450edef68
2014-05-14 00:20:53 +00:00
Junping Du
72ba2b35d8
YARN-766. TestNodeManagerShutdown in branch-2 should use Shell to form the output path and a format issue in trunk. (Contributed by Siddharth Seth)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1593660 13f79535-47bb-0310-9956-ffa450edef68
2014-05-10 03:45:01 +00:00
Ivan Mitic
4810e2b849
YARN-1865. ShellScriptBuilder does not check for some error conditions. Contributed by Remus Rusanu.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1588693 13f79535-47bb-0310-9956-ffa450edef68
2014-04-19 18:55:07 +00:00
Junping Du
0d087f7941
YARN-1750. TestNodeStatusUpdater#testNMRegistration is incorrect in test case. (Wangda Tan via junping_du)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1588343 13f79535-47bb-0310-9956-ffa450edef68
2014-04-17 19:06:48 +00:00
Vinod Kumar Vavilapalli
c6b70f4760
YARN-1933. Fixed test issues with TestAMRestart and TestNodeHealthService. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1587104 13f79535-47bb-0310-9956-ffa450edef68
2014-04-13 21:51:38 +00:00
Jian He
ed78328d50
YARN-1903. Set exit code and diagnostics when container is killed at NEW/LOCALIZING state. Contributed by Zhijie Shen
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1586522 13f79535-47bb-0310-9956-ffa450edef68
2014-04-11 01:26:36 +00:00
Karthik Kambatla
245012a9d9
YARN-1757. NM Recovery. Auxiliary service support. (Jason Lowe via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1585783 13f79535-47bb-0310-9956-ffa450edef68
2014-04-08 17:15:58 +00:00
Vinod Kumar Vavilapalli
06579878df
YARN-1775. Enhanced ProcfsBasedProcessTree to optionally add the ability to use smaps for obtaining used memory information. Contributed by Rajesh Balamohan.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1580087 13f79535-47bb-0310-9956-ffa450edef68
2014-03-22 00:01:59 +00:00
Jian He
6a89e57b8d
YARN-1206. Fixed AM container log to show on NM web page after application finishes if log-aggregation is disabled. Contributed by Rohith Sharmaks
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1578614 13f79535-47bb-0310-9956-ffa450edef68
2014-03-17 21:49:06 +00:00
Jonathan Turner Eagles
7944eab64a
YARN-1136. Replace junit.framework.Assert with org.junit.Assert (Chen He via jeagles)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1578539 13f79535-47bb-0310-9956-ffa450edef68
2014-03-17 20:03:35 +00:00
Vinod Kumar Vavilapalli
96e0ca2d27
YARN-1824. Improved NodeManager and clients to be able to handle cross platform application submissions. Contributed by Jian He.
...
MAPREDUCE-4052. Improved MapReduce clients to use NodeManagers' ability to handle cross platform application submissions. Contributed by Jian He.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1578135 13f79535-47bb-0310-9956-ffa450edef68
2014-03-16 18:32:05 +00:00
Vinod Kumar Vavilapalli
8aab8533a1
YARN-1800. Fixed NodeManager to gracefully handle RejectedExecutionException in the public-localizer thread-pool. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1576545 13f79535-47bb-0310-9956-ffa450edef68
2014-03-11 23:33:56 +00:00
Vinod Kumar Vavilapalli
0b1304d098
YARN-1781. Modified NodeManagers to allow admins to specify max disk utilization for local disks so as to be able to offline full disks. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1575463 13f79535-47bb-0310-9956-ffa450edef68
2014-03-08 00:52:06 +00:00
Vinod Kumar Vavilapalli
1c4047b0e4
YARN-1783. Fixed a bug in NodeManager's status-updater that was losing completed container statuses when NodeManager is forced to resync by the ResourceManager. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1575437 13f79535-47bb-0310-9956-ffa450edef68
2014-03-07 22:36:47 +00:00
Vinod Kumar Vavilapalli
d07f855892
YARN-1686. Fixed NodeManager to properly handle any errors during re-registration after a RESYNC and thus avoid hanging. Contributed by Rohith Sharma.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1571474 13f79535-47bb-0310-9956-ffa450edef68
2014-02-24 22:41:24 +00:00
Sanford Ryza
9024ad4aa0
YARN-1697. NodeManager reports negative running containers (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1567356 13f79535-47bb-0310-9956-ffa450edef68
2014-02-11 20:14:30 +00:00
Karthik Kambatla
d57c6e0fe7
YARN-1672. YarnConfiguration is missing a default for yarn.nodemanager.log.retain-seconds (Naren Koneru via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1565866 13f79535-47bb-0310-9956-ffa450edef68
2014-02-08 01:55:33 +00:00
Jian He
332c2704d4
YARN-1293. Fixed TestContainerLaunch#testInvalidEnvSyntaxDiagnostics failure caused by non-English system locale. Contributed by Tsuyoshi OZAWA.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1556318 13f79535-47bb-0310-9956-ffa450edef68
2014-01-07 19:00:21 +00:00
Jason Darrell Lowe
785c12056c
YARN-1409. NonAggregatingLogHandler can throw RejectedExecutionException. Contributed by Tsuyoshi OZAWA
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1556282 13f79535-47bb-0310-9956-ffa450edef68
2014-01-07 17:11:09 +00:00
Bikas Saha
d1fe9e4142
YARN-1053. Diagnostic message from ContainerExitEvent is ignored in ContainerImpl (Omkar Vinit Joshi via bikas)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1543973 13f79535-47bb-0310-9956-ffa450edef68
2013-11-20 22:25:59 +00:00
Sanford Ryza
1a76ccbbc4
YARN-1401. With zero sleep-delay-before-sigkill.ms, no signal is ever sent (Gera Shegalov via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1542038 13f79535-47bb-0310-9956-ffa450edef68
2013-11-14 19:52:14 +00:00
Jonathan Turner Eagles
656e859f5c
YARN-1386. NodeManager mistakenly loses resources and relocalizes them (Jason Lowe via jeagles)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1541375 13f79535-47bb-0310-9956-ffa450edef68
2013-11-13 03:17:23 +00:00
Chris Nauroth
218ef59d89
YARN-1357. TestContainerLaunch.testContainerEnvVariables fails on Windows. Contributed by Chuan Liu.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1537293 13f79535-47bb-0310-9956-ffa450edef68
2013-10-30 20:49:12 +00:00
Sanford Ryza
5a87d89366
Add missing file TestCgroupsLCEResourcesHandler for YARN-1284.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1530493 13f79535-47bb-0310-9956-ffa450edef68
2013-10-09 05:07:41 +00:00
Vinod Kumar Vavilapalli
44f7ee7192
YARN-1278. Fixed NodeManager to not delete local resources for apps on resync command from RM - a bug caused by YARN-1149. Contributed by Hitesh Shah.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1529657 13f79535-47bb-0310-9956-ffa450edef68
2013-10-06 18:32:16 +00:00
Vinod Kumar Vavilapalli
0a887a0910
YARN-1254. Fixed NodeManager to not pollute container's credentials. Contributed by Omkar Vinit Joshi.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1529382 13f79535-47bb-0310-9956-ffa450edef68
2013-10-05 04:25:44 +00:00
Alejandro Abdelnur
80c5bffc4b
YARN-1253. Changes to LinuxContainerExecutor to run containers as a single dedicated user in non-secure mode. (rvs via tucu)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1529325 13f79535-47bb-0310-9956-ffa450edef68
2013-10-04 21:59:43 +00:00
Hitesh Shah
d3841bd499
YARN-1149. NM throws InvalidStateTransitonException: Invalid event: APPLICATION_LOG_HANDLING_FINISHED at RUNNING. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1529043 13f79535-47bb-0310-9956-ffa450edef68
2013-10-04 00:43:26 +00:00
Bikas Saha
8ebf37f369
YARN-1256. NM silently ignores non-existent service in StartContainerRequest (Xuan Gong via bikas)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1529039 13f79535-47bb-0310-9956-ffa450edef68
2013-10-04 00:23:35 +00:00
Vinod Kumar Vavilapalli
74d20250ff
YARN-1070. Fixed race conditions in NodeManager during container-kill. Contributed by Zhijie Shen.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1527827 13f79535-47bb-0310-9956-ffa450edef68
2013-10-01 00:18:09 +00:00
Jonathan Turner Eagles
5a15c392a1
YARN-819. ResourceManager and NodeManager should check for a minimum allowed version (Robert Parker via jeagles)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1526660 13f79535-47bb-0310-9956-ffa450edef68
2013-09-26 19:55:26 +00:00
Siddharth Seth
10a4289ebf
YARN-1229. Define constraints on Auxiliary Service names. Change ShuffleHandler service name from mapreduce.shuffle to mapreduce_shuffle. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1526065 13f79535-47bb-0310-9956-ffa450edef68
2013-09-25 00:35:03 +00:00
Jason Darrell Lowe
baea09be61
YARN-1189. NMTokenSecretManagerInNM is not being told when applications have finished. Contributed by Omkar Vinit Joshi
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1523158 13f79535-47bb-0310-9956-ffa450edef68
2013-09-14 00:18:29 +00:00
Jason Darrell Lowe
2950c9ff6e
YARN-1194. TestContainerLogsPage fails with native builds. Contributed by Roman Shaposhnik
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1522968 13f79535-47bb-0310-9956-ffa450edef68
2013-09-13 15:14:07 +00:00
Chris Nauroth
f152a7e788
YARN-1078. TestNodeManagerResync, TestNodeManagerShutdown, and TestNodeStatusUpdater fail on Windows. Contributed by Chuan Liu.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1522644 13f79535-47bb-0310-9956-ffa450edef68
2013-09-12 15:58:34 +00:00
Vinod Kumar Vavilapalli
0f91d8485a
YARN-910. Augmented auxiliary services to listen for container starts and completions in addition to application events. Contributed by Alejandro Abdelnur.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1521298 13f79535-47bb-0310-9956-ffa450edef68
2013-09-09 21:48:09 +00:00
Bikas Saha
49afc64cd4
YARN-1065. NM should provide AuxillaryService data to the container (Xuan Gong via bikas)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1520135 13f79535-47bb-0310-9956-ffa450edef68
2013-09-04 20:42:11 +00:00
Vinod Kumar Vavilapalli
ff69557040
YARN-1077. Fixed TestContainerLaunch test failure on Windows. Contributed by Chuan Liu.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1519333 13f79535-47bb-0310-9956-ffa450edef68
2013-09-02 03:10:39 +00:00
Vinod Kumar Vavilapalli
a759abcd89
YARN-649. Added a new NM web-service to serve container logs in plain text over HTTP. Contributed by Sandy Ryza.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1519326 13f79535-47bb-0310-9956-ffa450edef68
2013-09-02 00:09:17 +00:00
Vinod Kumar Vavilapalli
4047ad72cf
YARN-602. Fixed NodeManager to not let users override some mandatory environmental variables. Contributed by Kenji Kikushima.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1518077 13f79535-47bb-0310-9956-ffa450edef68
2013-08-28 05:13:21 +00:00
Arun Murthy
472d8bc983
YARN-1056. Remove dual use of string 'resourcemanager' in yarn.resourcemanager.connect.{max.wait.secs|retry_interval.secs}. Contributed by Karthik Kambatla.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1514135 13f79535-47bb-0310-9956-ffa450edef68
2013-08-15 02:35:48 +00:00
Vinod Kumar Vavilapalli
ac933234ac
YARN-906. Fixed a bug in NodeManager where cancelling ContainerLaunch at KILLING state causes that the container to hang. Contributed by Zhijie Shen.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1509924 13f79535-47bb-0310-9956-ffa450edef68
2013-08-03 00:49:09 +00:00
Vinod Kumar Vavilapalli
5b7889f9a7
YARN-903. Changed ContainerManager to suppress unnecessary warnings when stopping already stopped containers. Contributed by Omkar Vinit Joshi.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1509560 13f79535-47bb-0310-9956-ffa450edef68
2013-08-02 06:54:22 +00:00
Vinod Kumar Vavilapalli
813efd25a1
YARN-966. Fixed ContainerLaunch to not fail quietly when there are no localized resources due to some other failure. Contributed by Zhijie Shen.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1508688 13f79535-47bb-0310-9956-ffa450edef68
2013-07-30 23:58:17 +00:00
Vinod Kumar Vavilapalli
3ca892dc0a
Reverting YARN-245 to fix a critical bug.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1508277 13f79535-47bb-0310-9956-ffa450edef68
2013-07-30 03:06:54 +00:00
Sanford Ryza
9eb06b31d6
YARN-932. TestResourceLocalizationService.testLocalizationInit can fail on JDK7. (Karthik Kambatla via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1508209 13f79535-47bb-0310-9956-ffa450edef68
2013-07-29 22:06:28 +00:00
Vinod Kumar Vavilapalli
412e19f8e3
YARN-245. Fixed NodeManager to handle duplicate responses from ResourceManager. Contributed by Mayank Bansal.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1508157 13f79535-47bb-0310-9956-ffa450edef68
2013-07-29 18:15:05 +00:00
Vinod Kumar Vavilapalli
db1e0c1eef
YARN-688. Fixed NodeManager to properly cleanup containers when it is shut down. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1506814 13f79535-47bb-0310-9956-ffa450edef68
2013-07-25 04:14:25 +00:00
Vinod Kumar Vavilapalli
3ca3137179
YARN-926. Modified ContainerManagerProtcol APIs to take in requests for multiple containers. Contributed by Jian He.
...
MAPREDUCE-5412. Update MR app to use multiple containers API of ContainerManager after YARN-926. Contributed by Jian He.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1506391 13f79535-47bb-0310-9956-ffa450edef68
2013-07-24 03:41:03 +00:00
Vinod Kumar Vavilapalli
7a29bccd7a
YARN-814. Improving diagnostics when containers fail during launch due to various reasons like invalid env etc. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1504732 13f79535-47bb-0310-9956-ffa450edef68
2013-07-19 00:28:18 +00:00
Vinod Kumar Vavilapalli
c6facfb10d
YARN-912. Move client facing exceptions to yarn-api module. Contributed by Mayank Bansal.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1504032 13f79535-47bb-0310-9956-ffa450edef68
2013-07-17 07:31:33 +00:00
Vinod Kumar Vavilapalli
8b9c1e68ab
YARN-62. Modified NodeManagers to avoid AMs from abusing container tokens for repetitive container launches. Contributed by Omkar Vinit Joshi.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1503986 13f79535-47bb-0310-9956-ffa450edef68
2013-07-17 04:24:44 +00:00
Vinod Kumar Vavilapalli
6d12709b19
YARN-820. Fixed an invalid state transition in NodeManager caused by failing resource localization. Contributed by Mayank Bansal.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1503947 13f79535-47bb-0310-9956-ffa450edef68
2013-07-16 23:44:56 +00:00
Vinod Kumar Vavilapalli
af0d2fc3e3
YARN-661. Fixed NM to cleanup users' local directories correctly when starting up. Contributed by Omkar Vinit Joshi.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1503942 13f79535-47bb-0310-9956-ffa450edef68
2013-07-16 23:31:02 +00:00
Bikas Saha
33b5a81509
YARN-513. Create common proxy client for communicating with RM (Xuan Gong & Jian He via bikas)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1503933 13f79535-47bb-0310-9956-ffa450edef68
2013-07-16 22:47:40 +00:00
Vinod Kumar Vavilapalli
4466d8653f
YARN-523. Modified a test-case to validate container diagnostics on localization failures. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1503532 13f79535-47bb-0310-9956-ffa450edef68
2013-07-16 00:57:41 +00:00
Chris Nauroth
1db4319072
YARN-909. Disable TestLinuxContainerExecutorWithMocks on Windows. Contributed by Chuan Liu.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1503357 13f79535-47bb-0310-9956-ffa450edef68
2013-07-15 17:27:31 +00:00
Chris Nauroth
31fff48ec4
YARN-894. NodeHealthScriptRunner timeout checking is inaccurate on Windows. Contributed by Chuan Liu.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1501016 13f79535-47bb-0310-9956-ffa450edef68
2013-07-08 23:40:43 +00:00
Hitesh Shah
575495b5b1
YARN-861. TestContainerManager is failing. Contributed by Vinod Kumar Vavilapalli.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1495160 13f79535-47bb-0310-9956-ffa450edef68
2013-06-20 20:18:04 +00:00
Siddharth Seth
ab647cfcdb
YARN-848. Fix NodeManager to register with RM using the fully qualified hostname. Contributed by Hitesh Shah.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1494385 13f79535-47bb-0310-9956-ffa450edef68
2013-06-18 23:49:23 +00:00
Vinod Kumar Vavilapalli
243bcd367f
YARN-694. Starting to use NMTokens to authenticate all communication with NodeManagers. Contributed by Omkar Vinit Joshi.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1494369 13f79535-47bb-0310-9956-ffa450edef68
2013-06-18 23:19:49 +00:00
Vinod Kumar Vavilapalli
f4d80e91ae
YARN-841. Move Auxiliary service to yarn-api, annotate and document it. Contributed by Vinod Kumar Vavilapalli.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1494031 13f79535-47bb-0310-9956-ffa450edef68
2013-06-18 06:20:37 +00:00
Chris Nauroth
b02dc5b464
YARN-839. TestContainerLaunch.testContainerEnvVariables fails on Windows. Contributed by Chuan Liu.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1493937 13f79535-47bb-0310-9956-ffa450edef68
2013-06-17 21:05:00 +00:00
Vinod Kumar Vavilapalli
b9efe6bd4a
HADOOP-9649. Promoted YARN service life-cycle libraries into Hadoop Common for usage across all Hadoop projects. Contributed by Zhijie Shen.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1493650 13f79535-47bb-0310-9956-ffa450edef68
2013-06-17 06:39:33 +00:00
Arun Murthy
77e134d13f
YARN-387. Renamed YARN protocols for consistency. Contributed by Vinod K V.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1493623 13f79535-47bb-0310-9956-ffa450edef68
2013-06-17 02:27:18 +00:00
Vinod Kumar Vavilapalli
29a3708724
YARN-781. Exposing LOGDIR in all containers' environment which should be used by containers for logging purposes. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1493428 13f79535-47bb-0310-9956-ffa450edef68
2013-06-15 23:09:24 +00:00
Vinod Kumar Vavilapalli
e9f753a11e
YARN-792. Moved NodeHealthStatus from yarn.api.record to yarn.server.api.record. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1492943 13f79535-47bb-0310-9956-ffa450edef68
2013-06-14 03:17:03 +00:00
Vinod Kumar Vavilapalli
f0eb4bc342
YARN-692. Creating NMToken master key on RM and sharing it with NM as a part of RM-NM heartbeat. Contributed by Omkar Vinit Joshi.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1492907 13f79535-47bb-0310-9956-ffa450edef68
2013-06-14 00:06:42 +00:00
Vinod Kumar Vavilapalli
c0cd68d8fb
YARN-773. Moved YarnRuntimeException from package api.yarn to api.yarn.exceptions. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1492903 13f79535-47bb-0310-9956-ffa450edef68
2013-06-13 23:25:14 +00:00
Vinod Kumar Vavilapalli
0928502029
YARN-530. Defined Service model strictly, implemented AbstractService for robust subclassing and migrated yarn-common services. Contributed by Steve Loughran.
...
YARN-117. Migrated rest of YARN to the new service model. Contributed by Steve Louhran.
MAPREDUCE-5298. Moved MapReduce services to YARN-530 stricter lifecycle. Contributed by Steve Loughran.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1492718 13f79535-47bb-0310-9956-ffa450edef68
2013-06-13 15:54:38 +00:00
Siddharth Seth
c6c41abf68
YARN-737. Throw some specific exceptions directly instead of wrapping them in YarnException. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1491896 13f79535-47bb-0310-9956-ffa450edef68
2013-06-11 18:47:16 +00:00
Kihwal Lee
78063b3a76
YARN-742. Log aggregation causes a lot of redundant setPermission calls. Contributed by Jason Lowe.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1489596 13f79535-47bb-0310-9956-ffa450edef68
2013-06-04 20:49:34 +00:00
Vinod Kumar Vavilapalli
a83fb61ac0
YARN-635. Renamed YarnRemoteException to YarnException. Contributed by Siddharth Seth.
...
MAPREDUCE-5301. Updated MR code to work with YARN-635 changes of renaming YarnRemoteException to YarnException. Contributed by Siddharth Seth
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1489283 13f79535-47bb-0310-9956-ffa450edef68
2013-06-04 04:05:50 +00:00
Vinod Kumar Vavilapalli
d33534c4fb
YARN-748. Moved BuilderUtils from yarn-common to yarn-server-common for eventual retirement. Contributed by Jian He.
...
MAPREDUCE-5297. Updated MR App since BuilderUtils is no longer public after YARN-748. Contributed by Jian He.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1489257 13f79535-47bb-0310-9956-ffa450edef68
2013-06-04 00:34:18 +00:00
Vinod Kumar Vavilapalli
a2c4233004
YARN-717. Put object creation factories for Token in the class itself and remove useless derivations for specific tokens. Contributed by Jian He.
...
MAPREDUCE-5289. Updated MR App to use Token directly after YARN-717. Contributed by Jian He.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1488616 13f79535-47bb-0310-9956-ffa450edef68
2013-06-01 21:43:17 +00:00
Siddharth Seth
39f019f413
YARN-735. Make ApplicationAttemptId, ContainerId and NodeId immutable. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1488439 13f79535-47bb-0310-9956-ffa450edef68
2013-06-01 00:14:31 +00:00
Siddharth Seth
2692675fc3
YARN-684. ContainerManager.startContainer should use ContainerTokenIdentifier instead of the entire Container. Contributed by Vinod Kumar Vavilapalli.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1488085 13f79535-47bb-0310-9956-ffa450edef68
2013-05-31 04:13:16 +00:00
Vinod Kumar Vavilapalli
982753dc8e
YARN-716. Making ApplicationID immutable. Contributed by Siddharth Seth.
...
MAPREDUCE-5282. Updating MR App to use immutable ApplicationID after YARN-716. Contributed by Siddharth Seth.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1487994 13f79535-47bb-0310-9956-ffa450edef68
2013-05-30 20:18:39 +00:00
Siddharth Seth
b16c5638b5
YARN-719. Move RMIdentifier from Container to ContainerTokenIdentifier. Contributed by Vinod Kumar Vavilapalli.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1487741 13f79535-47bb-0310-9956-ffa450edef68
2013-05-30 04:59:05 +00:00
Vinod Kumar Vavilapalli
5420f287cc
YARN-578. Fixed NM to use SecureIOUtils for reading and aggregating logs. Contributed by Omkar Vinit Joshi.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1487672 13f79535-47bb-0310-9956-ffa450edef68
2013-05-29 23:14:59 +00:00
Jason Darrell Lowe
31c96829c7
YARN-512. Log aggregation root directory check is more expensive than it needs to be. Contributed by Maysam Yabandeh
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1487498 13f79535-47bb-0310-9956-ffa450edef68
2013-05-29 14:25:04 +00:00
Vinod Kumar Vavilapalli
259edf8dca
YARN-571. Remove user from ContainerLaunchContext. Contributed by Omkar Vinit Joshi.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1485928 13f79535-47bb-0310-9956-ffa450edef68
2013-05-24 03:22:54 +00:00
Vinod Kumar Vavilapalli
1fac1ac98c
YARN-708. Moved RecordFactory classes to hadoop-yarn-api, and put some miscellaneous fixes to the interfaces. Contributed by Siddharth Seth.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1485478 13f79535-47bb-0310-9956-ffa450edef68
2013-05-22 22:51:04 +00:00
Vinod Kumar Vavilapalli
ca80246731
YARN-617. Made ContainerTokens to be used for validation at NodeManager also in unsecure mode to prevent AMs from faking resource requirements in unsecure mode. Contributed by Omkar Vinit Joshi.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1483667 13f79535-47bb-0310-9956-ffa450edef68
2013-05-17 06:36:33 +00:00
Vinod Kumar Vavilapalli
68148989bf
YARN-590. Added an optional mesage to be returned by ResourceMaanger when RM asks an RM to shutdown/resync etc so that NMs can log this message locally for better debuggability. Contributed by Mayank Bansal.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1481234 13f79535-47bb-0310-9956-ffa450edef68
2013-05-10 23:37:44 +00:00
Vinod Kumar Vavilapalli
a35c7fd80b
YARN-663. Changed ResourceTracker API and LocalizationProtocol API to throw YarnRemoteException and IOException. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1481215 13f79535-47bb-0310-9956-ffa450edef68
2013-05-10 21:58:59 +00:00
Vinod Kumar Vavilapalli
2638bc67a4
YARN-634. Modified YarnRemoteException to be not backed by PB and introduced a separate SerializedException record. Contributed by Siddharth Seth.
...
MAPREDUCE-5239. Updated MR App to reflect YarnRemoteException changes after YARN-634. Contributed by Siddharth Seth.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1481205 13f79535-47bb-0310-9956-ffa450edef68
2013-05-10 21:49:28 +00:00
Vinod Kumar Vavilapalli
9eb53b9c84
YARN-632. Changed ContainerManager api to throw IOException and YarnRemoteException. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1479740 13f79535-47bb-0310-9956-ffa450edef68
2013-05-07 00:42:48 +00:00
Vinod Kumar Vavilapalli
92b7165a71
YARN-629. Make YarnRemoteException not be rooted at IOException. Contributed by Xuan Gong.
...
MAPREDUCE-5204. Handling YarnRemoteException separately from IOException in MR app after YARN-629. Contributed by Xuan Gong.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1479680 13f79535-47bb-0310-9956-ffa450edef68
2013-05-06 19:04:31 +00:00
Vinod Kumar Vavilapalli
46b2898a1a
YARN-618. Modified RM_INVALID_IDENTIFIER to be -1 instead of zero. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1478230 13f79535-47bb-0310-9956-ffa450edef68
2013-05-01 23:23:20 +00:00
Suresh Srinivas
27d4592771
YARN-506. Move to common utils FileUtil#setReadable/Writable/Executable and FileUtil#canRead/Write/Execute. Contributed by Ivan Mitic.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1477408 13f79535-47bb-0310-9956-ffa450edef68
2013-04-29 23:09:28 +00:00
Vinod Kumar Vavilapalli
635f96e74e
YARN-562. Missed files from previous commit.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1476038 13f79535-47bb-0310-9956-ffa450edef68
2013-04-26 04:42:34 +00:00
Vinod Kumar Vavilapalli
fbb55784d9
YARN-562. Modified NM to reject any containers allocated by a previous ResourceManager. Contributed by Jian He.
...
MAPREDUCE-5167. Update MR App after YARN-562 to use the new builder API for the container. Contributed by Jian He.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1476034 13f79535-47bb-0310-9956-ffa450edef68
2013-04-26 03:50:20 +00:00
Vinod Kumar Vavilapalli
a2f97fd9cb
YARN-605. Fix failing unit test in TestNMWebServices when versionInfo has parantheses like when running on a git checkout. Contributed by Hitesh Shah.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1471608 13f79535-47bb-0310-9956-ffa450edef68
2013-04-24 19:42:47 +00:00
Vinod Kumar Vavilapalli
27e8c86999
YARN-561. Modified NodeManager to set key information into the environment of every container that it launches. Contributed by Xuan Gong.
...
MAPREDUCE-5175. Updated MR App to not set envs that will be set by NMs anyways after YARN-561. Contributed by Xuan Gong.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1471156 13f79535-47bb-0310-9956-ffa450edef68
2013-04-23 21:39:51 +00:00
Vinod Kumar Vavilapalli
ffd7dbb34a
YARN-583. Moved application level local resources to be localized under the filecache sub-directory under application directory. Contributed by Omkar Vinit Joshi.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1470812 13f79535-47bb-0310-9956-ffa450edef68
2013-04-23 04:37:45 +00:00
Vinod Kumar Vavilapalli
c570309b07
YARN-547. Fixed race conditions in public and private resource localization which used to cause duplicate downloads. Contributed by Omkar Vinit Joshi.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1470076 13f79535-47bb-0310-9956-ffa450edef68
2013-04-19 22:35:43 +00:00
Vinod Kumar Vavilapalli
44bf8525a5
YARN-493. Fixed some shell related flaws in YARN on Windows. Contributed by Chris Nauroth.
...
HADOOP-9486. Promoted Windows and Shell related utils from YARN to Hadoop Common. Contributed by Chris Nauroth.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1469667 13f79535-47bb-0310-9956-ffa450edef68
2013-04-19 02:14:58 +00:00
Vinod Kumar Vavilapalli
e4c55e17fe
YARN-486. Changed NM's startContainer API to accept Container record given by RM as a direct parameter instead of as part of the ContainerLaunchContext record. Contributed by Xuan Gong.
...
MAPREDUCE-5139. Update MR AM to use the modified startContainer API after YARN-486. Contributed by Xuan Gong.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1467063 13f79535-47bb-0310-9956-ffa450edef68
2013-04-11 19:28:51 +00:00
Vinod Kumar Vavilapalli
4234bc87b3
YARN-539. Addressed memory leak of LocalResource objects NM when a resource localization fails. Contributed by Omkar Vinit Joshi.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1466756 13f79535-47bb-0310-9956-ffa450edef68
2013-04-11 02:08:11 +00:00
Vinod Kumar Vavilapalli
2e3b56f6e9
YARN-495. Changed NM reboot behaviour to be a simple resync - kill all containers and re-register with RM. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1466752 13f79535-47bb-0310-9956-ffa450edef68
2013-04-11 02:00:47 +00:00
Vinod Kumar Vavilapalli
3a54a5653b
YARN-112. Fixed a race condition during localization that fails containers. Contributed by Omkar Vinit Joshi.
...
MAPREDUCE-5138. Fix LocalDistributedCacheManager after YARN-112. Contributed by Omkar Vinit Joshi.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1466196 13f79535-47bb-0310-9956-ffa450edef68
2013-04-09 19:56:10 +00:00
Vinod Kumar Vavilapalli
b96d18bd23
YARN-99. Modify private distributed cache to localize files such that no local directory hits unix file count limits and thus prevent job failures. Contributed by Omkar Vinit Joshi.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1465853 13f79535-47bb-0310-9956-ffa450edef68
2013-04-09 01:33:38 +00:00
Bikas Saha
16fedf5473
YARN-479. NM retry behavior for connection to RM should be similar for lost heartbeats (Jian He via bikas)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1465731 13f79535-47bb-0310-9956-ffa450edef68
2013-04-08 19:17:16 +00:00
Eli Collins
2d02b7741d
Revert YARN-516 per HADOOP-9357.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1464181 13f79535-47bb-0310-9956-ffa450edef68
2013-04-03 19:42:12 +00:00
Vinod Kumar Vavilapalli
3e9200ddde
YARN-101. Fix NodeManager heartbeat processing to not lose track of completed containers in case of dropped heartbeats. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1464105 13f79535-47bb-0310-9956-ffa450edef68
2013-04-03 16:57:07 +00:00
Vinod Kumar Vavilapalli
e67e3ff05d
YARN-467. Modify public distributed cache to localize files such that no local directory hits unix file count limits and thus prevent job failures. Contributed by Omkar Vinit Joshi.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1463823 13f79535-47bb-0310-9956-ffa450edef68
2013-04-03 05:00:28 +00:00
Vinod Kumar Vavilapalli
17be263e9b
YARN-516. Fix failure in TestContainerLocalizer caused by HADOOP-9357. Contributed by Andrew Wang.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1463362 13f79535-47bb-0310-9956-ffa450edef68
2013-04-02 02:16:35 +00:00
Vinod Kumar Vavilapalli
7ca9fe73a0
YARN-309. Changed NodeManager to obtain heart-beat interval from the ResourceManager. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1463346 13f79535-47bb-0310-9956-ffa450edef68
2013-04-02 01:04:07 +00:00
Alejandro Abdelnur
967b62f5d2
YARN-24. Nodemanager fails to start if log aggregation enabled and namenode unavailable. (sandyr via tucu)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1461891 13f79535-47bb-0310-9956-ffa450edef68
2013-03-27 23:14:05 +00:00
Siddharth Seth
66e90b205a
YARN-440. Flatten RegisterNodeManagerResponse. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1461256 13f79535-47bb-0310-9956-ffa450edef68
2013-03-26 18:20:36 +00:00
Siddharth Seth
18e08a8f51
YARN-439. Flatten NodeHeartbeatResponse. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1460811 13f79535-47bb-0310-9956-ffa450edef68
2013-03-25 18:28:50 +00:00
Siddharth Seth
6a482a88b8
YARN-71. Fix the NodeManager to clean up local-dirs on restart. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1460808 13f79535-47bb-0310-9956-ffa450edef68
2013-03-25 18:23:59 +00:00
Hitesh Shah
b15b44e722
YARN-470. Support a way to disable resource monitoring on the NodeManager. Contributed by Siddharth Seth.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1460001 13f79535-47bb-0310-9956-ffa450edef68
2013-03-22 21:28:31 +00:00
Hitesh Shah
0cda8267d5
YARN-491. TestContainerLogsPage fails on Windows. Contributed by Chris Nauroth.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1459526 13f79535-47bb-0310-9956-ffa450edef68
2013-03-21 21:04:59 +00:00
Jason Darrell Lowe
67699c2d18
YARN-200. yarn log does not output all needed information, and is in a binary format. Contributed by Ravi Prakash
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1458466 13f79535-47bb-0310-9956-ffa450edef68
2013-03-19 19:29:15 +00:00
Hitesh Shah
2ba66a54f5
YARN-196. Nodemanager should be more robust in handling connection failure to ResourceManager when a cluster is started. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1457038 13f79535-47bb-0310-9956-ffa450edef68
2013-03-15 17:58:58 +00:00
Thomas Graves
5f2c518c95
YARN-443. allow OS scheduling priority of NM to be different than the containers it launches (tgraves)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1454411 13f79535-47bb-0310-9956-ffa450edef68
2013-03-08 14:46:26 +00:00
Suresh Srinivas
638801cce1
HADOOP-8952. Enhancements to support Hadoop on Windows Server and Windows Azure environments. Contributed by Ivan Mitic, Chuan Liu, Ramya Sunil, Bikas Saha, Kanna Karanam, John Gordon, Brandon Li, Chris Nauroth, David Lao, Sumadhur Reddy Bolli, Arpit Agarwal, Ahmed El Baz, Mike Liddell, Jing Zhao, Thejas Nair, Steve Maine, Ganeshan Iyer, Raja Aluri, Giridharan Kesavan, Ramya Bharathi Nimmagadda.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1453486 13f79535-47bb-0310-9956-ffa450edef68
2013-03-06 19:15:18 +00:00
Jason Darrell Lowe
5889f54ad1
YARN-345. Many InvalidStateTransitonException errors for ApplicationImpl in Node Manager. Contributed by Robert Parker
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1452548 13f79535-47bb-0310-9956-ffa450edef68
2013-03-04 21:47:39 +00:00
Robert Joseph Evans
0057600a82
YARN-426. Failure to download a public resource prevents further downloads (Jason Lowe via bobby)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1450807 13f79535-47bb-0310-9956-ffa450edef68
2013-02-27 15:30:10 +00:00
Tsz-wo Sze
a63d50f79a
YARN-377. Use the new StringUtils methods added by HADOOP-9252 and fix TestContainersMonitor. Contributed by Chris Nauroth
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1443796 13f79535-47bb-0310-9956-ffa450edef68
2013-02-07 23:53:49 +00:00
Hitesh Shah
d7f9f9d177
YARN-330. Fix flakey test: TestNodeManagerShutdown#testKillContainersOnShutdown. Contributed by Sandy Ryza
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1433138 13f79535-47bb-0310-9956-ffa450edef68
2013-01-14 21:27:39 +00:00
Thomas Graves
ce9b0a05b0
YARN-334. Maven RAT plugin is not checking all source files (tgraves)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1432931 13f79535-47bb-0310-9956-ffa450edef68
2013-01-14 14:34:51 +00:00
Arun Murthy
4539263971
YARN-2. Enhanced CapacityScheduler to account for CPU alongwith memory for multi-dimensional resource scheduling. Contributed by Arun C. Murthy.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1430682 13f79535-47bb-0310-9956-ffa450edef68
2013-01-09 05:08:23 +00:00
Vinod Kumar Vavilapalli
2cd41855d5
YARN-170. Change NodeManager stop to be reentrant. Contributed by Sandy Ryza.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1429796 13f79535-47bb-0310-9956-ffa450edef68
2013-01-07 13:43:52 +00:00
Jason Darrell Lowe
16cf201042
YARN-293. Node Manager leaks LocalizerRunner object for every Container. Contributed by Robert Joseph Evans
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1428095 13f79535-47bb-0310-9956-ffa450edef68
2013-01-02 22:41:03 +00:00
Luke Lu
1a49c85438
YARN-223. Update process tree instead of getting new process trees. (Radim Kolar via llu)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1424244 13f79535-47bb-0310-9956-ffa450edef68
2012-12-20 00:20:53 +00:00
Alejandro Abdelnur
5032a694ed
YARN-3. Add support for CPU isolation/monitoring of containers. (adferguson via tucu)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1423706 13f79535-47bb-0310-9956-ffa450edef68
2012-12-18 22:58:32 +00:00
Thomas White
235749a8ab
YARN-72. NM should handle cleaning up containers when it shuts down. Contributed by Sandy Ryza.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1416484 13f79535-47bb-0310-9956-ffa450edef68
2012-12-03 12:12:11 +00:00
Jason Darrell Lowe
6db6e00649
YARN-212. NM state machine ignores an APPLICATION_CONTAINER_FINISHED event when it shouldn't. Contributed by Nathan Roberts
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1408812 13f79535-47bb-0310-9956-ffa450edef68
2012-11-13 15:59:13 +00:00
Robert Joseph Evans
8a1f333707
YARN-186. Coverage fixing LinuxContainerExecutor (Aleksey Gorshkov via bobby)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1407171 13f79535-47bb-0310-9956-ffa450edef68
2012-11-08 17:22:52 +00:00
Vinod Kumar Vavilapalli
a5ebccd7bc
YARN-30. Fixed tests verifying web-services to work on JDK7. Contributed by Thomas Graves.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1396391 13f79535-47bb-0310-9956-ffa450edef68
2012-10-09 22:54:11 +00:00
Siddharth Seth
1e30e49bf6
YARN-33. Change LocalDirsHandlerService to validate the configured local and log dirs. (Contributed by Mayank Bansal)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1395844 13f79535-47bb-0310-9956-ffa450edef68
2012-10-09 02:26:24 +00:00
Robert Joseph Evans
41ebf4fb24
YARN-106. Nodemanager needs to set permissions of local directories (jlowe via bobby)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1391649 13f79535-47bb-0310-9956-ffa450edef68
2012-09-28 20:50:44 +00:00