Haibo Chen
c5d256c760
YARN-6674 Add memory cgroup settings for opportunistic containers. (Miklos Szegedi via Haibo Chen)
2017-08-03 09:52:35 -07:00
Eric Payne
48899134d2
YARN-6846. Nodemanager can fail to fully delete application local directories when applications are killed. Contributed by Jason Lowe.
2017-08-02 10:59:33 -05:00
Subru Krishnan
70b1a757f1
YARN-6511. Federation: transparently spanning application across multiple sub-clusters. (Botong Huang via Subru).
...
(cherry picked from commit 8c988d235eaf0972783985b1ab24680d029aea79)
2017-08-01 17:28:26 -07:00
Subru Krishnan
bed1832c93
YARN-3666. Federation Intercepting and propagating AM- home RM communications. (Botong Huang via Subru).
...
(cherry picked from commit 2399eb8200609246cb623c74450ca4a2032063cc)
2017-08-01 17:28:25 -07:00
Subru Krishnan
859aa1f9d6
YARN-5531. UnmanagedAM pool manager for federating application across clusters. (Botong Huang via Subru).
...
(cherry picked from commit 73bb2102ce4b82b3a3bed91319f7c8f067ddc3e8)
2017-08-01 17:28:24 -07:00
Subru Krishnan
80e1904000
YARN-5411. Create a proxy chain for ApplicationClientProtocol in the Router. (Giovanni Matteo Fumarola via Subru).
...
(cherry picked from commit 4846069061b6baa06da3b524b9e36567dd368388)
2017-08-01 17:28:24 -07:00
Akira Ajisaka
b38a1eea8e
HADOOP-14245. Use Mockito.when instead of Mockito.stub. Contributed by Andras Bokor.
2017-08-01 15:15:43 +09:00
Arun Suresh
890e14c02a
YARN-6870. Fix floating point inaccuracies in resource availability check in AllocationBasedResourceUtilizationTracker. (Brook Zhou via asuresh)
2017-07-28 16:32:43 -07:00
Akira Ajisaka
38c6fa5c7a
HADOOP-11875. [JDK9] Adding a second copy of Hamlet without _ as a one-character identifier.
2017-07-28 09:10:52 +09:00
Jian He
ac9489f7fc
YARN-6804. Allow custom hostname for docker containers in native services. Contributed by Billie Rinaldi
2017-07-25 09:57:14 -07:00
Brahma Reddy Battula
2054324d47
Revert "YARN-6804. [YARN core changes] Allow custom hostname for docker containers in native services. Contributed by Billie Rinaldi"
...
This reverts commit 4a771d9010
.
2017-07-23 12:56:18 +08:00
Jian He
4a771d9010
YARN-6804. [YARN core changes] Allow custom hostname for docker containers in native services. Contributed by Billie Rinaldi
2017-07-21 11:31:33 -07:00
Jason Lowe
c8df3668ec
YARN-6837. Null LocalResource visibility or resource type can crash the nodemanager. Contributed by Jinjiang Ling
2017-07-20 11:03:04 -05:00
Ray Chiang
f5f14a2ad6
YARN-6798. Fix NM startup failure with old state store due to version mismatch. (Botong Huang via rchiang)
2017-07-18 12:35:08 -07:00
Arun Suresh
5b007921cd
YARN-6706. Refactor ContainerScheduler to make oversubscription change easier. (Haibo Chen via asuresh)
2017-07-17 14:11:14 -07:00
Jason Lowe
ebc048cc05
YARN-6805. NPE in LinuxContainerExecutor due to null PrivilegedOperationException exit code. Contributed by Jason Lowe
2017-07-13 17:44:47 -05:00
Jason Lowe
0ffca5d347
Revert "YARN-6805. NPE in LinuxContainerExecutor due to null PrivilegedOperationException exit code. Contributed by Jason Lowe"
...
This reverts commit f76f5c0919
.
2017-07-13 17:42:38 -05:00
Jason Lowe
f76f5c0919
YARN-6805. NPE in LinuxContainerExecutor due to null PrivilegedOperationException exit code. Contributed by Jason Lowe
2017-07-13 17:38:17 -05:00
Jason Lowe
7576a688ea
YARN-6708. Nodemanager container crash after ext3 folder limit. Contributed by Bibin A Chundatt
2017-07-06 09:40:09 -05:00
Andrew Wang
af2773f609
Updating version for 3.0.0-beta1 development
2017-06-29 17:57:40 -07:00
Arun Suresh
49aa60e50d
YARN-6127. Add support for work preserving NM restart when AMRMProxy is enabled. (Botong Huang via asuresh).
2017-06-22 11:27:13 -07:00
Daniel Templeton
a062374c39
YARN-6679. Reduce Resource instance overhead via non-PBImpl
...
(Contributed by Daryn Sharp via Daniel Templeton)
2017-06-08 10:08:08 -07:00
Daniel Templeton
73ecb19312
YARN-6208. Improve the log when FinishAppEvent sent to the NodeManager which didn't run the application
...
(Contributed by Akira Ajisaka via Daniel Templeton)
2017-06-02 08:50:19 -07:00
Varun Vasudev
547f18cb96
YARN-6366. Refactor the NodeManager DeletionService to support additional DeletionTask types. Contributed by Shane Kumpf.
2017-05-31 16:15:35 +05:30
Andrew Wang
16ad896d5c
Update maven version for 3.0.0-alpha4 development
2017-05-26 14:09:44 -07:00
Daniel Templeton
d81372dfad
YARN-6646. Modifier 'static' is redundant for inner enums
...
(Contributed by ZhangBing Lin via Daniel Templeton)
2017-05-26 12:05:48 -07:00
Jason Lowe
aea42930bb
YARN-6641. Non-public resource localization on a bad disk causes subsequent containers failure. Contributed by Kuhu Shukla
2017-05-26 09:37:56 -05:00
Haibo Chen
47474fffac
YARN-6555. Store application flow context in NM state store for work-preserving restart. (Rohith Sharma K S via Haibo Chen)
2017-05-25 21:15:27 -07:00
Varun Vasudev
1a56a3db59
Addendum patch to fix Docker sanitization.
2017-05-25 14:53:57 +05:30
Akira Ajisaka
bc28da65fb
YARN-6141. ppc64le on Linux doesn't trigger __linux get_executable codepath. Contributed by Sonia Garudi and Ayappan.
2017-05-25 17:06:26 +09:00
Eric Payne
19482e0d09
YARN-6618. TestNMLeveldbStateStoreService#testCompactionCycle can fail if compaction occurs more than once. Contributed by Jason Lowe.
2017-05-19 11:24:54 -05:00
Varun Vasudev
b46cd315f7
Sanitize arguments before launching Docker containers.
2017-05-18 10:37:16 +05:30
Varun Vasudev
e120ee865a
Validate docker image name before launching container.
2017-05-18 10:29:34 +05:30
Chris Douglas
b23fcc86c6
YARN-6577. Remove unused ContainerLocalization classes. Contributed by ZhangBing Lin
2017-05-17 20:40:06 -07:00
Robert Kanter
18c494a00c
YARN-6447. Provide container sandbox policies for groups (gphillips via rkanter)
2017-05-16 18:02:39 -07:00
Jian He
8236130b2c
YARN-6306. NMClient API change for container upgrade. Contributed by Arun Suresh
2017-05-16 10:48:46 -07:00
Daniel Templeton
74a61438ca
YARN-6475. Fix some long function checkstyle issues
...
(Contributed by Soumabrata Chakraborty via Daniel Templeton)
2017-05-10 10:46:50 -07:00
Daniel Templeton
a2f680493f
YARN-5301. NM mount cpu cgroups failed on some systems
...
(Contributed by Miklos Szegedi via Daniel Templeton)
2017-05-09 12:05:46 -07:00
Subru Krishnan
749e5c09b9
YARN-6281. Cleanup when AMRMProxy fails to initialize a new interceptor chain. (Botong Huang via Subru)
...
(cherry picked from commit 57a9afbd45b7ef8e6021cc58f96bd0074bf1389d)
2017-05-08 16:55:47 -07:00
Subru Krishnan
cd9ff27ffc
YARN-6234. Support multiple attempts on the node when AMRMProxy is enabled. (Giovanni Matteo Fumarola via Subru).
2017-05-08 16:41:30 -07:00
Jason Lowe
424887ecb7
YARN-3839. Quit throwing NMNotYetReadyException. Contributed by Manikandan R
2017-05-08 17:14:37 -05:00
Subru Krishnan
d6eed5acca
YARN-6565. Fix memory leak and finish app trigger in AMRMProxy. (Botong Huang via Subru).
2017-05-05 16:27:49 -07:00
Sidharta S
fd5cb2c946
YARN-6374. Improve test coverage and add utility classes for common Docker operations. Contributed by Shane Kumpf
2017-05-03 14:49:30 -07:00
Robert Kanter
68e45f554b
YARN-6472. Improve Java sandbox regex (gphillips via rkanter)
2017-04-28 11:01:50 -07:00
Haibo Chen
8ac50e1322
YARN-6500. Do not mount inaccessible cgroups directories in CgroupsLCEResourcesHandler. (Miklos Szegedi via Haibo Chen)
2017-04-24 11:37:52 -07:00
Daniel Templeton
46940d92e2
YARN-6302. Fail the node if Linux Container Executor is not configured properly
...
(Contributed by Miklos Szegedi via Daniel Templeton)
2017-04-19 12:23:49 -07:00
Daniel Templeton
dd43b895c2
YARN-6202. Configuration item Dispatcher.DISPATCHER_EXIT_ON_ERROR_KEY is disregarded
...
(Contributed by Yufei Gu via Daniel Templeton)
2017-04-19 11:44:55 -07:00
Daniel Templeton
41ac190dba
YARN-6438. Code can be improved in ContainersMonitorImpl.java
...
(Contributed by Miklos Szegedi via Daniel Templeton)
2017-04-19 10:53:08 -07:00
Karthik Kambatla
8a1d7480f7
YARN-6433. Only accessible cgroup mount directories should be selected for a controller. (Miklos Szegedi via kasha)
2017-04-14 15:07:14 -07:00
Eric Payne
a16ab2be91
YARN-6450. TestContainerManagerWithLCE requires override for each new test added to ContainerManagerTest? Contributed by Jason Lowe.
2017-04-12 10:20:08 -05:00
Daniel Templeton
e9ac61cc0e
YARN-6372. Add default value for NM disk validator (Contributed by Yufei Gu via Daniel Templeton)
2017-04-10 14:56:42 -07:00
Robert Kanter
63f7322522
YARN-6368. Decommissioning an NM results in a -1 exit code (miklos.szegedi@cloudera.com via rkanter)
2017-04-07 14:28:07 -07:00
Ming Ma
0116c3c957
YARN-5797. Add metrics to the node manager for cleaning the PUBLIC and PRIVATE caches. (Chris Trezzo via mingma)
2017-04-06 16:54:43 -07:00
Jason Lowe
1b081ca27e
YARN-6288. Exceptions during aggregated log writes are mishandled. Contributed by Akira Ajisaka
2017-04-06 16:24:36 -05:00
Jason Lowe
e8071aa249
YARN-6403. Invalid local resource request can raise NPE and make NM exit. Contributed by Tao Yang
2017-04-05 12:30:45 -05:00
Ming Ma
2d5c09b848
YARN-6004. Refactor TestResourceLocalizationService#testDownloadingResourcesOnContainer so that it is less than 150 lines. (Chris Trezzo via mingma)
2017-04-04 17:56:21 -07:00
Chris Douglas
6eba79232f
HADOOP-14271. Correct spelling of 'occurred' and variants. Contributed by Yeliang Cang
2017-04-03 20:13:14 -07:00
Varun Saxena
a4b5aa8493
YARN-6377. NMTimelinePublisher#serviceStop does not stop timeline clients (Haibo Chen via Varun Saxena)
2017-04-02 04:54:12 +05:30
Akira Ajisaka
82fb9ce8df
YARN-6329. Remove unnecessary TODO comment from AppLogAggregatorImpl.java. Contributed by victor bertschinger.
2017-03-29 10:36:24 +09:00
Ravi Prakash
d4f73e7e27
HDFS-11574. Spelling mistakes in the Java source. Contributed by Hu Xiaodong.
2017-03-24 09:38:17 -07:00
Jason Lowe
4a8e304502
YARN-6217. TestLocalCacheDirectoryManager test timeout is too aggressive. Contributed by Miklos Szegedi
2017-03-17 14:06:35 -05:00
Jason Lowe
7114baddb6
YARN-4051. ContainerKillEvent lost when container is still recovering and application finishes. Contributed by sandflee
2017-03-16 09:30:10 -05:00
Junping Du
34424e98a6
YARN-6314. Potential infinite redirection on YARN log redirection web service. Contributed by Xuan Gong.
...
(cherry picked from commit 5a9dda796f0e73060ada794ad5752cc6a237ab2e)
2017-03-14 02:58:07 -07:00
Sidharta S
e96a0b8c92
YARN-5669. Add support for docker pull command (Contribtued by luhuichun)
2017-03-09 16:22:19 -08:00
Robert Kanter
287ba4ffa6
YARN-6297. TestAppLogAggregatorImp.verifyFilesUploaded() should check # of filed uploaded with that of files expected (haibochen via rkanter)
2017-03-08 10:45:33 -08:00
Robert Kanter
6f6dfe0202
YARN-5280. Allow YARN containers to run with Java Security Manager (gphillips via rkanter)
2017-03-01 14:53:47 -08:00
Sangjin Lee
4fa1afdb88
YARN-4675. Reorganize TimelineClient and TimelineClientImpl into separate classes for ATSv1.x and ATSv2. Contributed by Naganarasimha G R.
2017-02-16 11:41:04 -08:00
Junping Du
ce2d5bfa5f
YARN-6174. Log files pattern should be same for both running and finished container. Contributed by Xuan Gong.
2017-02-15 09:05:14 -08:00
Junping Du
464ff479ce
YARN-6113. Re-direct NM Web Service to get container logs for finished applications. Contributed by Xuan Gong.
2017-02-13 06:12:54 -08:00
Junping Du
9dbfab1284
YARN-6108. Improve AHS webservice to accept NM address as a parameter to get container logs. Contributed by Xuan Gong.
2017-02-07 02:13:14 -08:00
Junping Du
327c9980aa
YARN-6100. Improve YARN webservice to output aggregated container logs. Contributed by Xuan Gong.
2017-02-02 00:41:18 -08:00
Jason Lowe
9e19f758c1
YARN-5641. Localizer leaves behind tarballs after container is complete. Contributed by Eric Badger
2017-01-25 21:41:43 +00:00
Junping Du
8528d85a68
YARN-6099. Improve webservice to list aggregated log files. Contributed by Xuan Gong.
2017-01-24 15:26:53 -08:00
Jason Lowe
a33ce45e35
YARN-5547. NMLeveldbStateStore should be more tolerant of unknown keys. Contributed by Ajith S
2017-01-24 16:17:36 +00:00
Andrew Wang
5d8b80ea9b
Preparing for 3.0.0-alpha3 development
2017-01-19 15:50:07 -08:00
Subru Krishnan
4d1f3d9020
YARN-6016. Fix minor bugs in handling of local AMRMToken in AMRMProxy. (Botong Huang via Subru).
2017-01-17 14:48:03 -08:00
Daniel Templeton
e6f13fe5d1
YARN-5849. Automatically create YARN control group for pre-mounted cgroups (Contributed by Miklos Szegedi via Daniel Templeton)
2017-01-11 16:00:01 -08:00
Junping Du
4db119b7b5
YARN-6079. Fix simple spelling errors in yarn test code. Contributed by vijay.
2017-01-10 18:20:27 -08:00
Varun Saxena
f59e36b4ce
YARN-6068. Log aggregation get failed when NM restart even with recovery (Junping Du via Varun Saxena)
2017-01-09 11:17:19 +05:30
Naganarasimha
a59df15757
YARN-6015. AsyncDispatcher thread name can be set to improved debugging. Contributed by Ajith S.
2017-01-07 08:11:58 +05:30
Junping Du
8fadd69047
YARN-5923. Unable to access logs for a running application if YARN_ACL_ENABLE is enabled. Contributed by Xuan Gong.
2017-01-03 15:03:38 -08:00
Varun Vasudev
972da46cb4
YARN-5719. Enforce a C standard for native container-executor. Contributed by Chris Douglas.
2016-12-28 14:59:57 +05:30
Robert Kanter
9262797e86
YARN-5257. Fix unreleased resources and null dereferences (yufeigu via rkanter)
2016-12-27 14:14:08 -08:00
Arun Suresh
ac1e5d4f77
YARN-5938. Refactoring OpportunisticContainerAllocator to use SchedulerRequestKey instead of Priority and other misc fixes (asuresh)
2016-12-27 12:40:44 -08:00
Junping Du
736f54b727
YARN-4990. Re-direction of a particular log file within in a container in NM UI does not redirect properly. Contributed by Xuan Gong.
2016-12-21 14:14:42 -08:00
Sunil G
575773a357
YARN-5877. Allow all env's from yarn.nodemanager.env-whitelist to get overridden during launch. Contributed by Bibin A Chundatt.
2016-12-20 11:39:06 +05:30
Arun Suresh
2273a74c1f
YARN-5646. Add documentation and update config parameter names for scheduling of OPPORTUNISTIC containers. (Konstantinos Karanasos via asuresh)
2016-12-16 08:14:34 -08:00
Robert Kanter
4b149a1e77
YARN-4457. Cleanup unchecked types for EventHandler (templedf via rkanter)
2016-12-09 17:28:25 -08:00
Arun Suresh
b0aace21b1
YARN-5982. Simplify opportunistic container parameters and metrics. (Konstantinos Karanasos via asuresh)
2016-12-09 16:41:25 -08:00
Robert Kanter
72fe546841
YARN-5963. Spelling errors in logging and exceptions for node manager, client, web-proxy, common, and app history code (gsohn via rkanter)
2016-12-07 15:07:25 -08:00
Daniel Templeton
62b42ef5dd
YARN-5725. Test uncaught exception in TestContainersMonitorResourceChange.testContainersResourceChange
...
when setting IP and host (Contributed by Miklos Szegedi via Daniel Templeton)
2016-11-28 15:19:52 -08:00
Akira Ajisaka
209e805430
HADOOP-13506. Redundant groupid warning in child projects. Contributed by Kai Sasaki.
2016-11-28 14:34:57 +09:00
Naganarasimha
3541ed8068
YARN-4330. MiniYARNCluster is showing multiple Failed to instantiate default resource calculator warning messages. Contributed by Varun Saxena
2016-11-23 14:12:23 +05:30
Jason Lowe
009452bb6d
YARN-5859. TestResourceLocalizationService#testParallelDownloadAttemptsForPublicResource sometimes fails. Contributed by Eric Badger
2016-11-21 16:36:08 +00:00
Jason Lowe
59bfcbf357
YARN-5836. Malicious AM can kill containers of other apps running in any node its containers are running. Contributed by Botong Huang
2016-11-16 22:21:03 +00:00
Daniel Templeton
264ddb13ff
YARN-5736 Addendum. Fixes segfault due to unterminated string. (Contributed by Miklos Szegedi via Daniel Templeton)
2016-11-15 13:01:57 -08:00
Mingliang Liu
5af572b644
HADOOP-13427. Eliminate needless uses of FileSystem#{exists(), isFile(), isDirectory()}. Contributed by Steve Loughran and Mingliang Liu
2016-11-15 10:57:00 -08:00
Arun Suresh
3219b7b4ac
YARN-4597. Introduce ContainerScheduler and a SCHEDULED state to NodeManager container lifecycle. (asuresh)
2016-11-15 07:56:25 -08:00
Naganarasimha
7ffb9943b8
YARN-4355. NPE while processing localizer heartbeat. Contributed by Varun Saxena & Jonathan Hung.
2016-11-15 15:41:56 +05:30
Naganarasimha
43aef303bf
Reverted due to issue YARN-5765. Revert "YARN-5287. LinuxContainerExecutor fails to set proper permission. Contributed by Ying Zhang"
...
This reverts commit 131d58a24e
.
server-nodemanager/src/main/native/container-executor/test/test-
container-executor.c
2016-11-15 13:27:37 +05:30
Karthik Kambatla
3a98419532
YARN-5834. TestNodeStatusUpdater.testNMRMConnectionConf compares nodemanager wait time to the incorrect value. (Chang Li via kasha)
2016-11-10 14:08:51 -08:00
Naganarasimha
de3a5f8d08
YARN-5856. Unnecessary duplicate start container request sent to NM State store. Contributed by Varun Saxena.
2016-11-10 05:42:30 +05:30
Jian He
bcc15c6290
YARN-5611. Provide an API to update lifetime of an application. Contributed by Rohith Sharma K S
2016-11-09 16:08:05 -08:00
Robert Kanter
09f43fa9c0
YARN-5736. YARN container executor config does not handle white space (miklos.szegedi@cloudera.com via rkanter)
2016-11-09 13:34:40 +01:00
Arun Suresh
283fa33feb
YARN-5823. Update NMTokens in case of requests with only opportunistic containers. (Konstantinos Karanasos via asuresh)
2016-11-09 00:11:25 -08:00
Andrew Wang
e1c6ef2efa
HADOOP-13789. Hadoop Common includes generated test protos in both jar and test-jar. Contributed by Sean Busbey.
2016-11-08 16:33:55 -08:00
Jason Lowe
3f93ac0733
YARN-5356. NodeManager should communicate physical resource capability to ResourceManager. Contributed by Inigo Goiri
2016-11-08 22:01:26 +00:00
Arun Suresh
f38a6d03a1
YARN-5377. Fix TestQueuingContainerManager.testKillMultipleOpportunisticContainers. (Konstantinos Karanasos via asuresh)
2016-11-07 22:10:03 -08:00
Arun Suresh
0aafc122d4
YARN-2995. Enhance UI to show cluster resource utilization of various container Execution types. (Konstantinos Karanasos via asuresh)
2016-11-04 07:31:54 -07:00
Varun Vasudev
9ee0e3172e
YARN-5822. Log ContainerRuntime initialization error in LinuxContainerExecutor. Contributed by Sidharta Seethana.
2016-11-03 22:33:43 +05:30
Jason Lowe
76893a4100
YARN-5001. Aggregated Logs root directory is created with wrong group if nonexistent. Contributed by Haibo Chen
2016-11-01 20:23:24 +00:00
Varun Saxena
310aa4686e
YARN-5805. Add isDebugEnabled check for debug logs in nodemanager (Bibin A Chundatt via Varun Saxena)
2016-11-01 14:51:31 +05:30
Daniel Templeton
f3eb4c3c73
YARN-5793. Trim configuration values in DockerLinuxContainerRuntime (Contributed by Tianyin Xu via Daniel Templeton)
2016-10-31 13:51:07 -07:00
Arun Suresh
aa3cab1eb2
YARN-5799. Fix Opportunistic Allocation to set the correct value of Node Http Address. (asuresh)
2016-10-29 02:03:57 -07:00
Jason Lowe
1b79c417dc
YARN-5767. Fix the order that resources are cleaned up from the local Public/Private caches. Contributed by Chris Trezzo
2016-10-28 15:58:04 +00:00
Jason Lowe
7146359bfd
YARN-5027. NM should clean up app log dirs after NM restart. Contributed by sandflee
2016-10-28 15:48:58 +00:00
Jason Lowe
c017171da0
HADOOP-13770. Shell.checkIsBashSupported swallowed an interrupted exception. Contributed by Wei-Chiu Chuang
2016-10-28 14:57:50 +00:00
Robert Kanter
5877f20f9c
HADOOP-10075. Update jetty dependency to version 9 (rkanter)
2016-10-27 16:09:00 -07:00
Robert Kanter
9449519a25
YARN-5776. Checkstyle: MonitoringThread.Run method length is too long (miklos.szegedi@cloudera.com via rkanter)
2016-10-27 14:36:38 -07:00
Jason Lowe
7e3c327d31
YARN-4831. Recovered containers will be killed after NM stateful restart. Contributed by Siqi Li
2016-10-27 20:42:52 +00:00
Robert Kanter
6fbfb501f2
YARN-4456. Clean up Lint warnings in nodemanager (templedf via rkanter)
2016-10-27 12:37:01 -07:00
Varun Saxena
7b4e9ec3b0
YARN-5686. DefaultContainerExecutor random working dir algorigthm skews results (Vrushali C via Varun Saxena)
2016-10-27 14:02:29 +05:30
Varun Saxena
5c39a419a6
YARN-5752. TestLocalResourcesTrackerImpl#testLocalResourceCache times out (Eric Badger via Varun Saxena)
2016-10-27 13:41:49 +05:30
Rohith Sharma K S
b110c4b5e8
YARN-4555. TestDefaultContainerExecutor#testContainerLaunchError fails on non-english locale environment. Contributed by Takashi Ohnishi.
2016-10-27 11:34:07 +05:30
Karthik Kambatla
de6faae97c
YARN-5388. Deprecate and remove DockerContainerExecutor. (Daniel Templeton via kasha)
2016-10-25 13:35:47 -07:00
Robert Kanter
f5d9235914
YARN-5466. DefaultContainerExecutor needs JavaDocs (templedf via rkanter)
2016-10-17 14:29:09 -07:00
Sangjin Lee
1f304b0c7f
YARN-5699. Retrospect yarn entity fields which are publishing in events info fields. Contributed by Rohith Sharma K S.
2016-10-15 13:54:40 -07:00
Chris Douglas
cf3f43e95b
YARN-5717. Add tests for container-executor is_feature_enabled. Contributed by Sidharta Seethana
2016-10-13 20:49:07 -07:00
Varun Vasudev
0992708d79
YARN-5704. Provide config knobs to control enabling/disabling new/work in progress features in container-executor. Contributed by Sidharta Seethana.
2016-10-05 15:31:38 +05:30
Arun Suresh
10be45986c
YARN-5486. Update OpportunisticContainerAllocatorAMService::allocate method to handle OPPORTUNISTIC container requests. (Konstantinos Karanasos via asuresh)
2016-09-29 15:11:41 -07:00
Varun Vasudev
bc2656f09f
YARN-5662. Provide an option to enable ContainerMonitor. Contributed by Jian He.
2016-09-28 15:18:18 +05:30
Arun Suresh
4815d024c5
YARN-5609. Expose upgrade and restart API in ContainerManagementProtocol. Contributed by Arun Suresh
2016-09-26 08:46:54 -07:00
Arun Suresh
2f163cd5cf
Revert "YARN-5609. Expose upgrade and restart API in ContainerManagementProtocol. Contributed by Arun Suresh"
...
This reverts commit fe644bafe7
.
2016-09-26 08:36:59 -07:00
Jian He
fe644bafe7
YARN-5609. Expose upgrade and restart API in ContainerManagementProtocol. Contributed by Arun Suresh
2016-09-26 22:41:16 +08:00
Arun Suresh
3552c2b99d
YARN-5637. Changes in NodeManager to support Container rollback and commit. (asuresh)
2016-09-18 10:55:18 -07:00
Arun Suresh
f67237cbe7
YARN-5657. Fix TestDefaultContainerExecutor. (asuresh)
2016-09-17 09:32:05 -07:00
Naganarasimha
4174b9756c
YARN-5642. Typos in 9 log messages. Contributed by Mehran Hassani
2016-09-17 10:35:39 +05:30
Arun Suresh
40b5a59b72
YARN-5620. Core changes in NodeManager to support re-initialization of Containers with new launchContext. (asuresh)
2016-09-15 07:15:11 -07:00
Jason Lowe
e793309735
YARN-5630. NM fails to start after downgrade from 2.8 to 2.7. Contributed by Jason Lowe
2016-09-13 14:41:27 +00:00
Varun Vasudev
e6fcfe28e3
YARN-5576. Allow resource localization while container is running. Contributed by Jian He.
2016-09-06 20:01:45 +05:30
Varun Vasudev
08f55ccbb0
YARN-5596. Fix failing unit test in TestDockerContainerRuntime. Contributed by Sidharta Seethana.
2016-09-01 14:08:51 +05:30
Arun Suresh
d6d9cff21b
YARN-5221. Expose UpdateResourceRequest API to allow AM to request for change in container properties. (asuresh)
2016-08-30 15:52:29 -07:00
Karthik Kambatla
cde3a00526
YARN-5373. NPE listing wildcard directory in containerLaunch. (Daniel Templeton via kasha)
2016-08-26 11:04:33 -07:00
Junping Du
9ef632f3b0
YARN-5557. Add localize API to the ContainerManagementProtocol. Contributed by Jian He.
2016-08-26 09:04:44 -07:00
Varun Vasudev
bfb4d95059
YARN-5430. Return container's ip and host from NM ContainerStatus call. Contributed by Jian He.
2016-08-26 16:30:18 +05:30
Varun Vasudev
525d52bf7c
YARN-5042. Mount /sys/fs/cgroup into Docker containers as read only mount. Contributed by luhuichun.
2016-08-25 14:18:26 +05:30
Varun Vasudev
7f05ff7a4e
YARN-5455. Update Javadocs for LinuxContainerExecutor. Contributed by Daniel Templeton.
2016-08-17 15:34:58 +05:30
Ray Chiang
aea3e65749
YARN-5137. Make DiskChecker pluggable in NodeManager. (Yufei Gu via rchiang)
2016-08-10 19:23:29 -07:00
Varun Vasudev
522ddbde79
YARN-5394. Remove bind-mount /etc/passwd for Docker containers. Contributed by Zhankun Tang.
2016-08-09 16:04:09 +05:30
Arun Suresh
82c9e06101
YARN-5457. Refactor DistributedScheduling framework to pull out common functionality. (asuresh)
2016-08-09 00:42:29 -07:00
Varun Saxena
37d939a453
YARN-4910. Fix incomplete log info in ResourceLocalizationService (Jun Gong via Varun Saxena)
2016-08-08 22:12:53 +05:30
Naganarasimha
131d58a24e
YARN-5287. LinuxContainerExecutor fails to set proper permission. Contributed by Ying Zhang
2016-08-07 01:08:13 +05:30
Wangda Tan
3f100d76ff
YARN-4888. Changes in scheduler to identify resource-requests explicitly by allocation-id. (Subru Krishnan via wangda)
2016-08-05 10:43:35 -07:00
Varun Vasudev
08e33381f5
YARN-5459. Add support for docker rm. Contributed by Shane Kumpf.
2016-08-04 14:08:34 +05:30
Jason Lowe
db646540f0
YARN-5462. TestNodeStatusUpdater.testNodeStatusUpdaterRetryAndNMShutdown fails intermittently. Contributed by Eric Badger
2016-08-03 19:17:25 +00:00
Varun Vasudev
d848184e90
YARN-5460. Change container runtime type logging in DelegatingLinuxContainerRuntime to debug. Contributed by Shane Kumpf.
2016-08-03 12:02:39 +05:30
Chris Nauroth
b913677365
YARN-5456. container-executor support for FreeBSD, NetBSD, and others if conf path is absolute. Contributed by Allen Wittenauer.
2016-08-02 22:24:34 -07:00
Varun Vasudev
7fc70c6422
YARN-5458. Rename DockerStopCommandTest to TestDockerStopCommand. Contributed by Shane Kumpf.
2016-08-02 16:56:45 +05:30
Varun Vasudev
2e7c2a13a8
YARN-5443. Add support for docker inspect command. Contributed by Shane Kumpf.
2016-08-02 16:55:10 +05:30
Varun Vasudev
95694b70cd
YARN-5444. Fix failing unit tests in TestLinuxContainerExecutorWithMocks. Contributed by Yufei Gu.
2016-08-01 16:08:38 +05:30
Arun Suresh
e5766b1dbe
YARN-5113. Refactoring and other clean-up for distributed scheduling. (Konstantinos Karanasos via asuresh)
2016-07-31 11:48:25 -07:00
Chris Nauroth
ef501b1a0b
YARN-5121. fix some container-executor portability issues. Contributed by Allen Wittenauer.
2016-07-30 08:26:19 -07:00
Akira Ajisaka
54fe17a607
HADOOP-9427. Use JUnit assumptions to skip platform-specific tests. Contributed by Gergely Novák.
2016-07-27 19:41:09 +09:00
Arun Suresh
8fbe6ece24
YARN-5350. Distributed Scheduling: Ensure sort order of allocatable nodes returned by the RM is not lost. (asuresh)
2016-07-19 23:03:58 -07:00
Andrew Wang
da456ffd62
Preparing for 3.0.0-alpha2 development
2016-07-15 19:04:17 -07:00
Varun Saxena
a72cb3825a
YARN-5383. Fix findbugs and checkstyle issues in ContainerExecutor. (Vrushali C via Varun Saxena)
2016-07-15 23:17:04 +05:30
Naganarasimha
b5ee7dbd8d
YARN-5380. NMTimelinePublisher should use getMemorySize instead of getMemory. Contributed by Vrushali C
2016-07-15 14:34:48 +05:30
Varun Vasudev
54bf14f80b
YARN-5303. Clean up ContainerExecutor JavaDoc. Contributed by Daniel Templeton.
2016-07-14 19:28:11 +05:30
Varun Vasudev
e5e558b0a3
YARN-4759. Fix signal handling for docker containers. Contributed by Shane Kumpf.
2016-07-14 19:27:16 +05:30
Varun Vasudev
58e1850801
YARN-5298. Mount usercache and NM filecache directories into Docker container. Contributed by Sidharta Seethana.
2016-07-14 16:11:07 +05:30
Rohith Sharma K S
dbe97aa768
YARN-5299. Log Docker run command when container fails. Contributed by Varun Vasudev.
2016-07-14 11:48:10 +05:30
Jian He
819224dcf9
YARN-5270. Solve miscellaneous issues caused by YARN-4844. Contributed by Wangda Tan
2016-07-11 22:36:20 -07:00
Sangjin Lee
6cf6ab7b78
Made a number of miscellaneous fixes for javac, javadoc, and checstyle warnings.
2016-07-10 08:46:05 -07:00
Vrushali
6d943038f6
Cleanup changes during rebase with trunk (Vrushali C)
2016-07-10 08:46:04 -07:00
Varun Saxena
1ff6833bba
YARN-5243. fix several rebase and other miscellaneous issues before merge. (Sangjin Lee via Varun Saxena)
2016-07-10 08:46:03 -07:00
Sangjin Lee
39cce4e629
YARN-3816. [Aggregation] App-level aggregation and accumulation for YARN system metrics (Li Lu via sjlee)
2016-07-10 08:45:56 -07:00
Sangjin Lee
84c35ac6c4
YARN-4711. NM is going down with NPE's due to single thread processing of events by Timeline client (Naganarasimha G R via sjlee)
2016-07-10 08:45:55 -07:00
Varun Saxena
6f6cc647d6
YARN-4712. CPU Usage Metric is not captured properly in YARN-2928. (Naganarasimha G R via Varun Saxena)
2016-07-10 08:45:54 -07:00
Sangjin Lee
0d02ab8729
YARN-3367. Replace starting a separate thread for post entity with event loop in TimelineClient (Naganarasimha G R via sjlee)
2016-07-10 08:45:53 -07:00
Sangjin Lee
960af7d471
YARN-4409. Fix javadoc and checkstyle issues in timelineservice code (Varun Saxena via sjlee)
2016-07-10 08:45:53 -07:00
Naganarasimha
6934b05c71
YARN-4238. createdTime and modifiedTime is not reported while publishing entities to ATSv2. (Varun Saxena via Naganarasimha G R)
2016-07-10 08:45:52 -07:00
Li Lu
89e5c44f9e
YARN-4356. Ensure the timeline service v.2 is disabled cleanly and has no
...
impact when it's turned off. Contributed by Sangjin Lee.
2016-07-10 08:45:48 -07:00
Sangjin Lee
8d9476ec5f
YARN-4058. Miscellaneous issues in NodeManager project (Naganarasimha G R via sjlee)
2016-07-10 08:45:43 -07:00
Junping Du
477a30f536
YARN-3045. Implement NM writing container lifecycle events to Timeline Service v2. Contributed by Naganarasimha G R.
2016-07-10 08:45:42 -07:00
Sangjin Lee
22e7ae5771
YARN-3792. Test case failures in TestDistributedShell and some issue fixes related to ATSV2 (Naganarasimha G R via sjlee)
...
(cherry picked from commit 84f37f1c7eefec6d139cbf091c50d6c06f734323)
2016-07-10 08:45:38 -07:00
Zhijie Shen
f3c661e8dd
YARN-3044. Made RM write app, attempt and optional container lifecycle events to timeline service v2. Contributed by Naganarasimha G R.
2016-07-10 08:45:37 -07:00
Junping Du
d275677e24
YARN-3634. TestMRTimelineEventHandling and TestApplication are broken. Contributed by Sangjin Lee.
...
(cherry picked from commit b059dd4882fd759e4762cc11c019be4b68fb74c1)
2016-07-10 08:45:36 -07:00
Sangjin Lee
fc8485d893
YARN-3551. Consolidate data model change according to the backend implementation (Zhijie Shen via sale)
...
(cherry picked from commit 557a3950bddc837469244835f5577899080115d8)
2016-07-10 08:45:35 -07:00
Junping Du
47f35a30bb
YARN-3391. Clearly define flow ID/ flow run / flow version in API and storage. Contributed by Zhijie Shen
...
(cherry picked from commit 68c6232f8423e55b4d152ef3d1d66aeb2d6a555e)
2016-07-10 08:45:33 -07:00
Zhijie Shen
5712b8f9fd
YARN-3334. NM uses timeline client to publish container metrics to new timeline service. Contributed by Junping Du.
2016-07-10 08:45:33 -07:00
Junping Du
d67c9bdb4d
YARN-3040. Make putEntities operation be aware of the app's context. Contributed by Zhijie Shen
2016-07-10 08:45:32 -07:00
Junping Du
2188a07e5b
YARN-3333. Rename TimelineAggregator etc. to TimelineCollector. Contributed by Sangjin Lee
2016-07-10 08:45:31 -07:00
Zhijie Shen
9b56364080
YARN-3039. Implemented the app-level timeline aggregator discovery service. Contributed by Junping Du.
2016-07-10 08:45:31 -07:00
Zhijie Shen
bf54d32750
YARN-3210. Refactored timeline aggregator according to new code organization proposed in YARN-3166. Contributed by Li Lu.
...
(cherry picked from commit d3ff7f06cbc66d3a23c2551e7d4c752689f46afe)
2016-07-10 08:45:30 -07:00
Zhijie Shen
9d57c9c015
YARN-3087. Made the REST server of per-node aggregator work alone in NM daemon. Conntributed by Li Lu.
...
(cherry picked from commit 41a08ad404d4278fe598d6c222b2ae0e84bae0df)
2016-07-10 08:45:29 -07:00
Zhijie Shen
7c8abec0a8
YARN-3030. Set up TS aggregator with basic request serving structure and lifecycle. Contributed by Sangjin Lee.
...
(cherry picked from commit f26941b39028ac30c77547e2be2d657bb5bf044a)
2016-07-10 08:45:28 -07:00
Vinod Kumar Vavilapalli
4c9e1aeb94
YARN-5224. Added new web-services /containers/{containerid}/logs & /containers/{containerid}/logs/{filename} and using them in "yarn logs" CLI to get logs of finished containers of a running application. Contributed by Xuan Gong.
2016-07-06 14:10:54 -07:00
Jian He
d792a90206
YARN-5296. NMs going OutOfMemory because ContainerMetrics leak in ContainerMonitorImpl. Contributed by Junping Du
2016-07-05 18:05:16 -07:00
Vinod Kumar Vavilapalli
ce9c006430
YARN-5214. Fixed locking in DirectoryCollection to avoid hanging NMs when various code-paths hit slow disks. Contributed by Junping Du.
2016-07-05 16:07:28 -07:00
Ray Chiang
0a5def155e
YARN-4568. Fix message when NodeManager runs into errors initializing the recovery directory. (rchiang)
2016-07-01 15:53:46 -07:00
Arun Suresh
99e5dd68d0
YARN-5171. Extend DistributedSchedulerProtocol to notify RM of containers allocated by the Node. (Inigo Goiri via asuresh)
2016-06-22 19:04:54 -07:00
Tsuyoshi Ozawa
5d58858bb6
HADOOP-9613. [JDK8] Update jersey version to latest 1.x release.
2016-06-21 08:05:32 +09:00
Sangjin Lee
5107a967fa
YARN-4958. The file localization process should allow for wildcards to reduce the application footprint in the state store (Daniel Templeton via sjlee)
2016-06-20 09:56:53 -07:00
Junping Du
d0162f2040
YARN-5246. NMWebAppFilter web redirects drop query parameters. Contributed by Varun Vasudev.
2016-06-19 17:44:54 -07:00
Junping Du
c35fa4a0e5
YARN-5251. Yarn CLI to obtain App logs for last 'n' bytes fails. Contributed by Xuan Gong.
2016-06-17 08:24:24 -07:00
Junping Du
5dfc38ff57
YARN-5237. Fix missing log files issue in rolling log aggregation. Contributed by Xuan Gong.
2016-06-15 16:17:54 -07:00
Wangda Tan
c77a1095dc
YARN-1942. Deprecate toString/fromString methods from ConverterUtils and move them to records classes like ContainerId/ApplicationId, etc. (wangda)
2016-06-14 15:06:38 -07:00
Arun Suresh
7dae2b3bc4
YARN-5212. Run existing ContainerManager tests using QueuingContainerManagerImpl. (Konstantinos Karanasos via asuresh)
2016-06-12 10:05:37 -07:00
Vinod Kumar Vavilapalli
9378d9428f
YARN-5191. Renamed the newly added “download=true” option for getting logs via NMWebServices and AHSWebServices to be a better "format" option. (Xuan Gong via vinodkv)
2016-06-09 12:30:58 -07:00
Naganarasimha
1500a0a300
YARN-4308. ContainersAggregated CPU resource utilization reports negative usage in first few heartbeats. Contributed by Sunil G
2016-06-09 05:41:09 +05:30
Arun Suresh
3344ba70e0
YARN-5204. Properly report status of killed/stopped queued containers. (Konstantinos Karanasos via asuresh)
2016-06-08 08:31:32 -07:00
Arun Suresh
76f0800c21
YARN-5176. More test cases for queuing of containers at the NM. (Konstantinos Karanasos via asuresh)
2016-06-07 17:16:18 -07:00
Xuan
58be55b6e0
YARN-5199. Close LogReader in in AHSWebServices#getStreamingOutput and
...
FileInputStream in NMWebServices#getLogs. Contributed by Xuan Gong
2016-06-07 16:07:02 -07:00
Rohith Sharma K S
bddea5fe5f
YARN-5118. Tests fails with localizer port bind exception. Contributed by Brahma Reddy Battula.
2016-06-07 11:20:15 +05:30
Ming Ma
4a1cedc010
MAPREDUCE-5044. Have AM trigger jstack on task attempts that timeout before killing them. (Eric Payne and Gera Shegalov via mingma)
2016-06-06 14:30:51 -07:00
Jian He
99cc439e29
YARN-5190. Registering/unregistering container metrics in ContainerMonitorImpl and ContainerImpl causing uncaught exception in ContainerMonitorImpl. Contributed by Junping Du
2016-06-03 11:10:42 -07:00
Arun Suresh
dc26601d8f
YARN-5180. Allow ResourceRequest to specify an enforceExecutionType flag. (asuresh)
2016-06-02 09:01:02 -07:00
Xuan
0bc05e40fa
YARN-5088. Improve "yarn log" command-line to read the last K bytes for the log files. Contributed by Xuan Gong
2016-06-01 13:44:21 -07:00
Varun Vasudev
86fb58b7dc
YARN-4007. Add support for different network setups when launching the docker container. Contributed by Sidharta Seethana.
2016-05-29 21:32:51 +05:30
Varun Vasudev
42f90ab885
YARN-4844. Add getMemorySize/getVirtualCoresSize to o.a.h.y.api.records.Resource. Contributed by Wangda Tan.
2016-05-29 21:24:16 +05:30
Arun Suresh
4fc09a897b
YARN-5117. QueuingContainerManager does not start GUARANTEED Container even if Resources are available. (Konstantinos Karanasos via asuresh)
2016-05-27 17:49:15 -07:00
Arun Suresh
aa975bc781
YARN-5127. Expose ExecutionType in Container api record. (Hitesh Sharma via asuresh)
2016-05-27 14:06:32 -07:00
Carlo Curino
013532a95e
YARN-4957. Add getNewReservation in ApplicationClientProtocol (Sean Po via curino)
2016-05-25 16:55:49 -07:00
Jason Lowe
1ba31fe9e9
YARN-4459. container-executor should only kill process groups. Contributed by Jun Gong
2016-05-25 21:24:55 +00:00
Robert Kanter
e07519b8db
YARN-4766. NM should not aggregate logs older than the retention policy (haibochen via rkanter)
2016-05-25 10:25:44 -07:00
Varun Vasudev
9a31e5dfef
YARN-5115. Avoid setting CONTENT-DISPOSITION header in the container-logs web-service. Contributed by Xuan Gong.
2016-05-25 19:01:12 +05:30
Varun Vasudev
c380a22031
YARN-857. Localization failures should be available in container diagnostics. Contributed by Vinod Kumar Vavilapalli.
2016-05-25 19:01:12 +05:30
Jason Lowe
d1df0266cf
YARN-5103. With NM recovery enabled, restarting NM multiple times results in AM restart. Contributed by Junping Du
2016-05-23 15:15:27 +00:00
Junping Du
6161d9ba52
YARN-5112. Excessive log warnings for directory permission issue on NM recovery. Contributed by Jian He.
...
(cherry picked from commit 867cd2f5a2e5966d6a7c5f5accb2fce78f9e7778)
2016-05-22 16:04:02 -07:00
Varun Vasudev
0287c49107
YARN-5016. Add support for a minimum retry interval for container retries. Contributed by Jun Gong.
2016-05-20 12:28:14 +05:30
Junping Du
22fcd819f0
YARN-5076. YARN web interfaces lack XFS protection. Contributed by Jonathan Maron.
...
(cherry picked from commit 2703ec68712279494d67b0d76b7ac10e7a1628be)
2016-05-19 14:15:21 -07:00
Akira Ajisaka
d4274c64bc
YARN-5107. TestContainerMetrics fails. (aajisaka)
2016-05-19 19:53:59 +09:00
Arun Suresh
1597630681
YARN-5110. Fix OpportunisticContainerAllocator to insert complete HostAddress in issued ContainerTokenIds. (Konstantinos Karanasos via asuresh)
2016-05-18 18:46:00 -07:00
Jian He
fa3bc3405d
YARN-4832. NM side resource value should get updated if change applied in RM side. Contributed by Junping Du
2016-05-17 12:52:19 -07:00
Jason Lowe
81effb7dcd
YARN-4325. Nodemanager log handlers fail to send finished/failed events in some cases. Contributed by Junping Du
2016-05-16 15:40:23 +00:00
Arun Suresh
f45bc5a83e
YARN-4738. Notify the RM about the status of OPPORTUNISTIC containers (Konstantinos Karanasos via asuresh)
2016-05-15 17:54:34 -07:00
Arun Suresh
f0ac18d001
YARN-2888. Corrective mechanisms for rebalancing NM container queues. (asuresh)
2016-05-13 13:38:36 -07:00
Andrew Wang
3c5c57af28
HADOOP-13142. Change project version from 3.0.0 to 3.0.0-alpha1.
2016-05-12 18:27:28 -07:00
Andrew Wang
ca5613af91
Revert "Update project version to 3.0.0-alpha1-SNAPSHOT."
...
This reverts commit 6b53802cba
.
2016-05-12 15:32:45 -07:00
Andrew Wang
6b53802cba
Update project version to 3.0.0-alpha1-SNAPSHOT.
2016-05-12 11:05:05 -07:00
Sangjin Lee
0bbe01f8d5
YARN-4577. Enable aux services to have their own custom classpath/jar file (Xuan Gong via sale)
2016-05-12 10:11:23 -07:00
Arun Suresh
d464f4d1c4
YARN-5049. Extend NMStateStore to save queued container information. (Konstantinos Karanasos via asuresh)
2016-05-11 19:10:17 -07:00
Arun Suresh
d971bf252f
YARN-5073. Refactor startContainerInternal() in ContainerManager to remove unused parameter. (Konstantinos Karanasos via asuresh)
2016-05-10 22:31:38 -07:00
Varun Vasudev
72b047715c
YARN-4595. Add support for configurable read-only mounts when launching Docker containers. Contributed by Billie Rinaldi.
2016-05-05 13:01:54 +05:30
Wangda Tan
7bd418e48c
YARN-4984. LogAggregationService shouldn't swallow exception in handling createAppDir() which cause thread leak. (Junping Du via wangda)
2016-05-04 11:38:55 -07:00
Junping Du
e61d431275
YARN-4920. ATS/NM should support a link to dowload/get the logs in text format. Contributed by Xuan Gong.
2016-05-04 10:35:49 -07:00
Robert Kanter
9d4d30243b
Remove parent's env vars from child processes
2016-04-29 09:25:51 -07:00
Varun Vasudev
0f25a1bb52
YARN-3998. Add support in the NodeManager to re-launch containers. Contributed by Jun Gong.
2016-04-29 16:09:07 +05:30
Jian He
4a8508501b
YARN-5009. NMLeveldbStateStoreService database can grow substantially leading to longer recovery times. Contributed by Jason Lowe
2016-04-28 21:54:11 -07:00
Arun Suresh
341888a0aa
YARN-4412. Create ClusterMonitor to compute ordered list of preferred NMs for OPPORTUNITIC containers. (asuresh)
2016-04-26 20:12:12 -07:00
Karthik Kambatla
1a3f1482e2
YARN-4795. ContainerMetrics drops records. (Daniel Templeton via kasha)
2016-04-26 06:15:36 -07:00
Arun Suresh
c282a08f38
YARN-2885. Create AMRMProxy request interceptor and ContainerAllocator to distribute OPPORTUNISTIC containers to appropriate Nodes (asuresh)
...
(cherry picked from commit 2bf025278a318b0452fdc9ece4427b4c42124e39)
2016-04-24 22:38:33 -07:00
Jing Zhao
63e5412f1a
HDFS-9427. HDFS should not default to ephemeral ports. Contributed by Xiaobing Zhou.
2016-04-22 15:14:40 -07:00
Karthik Kambatla
c8172f5f14
YARN-2883. Queuing of container requests in the NM. (Konstantinos Karanasos and Arun Suresh via kasha)
2016-04-20 09:55:50 -07:00
Jason Lowe
3150ae8108
YARN-4924. NM recovery race can lead to container not cleaned up. Contributed by sandflee
2016-04-14 19:17:14 +00:00
Robert Kanter
0d9194df00
YARN-4541. Change log message in LocalizedResource#handle() to DEBUG (rchiang via rkanter)
2016-04-13 17:45:18 -07:00
Naganarasimha
437e9d6475
YARN-4810. NM applicationpage cause internal error 500. Contributed by Bibin A Chundatt.
2016-04-12 17:59:46 +05:30
Vinod Kumar Vavilapalli
44bbc50d91
YARN-4168. Fixed a failing test TestLogAggregationService.testLocalFileDeletionOnDiskFull. Contributed by Takashi Ohnishi.
2016-04-11 12:11:14 -07:00
Akira Ajisaka
1ff27f9d12
YARN-4630. Remove useless boxing/unboxing code. Contributed by Kousuke Saruta.
2016-04-11 14:55:03 +09:00
Karthik Kambatla
e82f961a39
YARN-4756. Unnecessary wait in Node Status Updater during reboot. (Eric Badger via kasha)
2016-04-07 17:05:29 -07:00
Varun Vasudev
b41e65e5bc
YARN-4906. Capture container start/finish time in container metrics. Contributed by Jian He.
2016-04-06 13:41:33 +05:30
Junping Du
0005816743
YARN-4916. TestNMProxy.tesNMProxyRPCRetry fails. Contributed by Tibor Kiss.
2016-04-05 09:01:08 -07:00
naganarasimha
5092c94195
YARN-4746. yarn web services should convert parse failures of appId, appAttemptId and containerId to 400. Contributed by Bibin A Chundatt
2016-04-04 16:25:03 +05:30
Jian He
0dd9bcab97
YARN-4811. Generate histograms in ContainerMetrics for actual container resource usage
2016-03-31 14:28:13 -07:00
Allen Wittenauer
0a74610d1c
HADOOP-11393. Revert HADOOP_PREFIX, go back to HADOOP_HOME (aw)
2016-03-31 07:51:05 -07:00
Jason Lowe
948b758070
YARN-4773. Log aggregation performs extraneous filesystem operations when rolling log aggregation is disabled. Contributed by Jun Gong
2016-03-28 23:00:56 +00:00
Jian He
55ae143923
YARN-4117. End to end unit test with mini YARN cluster for AMRMProxy Service. Contributed by Giovanni Matteo Fumarola
2016-03-27 20:22:12 -07:00
Eric Payne
92b7e0d413
YARN-4686. MiniYARNCluster.start() returns before cluster is completely started. Contributed by Eric Badger.
2016-03-18 16:12:47 +00:00
Robert Kanter
22ca176dfe
TestResourceLocalizationService.testPublicResourceInitializesLocalDir fails Intermittently due to IllegalArgumentException from cleanup (templedf via rkanter)
2016-03-15 10:05:10 -07:00
Vinod Kumar Vavilapalli
b2661765a5
YARN-4762. Fixed CgroupHandler's creation and usage to avoid NodeManagers crashing when LinuxContainerExecutor is enabled. (Sidharta Seethana via vinodkv)
2016-03-07 11:08:17 -08:00
Jason Lowe
059caf9989
YARN-4744. Too many signal to container failure in case of LCE. Contributed by Sidharta Seethana
2016-03-07 15:40:01 +00:00
Varun Vasudev
8ed2e060e8
YARN-4245. Generalize config file handling in container-executor. Contributed by Sidharta Seethana.
2016-03-07 16:18:35 +05:30
Varun Vasudev
e51a8c1056
YARN-4737. Add CSRF filter support in YARN. Contributed by Jonathan Maron.
2016-03-07 15:26:44 +05:30
Haohui Mai
0fa54d45b1
HADOOP-12813. Migrate TestRPC and related codes to rebase on ProtobufRpcEngine. Contributed by Kai Zheng.
2016-02-29 11:41:00 -08:00
Jason Lowe
c58a6d53c5
YARN-4731. container-executor should not follow symlinks in recursive_unlink_children. Contributed by Colin Patrick McCabe
2016-02-29 15:24:35 +00:00
Ming Ma
7f3139e54d
YARN-4720. Skip unnecessary NN operations in log aggregation. (Jun Gong via mingma)
2016-02-26 08:40:05 -08:00
Robert Kanter
d7fdec1e6b
YARN-4579. Allow DefaultContainerExecutor container log directory permissions to be configurable (rchiang via rkanter)
2016-02-25 16:36:38 -08:00
Robert Kanter
954dd57043
YARN-4697. NM aggregation thread pool is not bound by limits (haibochen via rkanter)
2016-02-24 15:00:24 -08:00
Jason Lowe
d284e187b8
YARN-2046. Out of band heartbeats are sent only on container kill and possibly too early. Contributed by Ming Ma
2016-02-23 20:49:09 +00:00
Varun Vasudev
140cb5d745
YARN-4709. NMWebServices produces incorrect JSON for containers. Contributed by Varun Saxena.
2016-02-23 12:29:25 +05:30
Varun Vasudev
fa00d3e205
YARN-4655. Log uncaught exceptions/errors in various thread pools in YARN. Contributed by Sidharta Seethana.
2016-02-11 12:06:42 +05:30
Jason Lowe
fa328e2d39
YARN-4594. container-executor fails to remove directory tree when chmod required. Contributed by Colin Patrick McCabe
2016-02-03 17:21:12 +00:00
Varun Vasudev
1cd55e0c17
YARN-4649. Add additional logging to some NM state store operations. Contributed by Sidharta Seethana
2016-02-02 09:25:23 +05:30
Wangda Tan
9875325d5c
YARN-4340. Add list API to reservation system. (Sean Po via wangda)
2016-02-02 10:17:33 +08:00
Rohith Sharma K S
ac68666803
YARN-4543. Fix random test failure in TestNodeStatusUpdater.testStopReentrant. (Akihiro Suda via rohithsharmaks)
2016-01-29 12:29:54 +05:30
Jason Lowe
61382ff8fa
YARN-4643. Container recovery is broken with delegating container runtime. Contributed by Sidharta Seethana
2016-01-28 18:59:35 +00:00
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
2085e60a96
YARN-3542. Refactored existing CPU cgroups support to use the newer and integrated ResourceHandler mechanism, and also deprecated the old LCEResourceHandler inteface hierarchy. Contributed by Varun Vasudev.
2016-01-25 16:19:36 -08:00
Jian He
992dd2f783
YARN-4520. Finished app info is unnecessarily persisted in NM state-store if container is acquired but not lunched on this node. Contributed by sandflee
2016-01-25 15:36:14 -08:00
Xuan
618bfd6ac2
YARN-4496. Improve HA ResourceManager Failover detection on the client.
...
Contributed by Jian He
2016-01-22 18:20:38 -08:00
Jason Lowe
46e5ea81e0
YARN-4598. Invalid event: RESOURCE_FAILED at CONTAINER_CLEANEDUP_AFTER_KILL. Contributed by tangshangwen
2016-01-22 21:55:01 +00:00
Varun Vasudev
b41a7e89d1
YARN-4578. Directories that are mounted in docker containers need to be more restrictive/container-specific. Contributed by Sidharta Seethana.
2016-01-22 14:43:14 +05:30
Wangda Tan
89d1fd5dac
HADOOP-12356. Fix computing CPU usage statistics on Windows. (Inigo Goiri via wangda)
2016-01-19 21:27:38 +08:00
Karthik Kambatla
d40859fab1
YARN-4526. Make SystemClock singleton so AppSchedulingInfo could use it. (kasha)
2016-01-18 10:58:14 +01:00
Colin Patrick Mccabe
b2c155f810
HADOOP-12712. Fix some cmake plugin and native build warnings (cmccabe)
2016-01-15 12:29:18 -08:00
Colin Patrick Mccabe
b1ed28fa77
HADOOP-8887. Use a Maven plugin to build the native code using CMake (cmccabe)
2016-01-14 11:02:34 -08:00
Varun Vasudev
3ddb92bd30
YARN-4553. Add cgroups support for docker containers. Contributed by Sidharta Seethana.
2016-01-14 14:29:29 +05:30
Jason Lowe
13de8359a1
YARN-4414. Nodemanager connection errors are retried at multiple levels. Contributed by Chang Li
2016-01-12 15:56:15 +00:00
Akira Ajisaka
52b77577c4
YARN-4544. All the log messages about rolling monitoring interval are shown with WARN level. Contributed by Takashi Ohnishi.
2016-01-08 01:12:46 +09:00
Steve Loughran
07d1cb612c
YARN-4550. Some tests in TestContainerLanch fails on non-english locale environment. (Takashi Ohnishi via stevel)
2016-01-07 14:30:20 +00:00
rohithsharmaks
791c1639ae
YARN-4393. Fix intermittent test failure for TestResourceLocalizationService#testFailedDirsResourceRelease (Varun Saxana via rohithsharmaks)
2016-01-07 09:38:47 +05:30
Rohith Sharma K S
f9e36dea96
YARN-4530. LocalizedResource trigger a NPE Cause the NodeManager exit. (tangshangwen via rohithsharmaks)
2016-01-04 09:58:13 +05:30
Gera Shegalov
2c17b81569
YARN-2934. Improve handling of container's stderr. (Naganarasimha G R via gera)
2015-12-24 23:48:05 -08:00
Uma Mahesh
0f82b5d878
YARN-4480. Clean up some inappropriate imports. (Kai Zheng via umamahesh)
2015-12-19 23:10:13 -08:00
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
4e7d32c0db
YARN-1856. Added cgroups based memory monitoring for containers as another alternative to custom memory-monitoring. Contributed by Varun Vasudev.
2015-12-17 12:13:03 -08:00
Wangda Tan
79c41b1d83
YARN-4293. ResourceUtilization should be a part of yarn node CLI. (Sunil G via wangda)
2015-12-16 13:18:19 -08:00
Jian He
915cd6c3f4
YARN-4402. TestNodeManagerShutdown And TestNodeManagerResync fails with bind exception. Contributed by Brahma Reddy Battula
2015-12-14 14:59:01 -08:00
Wangda Tan
dfcbbddb09
YARN-4309. Add container launch related debug information to container logs when a container fails. (Varun Vasudev via wangda)
2015-12-14 11:13:22 -08:00
rohithsharmaks
15c3e7ffe3
YARN-4431. Not necessary to do unRegisterNM() if NM get stop due to failed to connect to RM. (Junpin Du via rohithsharmaks)
2015-12-09 10:50:43 +05:30
Steve Loughran
65f395226b
HADOOP-12321. Make JvmPauseMonitor an AbstractService. (Sunil G via Stevel) [includes HDFS-8947 MAPREDUCE-6462 and YARN-4072]
2015-12-06 17:43:35 +00:00
Jian He
9f77ccad73
YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999). Contributed by Mohammad Shahid Khan and Varun Saxena
2015-12-03 12:48:50 -08:00
Junping Du
62e9348bc1
YARN-4408. Fix issue that NodeManager still reports negative running containers. Contributed by Robert Kanter.
2015-12-03 06:36:37 -08:00
Tsuyoshi Ozawa
0656d2dc83
YARN-4380. TestResourceLocalizationService.testDownloadingResourcesOnContainerKill fails intermittently. Contributed by Varun Saxena.
2015-11-26 01:10:02 +09:00
Jason Lowe
4ac6799d4a
YARN-4132. Separate configs for nodemanager to resourcemanager connection timeout and retries. Contributed by Chang Li
2015-11-24 22:35:37 +00:00
Karthik Kambatla
52948bb20b
YARN-3980. Plumb resource-utilization info in node heartbeat through to the scheduler. (Inigo Goiri via kasha)
2015-11-24 13:47:17 +05:30
Jian He
fcd7888029
Revert "YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999) Contributed by Mohammad Shahid Khan"
...
This reverts commit 8fbea531d7
.
Conflicts:
hadoop-yarn-project/CHANGES.txt
2015-11-16 20:18:44 -08:00
Junping Du
855d52927b
YARN-4354. Public resource localization fails with NPE. Contributed by Jason Lowe.
2015-11-15 04:43:57 -08:00
Jian He
8fbea531d7
YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999) Contributed by Mohammad Shahid Khan
2015-11-09 10:43:45 -08:00
Jason Lowe
e2267de207
YARN-2902. Killing a container that is localizing can orphan resources in the DOWNLOADING state. Contributed by Varun Saxena
2015-10-29 16:34:25 +00:00
Akira Ajisaka
8a68630dd1
YARN-4300. [JDK8] Fix javadoc errors caused by wrong tags. (aajisaka)
2015-10-27 10:28:01 +09:00
Wangda Tan
6f606214e7
YARN-4169. Fix racing condition of TestNodeStatusUpdaterForLabels. (Naganarasimha G R via wangda)
2015-10-26 16:36:34 -07:00
Rohith Sharma K S
5acdde4744
YARN-2729. Support script based NodeLabelsProvider Interface in Distributed Node Label Configuration Setup. (Naganarasimha G R via rohithsharmaks)
2015-10-26 15:42:42 +05:30
Tsuyoshi Ozawa
ce60b4fc8b
YARN-3528. Tests with 12345 as hard-coded port break jenkins. Contributed by Brahma Reddy Battula.
2015-10-26 16:45:11 +09:00
Allen Wittenauer
1aa735c188
YARN-3724. Use POSIX nftw(3) instead of fts(3) (Alan Burlison via aw)
2015-10-25 21:43:23 -07:00
Jonathan Eagles
f8adeb712d
YARN-4009. CORS support for ResourceManager REST API. ( Varun Vasudev via jeagles)
2015-10-23 10:34:08 -05:00
Varun Vasudev
151eab2a12
YARN-4267. Add additional logging to container launch implementations in container-executor. Contributed by Sidharta Seethana.
2015-10-19 20:56:01 +05:30
Varun Vasudev
e39ae0e676
YARN-4262. Allow whitelisted users to run privileged docker containers. Contributed by Sidharta Seethana.
2015-10-19 20:19:22 +05:30
Steve Loughran
e2d59e2c7b
YARN-4155. TestLogAggregationService.testLogAggregationServiceWithInterval failing. (Bibin A Chundatt via stevel)
2015-10-18 12:03:57 +01:00
Varun Vasudev
63020c54c1
YARN-4258. Add support for controlling capabilities for docker containers. Contributed by Sidharta Seethana.
2015-10-15 13:41:20 +05:30
Steve Loughran
dfa7848463
HADOOP-12478. Shell.getWinUtilsPath() has been renamed Shell.getWinutilsPath(). (stevel)
2015-10-14 20:25:47 +01:00
Varun Vasudev
da1ee078f9
YARN-4255. container-executor does not clean up docker operation command files. Contributed by Sidharta Seethana.
2015-10-14 15:25:21 +05:30
Varun Vasudev
fb9c1519b3
YARN-4252. Log container-executor invocation details when exit code is non-zero. Contributed by Sidharta Seethana.
2015-10-14 15:18:08 +05:30
Varun Vasudev
8d59293089
YARN-4253. Standardize on using PrivilegedOperationExecutor for all invocations of container-executor in LinuxContainerExecutor. Contributed by Sidharta Seethana.
2015-10-14 14:58:10 +05:30
Steve Loughran
c59af2fdf8
HADOOP-10775. Shell operations to fail with meaningful errors on windows if winutils.exe not found. (stevel)
2015-10-13 21:49:28 +01:00
Varun Vasudev
5b6bae0094
YARN-4017. container-executor overuses PATH_MAX. Contributed by Sidharta Seethana
2015-10-13 12:48:30 +05:30
Jason Lowe
a0bca2b5ad
YARN-261. Ability to fail AM attempts. Contributed by Andrey Klochkov and Rohith Sharma K S
2015-10-09 14:17:38 +00:00
Jason Lowe
8d226225d0
YARN-3943. Use separate threshold configurations for disk-full detection and disk-not-full detection. Contributed by Zhihai Xu
2015-10-08 22:25:34 +00:00
Wangda Tan
30ac69c6bd
YARN-4176. Resync NM nodelabels with RM periodically for distributed nodelabels. (Bibin A Chundatt via wangda)
2015-10-05 15:47:21 -07:00
Vinod Kumar Vavilapalli
f1c19b9365
HADOOP-12441. Fixed shell-kill command behaviour to work correctly on some Linux distributions after HADOOP-12317. Contributed by Wangda Tan.
2015-10-05 10:56:55 -07:00
Xuan
8f08532bde
YARN-1897. CLI and core support for signal container functionality. Contributed by Ming Ma
2015-10-02 18:50:47 -07:00
Jason Lowe
fdf02d1f26
YARN-3619. ContainerMetrics unregisters during getMetrics and leads to ConcurrentModificationException. Contributed by Zhihai Xu
2015-10-02 20:20:31 +00:00
Jason Lowe
854d25b0c3
YARN-3727. For better error recovery, check if the directory exists before using it for localization. Contributed by Zhihai Xu
2015-09-30 14:59:44 +00:00
Rohith Sharma K S
8ed0d4b744
YARN-4152. NodeManager crash with NPE when LogAggregationService#stopContainer called for absent container. (Bibin A Chundatt via rohithsharmaks)
2015-09-24 11:24:14 +05:30
Jian He
c57eac5dfe
YARN-3868. Recovery support for container resizing. Contributed by Meng Ding
2015-09-23 13:29:38 -07:00
Jian He
c3dc1af072
YARN-1644. RM-NM protocol changes and NodeStatusUpdater implementation to support container resizing. Contributed by Meng Ding
2015-09-23 13:29:37 -07:00
Jian He
c59ae4eeb1
YARN-1643. Make ContainersMonitor support changing monitoring size of an allocated container. Contributed by Meng Ding and Wangda Tan
2015-09-23 13:29:37 -07:00
Jian He
5f5a968d65
YARN-3867. ContainerImpl changes to support container resizing. Contributed by Meng Ding
2015-09-23 13:29:37 -07:00
Jian He
ffd820c27a
YARN-1645. ContainerManager implementation to support container resizing. Contributed by Meng Ding & Wangda Tan
2015-09-23 13:29:37 -07:00
Jian He
83a18add10
YARN-1449. AM-NM protocol changes to support container resizing. Contributed by Meng Ding & Wangda Tan)
2015-09-23 13:29:36 -07:00
Jason Lowe
c890c51a91
YARN-4095. Avoid sharing AllocatorPerContext object in LocalDirAllocator between ShuffleHandler and LocalDirsHandlerService. Contributed by Zhihai Xu
2015-09-23 15:42:01 +00:00
Xuan
34ef1a092b
YARN-4149. yarn logs -am should provide an option to fetch all the log
...
files. Contributed by Varun Vasudev
2015-09-15 14:36:30 -07:00
Jason Lowe
8c1cdb17a0
YARN-4158. Remove duplicate close for LogWriter in AppLogAggregatorImpl#uploadLogsForContainers. Contributed by Zhihai Xu
2015-09-15 20:21:33 +00:00
Varun Vasudev
486d5cb803
YARN-4136. LinuxContainerExecutor loses info when forwarding ResourceHandlerException. Contributed by Bibin A Chundatt.
2015-09-11 14:37:48 +05:30
Wangda Tan
77666105b4
YARN-4106. NodeLabels for NM in distributed mode is not updated even after clusterNodelabel addition in RM. (Bibin A Chundatt via wangda)
2015-09-10 09:30:09 -07:00
Zhihai Xu
16b9037dc1
YARN-4096. App local logs are leaked if log aggregation fails to initialize for the app. Contributed by Jason Lowe.
2015-09-08 12:29:54 -07:00
Jian He
6f72f1e600
YARN-2884. Added a proxy service in NM to proxy the the communication between AM and RM. Contributed by Kishore Chaliparambil
2015-09-08 09:35:46 +08:00
Varun Vasudev
1dbd8e34a7
YARN-3591. Resource localization on a bad disk causes subsequent containers failure. Contributed by Lavkesh Lahngir.
2015-09-07 11:32:12 +05:30
Rohith Sharma K S
095ab9ab5f
YARN-4073. Removed unused ApplicationACLsManager in ContainerManagerImpl constructor. (Naganarasimha G R via rohithsharmaks)
2015-09-02 14:13:33 +05:30
Xuan
b71c6006f5
YARN-221. Addendum patch to compilation issue which is caused by missing
...
AllContainerLogAggregationPolicy. Contributed by Xuan Gong
2015-08-23 16:46:30 -07:00
Xuan
37e1c3d82a
YARN-221. NM should provide a way for AM to tell it not to aggregate
...
logs. Contributed by Ming Ma
2015-08-22 16:25:24 -07:00
Wangda Tan
fc07464d1a
YARN-2923. Support configuration based NodeLabelsProvider Service in Distributed Node Label Configuration Setup. (Naganarasimha G R)
2015-08-20 11:51:03 -07:00
Zhihai Xu
14215c8ef8
YARN-4057. If ContainersMonitor is not enabled, only print related log info one time. Contributed by Jun Gong.
2015-08-18 11:36:02 -07:00
Karthik Kambatla
13604bd5f1
YARN-4055. Report node resource utilization in heartbeat. (Inigo Goiri via kasha)
2015-08-16 15:08:53 -07:00
Karthik Kambatla
def12933b3
YARN-3534. Collect memory/cpu usage on the node. (Inigo Goiri via kasha)
2015-08-16 06:24:16 -07:00
Jian He
38aed1a94e
YARN-4005. Completed container whose app is finished is possibly not removed from NMStateStore. Contributed by Jun Gong
2015-08-13 14:46:08 -07:00
Junping Du
cfee02b3bd
YARN-4019. Add JvmPauseMonitor to ResourceManager and NodeManager. Contributed by Robert Kanter.
2015-08-06 06:49:45 -07:00
Xuan
c3364ca8e7
YARN-4004. container-executor should print output of docker logs if the
...
docker container exits with non-0 exit status. Contributed by Varun
Vasudev
2015-08-03 18:10:11 -07:00
Jason Lowe
469cfcd695
YARN-3965. Add startup timestamp to nodemanager UI. Contributed by Hong Zhiguo
2015-08-03 15:53:32 +00:00
Xuan
f170934215
YARN-3982. container-executor parsing of container-executor.cfg broken
...
in trunk and branch-2. Contributed by Varun Vasudev
2015-07-27 23:45:58 -07:00
Varun Vasudev
3e6fce91a4
YARN-3853. Add docker container runtime support to LinuxContainterExecutor. Contributed by Sidharta Seethana.
2015-07-27 11:57:40 -07:00
Varun Vasudev
f36835ff9b
YARN-3852. Add docker container support to container-executor. Contributed by Abin Shahab.
2015-07-27 10:14:51 -07:00
Jason Lowe
ff9c13e0a7
YARN-3925. ContainerLogsUtils#getContainerLogFile fails to read container log files from full disks. Contributed by zhihai xu
2015-07-24 22:14:39 +00:00
Wangda Tan
06e5dd2c84
YARN-3956. Fix TestNodeManagerHardwareUtils fails on Mac (Varun Vasudev via wangda)
2015-07-22 12:01:41 -07:00
Colin Patrick Mccabe
419c51d233
YARN-3844. Make hadoop-yarn-project Native code -Wall-clean (Alan Burlison via Colin P. McCabe)
2015-07-17 11:38:59 -07:00
Akira Ajisaka
19295b36d9
YARN-3381. Fix typo InvalidStateTransitonException. Contributed by Brahma Reddy Battula.
2015-07-13 17:52:13 +09:00
Zhijie Shen
1ea36299a4
YARN-3116. RM notifies NM whether a container is an AM container or normal task container. Contributed by Giovanni Matteo Fumarola.
2015-07-10 18:58:10 -07:00
Karthik Kambatla
527c40e4d6
YARN-1012. Report NM aggregated container resource utilization in heartbeat. (Inigo Goiri via kasha)
2015-07-09 09:35:14 -07:00
Varun Vasudev
63d0365088
YARN-2194. Addendum patch to fix failing unit test in TestPrivilegedOperationExecutor. Contributed by Sidharta Seethana.
2015-07-09 11:51:59 +05:30
Varun Vasudev
c40bdb56a7
YARN-2194. Fix bug causing CGroups functionality to fail on RHEL7. Contributed by Wei Yan.
2015-07-07 16:59:29 +05:30
Jason Lowe
b5cdf78e8e
YARN-3793. Several NPEs when deleting local files on NM recovery. Contributed by Varun Saxena
2015-07-01 21:13:32 +00:00
Colin Patrick Mccabe
d0cc0380b5
YARN-3827. Migrate YARN native build to new CMake framework (Alan Burlison via Colin P. McCabe)
2015-06-30 16:24:19 -07:00
Jian He
62e583c7dc
YARN-3695. ServerProxy (NMProxy, etc.) shouldn't retry forever for non network exception. Contributed by Raju Bairishetti
2015-06-29 13:37:32 -07:00
Jason Lowe
40b256949a
YARN-3850. NM fails to read files from full disks which can lead to container logs being lost and other issues. Contributed by Varun Saxena
2015-06-26 15:47:07 +00:00
Jason Lowe
8d58512d6e
YARN-3832. Resource Localization fails on a cluster due to existing cache directories. Contributed by Brahma Reddy Battula
2015-06-24 16:37:39 +00:00
Karthik Kambatla
5ebf2817e5
YARN-3842. NMProxy should retry on NMNotYetReadyException. (Robert Kanter via kasha)
2015-06-22 17:47:47 -07:00
Xuan
6c7a9d502a
YARN-3834. Scrub debug logging of tokens during resource localization. Contributed by Chris Nauroth
2015-06-21 17:13:44 -07:00
Devaraj K
126321eded
YARN-3747. TestLocalDirsHandlerService should delete the created test
...
directory logDir2. Contributed by David Moore.
2015-06-08 15:32:13 +05:30
Junping Du
d7e7f6aa03
YARN-41. The RM should handle the graceful shutdown of the NM. Contributed by Devaraj K.
2015-06-04 04:59:27 -07:00
Jason Lowe
e13b671aa5
YARN-3585. NodeManager cannot exit on SHUTDOWN event triggered and NM recovery is enabled. Contributed by Rohith Sharmaks
2015-06-03 19:44:07 +00:00
Robert Kanter
6aec13cb33
YARN-3713. Remove duplicate function call storeContainerDiagnostics in ContainerDiagnosticsUpdateTransition (zxu via rkanter)
2015-05-29 15:34:37 -07:00
Andrew Wang
7673d4f205
HADOOP-11885. hadoop-dist dist-layout-stitching.sh does not work with dash. (wang)
2015-05-29 14:54:18 -07:00
cnauroth
4102e5882e
YARN-3626. On Windows localized resources are not moved to the front of the classpath when they should be. Contributed by Craig Welch.
2015-05-27 14:31:49 -07:00
Vinod Kumar Vavilapalli
500a1d9c76
YARN-160. Enhanced NodeManager to automatically obtain cpu/memory values from underlying OS when configured to do so. Contributed by Varun Vasudev.
2015-05-26 11:38:35 -07:00
Junping Du
132d909d4a
YARN-3594. WintuilsProcessStubExecutor.startStreamReader leaks streams. Contributed by Lars Francke.
2015-05-22 04:23:25 -07:00
Vinod Kumar Vavilapalli
53fafcf061
YARN-3684. Changed ContainerExecutor's primary lifecycle methods to use a more extensible mechanism of context objects. Contributed by Sidharta Seethana.
2015-05-21 15:50:23 -07:00
Jian He
6329bd00fa
YARN-3654. ContainerLogsPage web UI should not have meta-refresh. Contributed by Xuan Gong
2015-05-20 17:20:21 -07:00
Wangda Tan
b37da52a1c
YARN-3565. NodeHeartbeatRequest/RegisterNodeManagerRequest should use NodeLabel object instead of String. (Naganarasimha G R via wangda)
2015-05-19 16:34:17 -07:00
Colin Patrick Mccabe
470c87dbc6
HADOOP-11970. Replace uses of ThreadLocal<Random> with JDK7 ThreadLocalRandom (Sean Busbey via Colin P. McCabe)
2015-05-19 10:50:15 -07:00
Ravi Prakash
c97f32e7b9
YARN-3302. TestDockerContainerExecutor should run automatically if it can detect docker in the usual place (Ravindra Kumar Naik via raviprak)
2015-05-19 10:28:11 -07:00
Junping Du
03a293aed6
YARN-3505 addendum: fix an issue in previous patch.
2015-05-15 06:39:39 -07:00
Ravi Prakash
53fe4eff09
YARN-1519. Check in container-executor if sysconf is implemented before using it (Radim Kolar and Eric Payne via raviprak)
2015-05-14 15:55:37 -07:00
Junping Du
15ccd967ee
YARN-3505. Node's Log Aggregation Report with SUCCEED should not cached in RMApps. Contributed by Xuan Gong.
2015-05-14 10:58:12 -07:00
Jason Lowe
711d77cc54
YARN-3641. NodeManager: stopRecoveryStore() shouldn't be skipped when exceptions happen in stopping NM's sub-services. Contributed by Junping Du
2015-05-13 21:06:47 +00:00
Xuan
0f95921447
YARN-3626. On Windows localized resources are not moved to the front of the classpath when they should be. Contributed by Craig Welch
2015-05-13 13:10:53 -07:00
Devaraj K
5c2f05cd9b
YARN-3629. NodeID is always printed as "null" in node manager
...
initialization log. Contributed by nijel.
2015-05-12 22:20:25 +05:30
Devaraj K
8badd82ce2
YARN-3513. Remove unused variables in ContainersMonitorImpl and add debug
...
log for overall resource usage by all containers. Contributed by
Naganarasimha G R.
2015-05-12 16:54:38 +05:30
Karthik Kambatla
70fb37cd79
YARN-1287. Consolidate MockClocks. (Sebastian Wong and Anubhav Dhoot via kasha)
2015-05-09 14:34:54 -07:00
Xuan
6471d18bc7
YARN-1912. ResourceLocalizer started without any jvm memory control.
...
Contributed by Masatake Iwasaki
2015-05-08 20:01:21 -07:00
Xuan
333f9a896d
YARN-3602. TestResourceLocalizationService.testPublicResourceInitializesLocalDir fails Intermittently due to IOException from cleanup. Contributed by zhihai xu
2015-05-08 17:52:54 -07:00
Jason Lowe
25e2b02122
YARN-3476. Nodemanager can fail to delete local logs if log aggregation fails. Contributed by Rohith
2015-05-08 22:45:52 +00:00
Xuan
088156de43
YARN-2331. Distinguish shutdown during supervision vs. shutdown for
...
rolling upgrade. Contributed by Jason Lowe
2015-05-08 15:10:43 -07:00
Robert (Bobby) Evans
bcf2890502
YARN-644: Basic null check is not performed on passed in arguments before using them in ContainerManagerImpl.startContainer
2015-05-08 11:11:01 -05:00
Devaraj K
241a72af0d
YARN-2784. Make POM project names consistent. Contributed by Rohith.
2015-05-08 19:23:53 +05:30
Akira Ajisaka
b167fe7605
YARN-1832. Fix wrong MockLocalizerStatus#equals implementation. Contributed by Hong Zhiguo.
2015-05-08 11:14:45 +09:00
Akira Ajisaka
918af8efff
YARN-3577. Misspelling of threshold in log4j.properties for tests. Contributed by Brahma Reddy Battula.
2015-05-07 13:33:03 +09:00
Robert Kanter
b72507810a
YARN-3491. PublicLocalizer#addResource is too slow. (zxu via rkanter)
2015-05-06 14:19:06 -07:00
Junping Du
3810242062
YARN-3396. Handle URISyntaxException in ResourceLocalizationService. (Contributed by Brahma Reddy Battula)
2015-05-05 10:18:23 -07:00
Wangda Tan
71f4de220c
YARN-3375. NodeHealthScriptRunner.shouldRun() check is performing 3 times for starting NodeHealthScriptRunner (Devaraj K via wangda)
2015-05-04 15:49:19 -07:00
Jason Lowe
8f65c793f2
YARN-3097. Logging of resource recovery on NM restart has redundancies. Contributed by Eric Payne
2015-05-04 15:31:15 +00:00
Robert Kanter
ac7d152901
YARN-3363. add localization and container launch time to ContainerMetrics at NM to show these timing information for each active container. (zxu via rkanter)
2015-05-01 16:39:21 -07:00
Vinod Kumar Vavilapalli
1b3b9e5c31
YARN-2619. Added NodeManager support for disk io isolation through cgroups. Contributed by Varun Vasudev and Wei Yan.
2015-04-30 21:41:07 -07:00
Karthik Kambatla
47279c3228
YARN-3464. Race condition in LocalizerRunner kills localizer before localizing all resources. (Zhihai Xu via kasha)
2015-04-26 09:13:46 -07:00
Jason Lowe
5e093f0d40
YARN-3537. NPE when NodeManager.serviceInit fails and stopRecoveryStore invoked. Contributed by Brahma Reddy Battula
2015-04-24 22:02:53 +00:00
Xuan
0b3f8957a8
YARN-3516. killing ContainerLocalizer action doesn't take effect when
...
private localizer receives FETCH_FAILURE status. Contributed by zhihai
xu
2015-04-23 16:40:40 -07:00
Vinod Kumar Vavilapalli
a100be685c
YARN-3366. Enhanced NodeManager to support classifying/shaping outgoing network bandwidth traffic originating from YARN containers Contributed by Sidharta Seethana.
2015-04-22 17:26:13 -07:00
Jian He
674c7ef649
YARN-3503. Expose disk utilization percentage and bad local and log dir counts in NM metrics. Contributed by Varun Vasudev
2015-04-21 20:57:02 -07:00
Junping Du
1db355a875
YARN-1402. Update related Web UI and CLI with exposing client API to check log aggregation status. Contributed by Xuan Gong.
2015-04-17 13:18:59 -07:00
Jian He
1b89a3e173
YARN-3354. Add node label expression in ContainerTokenIdentifier to support RM recovery. Contributed by Wangda Tan
2015-04-15 13:57:06 -07:00
Junping Du
838b06ac87
YARN-3443. Create a 'ResourceHandler' subsystem to ease addition of support for new resource types on the NM. Contributed by Sidharta Seethana.
2015-04-13 18:35:56 -07:00
Junping Du
92431c9617
YARN-1376. NM need to notify the log aggregation status to RM through Node heartbeat. Contributed by Xuan Gong.
2015-04-10 08:56:18 -07:00
Karthik Kambatla
6495940eae
YARN-3465. Use LinkedHashMap to preserve order of resource requests. (Zhihai Xu via kasha)
2015-04-09 00:07:49 -07:00
Tsuyoshi Ozawa
dd852f5b8c
YARN-3457. NPE when NodeManager.serviceInit fails and stopRecoveryStore called. Contributed by Bibin A Chundatt.
2015-04-08 15:56:18 +09:00
Wangda Tan
bad070fe15
YARN-2901. Add errors and warning metrics page to RM, NM web UI. (Varun Vasudev via wangda)
2015-04-02 17:23:20 -07:00
Vinod Kumar Vavilapalli
b21c72777a
YARN-3365. Enhanced NodeManager to support using the 'tc' tool via container-executor for outbound network traffic control. Contributed by Sidharta Seethana.
2015-04-02 16:53:59 -07:00
Tsuyoshi Ozawa
c69ba81497
YARN-3424. Change logs for ContainerMonitorImpl's resourse monitoring from info to debug. Contributed by Anubhav Dhoot.
2015-04-01 17:44:25 +09:00
Karthik Kambatla
2daa478a64
YARN-3428. Debug log resources to be localized for a container. (kasha)
2015-03-31 17:34:47 -07:00
Wangda Tan
2a945d24f7
YARN-2495. Allow admin specify labels from each NM (Distributed configuration for node label). (Naganarasimha G R via wangda)
2015-03-30 12:05:21 -07:00
Vinod Kumar Vavilapalli
c358368f51
YARN-3304. Cleaning up ResourceCalculatorProcessTree APIs for public use and removing inconsistencies in the default values. Contributed by Junping Du and Karthik Kambatla.
2015-03-30 10:09:40 -07:00
Ravi Prakash
e0ccea33c9
YARN-3288. Document and fix indentation in the DockerContainerExecutor code
2015-03-28 08:00:41 -07:00
Tsuyoshi Ozawa
82eda771e0
YARN-3384. TestLogAggregationService.verifyContainerLogs fails after YARN-2777. Contributed by Naganarasimha G R.
2015-03-24 00:25:30 +09:00
Junping Du
d81109e588
YARN-3269. Yarn.nodemanager.remote-app-log-dir could not be configured to fully qualified path. Contributed by Xuan Gong
2015-03-20 13:41:22 -07:00
Karthik Kambatla
20b49224eb
YARN-3351. AppMaster tracking URL is broken in HA. (Anubhav Dhoot via kasha)
2015-03-18 16:30:33 -07:00
Ravi Prakash
56085203c4
YARN-3339. TestDockerContainerExecutor should pull a single image and not the entire centos repository. (Ravindra Kumar Naik via raviprak)
2015-03-16 16:17:58 -07:00
Tsuyoshi Ozawa
3da9a97cfb
YARN-1453. [JDK8] Fix Javadoc errors caused by incorrect or illegal tags in doc comments. Contributed by Akira AJISAKA, Andrew Purtell, and Allen Wittenauer.
2015-03-16 23:19:05 +09:00
Vinod Kumar Vavilapalli
863079bb87
YARN-3154. Added additional APIs in LogAggregationContext to avoid aggregating running logs of application when rolling is enabled. Contributed by Xuan Gong.
2015-03-12 13:32:29 -07:00
Jian He
21101c01f2
YARN-2190. Added CPU and memory limit options to the default container executor for Windows containers. Contributed by Chuan Liu
2015-03-06 14:18:11 -08:00
Karthik Kambatla
53947f37c7
YARN-3122. Metrics for container's actual CPU usage. (Anubhav Dhoot via kasha)
2015-03-04 17:33:30 -08:00
Konstantin V Shvachko
8ca0d957c4
YARN-3255. RM, NM, JobHistoryServer, and WebAppProxyServer's main() should support generic options. Contributed by Konstantin Shvachko.
2015-02-26 17:12:19 -08:00
Allen Wittenauer
d4ac6822e1
YARN-2980. Move health check script related functionality to hadoop-common (Varun Saxena via aw)
2015-02-24 11:25:26 -08:00
Xuan
f56c65bb3e
YARN-3237. AppLogAggregatorImpl fails to log error cause. Contributed by
...
Rushabh S Shah
2015-02-20 14:02:40 -08:00
Junping Du
c33ae271c2
YARN-2799. Cleanup TestLogAggregationService based on the change in YARN-90. Contributed by Zhihai Xu
2015-02-20 09:43:39 -08:00
Tsuyoshi Ozawa
447bd7b5a6
YARN-3203. Correct a log message in AuxServices. Contributed by Brahma Reddy Battula.
2015-02-16 23:55:58 +09:00
Junping Du
ab0b958a52
YARN-2749. Fix some testcases from TestLogAggregationService fails in trunk. (Contributed by Xuan Gong)
2015-02-15 06:46:32 -08:00
cnauroth
6804d68901
YARN-2899. Run TestDockerContainerExecutorWithMocks on Linux only. Contributed by Ming Ma.
2015-02-13 21:58:54 -08:00
Jason Lowe
1a0f508b63
YARN-2847. Linux native container executor segfaults if default banned user detected. Contributed by Olaf Flebbe
2015-02-13 20:20:55 +00:00
Akira Ajisaka
6a49e58cb8
YARN-3191. Log object should be initialized with its own class. Contributed by Rohith.
2015-02-12 17:58:54 -08:00
Junping Du
04f5ef18f7
YARN-2079. Recover NonAggregatingLogHandler state upon nodemanager restart. (Contributed by Jason Lowe)
2015-02-12 11:46:47 -08:00
Jason Lowe
b379972ab3
YARN-3074. Nodemanager dies when localizer runner tries to write to a full disk. Contributed by Varun Saxena
2015-02-11 16:33:43 +00:00
Jason Lowe
3f5431a22f
YARN-2809. Implement workaround for linux kernel panic when removing cgroup. Contributed by Nathan Roberts
2015-02-10 17:27:21 +00:00
Jason Lowe
4eb5f7fa32
YARN-3090. DeletionService can silently ignore deletion task failures. Contributed by Varun Saxena
2015-02-10 16:54:21 +00:00
Jason Lowe
4c484320b4
YARN-3089. LinuxContainerExecutor does not handle file arguments to deleteAsUser. Contributed by Eric Payne
2015-02-06 20:39:01 +00:00
Arun C. Murthy
02f154a001
YARN-1537. Fix race condition in TestLocalResourcesTrackerImpl.testLocalResourceCache. Contributed by Xuan Gong.
2015-02-05 23:58:52 -08:00
Xuan
b73e776abc
YARN-3056. Add verification for containerLaunchDuration in
...
TestNodeManagerMetrics. Contributed by zhihai Xu
2015-02-03 15:12:54 -08:00
Robert Kanter
f7a77819a1
YARN-3022. Expose Container resource information from NodeManager for monitoring (adhoot via ranter)
2015-02-03 10:39:41 -08:00
Akira Ajisaka
342efa110a
HADOOP-9907. Webapp http://hostname:port/metrics link is not working. (aajisaka)
2015-01-30 02:49:10 +09:00
Allen Wittenauer
9dd0b7a2ab
YARN-2428. LCE default banned user list should have yarn (Varun Saxena via aw)
2015-01-29 09:30:26 -08:00
Jian He
4e15fc0841
YARN-3011. Possible IllegalArgumentException in ResourceLocalizationService might lead NM to crash. Contributed by Varun Saxena
2015-01-27 13:31:22 -08:00
Jason Lowe
902c6ea7e4
YARN-3088. LinuxContainerExecutor.deleteAsUser can throw NPE if native executor returns an error. Contributed by Eric Payne
2015-01-26 15:40:21 +00:00
Xuan
0d6bd62102
YARN-3024. LocalizerRunner should give DIE action when all resources are
...
localized. Contributed by Chengbing Liu
2015-01-25 19:37:57 -08:00
Tsuyoshi Ozawa
3aab354e66
YARN-3082. Non thread safe access to systemCredentials in NodeHeartbeatResponse processing. Contributed by Anubhav Dhoot.
2015-01-23 16:04:18 +09:00
Karthik Kambatla
84198564ba
YARN-2984. Metrics for container's actual memory usage. (kasha)
2015-01-17 05:44:04 +05:30
Junping Du
5d1cca34fa
YARN-3064. TestRMRestart/TestContainerResourceUsage/TestNodeManagerResync failure with allocation timeout. (Contributed by Jian He)
2015-01-16 00:08:36 -08:00
Jian He
cc2a745f7e
YARN-2997. Fixed NodeStatusUpdater to not send alreay-sent completed container statuses on heartbeat. Contributed by Chengbing Liu
2015-01-08 11:12:54 -08:00
Zhijie Shen
41a548a916
YARN-2937. Fixed new findbugs warnings in hadoop-yarn-nodemanager. Contributed by Varun Saxena.
2014-12-23 20:32:36 -08:00
Jian He
808cba3821
YARN-2952. Fixed incorrect version check in StateStore. Contributed by Rohith Sharmaks
2014-12-19 16:56:30 -08:00
Karthik Kambatla
954fb8581e
YARN-2675. containersKilled metrics is not updated when the container is killed during localization. (Zhihai Xu via kasha)
2014-12-19 16:02:20 -08:00
cnauroth
e996a1bfd4
HADOOP-11321. copyToLocal cannot save a file to an SMB share unless the user has Full Control permissions. Contributed by Chris Nauroth.
2014-12-16 15:29:22 -08:00
Steve Loughran
3681de2039
YARN-2912 Jersey Tests failing with port in use. (varun saxena via stevel)
2014-12-12 17:11:07 +00:00
Karthik Kambatla
db73cc9124
YARN-2931. PublicLocalizer may fail until directory is initialized by LocalizeRunner. (Anubhav Dhoot via kasha)
2014-12-08 22:26:18 -08:00
Harsh J
a31e016491
YARN-2891. Failed Container Executor does not provide a clear error message. Contributed by Dustin Cote. (harsh)
2014-12-04 03:17:15 +05:30
Jason Lowe
03ab24aa01
MAPREDUCE-5932. Provide an option to use a dedicated reduce-side shuffle log. Contributed by Gera Shegalov
2014-12-03 17:02:14 +00:00
Junping Du
e65b7c5ff6
YARN-1156. Enhance NodeManager AllocatedGB and AvailableGB metrics for aggregation of decimal values. (Contributed by Tsuyoshi OZAWA)
2014-12-03 04:11:18 -08:00
Karthik Kambatla
233b61e495
YARN-2679. Add metric for container launch duration. (Zhihai Xu via kasha)
2014-11-21 14:22:21 -08:00
Jason Lowe
49c38898b0
YARN-2816. NM fail to start with NPE during container recovery. Contributed by Zhihai Xu
2014-11-14 21:25:59 +00:00
Jason Lowe
33ea5ae92b
YARN-2846. Incorrect persist exit code for running containers in reacquireContainer() that interrupted by NodeManager restart. Contributed by Junping Du
2014-11-13 16:11:04 +00:00
Zhijie Shen
be7bf956e9
YARN-2794. Fixed log messages about distributing system-credentials. Contributed by Jian He.
2014-11-12 11:07:57 -08:00
Karthik Kambatla
a04143039e
YARN-2236. [YARN-1492] Shared Cache uploader service on the Node Manager. (Chris Trezzo and Sanjin Lee via kasha)
2014-11-12 09:31:05 -08:00
Ravi Prakash
53f64ee516
YARN-1964. Create Docker analog of the LinuxContainerExecutor in YARN
2014-11-11 21:28:11 -08:00
Karthik Kambatla
061bc293c8
YARN-2735. diskUtilizationPercentageCutoff and diskUtilizationSpaceCutoff are initialized twice in DirectoryCollection. (Zhihai Xu via kasha)
2014-11-11 10:31:39 -08:00
Xuan
5c9a51f140
YARN-2841. RMProxy should retry EOFException. Contributed by Jian He
2014-11-10 18:25:01 -08:00
Arun C. Murthy
43cd07b408
YARN-2830. Add backwords compatible ContainerId.newInstance constructor. Contributed by Jonathan Eagles.
2014-11-09 14:57:37 -08:00
Jason Lowe
c3d475070a
YARN-2825. Container leak on NM. Contributed by Jian He
2014-11-07 23:16:37 +00:00
cnauroth
06b797947c
YARN-2803. MR distributed cache not working correctly on Windows after NodeManager privileged account changes. Contributed by Craig Welch.
2014-11-07 12:29:39 -08:00
Vinod Kumar Vavilapalli
c5a46d4c8c
YARN-1922. Fixed NodeManager to kill process-trees correctly in the presence of races between the launch and the stop-container call and when root processes crash. Contributed by Billie Rinaldi.
2014-11-03 16:38:55 -08:00
Vinod Kumar Vavilapalli
58e9f24e0f
YARN-2788. Fixed backwards compatiblity issues with log-aggregation feature that were caused when adding log-upload-time via YARN-2703. Contributed by Xuan Gong.
2014-11-03 13:16:29 -08:00
Jason Lowe
6157ace547
YARN-2730. DefaultContainerExecutor runs only one localizer at a time. Contributed by Siqi Li
2014-11-03 20:37:47 +00:00
Vinod Kumar Vavilapalli
5c0381c96a
YARN-2790. Fixed a NodeManager bug that was causing log-aggregation to fail beyond HFDS delegation-token expiry even when RM is a proxy-user (YARN-2704). Contributed by Jian He.
2014-11-01 16:32:35 -07:00
Zhijie Shen
1cd088fd9d
YARN-2711. Fixed TestDefaultContainerExecutor#testContainerLaunchError failure on Windows. Contributed by Varun Vasudev.
2014-10-31 17:43:30 -07:00
Xuan
86ff28dea0
YARN-2701. Addendum patch. Potential race condition in startLocalizer when using LinuxContainerExecutor. Contributed by Xuan Gong
2014-10-31 14:36:25 -07:00
Jason Lowe
73e626ad91
YARN-2755. NM fails to clean up usercache_DEL_<timestamp> dirs after YARN-661. Contributed by Siqi Li
2014-10-30 15:10:27 +00:00
Zhijie Shen
8984e9b177
YARN-2741. Made NM web UI serve logs on the drive other than C: on Windows. Contributed by Craig Welch.
2014-10-28 14:11:19 -07:00
Vinod Kumar Vavilapalli
a16d022ca4
YARN-2704. Changed ResourceManager to optionally obtain tokens itself for the sake of localization and log-aggregation for long-running services. Contributed by Jian He.
2014-10-27 15:49:47 -07:00
Zhijie Shen
f81dc3f995
YARN-2703. Added logUploadedTime into LogValue for better display. Contributed by Xuan Gong.
2014-10-24 14:10:46 -07:00
Jian He
3b12fd6cfb
YARN-2198. Remove the need to run NodeManager as privileged account for Windows Secure Container Executor. Contributed by Remus Rusanu
2014-10-22 15:57:46 -07:00
cnauroth
6637e3cf95
YARN-2720. Windows: Wildcard classpath variables not expanded against resources contained in archives. Contributed by Craig Welch.
2014-10-21 12:33:21 -07:00
Jason Lowe
6f2028bd15
YARN-90. NodeManager should identify failed disks becoming good again. Contributed by Varun Vasudev
2014-10-21 17:31:13 +00:00
Jian He
4fa1fb3193
Missing file for YARN-2701
2014-10-20 19:57:45 -07:00
Jian He
2839365f23
YARN-2701. Potential race condition in startLocalizer when using LinuxContainerExecutor. Contributed by Xuan Gong
2014-10-20 18:45:47 -07:00
Jian He
0fd0ebae64
YARN-2682. Updated WindowsSecureContainerExecutor to not use DefaultContainerExecutor#getFirstApplicationDir and use getWorkingDir() instead. Contributed by Zhihai Xu
2014-10-16 18:14:34 -07:00
Jian He
0af1a2b5bc
YARN-2312. Deprecated old ContainerId#getId API and updated MapReduce to use ContainerId#getContainerId instead. Contributed by Tsuyoshi OZAWA
2014-10-15 15:22:07 -07:00
Karthik Kambatla
cc93e7e683
YARN-2566. DefaultContainerExecutor should pick a working directory randomly. (Zhihai Xu via kasha)
2014-10-13 16:32:01 -07:00
Jason Lowe
a56ea01002
YARN-2377. Localization exception stack traces are not passed as diagnostic info. Contributed by Gera Shegalov
2014-10-13 18:31:16 +00:00
Zhijie Shen
4aed2d8e91
YARN-2651. Spun off LogRollingInterval from LogAggregationContext. Contributed by Xuan Gong.
2014-10-13 10:54:09 -07:00
Zhijie Shen
cb81bac002
YARN-2583. Modified AggregatedLogDeletionService to be able to delete rolling aggregated logs. Contributed by Xuan Gong.
2014-10-10 00:11:30 -07:00
cnauroth
d3afd730ac
YARN-2662. TestCgroupsLCEResourcesHandler leaks file descriptors. Contributed by Chris Nauroth.
2014-10-09 22:44:44 -07:00
Vinod Kumar Vavilapalli
34cdcaad71
YARN-2468. Enhanced NodeManager to support log handling APIs (YARN-2569) for use by long running services. Contributed by Xuan Gong.
2014-10-03 12:15:40 -07:00
Jason Lowe
29f520052e
YARN-2624. Resource Localization fails on a cluster due to existing cache directories. Contributed by Anubhav Dhoot
2014-10-02 17:39:34 +00:00
Jian He
3ef1cf187f
YARN-2617. Fixed NM to not send duplicate container status whose app is not running. Contributed by Jun Gong
2014-10-02 10:04:09 -07:00
junping_du
c7cee9b455
YARN-1979. TestDirectoryCollection fails when the umask is unusual. (Contributed by Vinod Kumar Vavilapalli and Tsuyoshi OZAWA)
2014-10-02 08:02:59 -07:00
junping_du
0708827a93
YARN-2613. Support retry in NMClient for rolling-upgrades. (Contributed by Jian He)
2014-10-01 16:50:30 -07:00
Zhijie Shen
52bbe0f11b
YARN-2630. Prevented previous AM container status from being acquired by the current restarted AM. Contributed by Jian He.
2014-10-01 15:38:11 -07:00
Vinod Kumar Vavilapalli
ba7f31c2ee
YARN-1972. Added a secure container-executor for Windows. Contributed by Remus Rusanu.
2014-10-01 10:14:41 -07:00
Jian He
5391919b09
YARN-668. Changed NMTokenIdentifier/AMRMTokenIdentifier/ContainerTokenIdentifier to use protobuf object as the payload. Contributed by Junping Du.
2014-09-26 17:48:41 -07:00
Zhijie Shen
c86674a3a4
YARN-2581. Passed LogAggregationContext to NM via ContainerTokenIdentifier. Contributed by Xuan Gong.
2014-09-24 17:50:26 -07:00
Allen Wittenauer
034df0e2eb
YARN-2161. Fix build on macosx: YARN parts (Binglin Chang via aw)
2014-09-24 08:47:55 -07:00
junping_du
a9a55db065
YARN-2584. TestContainerManagerSecurity fails on trunk. (Contributed by Jian He)
2014-09-22 22:45:06 -07:00
Jian He
0a641496c7
YARN-1372. Ensure all completed containers are reported to the AMs across RM restart. Contributed by Anubhav Dhoot
2014-09-22 10:30:53 -07:00
Vinod Kumar Vavilapalli
9f6891d9ef
YARN-2531. Added a configuration for admins to be able to override app-configs and enforce/not-enforce strict control of per-container cpu usage. Contributed by Varun Vasudev.
2014-09-16 10:14:46 -07:00
cnauroth
9d4ec97c95
YARN-2549. TestContainerLaunch fails due to classpath problem with hamcrest classes. Contributed by Chris Nauroth.
2014-09-15 11:53:57 -07:00
Vinod Kumar Vavilapalli
4be95175cd
YARN-2440. Enabled Nodemanagers to limit the aggregate cpu usage across all containers to a preconfigured limit. Contributed by Varun Vasudev.
2014-09-10 19:22:52 -07:00
Jason Lowe
3fa5f728c4
YARN-2431. NM restart: cgroup is not removed for reacquired containers. Contributed by Jason Lowe
2014-09-04 21:11:27 +00:00
Jason Lowe
b03653f9a5
YARN-2462. TestNodeManagerResync#testBlockNewContainerRequestsOnStartAndResync should have a test timeout. Contributed by Eric Payne
2014-08-29 20:15:40 +00:00
Hitesh Shah
3de66011c2
YARN-2450. Fix typos in log messages. Contributed by Ray Chiang.
2014-08-29 11:16:36 -07:00
Allen Wittenauer
7e75226e68
YARN-2424. LCE should support non-cgroups, non-secure mode (Chris Douglas via aw)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1619421 13f79535-47bb-0310-9956-ffa450edef68
2014-08-21 14:57:11 +00:00
Junping Du
c2febdcbaa
YARN-1337. Recover containers upon nodemanager restart. (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617448 13f79535-47bb-0310-9956-ffa450edef68
2014-08-12 10:56:13 +00:00
Junping Du
b8f151231b
YARN-1354. Recover applications upon nodemanager restart. (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1615550 13f79535-47bb-0310-9956-ffa450edef68
2014-08-04 13:25:37 +00:00
Jian He
a41c314373
YARN-2343. Improve NMToken expire exception message. Contributed by Li Lu
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1615270 13f79535-47bb-0310-9956-ffa450edef68
2014-08-01 23:44:48 +00:00
Xuan Gong
e52f67e389
YARN-1994. Expose YARN/MR endpoints on multiple interfaces. Contributed by Craig Welch, Milan Potocnik,and Arpit Agarwal
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1614981 13f79535-47bb-0310-9956-ffa450edef68
2014-07-31 20:06:02 +00:00
Zhijie Shen
1d6e178144
YARN-2347. Consolidated RMStateVersion and NMDBSchemaVersion into Version in yarn-server-common. Contributed by Junping Du.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1614838 13f79535-47bb-0310-9956-ffa450edef68
2014-07-31 09:27:43 +00:00
Aaron Myers
5d4677b57b
YARN-1796. container-executor shouldn't require o-r permissions. Contributed by Aaron T. Myers.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1613548 13f79535-47bb-0310-9956-ffa450edef68
2014-07-26 01:51:35 +00:00
Devarajulu K
2050e0dad6
YARN-1342. Recover container tokens upon nodemanager restart. Contributed by Jason Lowe.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612995 13f79535-47bb-0310-9956-ffa450edef68
2014-07-24 05:02:00 +00:00
Junping Du
537c361f5b
YARN-2013. The diagnostics is always the ExitCodeException stack when the container crashes. (Contributed by Tsuyoshi OZAWA)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612449 13f79535-47bb-0310-9956-ffa450edef68
2014-07-22 03:01:58 +00:00
Jason Darrell Lowe
1ad2d7b405
YARN-2321. NodeManager web UI can incorrectly report Pmem enforcement. Contributed by Leitao Guo
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612411 13f79535-47bb-0310-9956-ffa450edef68
2014-07-21 21:55:06 +00:00
Jason Darrell Lowe
8a87085820
YARN-2045. Data persisted in NM should be versioned. Contributed by Junping Du
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612285 13f79535-47bb-0310-9956-ffa450edef68
2014-07-21 14:43:59 +00:00
Junping Du
403ec8ea80
YARN-1341. Recover NMTokens upon nodemanager restart. (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1611512 13f79535-47bb-0310-9956-ffa450edef68
2014-07-17 23:33:22 +00:00
Jian He
6d7dbd4fed
YARN-1367. Changed NM to not kill containers on NM resync if RM work-preserving restart is enabled. Contributed by Anubhav Dhoot
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1608334 13f79535-47bb-0310-9956-ffa450edef68
2014-07-07 04:37:59 +00:00
Steve Loughran
d1f54f4f4b
YARN-2065 AM cannot create new containers after restart
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1607441 13f79535-47bb-0310-9956-ffa450edef68
2014-07-02 18:35:10 +00:00
Karthik Kambatla
58e398f31d
YARN-2204. Explicitly enable vmem check in TestContainersMonitor#testContainerKillOnMemoryOverflow. (Anubhav Dhoot via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1607231 13f79535-47bb-0310-9956-ffa450edef68
2014-07-02 02:05:37 +00:00
Vinod Kumar Vavilapalli
e285b98f0f
YARN-2152. Added missing information into ContainerTokenIdentifier so that NodeManagers can report the same to RM when RM restarts. Contributed Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605205 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 21:43:22 +00:00
Thomas Graves
1f9a0fd927
YARN-2072. RM/NM UIs and webservices are missing vcore information. (Nathan Roberts via tgraves)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605162 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 19:34:34 +00:00
Haohui Mai
22cbcd16cb
HADOOP-10717. Revert r1603571.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1603620 13f79535-47bb-0310-9956-ffa450edef68
2014-06-18 20:37:25 +00:00
Steve Loughran
bae5837d3b
HADOOP-10717. Missing JSP support in Jetty
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1603534 13f79535-47bb-0310-9956-ffa450edef68
2014-06-18 17:08:45 +00:00
Jason Darrell Lowe
98238a8d4a
YARN-2167. LeveldbIterator should get closed in NMLeveldbStateStoreService#loadLocalizationState() within finally block. Contributed by Junping Du
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1603039 13f79535-47bb-0310-9956-ffa450edef68
2014-06-17 02:12:03 +00:00
Junping Du
072360d128
YARN-1339. Recover DeletionService state upon nodemanager restart. (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1603036 13f79535-47bb-0310-9956-ffa450edef68
2014-06-17 01:02:16 +00:00
Jian He
95897ca14b
YARN-1885. Fixed a bug that RM may not send application-clean-up signal to NMs where the completed applications previously ran in case of RM restart. Contributed by Wangda Tan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1603028 13f79535-47bb-0310-9956-ffa450edef68
2014-06-16 23:56:12 +00:00
Bikas Saha
ecfd43a2f1
YARN-2091. Add more values to ContainerExitStatus and pass it from NM to RM and then to app masters (Tsuyoshi OZAWA via bikas)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1601762 13f79535-47bb-0310-9956-ffa450edef68
2014-06-10 20:08:33 +00:00
Vinod Kumar Vavilapalli
23c325ad47
YARN-2115. Replaced RegisterNodeManagerRequest's ContainerStatus with a new NMContainerStatus which has more information that is needed for work-preserving RM-restart. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1598790 13f79535-47bb-0310-9956-ffa450edef68
2014-05-31 00:20:50 +00:00
Junping Du
66598697a6
YARN-1338. Recover localized resource cache state upon nodemanager restart (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1598640 13f79535-47bb-0310-9956-ffa450edef68
2014-05-30 15:37:27 +00:00
Junping Du
b29434a5c8
YARN-1362. Distinguish between nodemanager shutdown for decommission vs shutdown for restart. (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1594421 13f79535-47bb-0310-9956-ffa450edef68
2014-05-14 00:20:53 +00:00
Junping Du
72ba2b35d8
YARN-766. TestNodeManagerShutdown in branch-2 should use Shell to form the output path and a format issue in trunk. (Contributed by Siddharth Seth)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1593660 13f79535-47bb-0310-9956-ffa450edef68
2014-05-10 03:45:01 +00:00
Haohui Mai
d777cb8996
HADOOP-10563. Remove the dependency of jsp in trunk. Contributed by Haohui Mai.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1592069 13f79535-47bb-0310-9956-ffa450edef68
2014-05-02 22:26:02 +00:00
Ivan Mitic
4810e2b849
YARN-1865. ShellScriptBuilder does not check for some error conditions. Contributed by Remus Rusanu.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1588693 13f79535-47bb-0310-9956-ffa450edef68
2014-04-19 18:55:07 +00:00
Jason Darrell Lowe
cda8646cfa
YARN-1940. deleteAsUser() terminates early without deleting more files on error. Contributed by Rushabh S Shah
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1588546 13f79535-47bb-0310-9956-ffa450edef68
2014-04-18 19:24:13 +00:00
Junping Du
0d087f7941
YARN-1750. TestNodeStatusUpdater#testNMRegistration is incorrect in test case. (Wangda Tan via junping_du)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1588343 13f79535-47bb-0310-9956-ffa450edef68
2014-04-17 19:06:48 +00:00
Zhijie Shen
44b6261bfa
YARN-1892. Improved some logs in the scheduler. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1587717 13f79535-47bb-0310-9956-ffa450edef68
2014-04-15 20:37:44 +00:00
Vinod Kumar Vavilapalli
c6b70f4760
YARN-1933. Fixed test issues with TestAMRestart and TestNodeHealthService. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1587104 13f79535-47bb-0310-9956-ffa450edef68
2014-04-13 21:51:38 +00:00
Jian He
ed78328d50
YARN-1903. Set exit code and diagnostics when container is killed at NEW/LOCALIZING state. Contributed by Zhijie Shen
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1586522 13f79535-47bb-0310-9956-ffa450edef68
2014-04-11 01:26:36 +00:00
Karthik Kambatla
245012a9d9
YARN-1757. NM Recovery. Auxiliary service support. (Jason Lowe via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1585783 13f79535-47bb-0310-9956-ffa450edef68
2014-04-08 17:15:58 +00:00
Vinod Kumar Vavilapalli
06579878df
YARN-1775. Enhanced ProcfsBasedProcessTree to optionally add the ability to use smaps for obtaining used memory information. Contributed by Rajesh Balamohan.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1580087 13f79535-47bb-0310-9956-ffa450edef68
2014-03-22 00:01:59 +00:00
Jian He
6a89e57b8d
YARN-1206. Fixed AM container log to show on NM web page after application finishes if log-aggregation is disabled. Contributed by Rohith Sharmaks
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1578614 13f79535-47bb-0310-9956-ffa450edef68
2014-03-17 21:49:06 +00:00
Jonathan Turner Eagles
7944eab64a
YARN-1136. Replace junit.framework.Assert with org.junit.Assert (Chen He via jeagles)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1578539 13f79535-47bb-0310-9956-ffa450edef68
2014-03-17 20:03:35 +00:00
Vinod Kumar Vavilapalli
96e0ca2d27
YARN-1824. Improved NodeManager and clients to be able to handle cross platform application submissions. Contributed by Jian He.
...
MAPREDUCE-4052. Improved MapReduce clients to use NodeManagers' ability to handle cross platform application submissions. Contributed by Jian He.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1578135 13f79535-47bb-0310-9956-ffa450edef68
2014-03-16 18:32:05 +00:00
Christopher Douglas
53790d3300
YARN-1771. Reduce the number of NameNode operations during localization of
...
public resources using a cache. Contributed by Sangjin Lee
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1577391 13f79535-47bb-0310-9956-ffa450edef68
2014-03-14 00:30:35 +00:00
Vinod Kumar Vavilapalli
8aab8533a1
YARN-1800. Fixed NodeManager to gracefully handle RejectedExecutionException in the public-localizer thread-pool. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1576545 13f79535-47bb-0310-9956-ffa450edef68
2014-03-11 23:33:56 +00:00
Vinod Kumar Vavilapalli
0b1304d098
YARN-1781. Modified NodeManagers to allow admins to specify max disk utilization for local disks so as to be able to offline full disks. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1575463 13f79535-47bb-0310-9956-ffa450edef68
2014-03-08 00:52:06 +00:00
Vinod Kumar Vavilapalli
1c4047b0e4
YARN-1783. Fixed a bug in NodeManager's status-updater that was losing completed container statuses when NodeManager is forced to resync by the ResourceManager. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1575437 13f79535-47bb-0310-9956-ffa450edef68
2014-03-07 22:36:47 +00:00
Vinod Kumar Vavilapalli
d07f855892
YARN-1686. Fixed NodeManager to properly handle any errors during re-registration after a RESYNC and thus avoid hanging. Contributed by Rohith Sharma.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1571474 13f79535-47bb-0310-9956-ffa450edef68
2014-02-24 22:41:24 +00:00
Vinod Kumar Vavilapalli
990cffdcfa
YARN-1553. Modified YARN and MR to stop using HttpConfig.isSecure() and
...
instead rely on the http policy framework. And also fix some bugs related
to https handling in YARN web-apps. Contributed by Haohui Mai.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1568501 13f79535-47bb-0310-9956-ffa450edef68
2014-02-14 20:01:02 +00:00
Sanford Ryza
9024ad4aa0
YARN-1697. NodeManager reports negative running containers (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1567356 13f79535-47bb-0310-9956-ffa450edef68
2014-02-11 20:14:30 +00:00
Karthik Kambatla
d57c6e0fe7
YARN-1672. YarnConfiguration is missing a default for yarn.nodemanager.log.retain-seconds (Naren Koneru via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1565866 13f79535-47bb-0310-9956-ffa450edef68
2014-02-08 01:55:33 +00:00
Jason Darrell Lowe
3497e76e19
YARN-1575. Public localizer crashes with "Localized unkown resource". Contributed by Jason Lowe
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1561110 13f79535-47bb-0310-9956-ffa450edef68
2014-01-24 18:54:48 +00:00
Jason Darrell Lowe
a6ea460a91
MAPREDUCE-5672. Provide optional RollingFileAppender for container log4j (syslog). Contributed by Gera Shegalov
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1558948 13f79535-47bb-0310-9956-ffa450edef68
2014-01-16 22:56:09 +00:00
Karthik Kambatla
752441434c
YARN-1608. LinuxContainerExecutor has a few DEBUG messages at INFO level (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1558874 13f79535-47bb-0310-9956-ffa450edef68
2014-01-16 18:42:25 +00:00
Vinod Kumar Vavilapalli
7c8b654ba5
YARN-888. Cleaned up POM files so that non-leaf modules don't include any dependencies and thus compact the dependency list for leaf modules. Contributed by Alejandro Abdelnur.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1557801 13f79535-47bb-0310-9956-ffa450edef68
2014-01-13 18:34:02 +00:00
Jian He
332c2704d4
YARN-1293. Fixed TestContainerLaunch#testInvalidEnvSyntaxDiagnostics failure caused by non-English system locale. Contributed by Tsuyoshi OZAWA.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1556318 13f79535-47bb-0310-9956-ffa450edef68
2014-01-07 19:00:21 +00:00
Jason Darrell Lowe
785c12056c
YARN-1409. NonAggregatingLogHandler can throw RejectedExecutionException. Contributed by Tsuyoshi OZAWA
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1556282 13f79535-47bb-0310-9956-ffa450edef68
2014-01-07 17:11:09 +00:00
Vinod Kumar Vavilapalli
5a1b33507b
YARN-1145. Fixed a potential file-handle leak in the web interface for displaying aggregated logs. Contributed by Rohith Sharma.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1551326 13f79535-47bb-0310-9956-ffa450edef68
2013-12-16 19:27:48 +00:00
Arun Murthy
ef1f556cef
YARN-1438. Ensure container diagnostics includes exception from container launch. Contributed by Steve Loughran.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1548336 13f79535-47bb-0310-9956-ffa450edef68
2013-12-06 00:25:02 +00:00
Jing Zhao
2214871d91
HDFS-5545. Allow specifying endpoints for listeners in HttpServer. Contributed by Haohui Mai.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1546151 13f79535-47bb-0310-9956-ffa450edef68
2013-11-27 18:20:14 +00:00
Bikas Saha
d1fe9e4142
YARN-1053. Diagnostic message from ContainerExitEvent is ignored in ContainerImpl (Omkar Vinit Joshi via bikas)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1543973 13f79535-47bb-0310-9956-ffa450edef68
2013-11-20 22:25:59 +00:00
Vinod Kumar Vavilapalli
cfa783141f
YARN-1210. Changed RM to start new app-attempts on RM restart only after ensuring that previous AM exited or after expiry time. Contributed by Omkar Vinit Joshi.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1543310 13f79535-47bb-0310-9956-ffa450edef68
2013-11-19 05:17:20 +00:00
Sanford Ryza
1a76ccbbc4
YARN-1401. With zero sleep-delay-before-sigkill.ms, no signal is ever sent (Gera Shegalov via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1542038 13f79535-47bb-0310-9956-ffa450edef68
2013-11-14 19:52:14 +00:00
Jonathan Turner Eagles
656e859f5c
YARN-1386. NodeManager mistakenly loses resources and relocalizes them (Jason Lowe via jeagles)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1541375 13f79535-47bb-0310-9956-ffa450edef68
2013-11-13 03:17:23 +00:00
Chris Nauroth
218ef59d89
YARN-1357. TestContainerLaunch.testContainerEnvVariables fails on Windows. Contributed by Chuan Liu.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1537293 13f79535-47bb-0310-9956-ffa450edef68
2013-10-30 20:49:12 +00:00
Sanford Ryza
defa7af4d3
YARN-1109. Demote NodeManager "Sending out status for container" logs to debug (haosdent via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1536573 13f79535-47bb-0310-9956-ffa450edef68
2013-10-29 00:54:12 +00:00
Sanford Ryza
6a55ed333a
YARN-1295. In UnixLocalWrapperScriptBuilder, using bash -c can cause Text file busy errors. (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1532532 13f79535-47bb-0310-9956-ffa450edef68
2013-10-15 21:14:47 +00:00
Sanford Ryza
5a87d89366
Add missing file TestCgroupsLCEResourcesHandler for YARN-1284.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1530493 13f79535-47bb-0310-9956-ffa450edef68
2013-10-09 05:07:41 +00:00
Sanford Ryza
726c3538a7
YARN-1284. LCE: Race condition leaves dangling cgroups entries for killed containers. (Alejandro Abdelnur via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1530492 13f79535-47bb-0310-9956-ffa450edef68
2013-10-09 05:05:17 +00:00
Vinod Kumar Vavilapalli
21181b6553
YARN-1277. Added a policy based configuration for http/https in common HttpServer and using the same in YARN - related
...
to per project https config support via HADOOP-10022. Contributed by Suresh Srinivas and Omkar Vinit Joshi.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1529662 13f79535-47bb-0310-9956-ffa450edef68
2013-10-06 18:43:36 +00:00
Vinod Kumar Vavilapalli
44f7ee7192
YARN-1278. Fixed NodeManager to not delete local resources for apps on resync command from RM - a bug caused by YARN-1149. Contributed by Hitesh Shah.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1529657 13f79535-47bb-0310-9956-ffa450edef68
2013-10-06 18:32:16 +00:00
Vinod Kumar Vavilapalli
52b0ce3565
YARN-1274. Fixed NodeManager's LinuxContainerExecutor to create user, app-dir and log-dirs correctly even when there are no resources to localize for the container. Contributed by Siddharth Seth.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1529555 13f79535-47bb-0310-9956-ffa450edef68
2013-10-06 00:14:55 +00:00