lujiefsi
ae95caa60e
YARN-10976. Fix resource leak due to Files.walk ( #3552 )
...
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2021-10-18 15:24:15 +09:00
Viraj Jasani
d336227e5c
HADOOP-17963. Replace Guava VisibleForTesting by Hadoop's own annotation in hadoop-yarn-project modules ( #3541 )
2021-10-14 18:03:01 +09:00
Viraj Jasani
1151edf12e
HADOOP-17956. Replace all default Charset usage with UTF-8 ( #3529 )
...
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2021-10-14 13:07:24 +09:00
Szilard Nemeth
8c271caa81
YARN-6862. Nodemanager resource usage metrics sometimes are negative. Contributed by Benjamin Teke
2021-10-12 17:35:06 +02:00
Tamas Domok
0ba6f35dc2
YARN-10355. Refactor NM ContainerLaunch.java#orderEnvByDependencies
...
Co-authored-by: Tamas Domok <tdomok@cloudera.com>
2021-08-04 15:27:46 +02:00
Tamas Domok
a5811dda7b
YARN-10874. Refactor NM ContainerLaunch#getEnvDependencies's unit tests ( #3248 )
...
Co-authored-by: Tamas Domok <tdomok@cloudera.com>
2021-08-03 16:19:04 +02:00
minni31
d78b300ed4
YARN-10841. Fix token reset synchronization for UAM response token. ( #3194 )
...
YARN-10841. Fix token reset synchronization for UAM response token. Contributed by Minni Mittal
2021-07-29 14:55:39 +05:30
Artem Smotrakov
ba325a8ada
HADOOP-17793. Better token validation ( #3189 )
...
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2021-07-10 13:42:31 +09:00
Viraj Jasani
618c9218ee
HADOOP-17788. Replace IOUtils#closeQuietly usages by Hadoop's own utility ( #3171 )
...
Reviewed-by: Steve Loughran <stevel@apache.org>
Reviewed-by: Akira Ajisaka <aajisaka@apache.org>
Signed-off-by: Takanobu Asanuma <tasanuma@apache.org>
2021-07-08 16:03:40 +09:00
Jim Brennan
7c7d02edbd
YARN-10824. Title not set for JHS and NM webpages. Contributed by Bilwa S T.
2021-06-25 20:32:08 +00:00
Viraj Jasani
81d7069316
YARN-10805. Replace Guava Lists usage by Hadoop's own Lists in hadoop-yarn-project ( #3075 )
2021-06-09 15:15:47 +09:00
Hideyuki Furue
9c7b8cf54e
Fix container-executor
...
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2021-06-06 07:05:29 +09:00
Viraj Jasani
996d31f2dc
HADOOP-17721. Replace Guava Sets usage by Hadoop's own Sets in hadoop-yarn-project ( #3033 )
...
Signed-off-by: Takanobu Asanuma <tasanuma@apache.org>
2021-05-25 18:10:20 +09:00
Peter Bacsko
eb72628e15
YARN-10258. Add metrics for 'ApplicationsRunning' in NodeManager. Contributed by ANANDA G B.
2021-05-17 21:07:29 +02:00
Wei-Chiu Chuang
f1e1809029
HADOOP-17653. Do not use guava's Files.createTempDir(). ( #2945 )
...
Reviewed-by: Steve Loughran <stevel@apache.org>
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2021-05-02 11:10:32 +09:00
Eric Badger
803ac4b1a0
YARN-10707. Support custom resources in ResourceUtilization, and update Node GPU Utilization to use. Contributed by Qi Zhu
2021-04-29 17:42:13 +00:00
Jim Brennan
1cbe35946b
YARN-10743. Add a policy for not aggregating for containers which are killed because exceeding container log size limit. Contributed by Qi Zhu.
2021-04-23 21:35:09 +00:00
Viraj Jasani
9a261b0a16
HADOOP-17642. Remove appender EventCounter to avoid instantiation ( #2922 )
...
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2021-04-17 20:47:44 +09:00
Akira Ajisaka
8ad77a26c9
YARN-10716. Fix typo in ContainerRuntime. Contributed by xishuhai.
2021-03-30 17:46:51 +09:00
Jack Jiang
d8ec8ab965
HADOOP-17599. Remove NULL checks before instanceof ( #2804 )
2021-03-23 08:46:11 -07:00
Eric Badger
181b9ddc51
YARN-10703. Fix potential null pointer error of gpuNodeResourceUpdateHandler in NodeResourceMonitorImpl. Contributed by Qi Zhu.
2021-03-18 17:45:09 +00:00
Peter Bacsko
38495af325
YARN-10692. Add Node GPU Utilization and apply to NodeMetrics. Contributed by Qi Zhu.
2021-03-18 12:46:11 +01:00
Jim Brennan
5f067cf0f3
[YARN-10687] Add option to disable/enable free disk space checking and percentage checking for full and not-full disks. Contributed by Qi Zhu.
2021-03-12 17:17:31 +00:00
Eric Badger
9d088639bf
YARN-10664. Allow parameter expansion in NM_ADMIN_USER_ENV. Contributed by Jim
...
Brennan.
2021-03-05 19:46:40 +00:00
Jim Brennan
3fcc7a55b6
[YARN-10626] Log resource allocation in NM log at container start time. Contributed by Eric Badger
2021-02-16 16:45:35 +00:00
Szilard Nemeth
cacc870389
YARN-10593. Fix incorrect string comparison in GpuDiscoverer. Contributed by Peter Bacsko
2021-02-10 13:09:32 +01:00
Jim Brennan
c22c77af43
[YARN-10607] User environment is unable to prepend PATH when mapreduce.admin.user.env also sets PATH. Contributed by Eric Badger.
2021-02-05 17:33:01 +00:00
belugabehr
21a3fc3d2d
HADOOP-17482: Remove Commons Logger from FileSystem Class ( #2633 )
2021-02-01 09:40:01 -08:00
Eric Badger
768e2f42ba
YARN-10562. Follow up changes for YARN-9833. Contributed by Jim Brennan.
2021-01-13 23:41:52 +00:00
Eric Payne
d8fadec944
YARN-4589: Diagnostics for localization timeouts is lacking. Contributed by Chang Li (lichangleo) and Jim Brennan (Jim_Brennan)
2021-01-13 17:01:28 +00:00
Eric Badger
4c5d88e230
YARN-10540. Node page is broken in YARN UI1 and UI2 including RMWebService api
...
for nodes. Contributed by Jim Brennan.
2020-12-21 23:10:31 +00:00
adamantal
0a45bd034e
YARN-10520. Deprecated the residual nested class for the LCEResourceHandler ( #2527 ). Contributed by Wanqiang Ji
2020-12-09 12:55:24 +01:00
Eric Badger
7d3c8ef606
YARN-10495. make the rpath of container-executor configurable. Contributed by
...
angerszhu.
2020-12-08 00:53:24 +00:00
Ahmed Hussein
569b20e31c
YARN-10468. Fix TestNodeStatusUpdater timeouts and broken conditions ( #2461 )
2020-11-24 11:09:30 -08:00
Eric E Payne
55339c2bdd
YARN-10479. RMProxy should retry on SocketTimeout Exceptions. Contributed by Jim Brennan (Jim_Brennan)
2020-11-05 21:50:46 +00:00
Jim Brennan
8ee6bc2518
YARN-10471. Prevent logs for any container from becoming larger than a configurable size. Contributed by Eric Payne
2020-10-29 19:49:18 +00:00
Szilard Nemeth
31477a224f
YARN-10460. Upgrading to JUnit 4.13 causes tests in TestNodeStatusUpdater to fail. Contributed by Peter Bacsko
2020-10-19 13:17:51 +02:00
Ayush Saxena
1e3a6efcef
HADOOP-17288. Use shaded guava from thirdparty. ( #2342 ). Contributed by Ayush Saxena.
2020-10-17 12:01:18 +05:30
Jim Brennan
deb35a32ba
YARN-10455. TestNMProxy.testNMProxyRPCRetry is not consistent. Contributed by Ahmed Hussein
2020-10-08 18:46:47 +00:00
Adam Antal
a1f7e760df
YARN-10393. MR job live lock caused by completed state container leak in heartbeat between node manager and RM. Contributed by zhenzhao wang and Jim Brennan
2020-10-05 10:09:14 +02:00
Jim Brennan
90894ea641
YARN-10430. Log improvements in NodeStatusUpdaterImpl. Contributed by Bilwa S T.
2020-09-14 21:19:14 +00:00
Szilard Nemeth
4ffe26f9b8
[UI1] Provide a way to hide Tools section in Web UIv1. Contributed by Andras Gyori
2020-08-31 16:00:39 +02:00
Jim Brennan
4a7deae478
YARN-10391. --module-gpu functionality is broken in container-executor. Contributed by Eric Badger
2020-08-17 16:41:15 +00:00
bibinchundatt
81da221c75
YARN-10388. RMNode updatedCapability flag not set while RecommissionNodeTransition. Contributed by Pranjal Protim Borah
2020-08-07 08:36:52 +05:30
Brahma Reddy Battula
eac558380f
YARN-10229. [Federation] Client should be able to submit application to RM directly using normal client conf. Contributed by Bilwa S T.
2020-08-03 12:54:36 +05:30
bibinchundatt
5323e83edf
YARN-10359. Log container report only if list is not empty. Contributed by Bilwa S T.
2020-08-01 13:03:46 +05:30
Eric E Payne
e0c9653166
YARN-1529: Add Localization overhead metrics to NM. Contributed by Jim_Brennan.
2020-07-30 15:30:22 +00:00
Eric Badger
ac5f21dbef
YARN-4771. Some containers can be skipped during log aggregation after NM
...
restart. Contributed by Jason Lowe and Jim Brennan.
2020-07-24 22:35:16 +00:00
Eric Badger
736bed6d6d
[YARN-10353] Log vcores used and cumulative cpu in containers monitor.
...
Contributed by Jim Brennan
2020-07-20 18:08:27 +00:00
Ayush Saxena
6bcb24d269
HADOOP-17100. Replace Guava Supplier with Java8+ Supplier in Hadoop. Contributed by Ahmed Hussein.
2020-07-18 14:33:43 +05:30
Ahmed Hussein
1f71c4ae71
HADOOP-17099. Replace Guava Predicate with Java8+ Predicate
...
Signed-off-by: Jonathan Eagles <jeagles@gmail.com>
2020-07-15 11:39:06 -05:00
Xiaoyu Yao
f91a8ad88b
HADOOP-17079. Optimize UGI#getGroups by adding UGI#getGroupsSet. ( #2085 )
2020-07-09 11:33:37 -07:00
Akira Ajisaka
639acb6d89
HADOOP-17111. Replace Guava Optional with Java8+ Optional. Contributed by Ahmed Hussein.
2020-07-06 16:08:36 +09:00
Eric Yang
e8dc862d38
YARN-9809. Added node manager health status to resource manager registration call.
...
Contributed by Eric Badger via eyang
2020-06-30 11:39:16 -07:00
Vinayakumar B
e154084770
HADOOP-17046. Support downstreams' existing Hadoop-rpc implementations using non-shaded protobuf classes ( #2026 )
2020-06-12 23:16:33 +05:30
Ahmed Hussein
f5081a9a5d
YARN-10256. Refactor TestContainerSchedulerQueuing.testContainerUpdateExecTypeGuaranteedToOpportunistic (Ahmed Hussein via jeagles)
...
Signed-off-by: Jonathan Eagles <jeagles@gmail.com>
2020-05-04 10:13:51 -05:00
Wilfred Spiegelenburg
2214005c0f
YARN-10063. Add container-executor arguments --http/--https to usage. Contributed by Siddharth Ahuja
2020-04-08 11:30:03 +10:00
Brahma Reddy Battula
8914cf9167
Preparing for 3.4.0 development
2020-03-29 23:24:25 +05:30
Eric E Payne
ede05b19d1
YARN-942. TestContainerSchedulerQueuing.testKillOnlyRequiredOpportunisticContainers fails sporadically Contributed by Ahmed Hussein (ahussein)
2020-03-10 14:17:45 +00:00
Szilard Nemeth
44afe1154d
YARN-9419. Log a warning if GPU isolation is enabled but LinuxContainerExecutor is disabled. Contribued by Andras Gyori
2020-03-09 16:08:24 +01:00
bibinchundatt
3859fa76d0
YARN-6924. Metrics for Federation AMRMProxy. Contributed by Young Chen
2020-03-07 09:34:42 +05:30
Eric Badger
2649f8b327
YARN-10173. Make pid file generation timeout configurable in case of reacquired
...
container. Contributed by Adam Antal.
2020-03-04 23:31:57 +00:00
Szilard Nemeth
3f1aad05f0
YARN-10147. FPGA plugin can't find the localized aocx file. Contributed by Peter Bacsko
2020-02-19 15:54:25 +01:00
Vinayakumar B
7dac7e1d13
HADOOP-16596. [pb-upgrade] Use shaded protobuf classes from hadoop-thirdparty dependency ( #1635 ). Contributed by Vinayakumar B.
2020-02-07 14:51:24 +05:30
Szilard Nemeth
71b2c2ffe9
YARN-10101. Support listing of aggregated logs for containers belonging to an application attempt. Contributed by Adam Antal
2020-02-06 12:25:06 +01:00
Prabhu Joseph
825db8fe2a
YARN-10107. Fix GpuResourcePlugin#getNMResourceInfo to honor Auto Discovery Enabled
...
Contributed by Szilard Nemeth.
2020-01-29 13:30:00 +05:30
Akira Ajisaka
14c2c3d69d
YARN-9512. [JDK11] TestAuxServices#testCustomizedAuxServiceClassPath fails because of ClassCastException. ( #1806 )
2020-01-16 12:21:50 +09:00
Eric Yang
2301b25899
YARN-10019. Improved container-executor exec() calls.
...
Contributed by Peter Bacsko
2020-01-10 19:04:04 -05:00
Eric E Payne
93233a7d6e
YARN-9018. Add functionality to AuxiliaryLocalPathHandler to return all locations to read for a given path. Contributed by Kuhu Shukla (kshukla)
2020-01-09 17:18:44 +00:00
Szilard Nemeth
631dbbc6f2
YARN-9923. Introduce HealthReporter interface to support multiple health checker files. Contributed by Adam Antal
2019-12-15 17:28:04 +01:00
Eric Badger
8dffd8dc89
YARN-9561. Add C changes for the new RuncContainerRuntime. Contributed by Eric Badger
2019-12-09 01:25:10 +00:00
Eric Badger
ab2cc5ac92
Revert "YARN-9561. Add C changes for the new RuncContainerRuntime. Contributed by Eric Badger"
...
This reverts commit 76bb297db9
.
2019-12-09 01:25:10 +00:00
Eric Badger
76bb297db9
YARN-9561. Add C changes for the new RuncContainerRuntime. Contributed by Eric Badger
2019-12-06 18:33:58 +00:00
Wanqiang Ji
c48de9aa2d
YARN-9958. Remove the invalid lock in ContainerExecutor ( #1704 )
2019-12-04 14:35:41 +09:00
Szilard Nemeth
828ab400ee
YARN-9362. Code cleanup in TestNMLeveldbStateStoreService. Contributed by Denes Gerencser
2019-11-26 21:45:12 +01:00
Eric Badger
6c20512b68
Revert "YARN-9561. Add C changes for the new RuncContainerRuntime. Contributed by Eric Badger"
...
This reverts commit 289bbca870
.
2019-11-25 17:46:42 +00:00
Szilard Nemeth
4c1a1287bc
YARN-9968. Public Localizer is exiting in NodeManager due to NullPointerException. Contributed by Tarun Parimi
2019-11-22 12:58:10 +01:00
Abhishek Modi
dc3f4fc2f4
YARN-9965. Fix NodeManager failing to start on subsequent times when Hdfs Auxillary Jar is set (addendum). Contributed by Prabhu Joseph.
2019-11-19 08:08:40 +05:30
Eric Badger
0e22e9ab83
YARN-9562. Add Java changes for the new RuncContainerRuntime. Contributed by Eric Badger
2019-11-18 21:56:31 +00:00
Eric Badger
289bbca870
YARN-9561. Add C changes for the new RuncContainerRuntime. Contributed by Eric Badger
2019-11-18 21:54:16 +00:00
Abhishek Modi
516377bfa6
YARN-9965. Fix NodeManager failing to start when Hdfs Auxillary Jar is set. Contributed by Prabhu Joseph.
2019-11-11 16:01:44 +05:30
Szilard Nemeth
31f172fd96
YARN-9677. Make FpgaDevice and GpuDevice classes more similar to each other. Contributed by kevin su
2019-11-09 14:04:14 +01:00
Eric Badger
eef34f2d87
YARN-9914. Use separate configs for free disk space checking for full and not-full disks. Contributed by Jim Brennan
2019-10-25 17:13:04 +00:00
Eric Yang
ac6b6a6a85
HADOOP-16614. Add aarch64 support for dependent leveldbjni.
...
Contributed by liusheng
close #1546
2019-10-24 11:45:57 -04:00
Eric Yang
72b1bed998
YARN-9884. Make container-executor mount logic modular
...
Contributed by Eric Badger
2019-10-18 19:10:32 -04:00
Eric Yang
31e0122f4d
YARN-9860. Enable service mode for Docker containers on YARN
...
Contributed by Prabhu Joseph and Shane Kumpf
2019-10-10 19:02:02 -04:00
Szilard Nemeth
6f1ab95168
YARN-9128. Use SerializationUtils from apache commons to serialize / deserialize ResourceMappings. Contributed by Zoltan Siegl
2019-10-09 13:26:26 +02:00
Abhishek Modi
4d3c580b03
YARN-9859. Refactoring of OpportunisticContainerAllocator. Contributed by Abhishek Modi.
2019-09-30 23:40:15 +05:30
Szilard Nemeth
bec0864394
YARN-9808. Zero length files in container log output haven't got a header. Contributed by Adam Antal
2019-09-25 10:28:34 +02:00
Duo Zhang
bfe1dac29d
HADOOP-16560. [YARN] use protobuf-maven-plugin to generate protobuf classes ( #1496 )
...
Signed-off-by: Vinayakumar B <vinayakumarb@apache.org>
2019-09-24 08:50:36 +08:00
Sunil G
c474e24c0b
YARN-9833. Race condition when DirectoryCollection.checkDirs() runs during container launch. Contributed by Peter Bacsko.
2019-09-17 19:41:38 +05:30
HUAN-PING SU
4239007349
YARN-9787. Typo in analysesErrorMsg. Contributed by kevin su.
...
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
Reviewed-by: Surendra Singh Lilhore <surendralilhore@apache.org>
2019-09-13 10:36:45 -07:00
Abhishek Modi
b939ae98e6
YARN-8174. Add containerId to ResourceLocalizationService fetch failure log statement. Contributed by Prabhu Joseph.
2019-09-02 10:45:59 +05:30
Szilard Nemeth
e8fa192f07
YARN-9217. Nodemanager will fail to start if GPU is misconfigured on the node or GPU drivers missing. Contributed by Peter Bacsko
2019-08-21 16:44:22 +02:00
bibinchundatt
e684b17e6f
YARN-5857. TestLogAggregationService.testFixedSizeThreadPool fails intermittently on trunk. Contributed by Bilwa S T.
2019-08-21 17:14:42 +05:30
Szilard Nemeth
2216ec54e5
YARN-9100. Add tests for GpuResourceAllocator and do minor code cleanup. Contributed by Peter Bacsko
2019-08-16 09:13:20 +02:00
Szilard Nemeth
2a05e0ff3b
YARN-9749. TestAppLogAggregatorImpl#testDFSQuotaExceeded fails on trunk. Contributed by Adam Antal
2019-08-16 08:52:09 +02:00
Adam Antal
22c4f38c4b
YARN-9679. Regular code cleanup in TestResourcePluginManager ( #1122 )
2019-08-15 17:32:05 +02:00
HUAN-PING SU
167acd87da
YARN-9683. Remove reapDockerContainerNoPid left behind by YARN-9074 ( #1212 ) Contributed by Kevin Su.
...
Reviewed-by: Eric Yang <eyang@apache.org>
Reviewed-by: Adam Antal <adam.antal@cloudera.com>
2019-08-14 10:42:29 -07:00
Adam Antal
c89bdfacc8
YARN-9676. Add DEBUG and TRACE level messages to AppLogAggregatorImpl… ( #1261 )
...
* YARN-9676. Add DEBUG and TRACE level messages to AppLogAggregatorImpl and connected classes
* Using {} placeholder, and increasing loglevel if log aggregation failed.
2019-08-14 17:35:16 +02:00