hadoop

History

Steve Loughran 6574f27fa3 HADOOP-16570. S3A committers encounter scale issues. Contributed by Steve Loughran. This addresses two scale issues which has surfaced in large scale benchmarks of the S3A Committers. * Thread pools are not cleaned up. This now happens, with tests. * OOM on job commit for jobs with many thousands of tasks, each generating tens of (very large) files. Instead of loading all pending commits into memory as a single list, the list of files to load is the sole list which is passed around; .pendingset files are loaded and processed in isolation -and reloaded if necessary for any abort/rollback operation. The parallel commit/abort/revert operations now work at the .pendingset level, rather than that of individual pending commit files. The existing parallelized Tasks API is still used to commit those files, but with a null thread pool, so as to serialize the operations. Change-Id: I5c8240cd31800eaa83d112358770ca0eb2bca797		2019-10-04 18:54:22 +01:00
..
hadoop-aliyun	HADOOP-16331. Fix ASF License check in pom.xml	2019-05-29 17:25:13 +09:00
hadoop-archive-logs	HADOOP-16331. Fix ASF License check in pom.xml	2019-05-29 17:25:13 +09:00
hadoop-archives	HADOOP-16331. Fix ASF License check in pom.xml	2019-05-29 17:25:13 +09:00
hadoop-aws	HADOOP-16570. S3A committers encounter scale issues.	2019-10-04 18:54:22 +01:00
hadoop-azure	HADOOP-16578 : Avoid FileSystem API calls when FileSystem already exists	2019-10-01 17:38:11 -07:00
hadoop-azure-datalake	HADOOP-16605. Fix testcase testSSLChannelModeConfig	2019-10-03 11:13:55 +01:00
hadoop-datajoin	HADOOP-16331. Fix ASF License check in pom.xml	2019-05-29 17:25:13 +09:00
hadoop-distcp	HDFS-13660. DistCp job fails when new data is appended in the file while the DistCp copy job is running	2019-09-24 11:23:24 +01:00
hadoop-dynamometer	HDFS-14637. Namenode may not replicate blocks to meet the policy after enabling upgradeDomain. Contributed by Stephen O'Donnell.	2019-10-03 22:13:50 -07:00
hadoop-extras	HADOOP-16331. Fix ASF License check in pom.xml	2019-05-29 17:25:13 +09:00
hadoop-fs2img	HADOOP-16557. [pb-upgrade] Upgrade protobuf.version to 3.7.1 (#1432 )	2019-09-20 16:08:30 +05:30
hadoop-gridmix	HADOOP-16331. Fix ASF License check in pom.xml	2019-05-29 17:25:13 +09:00
hadoop-kafka	HADOOP-16331. Fix ASF License check in pom.xml	2019-05-29 17:25:13 +09:00
hadoop-openstack	HADOOP-16431. Remove useless log in IOUtils.java and ExceptionDiags.java.	2019-07-24 10:04:39 +09:00
hadoop-pipes	HADOOP-16331. Fix ASF License check in pom.xml	2019-05-29 17:25:13 +09:00
hadoop-resourceestimator	HADOOP-16331. Fix ASF License check in pom.xml	2019-05-29 17:25:13 +09:00
hadoop-rumen	HADOOP-16331. Fix ASF License check in pom.xml	2019-05-29 17:25:13 +09:00
hadoop-sls	YARN-9782. Avoid DNS resolution while running SLS. Contributed by Abhishek Modi.	2019-10-04 14:45:10 +05:30
hadoop-streaming	HADOOP-16331. Fix ASF License check in pom.xml	2019-05-29 17:25:13 +09:00
hadoop-tools-dist	HADOOP-16331. Fix ASF License check in pom.xml	2019-05-29 17:25:13 +09:00
pom.xml	HDFS-12345 Add Dynamometer to hadoop-tools, a tool for scale testing the HDFS NameNode with real metadata and workloads. Contributed by Erik Krogen.	2019-06-25 08:07:39 -07:00