hadoop/hadoop-common-project
Steve Loughran 936e9e15d0
MAPREDUCE-7435. Manifest Committer OOM on abfs (#5519)
This modifies the manifest committer so that the list of files
to rename is passed between stages as a file of
writeable entries on the local filesystem.

The map of directories to create is still passed in memory;
this map is built across all tasks, so even if many tasks
created files, if they all write into the same set of directories
the memory needed is O(directories) with the
task count not a factor.

The _SUCCESS file reports on heap size through gauges.
This should give a warning if there are problems.

Contributed by Steve Loughran
2023-06-12 13:43:43 +01:00
..
hadoop-annotations HADOOP-18305. Preparing for 3.3.4 release: branch-3.3 version => 3.3.9 (#4482) 2022-06-22 13:09:50 +01:00
hadoop-auth HADOOP-18687. Remove json-smart dependency. (#5549 + #5524) 2023-05-09 17:34:36 +01:00
hadoop-auth-examples HADOOP-18305. Preparing for 3.3.4 release: branch-3.3 version => 3.3.9 (#4482) 2022-06-22 13:09:50 +01:00
hadoop-common MAPREDUCE-7435. Manifest Committer OOM on abfs (#5519) 2023-06-12 13:43:43 +01:00
hadoop-kms HADOOP-18305. Preparing for 3.3.4 release: branch-3.3 version => 3.3.9 (#4482) 2022-06-22 13:09:50 +01:00
hadoop-minikdc HADOOP-18329. Support for IBM Semeru JVM > 11.0.15.0 Vendor Name Changes (#4537) (#5208) 2022-12-12 17:28:56 +00:00
hadoop-nfs HADOOP-18584. [NFS GW] Fix regression after netty4 migration. (#5252) 2023-02-01 05:33:01 -08:00
hadoop-registry Hadoop-18519. Backport HDFS-15383 and HADOOP-17835 to branch-3.3 (#5112) 2022-11-07 13:29:50 -08:00
pom.xml HADOOP-11867. Add a high-performance vectored read API. (#3904) 2022-06-23 17:09:16 -05:00