hadoop/dev-support
Mukund Thakur 5c348c41ab HADOOP-11867. Add a high-performance vectored read API. (#3904)
part of HADOOP-18103.
Add support for multiple ranged vectored read api in PositionedReadable.
The default iterates through the ranges to read each synchronously,
but the intent is that FSDataInputStream subclasses can make more
efficient readers especially in object stores implementation.

Also added implementation in S3A where smaller ranges are merged and
sliced byte buffers are returned to the readers. All the merged ranged are
fetched from S3 asynchronously.

Contributed By: Owen O'Malley and Mukund Thakur

 Conflicts:
	hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/RawLocalFileSystem.java
	pom.xml
2022-06-23 17:09:16 -05:00
..
bin HADOOP-18240. Upgrade Yetus to 0.14.0 (#4328) 2022-05-25 17:32:19 +09:00
byteman HDDS-342. Add example byteman script to print out hadoop rpc traffic. 2018-08-22 14:48:22 -07:00
docker HADOOP-18048. [branch-3.3] Dockerfile_aarch64 build fails with fatal error: Python.h: No such file or directory (#3801) 2021-12-14 17:56:58 -08:00
git-jira-validation HADOOP-18251. Fix failure of extracting JIRA id from commit message in git_jira_fix_version_check.py. (#4344) 2022-05-26 03:28:55 +00:00
findHangingTest.sh
Jenkinsfile HADOOP-11867. Add a high-performance vectored read API. (#3904) 2022-06-23 17:09:16 -05:00
README.md
verify-xml.sh
win-paths-eg.cmd

This directory contains tools to help in the development and release of Apache Hadoop.

  • bin

    • releasedocmaker

      Build release notes for a given Hadoop project or subproject. This is called from Maven when -Preleasedocs is used. See BUILDING.txt for more information.

    • shelldocs

      Create documentation for the Unix Shell API. This is called from Maven when -Pshelldocs is used.

    • smart-apply-patch

      Intelligently apply a patch file to a source tree.

    • test-patch

      Test a patch against a source tree.

  • create-release.sh

    Helps REs create a release of Apache Hadoop for distribution.

  • determine-flaky-tests-hadoop.py

    Given a jenkins test job, this script examines all runs of the job done within specified period of time (number of days prior to the execution time of this script), and reports all failed tests.

  • docker

    Various helpers for the start-build-env.sh script, including the Dockerfile itself. See parent BUILDING.txt for more information.

  • findHangingTest.sh

    Finds hanging test from Jenkins build output.

Previously, the scripts test-patch.sh, smart-apply-patch.sh, releasedocmaker.py, and shelldocs.py were in this directory. They have been moved to the Apache Yetus project (https://yetus.apache.org). These scripts have been replaced with wrapper scripts located in the bin directory. Command line options are generally different than the previous versions that shipped with older versions of Apache Hadoop.

The wrapper scripts will download, verify (if GPG is installed), and cache a local copy of Apache Yetus in the hadoop/patchprocess directory. The version that is used may be overridden by setting the HADOOP_YETUS_VERSION environment variable. The cache directory may be overwritten by setting the HADOOP_PATCHPROCESS directory. If a local version of Apache Yetus is already installed, it may be used instead by setting the YETUS_HOME environment variable to point to that directory.