hadoop

Author	SHA1	Message	Date
Wei-Chiu Chuang	7db9895000	Post release update * Add jdiff xml files from 3.3.6 release. * Declare 3.3.6 as the latest stable release. * Copy release notes.	2023-06-26 15:59:39 +00:00
Steve Loughran	ab594ec77e	HADOOP-18724. Open file fails with NumberFormatException for S3AFileSystem (#5611 ) This: 1. Adds optLong, optDouble, mustLong and mustDouble methods to the FSBuilder interface to let callers explicitly passin long and double arguments. 2. The opt() and must() builder calls which take float/double values now only set long values instead, so as to avoid problems related to overloaded methods resulting in a ".0" being appended to a long value. 3. All of the relevant opt/must calls in the hadoop codebase move to the new methods 4. And the s3a code is resilient to parse errors in is numeric options -it will downgrade to the default. This is nominally incompatible, but the floating-point builder methods were never used: nothing currently expects floating point numbers. For anyone who wants to safely set numeric builder options across all compatible releases, convert the number to a string and then use the opt(String, String) and must(String, String) methods. Contributed by Steve Loughran	2023-05-16 13:41:17 +01:00
Wei-Chiu Chuang	99312bdfdb	HADOOP-18671. Add recoverLease(), setSafeMode(), isFileClosed() as interfaces to hadoop-common (#5553 ) (#5619 ) * HADOOP-18671. Add recoverLease(), setSafeMode(), isFileClosed() as interfaces to hadoop-common (#5553) The HDFS lease APIs have been replicated as interfaces in hadoop-common so other filesystems can also implement them. Applications which use the leasing APIs should migrate to the new interface where possible. Contributed by Stephen Wu (cherry picked from commit `0e46388474`) Conflicts: hadoop-hdfs-project/hadoop-hdfs-client/src/main/java/org/apache/hadoop/hdfs/DistributedFileSystem.java hadoop-hdfs-project/hadoop-hdfs-rbf/src/test/java/org/apache/hadoop/hdfs/server/federation/router/TestRouterRpc.java hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestDFSUpgrade.java hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestEncryptionZones.java hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestViewDistributedFileSystem.java hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/namenode/TestFSImageWithAcl.java hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/namenode/TestFSImageWithSnapshot.java hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/namenode/TestNameNodeRetryCacheMetrics.java hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/namenode/snapshot/TestFSImageWithOrderedSnapshotDeletion.java hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/namenode/snapshot/TestOrderedSnapshotDeletion.java hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/namenode/snapshot/TestSnapshotDeletion.java hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/tools/offlineImageViewer/TestOfflineImageViewer.java hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/tools/offlineImageViewer/TestOfflineImageViewerForErasureCodingPolicy.java Co-authored-by: Tak Lon (Stephen) Wu <taklwu@apache.org>	2023-05-09 06:20:56 +08:00
Sebastian Baunsgaard	919c3f615b	HADOOP-18660. Filesystem Spelling Mistake (#5475 ). Contributed by Sebastian Baunsgaard. Signed-off-by: Ayush Saxena <ayushsaxena@apache.org>	2023-04-25 19:59:54 +01:00
Nikita Eshkeev	7a32e7cc38	HADOOP-18597. Simplify single node instructions for creating directories for Map Reduce. (#5305 ) Signed-off-by: Ayush Saxena <ayushsaxena@apache.org>	2023-04-24 01:14:09 +05:30
Steve Loughran	a505940a2f	HADOOP-18470. Hadoop 3.3.5 release wrap-up (#5558 ) Post-release updates of the branches * Add jdiff xml files from 3.3.5 release. * Declare 3.3.5 as the latest stable release. * Copy release notes.	2023-04-18 10:12:41 +01:00
rdingankar	94b3c6dd90	HDFS-16917 Add transfer rate quantile metrics for DataNode reads (#5397 ) Co-authored-by: Ravindra Dingankar <rdingankar@linkedin.com>	2023-02-27 15:49:26 -08:00
Arnout Engelen	477f17be97	HADOOP-18627. Add stronger wording in 'secure mode' introduction (#5406 ) Make it more clear that when deploying Hadoop 'secure mode' is generally not optional. Contributed by Arnout Engelen	2023-02-17 16:31:21 +00:00
Steve Loughran	cd2401d2cc	HADOOP-18470. More in the 3.3.5 index.html about security (#5383 ) Expands on the comments in cluster config to tell people they shouldn't be running a cluster without a private VLAN in cloud, that Knox is good here, and unsecured clusters without a VLAN are just computation-as-a-service to crypto miners Contributed by Steve Loughran	2023-02-14 17:25:20 +00:00
Steve Loughran	36889005f7	HADOOP-18470. index.md update for 3.3.5 release	2022-12-05 16:22:40 +00:00
Steve Loughran	c70b8709cc	HADOOP-18442. Remove openstack support (#4855 ) The swift:// connector for openstack support has been removed. The hadoop-openstack jar remains, only now it is empty of code. This is to ensure that projects which declare the JAR a dependency will still have successful builds. Contributed by Steve Loughran	2022-10-07 12:03:08 +01:00
Mukund Thakur	0a11ce2546	HADOOP-18407. Improve readVectored() api spec (#4760 ) part of HADOOP-18103. Contributed By: Mukund Thakur	2022-08-31 11:15:10 -05:00
Steve Loughran	9c5228cf6b	HADOOP-18305. Release Hadoop 3.3.4: upstream changelog and jdiff files Add the r3.3.4 changelog, release notes and jdiff xml files. Change-Id: I98b0fed54da3b810c3f23fe5b12e673937916257	2022-08-05 14:02:28 +01:00
Masatake Iwasaki	ff13f9ee8b	Make upstream aware of 3.2.4 release. (cherry picked from commit e1637a57dfd41385dbce5de90620c48a45abb263)	2022-07-22 02:31:34 +00:00
Mukund Thakur	c517b086f2	HADOOP-18106: Handle memory fragmentation in S3A Vectored IO. (#4445 ) part of HADOOP-18103. Handling memory fragmentation in S3A vectored IO implementation by allocating smaller user range requested size buffers and directly filling them from the remote S3 stream and skipping undesired data in between ranges. This patch also adds aborting active vectored reads when stream is closed or unbuffer() is called. Contributed By: Mukund Thakur Conflicts: hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/RawLocalFileSystem.java	2022-06-23 17:34:29 -05:00
Mukund Thakur	9f03f87963	HADOOP-18104: S3A: Add configs to configure minSeekForVectorReads and maxReadSizeForVectorReads (#3964 ) Part of HADOOP-18103. Introducing fs.s3a.vectored.read.min.seek.size and fs.s3a.vectored.read.max.merged.size to configure min seek and max read during a vectored IO operation in S3A connector. These properties actually define how the ranges will be merged. To completely disable merging set fs.s3a.max.readsize.vectored.read to 0. Contributed By: Mukund Thakur	2022-06-23 17:11:04 -05:00
Mukund Thakur	5c348c41ab	HADOOP-11867. Add a high-performance vectored read API. (#3904 ) part of HADOOP-18103. Add support for multiple ranged vectored read api in PositionedReadable. The default iterates through the ranges to read each synchronously, but the intent is that FSDataInputStream subclasses can make more efficient readers especially in object stores implementation. Also added implementation in S3A where smaller ranges are merged and sliced byte buffers are returned to the readers. All the merged ranged are fetched from S3 asynchronously. Contributed By: Owen O'Malley and Mukund Thakur Conflicts: hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/RawLocalFileSystem.java pom.xml	2022-06-23 17:09:16 -05:00
Viraj Jasani	4ba463069b	HADOOP-18288. Total requests and total requests per sec served by RPC servers (#4485 ) Signed-off-by: Tao Li <tomscut@apache.org>	2022-06-23 17:30:01 +08:00
Steve Loughran	aeb2a2f860	HADOOP-17833. Improve Magic Committer performance (#3289 ) (#4470 ) Speed up the magic committer with key changes being * Writes under __magic always retain directory markers * File creation under __magic skips all overwrite checks, including the LIST call intended to stop files being created over dirs. * mkdirs under __magic probes the path for existence but does not look any further. Extra parallelism in task and job commit directory scanning Use of createFile and openFile with parameters which all for HEAD checks to be skipped. The committer can write the summary _SUCCESS file to the path `fs.s3a.committer.summary.report.directory`, which can be in a different file system/bucket if desired, using the job id as the filename. Also: HADOOP-15460. S3A FS to add `fs.s3a.create.performance` Application code can set the createFile() option fs.s3a.create.performance to true to disable the same safety checks when writing under magic directories. Use with care. The createFile option prefix `fs.s3a.create.header.` can be used to add custom headers to S3 objects when created. Contributed by Steve Loughran.	2022-06-21 10:49:37 +01:00
Steve Loughran	fe306ce57e	HADOOP-18198. Release 3.3.3: release notes and jdiff files. * Add the changelog and release notes * add all jdiff XML files * update the project pom with the new stable version Change-Id: Iaea846c3e451bbd446b45de146845a48953d580d	2022-05-17 19:00:09 +01:00
Ashutosh Gupta	277daca91f	HADOOP-17479. Fix the examples of hadoop config prefix (#4197 ) Signed-off-by: Akira Ajisaka <aajisaka@apache.org> (cherry picked from commit `40a8b9a6a5`)	2022-05-08 08:09:47 +09:00
Steve Loughran	75950e47e7	HADOOP-16202. Enhanced openFile(): hadoop-common changes. (#2584/1) This defines standard option and values for the openFile() builder API for opening a file: fs.option.openfile.read.policy A list of the desired read policy, in preferred order. standard values are adaptive, default, random, sequential, vector, whole-file fs.option.openfile.length How long the file is. fs.option.openfile.split.start start of a task's split fs.option.openfile.split.end end of a task's split These can be used by filesystem connectors to optimize their reading of the source file, including but not limited to * skipping existence/length probes when opening a file * choosing a policy for prefetching/caching data The hadoop shell commands which read files all declare "whole-file" and "sequential", as appropriate. Contributed by Steve Loughran. Change-Id: Ia290f79ea7973ce8713d4f90f1315b24d7a23da1	2022-04-27 19:23:10 +01:00
Ashutosh Gupta	261b11f7db	HADOOP-17564. Fix typo in UnixShellGuide.html (#4195 ) contributed by Ashutosh Gupta	2022-04-22 18:00:58 +01:00
Masatake Iwasaki	419d9718a8	Make upstream aware of 3.2.3 release. (cherry picked from commit `10876333ac`)	2022-03-28 08:03:39 +00:00
Steve Loughran	105e0dbd92	HADOOP-13704. Optimized S3A getContentSummary() Optimize the scan for s3 by performing a deep tree listing, inferring directory counts from the paths returned. Contributed by Ahmar Suhail. Change-Id: I26ffa8c6f65fd11c68a88d6e2243b0eac6ffd024	2022-03-22 13:31:13 +00:00
Chao Sun	b174aaed57	Make upstream aware of 3.3.2 release	2022-03-02 19:10:30 -08:00
GuoPhilipse	7512714475	HDFS-16449. Fix hadoop web site release notes and changelog not available (#3967 ) Reviewed-by: Ayush Saxena <ayushsaxena@apache.org> Signed-off-by: Akira Ajisaka <aajisaka@apache.org> (cherry picked from commit `b68964336d`)	2022-02-14 05:40:16 +09:00
Steve Loughran	8ccc586af6	HADOOP-17409. Remove s3guard from S3A module (#3534 ) Completely removes S3Guard support from the S3A codebase. If the connector is configured to use any metastore other than the null and local stores (i.e. DynamoDB is selected) the s3a client will raise an exception and refuse to initialize. This is to ensure that there is no mix of S3Guard enabled and disabled deployments with the same configuration but different hadoop releases -it must be turned off completely. The "hadoop s3guard" command has been retained -but the supported subcommands have been reduced to those which are not purely S3Guard related: "bucket-info" and "uploads". This is major change in terms of the number of files changed; before cherry picking subsequent s3a patches into older releases, this patch will probably need backporting first. Goodbye S3Guard, your work is done. Time to die. Contributed by Steve Loughran.	2022-01-18 18:04:48 +00:00
Wei-Chiu Chuang	350b51f287	Make upstream aware of 3.3.1 release	2022-01-04 14:48:49 -08:00
Steve Loughran	67eaf5aa9f	HADOOP-17979. Add Interface EtagSource to allow FileStatus subclasses to provide etags (#3633 ) Contributed by Steve Loughran Change-Id: I596205d788f623114c12962941445432e2036c34	2021-11-29 16:20:55 +00:00
smarthan	bc40a41064	HADOOP-18023. Allow cp command to run with multi threads. (#3721 ) (cherry picked from commit `932a78fe38`)	2021-11-29 12:47:02 +00:00
smarthan	cbb3ba135c	HADOOP-17998. Allow get command to run with multi threads. (#3645 ) (cherry picked from commit `63018dc73f`) Conflicts: hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/shell/CopyCommands.java	2021-11-22 12:14:32 +00:00
litao	026d5860cb	HDFS-16315. Add metrics related to Transfer and NativeCopy for DataNode (#3666 )	2021-11-17 11:06:53 +09:00
litao	340dee4469	HDFS-16319. Add metrics doc for ReadLockLongHoldCount and WriteLockLongHoldCount (#3653 ). Contributed by tomscut. Signed-off-by: Ayush Saxena <ayushsaxena@apache.org>	2021-11-14 20:12:13 +05:30
Rintaro Ikeda	92af6cd3bc	HADOOP-17919. Fix command line example in Hadoop Cluster Setup documentation. (#3453 ) (cherry picked from commit `607c20c612`)	2021-09-17 13:34:07 +00:00
jianghuazhu	7c663043b2	HDFS-16173.Improve CopyCommands#Put#executor queue configurability. (#3302 ) Co-authored-by: zhujianghua <zhujianghua@zhujianghuadeMacBook-Pro.local> Reviewed-by: Hui Fei <ferhui@apache.org> Reviewed-by: Viraj Jasani <vjasani@apache.org> (cherry picked from commit `4c94831364`)	2021-08-27 12:06:26 +08:00
Petre Bogdan Stolojan	f2cec5cb88	HADOOP-17139 Re-enable optimized copyFromLocal implementation in S3AFileSystem (#3101 ) This work * Defines the behavior of FileSystem.copyFromLocal in filesystem.md * Implements a high performance implementation of copyFromLocalOperation for S3 * Adds a contract test for the operation: AbstractContractCopyFromLocalTest * Implements the contract tests for Local and S3A FileSystems Contributed by: Bogdan Stolojan Change-Id: I25d502102775c3626c4264e5a14c649879730050	2021-08-02 11:58:36 +01:00
Viraj Jasani	ec3311975c	HADOOP-16290. Enable RpcMetrics units to be configurable (#3198 ) Signed-off-by: Akira Ajisaka <aajisaka@apache.org> (cherry picked from commit `e1d00addb5`)	2021-07-20 14:56:28 +08:00
Takanobu Asanuma	25138c98bf	HADOOP-17760. Delete hadoop.ssl.enabled and dfs.https.enable from docs and core-default.xml (#3099 ) Reviewed-by: Ayush Saxena <ayushsaxena@apache.org> (cherry picked from commit `9e7c7ad129`)	2021-06-17 10:00:36 +09:00
Xiaoyu Yao	3f9c9ccf46	HADOOP-17284. Support BCFKS keystores for Hadoop Credential Provider.… (#3010 ) * HADOOP-17284. Support BCFKS keystores for Hadoop Credential Provider. (#2334) (cherry picked from commit `4c5ad57818`)	2021-05-13 16:57:58 -07:00
Takanobu Asanuma	65bf544118	HADOOP-16954. Add -S option in "Count" command to show only Snapshot Counts. Contributed by hemanthboyina. (cherry picked from commit `b89d875f7b`)	2021-05-04 17:44:34 +01:00
lfengnan	43fac739bb	HDFS-15810. RBF: RBFMetrics's TotalCapacity out of bounds (#2910 ) Reviewed-by: Inigo Goiri <inigoiri@apache.org> Signed-off-by: Akira Ajisaka <aajisaka@apache.org> (cherry picked from commit `6e525ab81c`)	2021-05-02 19:19:55 +09:00
touchida	dca2bf9dd5	HDFS-15759. EC: Verify EC reconstruction correctness on DataNode (#2585 ) (cherry picked from commit `95e6892675`)	2021-04-08 17:20:08 +08:00
litao	62937d15bb	HDFS-15892. Add metric for editPendingQ in FSEditLogAsync (#2770 ) Signed-off-by: Takanobu Asanuma <tasanuma@apache.org> (cherry picked from commit `4bd04126d6`)	2021-04-02 10:57:05 +09:00
kwangsun	3aae563421	HADOOP-17952. Fix the wrong CIDR range example in Proxy User documentation. (#2780 ) Signed-off-by: Akira Ajisaka <aajisaka@apache.org> (cherry picked from commit `c8d327a4f1`)	2021-03-22 11:45:42 +09:00
He Xiaoqiao	7fb49a48d1	HADOOP-17585. Correct timestamp format in the docs for the touch command. Contributed by Stephen O'Donnell. (cherry picked from commit `b1dc6c40a0`)	2021-03-14 14:56:16 +00:00
Steve Loughran	469fcdaf8f	HADOOP-16721. Improve S3A rename resilience (#2742 ) The S3A connector's rename() operation now raises FileNotFoundException if the source doesn't exist; a FileAlreadyExistsException if the destination exists and is unsuitable for the source file/directory. When renaming to a path which does not exist, the connector no longer checks for the destination parent directory existing -instead it simply verifies that there is no file immediately above the destination path. This is needed to avoid race conditions with delete() and rename() calls working on adjacent subdirectories. Contributed by Steve Loughran.	2021-03-11 12:54:15 +00:00
Akira Ajisaka	4462da0a84	HADOOP-17546. Update Description of hadoop-http-auth-signature-secret in HttpAuthentication.md. Contributed by Ravuri Sushma sree. (cherry picked from commit `9fd2198daa`)	2021-03-04 14:56:54 +09:00
Steve Loughran	4423a7e736	HADOOP-16906. Abortable (#2684 ) Adds an Abortable.abort() interface for streams to enable output streams to be terminated; this is implemented by the S3A connector's output stream. It allows for commit protocols to be implemented which commit/abort work by writing to the final destination and using the abort() call to cancel any write which is not intended to be committed. Consult the specification document for information about the interface and its use. Contributed by Jungtaek Lim and Steve Loughran. Change-Id: I7fcc25e9dd8c10ce6c29f383529f3a2642a201ae	2021-02-17 11:29:19 +00:00
Steve Loughran	98e4d516ea	HADOOP-13327 Output Stream Specification. (#2587 ) This defines what output streams and especially those which implement Syncable are meant to do, and documents where implementations (HDFS; S3) don't. With tests. The file:// FileSystem now supports Syncable if an application calls FileSystem.setWriteChecksum(false) before creating a file -checksumming and Syncable.hsync() are incompatible. Contributed by Steve Loughran. Change-Id: I892d768de6268f4dd6f175b3fe3b7e5bcaa91194	2021-02-10 10:31:22 +00:00

1 2 3 4 5 ...

459 Commits