hadoop

Author	SHA1	Message	Date
Mehakmeet Singh	90b1e737d3	HADOOP-18242. ABFS Rename Failure when tracking metadata is in an incomplete state (#4517 ) ABFS rename fails intermittently when the Storage-blob tracking metadata is in an incomplete state. This surfaces as the error code 404 and an error message of "RenameDestinationParentPathNotFound" To mitigate this issue, when a request fails with this response. the ABFS client issues a HEAD call on the source file and then retries the rename operation again ABFS filesystem statistics track when this occurs with new counters rename_recovery metadata_incomplete_rename_failures rename_path_attempts This is very rare occurrence and appears to be triggered under certain heavy load conditions, just as with HADOOP-18163. Contributed by Mehakmeet Singh.	2022-07-02 01:49:14 +05:30
Mukund Thakur	7eb1c908a0	HADOOP-18322. Yetus build failure in branch-3.3. caused by HADOOP-18103	2022-06-30 15:05:38 -05:00
Masatake Iwasaki	75c739c458	Revert "HADOOP-17196. Fix C/C++ standard warnings (#2208 )" This reverts commit `b4a105a209`.	2022-06-30 00:57:52 +00:00
Mukund Thakur	c517b086f2	HADOOP-18106: Handle memory fragmentation in S3A Vectored IO. (#4445 ) part of HADOOP-18103. Handling memory fragmentation in S3A vectored IO implementation by allocating smaller user range requested size buffers and directly filling them from the remote S3 stream and skipping undesired data in between ranges. This patch also adds aborting active vectored reads when stream is closed or unbuffer() is called. Contributed By: Mukund Thakur Conflicts: hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/RawLocalFileSystem.java	2022-06-23 17:34:29 -05:00
Mukund Thakur	bfb7d020d1	HADOOP-18105 Implement buffer pooling with weak references (#4263 ) part of HADOOP-18103. Required for vectored IO feature. None of current buffer pool implementation is complete. ElasticByteBufferPool doesn't use weak references and could lead to memory leak errors and DirectBufferPool doesn't support caller preferences of direct and heap buffers and has only fixed length buffer implementation. Contributed By: Mukund Thakur	2022-06-23 17:11:13 -05:00
Mukund Thakur	bb5a17b177	HADOOP-18107 Adding scale test for vectored reads for large file (#4273 ) part of HADOOP-18103. Contributed By: Mukund Thakur	2022-06-23 17:11:09 -05:00
Mukund Thakur	9f03f87963	HADOOP-18104: S3A: Add configs to configure minSeekForVectorReads and maxReadSizeForVectorReads (#3964 ) Part of HADOOP-18103. Introducing fs.s3a.vectored.read.min.seek.size and fs.s3a.vectored.read.max.merged.size to configure min seek and max read during a vectored IO operation in S3A connector. These properties actually define how the ranges will be merged. To completely disable merging set fs.s3a.max.readsize.vectored.read to 0. Contributed By: Mukund Thakur	2022-06-23 17:11:04 -05:00
Mukund Thakur	5c348c41ab	HADOOP-11867. Add a high-performance vectored read API. (#3904 ) part of HADOOP-18103. Add support for multiple ranged vectored read api in PositionedReadable. The default iterates through the ranges to read each synchronously, but the intent is that FSDataInputStream subclasses can make more efficient readers especially in object stores implementation. Also added implementation in S3A where smaller ranges are merged and sliced byte buffers are returned to the readers. All the merged ranged are fetched from S3 asynchronously. Contributed By: Owen O'Malley and Mukund Thakur Conflicts: hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/RawLocalFileSystem.java pom.xml	2022-06-23 17:09:16 -05:00
Viraj Jasani	4ba463069b	HADOOP-18288. Total requests and total requests per sec served by RPC servers (#4485 ) Signed-off-by: Tao Li <tomscut@apache.org>	2022-06-23 17:30:01 +08:00
Igor Dvorzhak	d41e0a9cc3	HADOOP-18300. Upgrade Gson dependency to version 2.9.0 (#4454 ) Reviewed-by: Ayush Saxena <ayushsaxena@apache.org> Signed-off-by: Chris Nauroth <cnauroth@apache.org> (cherry picked from commit `77d1b194c7`)	2022-06-22 23:42:59 +00:00
Benjamin Teke	838b63d836	YARN-10974. Queue filter in CS UI v1 does not work as expected. Contributed by Chengbing Liu.	2022-06-22 18:20:09 +02:00
Steve Loughran	fb4e8172a0	MAPREDUCE-7391. TestLocalDistributedCacheManager failing after HADOOP-16202 (#4472 ) Fixing a mockito-based test which broke when HADOOP-16202 changed the methods being invoked. Contributed by Steve Loughran	2022-06-22 13:13:24 +01:00
Viraj Jasani	53a530aa88	MAPREDUCE-7371. DistributedCache alternative APIs should not use DistributedCache APIs internally (#3855 ) Contributed by Viraj Jasani	2022-06-22 13:13:05 +01:00
Steve Loughran	9ca4ac0af0	HADOOP-18305. Preparing for 3.3.4 release: branch-3.3 version => 3.3.9 (#4482 ) Updating the hadoop version of branch-3.3 to 3.3.9-SNAPSHOT pending agreement on what number its future release should take. Using 3.3.9-SNAPSHOT puts space in for other incremental releases, while avoiding creating JIRA release ordering and autocompletion confusion the way adding a 3.3.10 or higher version would do. Contributed by Steve Loughran	2022-06-22 13:09:50 +01:00
André Fonseca	49342cffdb	HADOOP-18159. Bump cos_api-bundle to 5.6.69 to update public-suffix-list.txt (#4444 ) Bump cos_api-bundle to 5.6.69 All copies of httpclient, including shaded ones in libraries used by the s3a, gs and cos cloud connectors, turn out to load their TLD list from the same resource mozilla/public-suffix-list.txt Updating the hadoop-cos dependency ensures that its version of public-suffix-list.txt is up to date -and so the s3a connector able to talk to s3 resources if the cos-api-bundle JAR is where the resource is loaded from. Contributed by André Fonseca	2022-06-21 20:17:08 +01:00
Steve Loughran	aeb2a2f860	HADOOP-17833. Improve Magic Committer performance (#3289 ) (#4470 ) Speed up the magic committer with key changes being * Writes under __magic always retain directory markers * File creation under __magic skips all overwrite checks, including the LIST call intended to stop files being created over dirs. * mkdirs under __magic probes the path for existence but does not look any further. Extra parallelism in task and job commit directory scanning Use of createFile and openFile with parameters which all for HEAD checks to be skipped. The committer can write the summary _SUCCESS file to the path `fs.s3a.committer.summary.report.directory`, which can be in a different file system/bucket if desired, using the job id as the filename. Also: HADOOP-15460. S3A FS to add `fs.s3a.create.performance` Application code can set the createFile() option fs.s3a.create.performance to true to disable the same safety checks when writing under magic directories. Use with care. The createFile option prefix `fs.s3a.create.header.` can be used to add custom headers to S3 objects when created. Contributed by Steve Loughran.	2022-06-21 10:49:37 +01:00
Viraj Jasani	7561dbd134	HDFS-16637. TestHDFSCLI#testAll consistently failing (#4466 ). Contributed by Viraj Jasani. Reviewed-by: Tao Li <tomscut@apache.org> Signed-off-by: Ayush Saxena <ayushsaxena@apache.org>	2022-06-21 13:44:30 +05:30
Ashutosh Gupta	4f860f8ac2	MAPREDUCE-7369. Fixed MapReduce tasks timing out when spends more time on MultipleOutputs#close (#4247 ) Contributed by Ravuri Sushma sree. Co-authored-by: Ashutosh Gupta <ashugpt@amazon.com> (cherry picked from commit `36c4be819f`) Conflicts: hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapred/TaskAttemptListenerImpl.java	2022-06-20 08:02:58 +00:00
slfan1989	43f4a0e92d	MAPREDUCE-7387. Fix TestJHSSecurity#testDelegationToken AssertionError due to HDFS-16563 (#4428 ). Contributed by fanshilun. Signed-off-by: Ayush Saxena <ayushsaxena@apache.org>	2022-06-20 12:16:33 +05:30
KevinWikant	33ab84f2e2	HDFS-16064. Determine when to invalidate corrupt replicas based on number of usable replicas (#4410 ) Co-authored-by: Kevin Wikant <wikak@amazon.com> Signed-off-by: Akira Ajisaka <aajisaka@apache.org> (cherry picked from commit `cfceaebde6`)	2022-06-20 11:24:45 +09:00
zhengchenyu	d7de378b22	YARN-11172. Fix TestClientRMTokens#testDelegationToken introduced by HDFS-16563. (#4408 ) Regression caused by HDFS-16563; the hdfs exception text was changed, but because it was a YARN test doing the check, Yetus didn't notice. Contributed by zhengchenyu	2022-06-17 19:51:56 +01:00
jianghuazhu	18a5e843bc	HDFS-16581.Print node status when executing printTopology. (#4321 ) Reviewed-by: Viraj Jasani <vjasani@apache.org> Signed-off-by: Tao Li <tomscut@apache.org>	2022-06-16 19:20:34 +08:00
xuzq	ee3ee98ee5	HDFS-16623. Avoid IllegalArgumentException in LifelineSender (#4409 ) * HDFS-16623. Avoid IllegalArgumentException in LifelineSender Co-authored-by: zengqiang.xu <zengqiang.xu@shopee.com> (cherry picked from commit `af5003a473`)	2022-06-10 19:02:47 +00:00
Steve Loughran	de9f994338	YARN-11173. remove redeclaration of os-maven-plugin.version from yarn-csi (#4417 ) This is a followup to HADOOP-18275 and its upgrade of os-maven-plugin.version When that change is merged in, this MUST follow it. Contributed by Steve Loughran Change-Id: I61d087041561eeb8c9c42b5b7d8f0bb63f296b15	2022-06-10 17:03:25 +01:00
Ashutosh Gupta	bdef321d52	HDFS-16576. Remove unused imports in HDFS project (#4389 ) Co-authored-by: Ashutosh Gupta <ashugpt@amazon.com> Reviewed-by: Tao Li <tomscut@apache.org> Signed-off-by: Akira Ajisaka <aajisaka@apache.org> (cherry picked from commit `6e11c94170`) Conflicts: hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/datanode/checker/AbstractFuture.java	2022-06-09 22:42:04 +09:00
slfan1989	a2f8a9e5d8	HDFS-16624. Fix flaky unit test TestDFSAdmin#testAllDatanodesReconfig (#4412 ) Reviewed-by: Viraj Jasani <vjasani@apache.org> Signed-off-by: Tao Li <tomscut@apache.org>	2022-06-09 09:59:34 +08:00
monthonk	7ec988d264	HADOOP-12020. Add s3a storage class option fs.s3a.create.storage.class (#3877 ) Adds a new option fs.s3a.create.storage.class which can be used to set the storage class for files created in AWS S3. Consult the documentation for details and instructions on how disable the relevant tests when testing against third-party stores. Contributed by Monthon Klongklaew Change-Id: I8cdebadf294a89fde08d98729ad96f251d58411c	2022-06-08 20:02:07 +01:00
Viraj Jasani	516a2a8e44	HDFS-16618. sync_file_range error should include more volume/file info (#4402 ) Signed-off-by: Tao Li <tomscut@apache.org>	2022-06-07 16:56:07 +08:00
Viraj Jasani	132fbbe228	HDFS-16595. Slow peer metrics - add median, mad and upper latency limits (#4357 ) (#4405 ) Reviewed-by: Tao Li <tomscut@apache.org> Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>	2022-06-07 06:41:16 +08:00
Steve Loughran	03c2941d4b	HADOOP-18275. Update os-maven-plugin to 1.7.0 (#4397 ) Contributed by Steve Loughran Change-Id: Ic4d442a37299dc8098b0bca3cc51beca6f058283	2022-06-06 13:20:00 +01:00
Renukaprasad C	0c15daa77a	HDFS-16563. Namenode WebUI prints sensitive information on Token expiry (#4241 ) Contributed by Renukaprasad C Change-Id: I5cd2cec1dd79917f810207821b3bdf4fe1a5d24c	2022-06-06 11:08:57 +01:00
Samrat	7223a337f6	HDFS-16608. Fix the link in TestClientProtocolForPipelineRecovery (#4379 ) Signed-off-by: Akira Ajisaka <aajisaka@apache.org> (cherry picked from commit `7f08ed0d1d`)	2022-06-06 18:02:44 +09:00
Stephen O'Donnell	7d6b133af3	HDFS-16610. Make fsck read timeout configurable (#4384 ) (cherry picked from commit `34a973a90e`) Conflicts: hadoop-hdfs-project/hadoop-hdfs/src/main/resources/hdfs-default.xml	2022-06-01 20:54:56 +01:00
Ashutosh Gupta	de4c975710	HADOOP-18238. Fix reentrancy check in SFTPFileSystem.close() (#4330 ) Contributed by Ashutosh Gupta Change-Id: I2742675add74259a93b3762a80c7ab5ee6d08c37	2022-05-30 17:34:45 +01:00
GuoPhilipse	dd9b8af9c4	HADOOP-18269. Misleading method name in DistCpOptions.(#4216 ) Contributed by guophilipse Change-Id: I5e75d030406997339c20e970483825e529d9cd10	2022-05-30 14:04:33 +01:00
slfan1989	91f19bf8fa	HADOOP-18244. Fix Hadoop-Common JavaDoc Error on branch-3.3 (#4327 ). Contributed by fanshilun.	2022-05-29 11:31:16 +05:30
Ashutosh Gupta	d921cc71fd	HDFS-16585.Add @VisibleForTesting in Dispatcher.java (#4337 ) Co-authored-by: Ashutosh Gupta <ashugpt@amazon.com> Reviewed-by: Tao Li <tomscut@apache.org> Reviewed-by: Ayush Saxena <ayushsaxena@apache.org> Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org> (cherry picked from commit `bee538f785`)	2022-05-26 15:28:27 -07:00
Stephen O'Donnell	55ba3a7944	HDFS-16583. DatanodeAdminDefaultMonitor can get stuck in an infinite loop holding the write lock (#4332 ) Co-authored-by: S O'Donnell <sodonnell@cloudera.com> (cherry picked from commit `297f0f6d6a`)	2022-05-26 10:14:50 -07:00
Wei-Chiu Chuang	ba856bff95	HDFS-16456. EC: Decommission a rack with only on dn will fail when the rack number is equal with replication (#4126 ) (#4304 ) (cherry picked from commit `cee8c62498`) Conflicts: hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/net/NetworkTopology.java (cherry picked from commit dd79aee635fdc61648e0c87bea1560dc35aee053) Co-authored-by: caozhiqiang <lfxy@163.com> Reviewed-by: Takanobu Asanuma <tasanuma@apache.org>	2022-05-27 00:50:40 +08:00
Ashutosh Gupta	e0732baeb8	YARN-11128. Fix comments in TestProportionalCapacityPreemptionPolicy* (#4271 ) Co-authored-by: Ashutosh Gupta <ashugpt@amazon.com> Reviewed-by: Hemanth Boyina <hemanthboyina@apache.org> Signed-off-by: Akira Ajisaka <aajisaka@apache.org> (cherry picked from commit `e3e9369c1d`)	2022-05-26 15:58:47 +09:00
Masatake Iwasaki	241fb6b2a7	HADOOP-18251. Fix failure of extracting JIRA id from commit message in git_jira_fix_version_check.py. (#4344 ) (cherry picked from commit `6b331dde31`)	2022-05-26 03:28:55 +00:00
Michael Stack	ae9d671232	HDFS-16586. Purge FsDatasetAsyncDiskService threadgroup; it causes BP… (#4347 ) Remove the ThreadGroup used by executor factories; they are unused and ThreadGroups auto-destroy when their Thread-member count goes to zero. This behavior is incompatible with the configuration we have on the per-volume executor which is set to let all threads die if no use inside the keepalive time.	2022-05-25 17:02:28 -07:00
Ashutosh Gupta	8c492a1d65	HADOOP-18240. Upgrade Yetus to 0.14.0 (#4328 ) Co-authored-by: Ashutosh Gupta <ashugpt@amazon.com> Reviewed-by: Chris Nauroth <cnauroth@apache.org> Signed-off-by: Akira Ajisaka <aajisaka@apache.org> (cherry picked from commit `84b0455cf8`)	2022-05-25 17:32:19 +09:00
jianghuazhu	fe6b050857	HDFS-16588. Backport HDFS-16584 to branch-3.3. (#4342 ). Contributed by JiangHua Zhu. Signed-off-by: litao <tomleescut@gmail.com> Signed-off-by: He Xiaoqiao <hexiaoqiao@apache.org>	2022-05-24 23:47:45 +08:00
Owen O'Malley	1f111d6a41	YARN-11162. Set the zk acl for nodes created by ZKConfigurationStore. (#4350 ) (cherry picked from commit `f390edaec4`)	2022-05-24 05:17:34 +00:00
Viraj Jasani	ab3a9cedc9	HDFS-16582. Expose aggregate latency of slow node as perceived by the reporting node (#4323 ) Reviewed-by: Wei-Chiu Chuang <weichiu@apache.org> Signed-off-by: Tao Li <tomscut@apache.org>	2022-05-21 09:47:18 +08:00
Ashutosh Gupta	57fe613299	HDFS-16453. Upgrade okhttp from 2.7.5 to 4.9.3 (#4229 ) Co-authored-by: Ashutosh Gupta <ashugpt@amazon.com> Signed-off-by: Akira Ajisaka <aajisaka@apache.org> (cherry picked from commit `fb910bd906`) Conflicts: hadoop-project/pom.xml	2022-05-21 03:17:15 +09:00
Szilard Nemeth	90ec4418c7	YARN-11141. Capacity Scheduler does not support ambiguous queue names when moving application across queues. Contributed by Andras Gyori	2022-05-18 14:34:08 +02:00
Szilard Nemeth	4f112e3138	YARN-11126. ZKConfigurationStore Java deserialisation vulnerability. Contributed by Tamas Domok	2022-05-18 14:25:35 +02:00
Szilard Nemeth	b4550b3356	YARN-10850. TimelineService v2 lists containers for all attempts when filtering for one. Contributed by Benjamin Teke	2022-05-18 14:08:41 +02:00

... 2 3 4 5 6 ...

25120 Commits