hadoop

Author	SHA1	Message	Date
Steve Loughran	b9b999f7b1	YARN-11441. Revert YARN-10495. This reverts commit `1022ed91ac`.	2023-02-17 15:07:41 +00:00
Bryan Beaudreault	aa6c51364a	HADOOP-18215. Enhance WritableName to be able to return aliases for classes that use serializers (#4215 )	2023-02-16 11:38:20 -08:00
Viraj Jasani	8c9c68c19e	HADOOP-18628. IPC Server Connection should log host name before returning VersionMismatch error (#5385 ) Contributed by Viraj Jasani	2023-02-15 18:23:44 +00:00
Steve Loughran	cd2401d2cc	HADOOP-18470. More in the 3.3.5 index.html about security (#5383 ) Expands on the comments in cluster config to tell people they shouldn't be running a cluster without a private VLAN in cloud, that Knox is good here, and unsecured clusters without a VLAN are just computation-as-a-service to crypto miners Contributed by Steve Loughran	2023-02-14 17:25:20 +00:00
Mehakmeet Singh	a2ceb09323	HADOOP-18596. Distcp -update to use modification time while checking for file skip. (#5387 ) Adding toggleable support for modification time during distcp -update between two stores with incompatible checksum comparison. Contributed by: Mehakmeet Singh <mehakmeet.singh.behl@gmail.com>	2023-02-14 15:17:27 +05:30
Steve Vaughan	d437571fe2	HDFS-16904. Close webhdfs during TestSymlinkHdfs teardown (#5372 ) This is a followup to the original patch, `08f58ecf07`, which it supercedes * Switch to org.apache.hadoop.io.IOUtils and closeStream. * Use cleanupWithLogger to include error logging Contributed by Steve Vaughan Jr	2023-02-13 14:47:00 +00:00
Viraj Jasani	5099077f4d	HDFS-16907. Add LastHeartbeatResponseTime for BP service actor (#5349 ) (#5358 ) * ADDENDUM Remove unused variables from testDataNodeMXBeanLastHeartbeats (#5373) Reviewed-by: Ayush Saxena <ayushsaxena@apache.org> Reviewed-by: Shilun Fan <slfan1989@apache.org> Signed-off-by: Tao Li <tomscut@apache.org>	2023-02-12 11:53:59 +05:30
Ayush Saxena	b6d732b96b	HDFS-15368. TestBalancerWithHANameNodes#testBalancerWithObserver failed occasionally. Contributed by Xiaoqiao He.	2023-02-12 01:57:34 +05:30
Masatake Iwasaki	d85c12ee69	HDFS-15674. TestBPOfferService#testMissBlocksWhenReregister fails on trunk. (#2467 )	2023-02-12 01:50:39 +05:30
Ahmed Hussein	c17734b747	HDFS-15654. TestBPOfferService#testMissBlocksWhenReregister fails intermittently (#2419 )	2023-02-12 01:50:26 +05:30
Owen O'Malley	9e7a9fd46d	HDFS-18324. Fix race condition in closing IPC connections. (#5371 )	2023-02-10 13:56:52 -08:00
huhaiyang	de08baded6	HADOOP-18625. Fix method name of RPC.Builder#setnumReaders (#5301 ) Changes method name of RPC.Builder#setnumReaders to setNumReaders() The original method is still there, just marked deprecated. It is the one which should be used when working with older branches. Contributed by Haiyang Hu	2023-02-09 13:29:47 +00:00
Steve Vaughan	b6680907f9	HDFS-16904. Close webhdfs during TestSymlinkHdfs teardown (#5342 ) Contributed by Steve Vaughan Jr	2023-02-09 13:29:00 +00:00
gardenia	752f6d8213	HADOOP-18621. Resource leak in CryptoOutputStream.close() (#5347 ) When closing we need to wrap the flush() in a try .. finally, otherwise when flush throws it will stop completion of the remainder of the close activities and in particular the close of the underlying wrapped stream object resulting in a resource leak. Contributed by Colm Dougan	2023-02-07 12:04:00 +00:00
Steve Vaughan	221221d6fb	HADOOP-18612. Avoid mixing canonical and non-canonical when performing comparisons (#5339 ) Contributed by Steve Vaughan Jr	2023-02-06 18:30:45 +00:00
Tao Li	10b42e9306	HDFS-16902. Add Namenode status to BPServiceActor metrics and improve logging in offerservice (#5334 ) Reviewed-by: Mingliang Liu <liuml07@apache.org> Reviewed-by: Shilun Fan <slfan1989@apache.org> Signed-off-by: Tao Li <tomscut@apache.org>	2023-02-03 10:02:45 +08:00
Steve Vaughan	7b6a69faaa	HADOOP-18279. Cancel fileMonitoringTimer even if trustManager isn't defined (#4789 ) Co-authored-by: Steve Vaughan Jr <s_vaughan@apple.com>	2023-02-01 13:33:34 -08:00
Viraj Jasani	f3fa4af5dc	HADOOP-18592. Sasl connection failure should log remote address. (#5294 ) Contributed by Viraj Jasani <vjasani@apache.org> Signed-off-by: Chris Nauroth <cnauroth@apache.org> Signed-off-by: Steve Loughran <stevel@apache.org> Signed-off-by: Mingliang Liu <liuml07@apache.org>	2023-02-01 10:16:42 -08:00
Wei-Chiu Chuang	4836f1ec37	HADOOP-18584. [NFS GW] Fix regression after netty4 migration. (#5252 ) Reviewed-by: Tsz-Wo Nicholas Sze <szetszwo@apache.org> (cherry picked from commit `9d47108b50`)	2023-02-01 05:33:01 -08:00
Masatake Iwasaki	004121f9cc	HADOOP-18598. maven site generation doesn't include javadocs. (#5319 ) Reviewed-by: Chris Nauroth <cnauroth@apache.org>	2023-01-31 19:49:21 +09:00
huhaiyang	80b42625cf	HDFS-16888. BlockManager#maxReplicationStreams, replicationStreamsHardLimit, blocksReplWorkMultiplier and PendingReconstructionBlocks#timeout should be volatile (#5296 ) Reviewed-by: Tao Li <tomscut@apache.org> Signed-off-by: Takanobu Asanuma <tasanuma@apache.org> (cherry picked from commit `88c8ac750d`)	2023-01-31 17:52:29 +09:00
Ayush Saxena	73f3196db5	HADOOP-18604. Add compile platform in the hadoop version output. (#5327 ). Contributed by Ayush Saxena. Signed-off-by: Chris Nauroth <cnauroth@apache.org>	2023-01-28 14:20:27 +05:30
Steve Loughran	0956994492	HADOOP-17717. Update wildfly openssl to 1.1.3.Final. (#5310 ) Contributed by Wei-Chiu Chuang	2023-01-27 11:59:22 +00:00
kevin wan	5cd006455d	HADOOP-18582. skip unnecessary cleanup logic in distcp (#5251 ) Co-authored-by: 万康 <mingge@xiaohongshu.com> Reviewed-by: Steve Loughran <stevel@apache.org> Signed-off-by: Ayush Saxena <ayushsaxena@apache.org> Signed-off-by: Chris Nauroth <cnauroth@apache.org> (cherry picked from commit `3b7b79b37a`)	2023-01-24 23:50:11 +00:00
Viraj Jasani	63443be5f4	HDFS-16891 Avoid the overhead of copy-on-write exception list while loading inodes sub sections in parallel (#5300 ) Reviewed-by: Stephen O'Donnell <sodonnell@apache.org> Signed-off-by: Chris Nauroth <cnauroth@apache.org> (cherry picked from commit `04f3573f6a`)	2023-01-18 21:14:17 +00:00
PJ Fanning	ada06aa22e	HADOOP-18575: followup: try to avoid repeatedly hitting exceptions when transformer factories do not support attributes (#5253 ) Part of HADOOP-18469 and the hardening of XML/XSL parsers. Followup to the main HADOOP-18575 patch, to improve performance when working with xml/xsl engines which don't support the relevant attributes. Include this change when backporting. Contributed by PJ Fanning.	2023-01-16 15:48:15 +00:00
Viraj Jasani	c5cf845d78	HDFS-16887 Log start and end of phase/step in startup progress (#5292 ) Signed-off-by: Chris Nauroth <cnauroth@apache.org> (cherry picked from commit `1263e024b9`)	2023-01-12 22:27:35 +00:00
skysiders	eef2fdcc29	MAPREDUCE-7375 JobSubmissionFiles don't set right permission after mkdirs (#4237 ) Signed-off-by: Chris Nauroth <cnauroth@apache.org> (cherry picked from commit `36bf54aba0`)	2023-01-12 21:49:15 +00:00
huangxiaoping	f5e9901e6d	HADOOP-18591. Fix a typo in Trash (#5291 ) Signed-off-by: Tao Li <tomscut@apache.org> Signed-off-by: Chris Nauroth <cnauroth@apache.org> (cherry picked from commit `a90e424d9f`)	2023-01-12 21:22:25 +00:00
Simbarashe Dzinamarira	dde22d522c	HDFS-16886: Fixes error in documentation for StateStoreRecordsOperations. (#5290 )	2023-01-11 11:47:57 -08:00
Chengbing Liu	af96e0f5b3	HDFS-16872. Fix log throttling by declaring LogThrottlingHelper as static members (#5246 ) Co-authored-by: Chengbing Liu <liuchengbing@qiyi.com> Signed-off-by: Erik Krogen <xkrogen@apache.org> (cherry picked from commit `4cf304de45`)	2023-01-10 10:04:05 -08:00
ZanderXu	a49378cb93	HDFS-16764. [SBN Read] ObserverNamenode should throw ObserverRetryOnActiveException instead of FileNotFoundException during processing of addBlock rpc (#4872 ) Signed-off-by: Erik Krogen <xkrogen@apache.org> Co-authored-by: zengqiang.xu <zengqiang.xu@shopee.com> (cherry picked from commit `8d221255f2`)	2023-01-10 10:03:45 -08:00
Dongjoon Hyun	b6fd322307	HADOOP-18590. Publish SBOM artifacts (#5281 ) Signed-off-by: Chris Nauroth <cnauroth@apache.org> (cherry picked from commit `6f99558c2e`)	2023-01-10 00:43:32 +00:00
PJ Fanning	f856611121	HADOOP-18587: upgrade to jettison 1.5.3 due to cve (#5270 ) Signed-off-by: Chris Nauroth <cnauroth@apache.org> (cherry picked from commit `b9eb760ed2`)	2023-01-06 23:41:18 +00:00
Ayush Saxena	f63f20259b	HADOOP-18586. Update the year to 2023. (#5265 ). Contributed by Ayush Saxena. Reviewed-by: Takanobu Asanuma <tasanuma@apache.org>	2023-01-01 22:45:23 +05:30
Chris Nauroth	1f270d8a5e	YARN-11388: Prevent resource leaks in TestClientRMService. (#5187 ) Signed-off-by: Shilun Fan <slfan1989@apache.org> (cherry picked from commit `6b67373d10`)	2022-12-28 19:02:02 +00:00
curie71	290dc7817c	YARN-11392 Audit Log missing in ClientRMService (#5250 ). Contributed by Beibei Zhao. Signed-off-by: Chris Nauroth <cnauroth@apache.org> (cherry picked from commit `9668a85d40`)	2022-12-28 00:00:18 +00:00
Bence Kosztolnik	7190fcf713	YARN-11395. RM UI, RMAttemptBlock can not render FINAL_SAVING. Contributed by Bence Kosztolnik - In the YARN-1345 remove of FINAL_SAVING was missed from RMAttemptBlock - Same issue was present after YARN-1345 in YARN-4411 - YARN-4411 logic was applied in this commit for FINAL_SAVING	2022-12-23 17:20:35 +01:00
susheel-gupta	f9fac84f43	YARN-10879. Incorrect WARN text in ACL check for application tag based placement (#5231 ) Change-Id: Id892e38fe4c834b1743a0df2f0a40146d3d5a878	2022-12-22 17:26:05 +01:00
Steve Loughran	cda1d45a61	HADOOP-18470. Update index md with section on ABFS prefetching	2022-12-19 13:03:57 +00:00
Steve Loughran	223046cb64	HADOOP-18561. Update commons-net to 3.9.0 (#5214 ) Addresses CVE-2021-37533, which only relates to FTP. Applications not using the ftp:// filesystem, which, as anyone who has used it will know is very minimal and so rarely used, is not a critical part of the project. Furthermore, the FTP-related issue is at worst information leakage if someone connects to a malicious server. This is a due diligence PR rather than an emergency fix. Contributed by Steve Loughran	2022-12-19 11:57:47 +00:00
PJ Fanning	29b6df563b	HADOOP-18575. Make XML transformer factory more lenient (#5224 ) Due diligence followup to HADOOP-18469. Add secure XML parser factories to XMLUtils (#4940) Contributed by P J Fanning	2022-12-18 12:26:11 +00:00
Steve Loughran	c59444b160	HADOOP-18577. Followup: javadoc fix (#5232 ) Fixes a javadoc error which came with HADOOP-18577. ABFS: Add probes of readahead fix (#5205) Part of the HADOOP-18521 ABFS readahead fix; MUST be included. Contributed by Steve Loughran	2022-12-18 12:20:41 +00:00
Chengbing Liu	bfc916e7b0	HADOOP-18567. LogThrottlingHelper: properly trigger dependent recorders in cases of infrequent logging (#5215 ) Signed-off-by: Erik Krogen <xkrogen@apache.org> Co-authored-by: Chengbing Liu <liuchengbing@qiyi.com> (cherry picked from commit `ca3526da92`)	2022-12-16 09:16:51 -08:00
Xing Lin	d43fa95043	HDFS-16852. Skip KeyProviderCache shutdown hook registration if already shutting down (#5160 ) Signed-off-by: Erik Krogen <xkrogen@apache.org> (cherry picked from commit `f7bdf6c667`)	2022-12-16 08:47:33 -08:00
Steve Loughran	daa33aafff	HADOOP-18577. ABFS: Add probes of readahead fix (#5205 ) Followup patch to HADOOP-18456 as part of HADOOP-18521, ABFS ReadBufferManager buffer sharing across concurrent HTTP requests Add probes of readahead fix aid in checking safety of hadoop ABFS client across different releases. * ReadBufferManager constructor logs the fact it is safe at TRACE * AbfsInputStream declares it is fixed in toString() by including fs.azure.capability.readahead.safe" in the result. The ABFS FileSystem hasPathCapability("fs.azure.capability.readahead.safe") probe returns true to indicate the client's readahead manager has been fixed to be safe when prefetching. All Hadoop releases for which probe this returns false and for which the probe "fs.capability.etags.available" returns true at risk of returning invalid data when reading ADLS Gen2/Azure storage data. Contributed by Steve Loughran.	2022-12-15 17:11:22 +00:00
Steve Loughran	65892a7759	HADOOP-18573. Improve error reporting on non-standard kerberos names (#5221 ) The kerberos RPC does not declare any restriction on characters used in kerberos names, though implementations MAY be more restrictive. If the kerberos controller supports use non-conventional principal names and the kerberos admin chooses to use them this can confuse some of the parsing. The obvious solution is for the enterprise admins to "not do that" as a lot of things break, bits of hadoop included. Harden the hadoop code slightly so at least we fail more gracefully, so people can then get in touch with their sysadmin and tell them to stop it.	2022-12-15 11:44:12 +00:00
Mehakmeet Singh	1009d2560f	HADOOP-18574. Changing log level of IOStatistics increment to make the DEBUG logs less noisy (#5223 ) Contributed by: Mehakmeet Singh	2022-12-15 11:43:49 +00:00
Steve Loughran	ba55f370a9	HADOOP-18526. Leak of S3AInstrumentation instances via hadoop Metrics references (#5144 ) This has triggered an OOM in a process which was churning through s3a fs instances; the increased memory footprint of IOStatistics amplified what must have been a long-standing issue with FS instances being created and not closed() * Makes sure instrumentation is closed when the FS is closed. * Uses a weak reference from metrics to instrumentation, so even if the FS wasn't closed (see HADOOP-18478), this back reference would not cause the S3AInstrumentation reference to be retained. * If S3AFileSystem is configured to log at TRACE it will log the calling stack of initialize(), so help identify where the instance is being created. This should help track down the cause of instance leakage. Contributed by Steve Loughran.	2022-12-14 18:23:04 +00:00
Steve Loughran	654082773c	HADOOP-18183. s3a audit logs to publish range start/end of GET requests. (#5110 ) The start and end of the range is set in a new audit param "rg", e.g "?rg=100-200" Contributed by Ankit Saurabh	2022-12-14 16:51:46 +00:00

1 2 3 4 5 ...

25161 Commits