hadoop

Author	SHA1	Message	Date
smarthan	cbb3ba135c	HADOOP-17998. Allow get command to run with multi threads. (#3645 ) (cherry picked from commit 63018dc73f4d29632e93be08d035ab9a7e73531c) Conflicts: hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/shell/CopyCommands.java	2021-11-22 12:14:32 +00:00
Abhishek Das	f456dc1837	HADOOP-17999. No-op implementation of setWriteChecksum and setVerifyChecksum in ViewFileSystem. Contributed by Abhishek Das. (#3639 ) (cherry picked from commit 54a1d78e16533e286455de62a545ee75cbc1eff5)	2021-11-16 22:40:24 -08:00
Mehakmeet Singh	bd077c3814	HADOOP-17953. S3A: Tests to lookup global or per-bucket configuration for encryption algorithm (#3525 ) Followup to S3-CSE work of HADOOP-13887 Contributed by Mehakmeet Singh	2021-10-21 12:03:50 +01:00
Szilard Nemeth	6f45666d0b	HADOOP-17857. Check real user ACLs in addition to proxied user ACLs. Contributed by Eric Payne (cherry picked from commit 5428d36b56fab319ab68258139d6133ded9bbafc)	2021-10-19 20:40:30 +00:00
Steve Loughran	b8f3e54ff7	HADOOP-17945. JsonSerialization raises EOFException reading JSON data stored on google GCS (#3501 ) Contributed By: Steve Loughran	2021-10-19 15:36:10 +05:30
Xing Lin	af920f138b	HADOOP-16532. Fix TestViewFsTrash to use the correct homeDir. Contributed by Xing Lin. (#3514 ) (cherry picked from commit 97c0f968792e1a45a1569a3184af7b114fc8c022)	2021-10-13 14:58:08 -07:00
Masatake Iwasaki	9e2936f8d1	HADOOP-17424. Replace HTrace with No-Op tracer (#3520 ) (cherry picked from commit 1a205cc3adffa568c814a5241e041b08e2fcd3eb) Conflicts: hadoop-hdfs-project/hadoop-hdfs-client/src/main/java/org/apache/hadoop/hdfs/DataStreamer.java hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/datanode/DataNode.java hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/tracing/TestTracing.java Co-authored-by: Siyao Meng <50227127+smengcl@users.noreply.github.com>	2021-10-12 00:07:09 +09:00
Viraj Jasani	77ee5a4266	HADOOP-17950. Provide replacement for deprecated APIs of commons-io IOUtils (#3515 ) Signed-off-by: Akira Ajisaka <aajisaka@apache.org> (cherry picked from commit 8071dbb9c6b4a654d5e1e7c8e3b4d2ca1a736d53)	2021-10-07 11:00:19 +09:00
Ahmed Hussein	2cdc6a245d	HADOOP-17930. implement non-guava Precondition checkState (#3522 ) Reviewed-by: Viraj Jasani <vjasani@apache.org> Signed-off-by: Takanobu Asanuma <tasanuma@apache.org> (cherry picked from commit c36f9402dc082a8903cf6e7fdca128658b11c59d)	2021-10-07 10:57:20 +09:00
Mehakmeet Singh	aee975a136	HADOOP-13887. Support S3 client side encryption (S3-CSE) using AWS-SDK (#2706 ) This (big!) patch adds support for client side encryption in AWS S3, with keys managed by AWS-KMS. Read the documentation in encryption.md very, very carefully before use and consider it unstable. S3-CSE is enabled in the existing configuration option "fs.s3a.server-side-encryption-algorithm": fs.s3a.server-side-encryption-algorithm=CSE-KMS fs.s3a.server-side-encryption.key=<KMS_KEY_ID> You cannot enable CSE and SSE in the same client, although you can still enable a default SSE option in the S3 console. * Filesystem list/get status operations subtract 16 bytes from the length of all files >= 16 bytes long to compensate for the padding which CSE adds. * The SDK always warns about the specific algorithm chosen being deprecated. It is critical to use this algorithm for ranged GET requests to work (i.e. random IO). Ignore. * Unencrypted files CANNOT BE READ. The entire bucket SHOULD be encrypted with S3-CSE. * Uploading files may be a bit slower as blocks are now written sequentially. * The Multipart Upload API is disabled when S3-CSE is active. Contributed by Mehakmeet Singh Change-Id: Ie1a27a036a39db66a67e9c6d33bc78d54ea708a0	2021-10-05 11:37:41 +01:00
Ahmed Hussein	31b44c519c	HADOOP-17929. implement non-guava Precondition checkArgument (#3473 ) Reviewed-by: Viraj Jasani <vjasani@apache.org> (cherry picked from commit 0c498f21dee7a5bbf91ad8afbfb372d08bacce6c)	2021-10-01 16:49:07 +08:00
Chao Sun	6931b70a00	HADOOP-17936. Fix test failure after reverting HADOOP-16878 from branch-3.3 (#3478 )	2021-09-27 13:56:44 -07:00
Chao Sun	ff26a7700d	Revert "HADOOP-16878. FileUtil.copy() to throw IOException if the source and destination are the same (#2383 )" This reverts commit 54c40cbf49f2ebf4bbc1976279a6eba7a2c5fe23.	2021-09-23 15:04:27 -07:00
Mehakmeet Singh	8e5620cd9e	HADOOP-17195. ABFS: OutOfMemory error while uploading huge files (#3446 ) Addresses the problem of processes running out of memory when there are many ABFS output streams queuing data to upload, especially when the network upload bandwidth is less than the rate data is generated. ABFS Output streams now buffer their blocks of data to "disk", "bytebuffer" or "array", as set in "fs.azure.data.blocks.buffer" When buffering via disk, the location for temporary storage is set in "fs.azure.buffer.dir" For safe scaling: use "disk" (default); for performance, when confident that upload bandwidth will never be a bottleneck, experiment with the memory options. The number of blocks a single stream can have queued for uploading is set in "fs.azure.block.upload.active.blocks". The default value is 20. Contributed by Mehakmeet Singh.	2021-09-22 11:19:16 +01:00
Neil	9700d98eac	HADOOP-17893. Improve PrometheusSink for Namenode TopMetrics (#3426 ) Signed-off-by: Akira Ajisaka <aajisaka@apache.org> (cherry picked from commit ae2c5ccfcf75a89c60ec6e4a339b46131f9134be)	2021-09-21 10:44:51 +09:00
Steve Loughran	9188fa8cce	HADOOP-17126. implement non-guava Precondition checkNotNull This adds a new class org.apache.hadoop.util.Preconditions which is * @Private/@Unstable * Intended to allow us to move off Google Guava * Is designed to be trivially backportable (i.e contains no references to guava classes internally) Please use this instead of the guava equivalents, where possible. Contributed by: Ahmed Hussein Change-Id: Ic392451bcfe7d446184b7c995734bcca8c07286e	2021-09-17 11:06:59 +01:00
Adam Binford	59a955dfa0	HADOOP-17804. Expose prometheus metrics only after a flush and dedupe with tag values (#3369 ) Signed-off-by: Akira Ajisaka <aajisaka@apache.org> (cherry picked from commit 4ced012f3301d0848680fdf0ef2972da9b3e1298)	2021-09-09 16:51:04 +09:00
Masatake Iwasaki	76393e1359	HADOOP-17899. Avoid using implicit dependency on junit-jupiter-api. (#3399 ) (cherry picked from commit ce7a5bfbd3cb55afda265d105ff10ba2e2874a3f)	2021-09-08 09:11:39 +00:00
Yellow Flash	09e8e5c5cb	HADOOP-17870. Http Filesystem to qualify relative paths. (#3338 ) Contributed by Yellowflash Change-Id: I217da06a1a2e5c0ca2b324f8e21baa0846f64858	2021-09-07 10:54:35 +01:00
Chris Nauroth	cc90b4f987	HADOOP-15129. Datanode caches namenode DNS lookup failure and cannot startup (#3348 ) Co-authored-by: Karthik Palaniappan Change-Id: Id079a5319e5e83939d5dcce5fb9ebe3715ee864f	2021-09-03 18:48:07 +00:00
jianghuazhu	7c663043b2	HDFS-16173.Improve CopyCommands#Put#executor queue configurability. (#3302 ) Co-authored-by: zhujianghua <zhujianghua@zhujianghuadeMacBook-Pro.local> Reviewed-by: Hui Fei <ferhui@apache.org> Reviewed-by: Viraj Jasani <vjasani@apache.org> (cherry picked from commit 4c94831364e9258247029c22a222a665771ab4c0)	2021-08-27 12:06:26 +08:00
jianghuazhu	2b2f8f575b	HDFS-16175.Improve the configurable value of Server #PURGE_INTERVAL_NANOS. (#3307 ) Co-authored-by: zhujianghua <zhujianghua@zhujianghuadeMacBook-Pro.local> Reviewed-by: Ayush Saxena <ayushsaxena@apache.org> (cherry picked from commit ad54f5195c8c01f333703c55cd70703109d75f29)	2021-08-25 17:35:50 +08:00
Bryan Beaudreault	2fda130260	HADOOP-17837: Add unresolved endpoint value to UnknownHostException (ADDENDUM) (#3276 ) (cherry picked from commit b0b867e977ab853d1dfc434195c486cf0ca32dab)	2021-08-06 21:57:46 +05:30
Bryan Beaudreault	7659b62682	HADOOP-17837: Add unresolved endpoint value to UnknownHostException (#3272 ) (cherry picked from commit 5e54d92e6ec866dc49a750110863a3fa8b2bcf7c)	2021-08-06 17:32:01 +08:00
Steve Loughran	26514b6534	HADOOP-17628. Distcp contract test is really slow with ABFS and S3A; timing out. (#3240 ) This patch cuts down the size of directory trees used for distcp contract tests against object stores, so making them much faster against distant/slow stores. On abfs, the test only runs with -Dscale (as was the case for s3a already), and has the larger scale test timeout. After every test case, the FileSystem IOStatistics are logged, to provide information about what IO is taking place and what it's performance is. There are some test cases which upload files of 1+ MiB; you can increase the size of the upload in the option "scale.test.distcp.file.size.kb" Set it to zero and the large file tests are skipped. Contributed by Steve Loughran.	2021-08-02 12:58:37 +01:00
Petre Bogdan Stolojan	f2cec5cb88	HADOOP-17139 Re-enable optimized copyFromLocal implementation in S3AFileSystem (#3101 ) This work * Defines the behavior of FileSystem.copyFromLocal in filesystem.md * Implements a high performance implementation of copyFromLocalOperation for S3 * Adds a contract test for the operation: AbstractContractCopyFromLocalTest * Implements the contract tests for Local and S3A FileSystems Contributed by: Bogdan Stolojan Change-Id: I25d502102775c3626c4264e5a14c649879730050	2021-08-02 11:58:36 +01:00
Viraj Jasani	ec3311975c	HADOOP-16290. Enable RpcMetrics units to be configurable (#3198 ) Signed-off-by: Akira Ajisaka <aajisaka@apache.org> (cherry picked from commit e1d00addb5b6d7240884536aaa57846af34a0dd5)	2021-07-20 14:56:28 +08:00
Abhishek Das	450dae7383	HADOOP-17028. ViewFS should initialize mounted target filesystems lazily. Contributed by Abhishek Das (#2260 ) (cherry picked from commit 1dd03cc4b573270dc960117c3b6c74bb78215caa)	2021-07-13 18:23:27 -07:00
Rafal Wojdyla	e3fb63f33f	HADOOP-17402. Add GCS config to the core-site (#2638 ) Contributed by Rafal Wojdyla	2021-07-07 22:43:31 +01:00
liangxs	24b780820c	HADOOP-17749. Remove lock contention in SelectorPool of SocketIOWithTimeout (#3080 ) (cherry picked from commit a5db6831bc674a24a3251cf1b20f22a4fd4fac9f)	2021-07-07 09:41:11 +08:00
Viraj Jasani	b8a98e4f82	HDFS-16075. Use empty array constants present in StorageType and DatanodeInfo to avoid creating redundant objects (#3115 ) Reviewed-by: Hui Fei <ferhui@apache.org> (cherry picked from commit c488abbc79cc1ad2596cbf509a0cde14acc5ad6b)	2021-06-21 10:28:05 +09:00
Takanobu Asanuma	25138c98bf	HADOOP-17760. Delete hadoop.ssl.enabled and dfs.https.enable from docs and core-default.xml (#3099 ) Reviewed-by: Ayush Saxena <ayushsaxena@apache.org> (cherry picked from commit 9e7c7ad129fcf466d9647e0672ecf7dd72213e72)	2021-06-17 10:00:36 +09:00
Steve Loughran	4ac9123619	HADOOP-17631. Configuration ${env.VAR:-FALLBACK} to eval FALLBACK when restrictSystemProps=true (#2977 ) Contributed by Steve Loughran. Change-Id: I9b82109eddeb659c01896152cf603d458e2a04cd	2021-06-08 22:05:00 +01:00
Steve Loughran	464bbd5b7c	HADOOP-17511. Add audit/telemetry logging to S3A connector (#2807 ) The S3A connector supports "an auditor", a plugin which is invoked at the start of every filesystem API call, and whose issued "audit span" provides a context for all REST operations against the S3 object store. The standard auditor sets the HTTP Referrer header on the requests with information about the API call, such as process ID, operation name, path, and even job ID. If the S3 bucket is configured to log requests, this information will be preserved there and so can be used to analyze and troubleshoot storage IO. Contributed by Steve Loughran. Change-Id: Ic0a105c194342ed2d529833ecc42608e8ba2f258	2021-05-25 12:55:38 +01:00
Vinayakumar B	dbf1ef4aff	HDFS-15790. Make ProtobufRpcEngineProtos and ProtobufRpcEngineProtos2 Co-Exist (#2767 ) (cherry picked from commit 2bbeae324029d7ad19aa21a9b8a663c7890776f9) Conflicts: hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/ipc/ProtobufRpcEngine.java	2021-05-24 18:00:38 +08:00
Wei-Chiu Chuang	86c28f0639	Revert "HADOOP-17669. Backport HADOOP-17079, HADOOP-17505 to branch-3.3 (#2959 )" This reverts commit 4ffe5eb1ddb4aa260134005335a003ed6d270685.	2021-05-24 17:37:18 +08:00
Wei-Chiu Chuang	4ffe5eb1dd	HADOOP-17669. Backport HADOOP-17079, HADOOP-17505 to branch-3.3 (#2959 ) * HADOOP-17079. Optimize UGI#getGroups by adding UGI#getGroupsSet. Co-authored-by: Wei-Chiu Chuang <weichiu@apache.org> Change-Id: I0f31409923ece24a82dfba4c4610d8a38c52d9fb * HADOOP-17505. public interface GroupMappingServiceProvider needs default impl for getGroupsSet() (#2661). Contributed by Vinayakumar B. (cherry picked from commit c4c0683dff577a91ca978939e182ec0fee65b7c3) Co-authored-by: Xiaoyu Yao <xyao@apache.org> Co-authored-by: Vinayakumar B <vinayakumarb@apache.org>	2021-05-17 18:57:46 -07:00
Xiaoyu Yao	3f9c9ccf46	HADOOP-17284. Support BCFKS keystores for Hadoop Credential Provider.… (#3010 ) * HADOOP-17284. Support BCFKS keystores for Hadoop Credential Provider. (#2334) (cherry picked from commit 4c5ad57818a7e894b5bf430358e02a0bb8618769)	2021-05-13 16:57:58 -07:00
Mike	0f12f3e125	HADOOP-17036. TestFTPFileSystem failing as ftp server dir already exists. Contributed by Mikhail Pryakhin. (cherry picked from commit 017d24e9703e9447f88ba94df3a8aa0800de770b)	2021-05-07 14:20:29 +09:00
Viraj Jasani	be34c1222a	HADOOP-11616. Remove workaround for Curator's ChildReaper requiring Guava 15+ (#2973 ) Reviewed-by: Wei-Chiu Chuang <weichiu@apache.org> Signed-off-by: Akira Ajisaka <aajisaka@apache.org> (cherry picked from commit b93e448f9aa66689f1ce5059f6cdce8add130457)	2021-05-06 04:53:06 +09:00
Takanobu Asanuma	65bf544118	HADOOP-16954. Add -S option in "Count" command to show only Snapshot Counts. Contributed by hemanthboyina. (cherry picked from commit b89d875f7b1db4a98d37f13040eecc5afdf1a485)	2021-05-04 17:44:34 +01:00
kishendas	98aa4fc32c	HADOOP-17657: implement StreamCapabilities in SequenceFile.Writer and fall back to flush, if hflush is not supported (#2949 ) Co-authored-by: Kishen Das <kishen@cloudera.com> Reviewed-by: Steve Loughran <stevel@apache.org> (cherry picked from commit e571025f5b371ade25d1457f0186ba656bb71c5f)	2021-05-04 16:35:00 +08:00
Akira Ajisaka	72355c7b6e	HADOOP-17630. [JDK 15] TestPrintableString fails due to Unicode 13.0 support. (#2890 ) Reviewed-by: Wei-Chiu Chuang <weichiu@apache.org> (cherry picked from commit 156ecc89be3ae1f42bde9c22ab5ba96cf60df3c6)	2021-04-13 17:10:00 +09:00
touchida	dca2bf9dd5	HDFS-15759. EC: Verify EC reconstruction correctness on DataNode (#2585 ) (cherry picked from commit 95e68926750b55196cf9da53c25359c98ef58a4f)	2021-04-08 17:20:08 +08:00
Viraj Jasani	8b4b3d6fe6	HADOOP-17622. Avoid usage of deprecated IOUtils#cleanup API. (#2862 ) Signed-off-by: Takanobu Asanuma <tasanuma@apache.org> (cherry picked from commit 3f2682b92b540be3ce15642ab8be463df87a4e4e)	2021-04-06 14:18:31 +09:00
Borislav Iordanov	c365149e16	HADOOP-16524. Automatic keystore reloading for HttpServer2 Reapply of issue reverted first because it caused yarn failures. Signed-off-by: stack <stack@apache.org>	2021-03-31 10:50:28 -07:00
Stephen O'Donnell	56ef16468a	HADOOP-17222. Create socket address leveraging URI cache (#2817 ) Contributed by fanrui. Signed-off-by: Mingliang Liu <liuml07@apache.org> Signed-off-by: He Xiaoqiao <hexiaoqiao@apache.org>	2021-03-30 11:59:44 +01:00
Ayush Saxena	9c9b16c957	HADOOP-17531. DistCp: Reduce memory usage on copying huge directories. (#2808 ). Contributed by Ayush Saxena. * HADOOP-17531. DistCp: Reduce memory usage on copying huge directories. (#2732). * HADOOP-17531.Addendum: DistCp: Reduce memory usage on copying huge directories. (#2820) Signed-off-by: Steve Loughran <stevel@apache.org>	2021-03-27 09:25:25 +05:30
Xiaoyu Yao	67d52af225	HADOOP-16828. Zookeeper Delegation Token Manager fetch sequence number by batch. Contributed by Fengnan Li. (cherry picked from commit 6288e15118fab65a9a1452898e639313c6996769)	2021-03-25 14:44:02 +00:00
Ayush Saxena	27944772d3	HADOOP-17310. Touch command with -c option is broken. (#2393 ). Contributed by Ayush Saxena.	2021-03-19 00:13:31 +05:30

1 2 3 4 5 ...

2126 Commits