hadoop

Author	SHA1	Message	Date
Steve Loughran	762a83e044	HADOOP-17631. Configuration ${env.VAR:-FALLBACK} to eval FALLBACK when restrictSystemProps=true (#2977 ) Contributed by Steve Loughran.	2021-06-08 21:56:40 +01:00
Viraj Jasani	f4b24c68e7	HADOOP-17743. Replace Guava Lists usage by Hadoop's own Lists in hadoop-common, hadoop-tools and cloud-storage projects (#3072 )	2021-06-07 13:24:09 +09:00
Viraj Jasani	59fc4061cb	HADOOP-17152. Provide Hadoop's own Lists utility to reduce dependency on Guava (#3061 ) Reviewed-by: Wei-Chiu Chuang <weichiu@apache.org> Signed-off-by: Takanobu Asanuma <tasanuma@apache.org>	2021-06-03 18:56:00 +09:00
Steve Loughran	832a3c6a89	HADOOP-17511. Add audit/telemetry logging to S3A connector (#2807 ) The S3A connector supports "an auditor", a plugin which is invoked at the start of every filesystem API call, and whose issued "audit span" provides a context for all REST operations against the S3 object store. The standard auditor sets the HTTP Referrer header on the requests with information about the API call, such as process ID, operation name, path, and even job ID. If the S3 bucket is configured to log requests, this information will be preserved there and so can be used to analyze and troubleshoot storage IO. Contributed by Steve Loughran.	2021-05-25 10:25:41 +01:00
Vinayakumar B	2bbeae3240	HDFS-15790. Make ProtobufRpcEngineProtos and ProtobufRpcEngineProtos2 Co-Exist (#2767 )	2021-05-24 02:45:39 -07:00
Viraj Jasani	e4062ad027	HADOOP-17115. Replace Guava Sets usage by Hadoop's own Sets in hadoop-common and hadoop-tools (#2985 ) Signed-off-by: Sean Busbey <busbey@apache.org>	2021-05-20 10:47:04 -05:00
Hongbing Wang	f7247922b7	HDFS-16018. Optimize the display of hdfs "count -e" or "count -t" com… (#2994 )	2021-05-20 11:23:54 +08:00
Xiaoyu Yao	86729e130f	HADOOP-17699. Remove hardcoded SunX509 usage from SSLFactory. (#3016 )	2021-05-18 10:11:36 -07:00
Akira Ajisaka	35ca1dcb9d	HADOOP-17685. Fix junit deprecation warnings in hadoop-common module. (#2983 ) Signed-off-by: Takanobu Asanuma <tasanuma@apache.org>	2021-05-13 14:22:25 +09:00
Viraj Jasani	b93e448f9a	HADOOP-11616. Remove workaround for Curator's ChildReaper requiring Guava 15+ (#2973 ) Reviewed-by: Wei-Chiu Chuang <weichiu@apache.org> Signed-off-by: Akira Ajisaka <aajisaka@apache.org>	2021-05-06 04:52:02 +09:00
kishendas	e571025f5b	HADOOP-17657: implement StreamCapabilities in SequenceFile.Writer and fall back to flush, if hflush is not supported (#2949 ) Co-authored-by: Kishen Das <kishen@cloudera.com> Reviewed-by: Steve Loughran <stevel@apache.org>	2021-05-04 01:20:56 -07:00
Wei-Chiu Chuang	b2e54762a4	HDFS-15624. fix the function of setting quota by storage type (#2377 ) (#2955 ) 1. puts NVDIMM to the end of storage type enum to make sure compatibility. 2. adds check to make sure the software layout version is satisfied Co-authored-by: su xu <kevinbrandon@163.com> Co-authored-by: huangtianhua <huangtianhua223@gmail.com> Co-authored-by: YaYun-Wang <34060507+YaYun-Wang@users.noreply.github.com> Signed-off-by: Mingliang Liu <liuml07@apache.org> Signed-off-by: Ayush Saxena <ayushsaxena@apache.org> Signed-off-by: Vinayakumar B <vinayakumarb@apache.org> Change-Id: I3c58beef50730827a09b3c968e9ad637baa57d44	2021-04-28 23:54:39 -07:00
Wei-Chiu Chuang	90c6caf650	Revert "HDFS-15624. fix the function of setting quota by storage type (#2377 )" This reverts commit `394b9f7a5c`. Ref: HDFS-15995. Had to revert this commit, so we can commit HDFS-15566 (a critical bug preventing rolling upgrade to Hadoop 3.3) Will re-work this fix again later.	2021-04-26 11:27:15 +08:00
Viraj Jasani	9179638017	HADOOP-17524. Remove EventCounter and Log counters from JVM Metrics (#2909 ) Reviewed-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Akira Ajisaka <aajisaka@apache.org>	2021-04-15 18:04:46 +09:00
Akira Ajisaka	156ecc89be	HADOOP-17630. [JDK 15] TestPrintableString fails due to Unicode 13.0 support. (#2890 ) Reviewed-by: Wei-Chiu Chuang <weichiu@apache.org>	2021-04-13 17:08:49 +09:00
Viraj Jasani	3f2682b92b	HADOOP-17622. Avoid usage of deprecated IOUtils#cleanup API. (#2862 ) Signed-off-by: Takanobu Asanuma <tasanuma@apache.org>	2021-04-06 13:39:10 +09:00
Borislav Iordanov	2c482fbacf	HADOOP-16524. Automatic keystore reloading for HttpServer2 Reapply of issue reverted first because it caused yarn failures and then again because the commit message was incorrectly formatted (and yet again because of commit message format). Signed-off-by: stack <stack@apache.org>	2021-03-31 10:46:35 -07:00
stack	22961a615d	Revert "HADOOP-16524. Automatic keystore reloading for HttpServer2" This reverts commit `a2975d2153`.	2021-03-31 10:43:09 -07:00
stack	a2975d2153	HADOOP-16524. Automatic keystore reloading for HttpServer2 Reapply of issue reverted first because it caused yarn failures and then again because the commit message was incorrectly formatted.	2021-03-31 10:40:20 -07:00
stack	5183aaeda2	Revert "Hadoop 16524 - resubmission following some unit test fixes (#2693 )" Revert to fix the summary message. This reverts commit `9509bebf7f`.	2021-03-31 10:39:55 -07:00
Borislav Iordanov	9509bebf7f	Hadoop 16524 - resubmission following some unit test fixes (#2693 ) Signed-off-by: stack <stack@apache.org>	2021-03-31 10:07:42 -07:00
Ayush Saxena	f5c1557288	HADOOP-17531.Addendum: DistCp: Reduce memory usage on copying huge directories. (#2820 ). Contributed by Ayush Saxena. Signed-off-by: Steve Loughran <stevel@apache.org>	2021-03-27 03:01:41 +05:30
Akira Ajisaka	af1f9f43ea	HADOOP-17133. Implement HttpServer2 metrics (#2145 )	2021-03-25 12:09:43 -07:00
touchida	95e6892675	HDFS-15759. EC: Verify EC reconstruction correctness on DataNode (#2585 )	2021-03-24 16:56:09 +08:00
Ayush Saxena	03cfc85279	HADOOP-17531. DistCp: Reduce memory usage on copying huge directories. (#2732 ). Contributed by Ayush Saxena. Signed-off-by: Steve Loughran <stevel@apache.org>	2021-03-24 02:36:26 +05:30
Jim Brennan	299b8062f1	MAPREDUCE-7322. revisiting TestMRIntermediateDataEncryption. Contributed by Ahmed Hussein.	2021-03-15 20:13:17 +00:00
He Xiaoqiao	b1dc6c40a0	HADOOP-17585. Correct timestamp format in the docs for the touch command. Contributed by Stephen O'Donnell.	2021-03-14 18:09:50 +08:00
Chao Sun	176bd88890	HADOOP-16080. hadoop-aws does not work with hadoop-client-api. (#2522 ) Contributed by Chao Sun. (Cherry-picked via PR #2575)	2021-03-09 20:01:29 +00:00
Haoze Wu	ef7ab535c5	HADOOP-17552. Change ipc.client.rpc-timeout.ms from 0 to 120000 by default to avoid potential hang. (#2727 )	2021-03-06 22:26:16 +09:00
Ahmed Hussein	e04bcb3a06	MAPREDUCE-7320. organize test directories for ClusterMapReduceTestCase (#2722 ). Contributed by Ahmed Hussein	2021-02-26 13:42:33 -06:00
Mike	7b7c0019f4	HADOOP-17528. SFTP File System: close the connection pool when closing a FileSystem (#2701 ) Contributed by Mike Pryakhin.	2021-02-23 17:03:27 +00:00
Steve Loughran	78905d7e3f	HADOOP-16906. Abortable (#2684 ) Adds an Abortable.abort() interface for streams to enable output streams to be terminated; this is implemented by the S3A connector's output stream. It allows for commit protocols to be implemented which commit/abort work by writing to the final destination and using the abort() call to cancel any write which is not intended to be committed. Consult the specification document for information about the interface and its use. Contributed by Jungtaek Lim and Steve Loughran.	2021-02-11 17:37:20 +00:00
Steve Loughran	798df6d699	HADOOP-13327 Output Stream Specification. (#2587 ) This defines what output streams and especially those which implement Syncable are meant to do, and documents where implementations (HDFS; S3) don't. With tests. The file:// FileSystem now supports Syncable if an application calls FileSystem.setWriteChecksum(false) before creating a file -checksumming and Syncable.hsync() are incompatible. Contributed by Steve Loughran.	2021-02-10 10:28:59 +00:00
YaYun-Wang	394b9f7a5c	HDFS-15624. fix the function of setting quota by storage type (#2377 ) 1. puts NVDIMM to the end of storage type enum to make sure compatibility. 2. adds check to make sure the software layout version is satisfied Co-authored-by: su xu <kevinbrandon@163.com> Co-authored-by: huangtianhua <huangtianhua223@gmail.com> Signed-off-by: Mingliang Liu <liuml07@apache.org> Signed-off-by: Ayush Saxena <ayushsaxena@apache.org> Signed-off-by: Vinayakumar B <vinayakumarb@apache.org>	2021-02-02 22:44:34 -08:00
belugabehr	21a3fc3d2d	HADOOP-17482: Remove Commons Logger from FileSystem Class (#2633 )	2021-02-01 09:40:01 -08:00
Siyao Meng	1a205cc3ad	HADOOP-17424. Replace HTrace with No-Op tracer (#2645 )	2021-02-01 13:42:44 +09:00
Anton Kutuzov	91d4ba57c5	HDFS-15632. AbstractContractDeleteTest should set recursive peremeter to true for recursive test cases. Contributed by Anton Kutuzov.	2021-01-22 17:55:37 -08:00
stack	d4fd675a95	Revert "HADOOP-16524. Reloading SSL keystore for both DataNode and NameNode (#2470 )" This reverts commit `e306f59421`.	2021-01-11 08:54:55 -08:00
Borislav Iordanov	e306f59421	HADOOP-16524. Reloading SSL keystore for both DataNode and NameNode (#2470 ) Co-authored-by: Borislav Iordanov <biordanov@apple.com> Signed-off-by: stack <stack@apache.org>	2021-01-08 09:10:21 -08:00
dgzdot	b1abb10ea2	HADOOP-17430. Restore ability to set Text to empty byte array (#2545 ) Contributed by gaozhan.ding	2021-01-05 21:09:41 +00:00
Steve Loughran	99d08a19ba	HADOOP-17450. Add Public IOStatistics API. (#2577 ) This is the API and implementation classes of HADOOP-16830, which allows callers to query IO object instances (filesystems, streams, remote iterators, ...) and other classes for statistics on their I/O Usage: operation count and min/max/mean durations. New Packages org.apache.hadoop.fs.statistics. Public API, including: IOStatisticsSource IOStatistics IOStatisticsSnapshot (seralizable to java objects and json) +helper classes for logging and integration BufferedIOStatisticsInputStream implements IOStatisticsSource and StreamCapabilities BufferedIOStatisticsOutputStream implements IOStatisticsSource, Syncable and StreamCapabilities org.apache.hadoop.fs.statistics.impl Implementation classes for internal use. org.apache.hadoop.util.functional functional programming support for RemoteIterators and other operations which raise IOEs; all wrapper classes implement and propagate IOStatisticsSource Contributed by Steve Loughran.	2020-12-31 11:52:42 +00:00
Jim Brennan	6de1a8eb67	HADOOP-13571. ServerSocketUtil.getPort() should use loopback address, not 0.0.0.0. Contributed by Eric Badger	2020-12-11 20:16:56 +00:00
Ahmed Hussein	f94e927bfb	HADOOP-17392. Remote exception messages should not include the exception class (#2486 ). Contributed by Daryn Sharp and Ahmed Hussein	2020-12-03 10:55:51 -06:00
Steve Loughran	ac7045b75f	HADOOP-17313. FileSystem.get to support slow-to-instantiate FS clients. (#2396 ) This adds a semaphore to throttle the number of FileSystem instances which can be created simultaneously, set in "fs.creation.parallel.count". This is designed to reduce the impact of many threads in an application calling FileSystem.get() on a filesystem which takes time to instantiate -for example to an object where HTTPS connections are set up during initialization. Many threads trying to do this may create spurious delays by conflicting for access to synchronized blocks, when simply limiting the parallelism diminishes the conflict, so speeds up all threads trying to access the store. The default value, 64, is larger than is likely to deliver any speedup -but it does mean that there should be no adverse effects from the change. If a service appears to be blocking on all threads initializing connections to abfs, s3a or store, try a smaller (possibly significantly smaller) value. Contributed by Steve Loughran.	2020-11-25 14:31:02 +00:00
Ahmed Hussein	07050339e0	HADOOP-17367. Add InetAddress api to ProxyUsers.authorize (#2449 ). Contributed by Daryn Sharp and Ahmed Hussein	2020-11-19 14:37:14 -06:00
Liang-Chi Hsieh	34aa6137bd	HADOOP-17292. Using lz4-java in Lz4Codec (#2350 ) Contributed by Liang-Chi Hsieh.	2020-11-18 12:03:25 -08:00
Steve Loughran	e3c08f285a	HADOOP-17244. S3A directory delete tombstones dir markers prematurely. (#2310 ) This fixes the S3Guard/Directory Marker Retention integration so that when fs.s3a.directory.marker.retention=keep, failures during multipart delete are handled correctly, as are incremental deletes during directory tree operations. In both cases, when a directory marker with children is deleted from S3, the directory entry in S3Guard is not deleted, because it is still critical to representing the structure of the store. Contributed by Steve Loughran. Change-Id: I4ca133a23ea582cd42ec35dbf2dc85b286297d2f	2020-11-18 12:18:11 +00:00
Ahmed Hussein	ebe1d1fbf7	HADOOP-17362. reduce RPC calls doing ls on HAR file (#2444 ). Contributed by Daryn Sharp and Ahmed Hussein	2020-11-13 14:22:35 -06:00
Ahmed Hussein	5ce18101cb	HADOOP-17346. Fair call queue is defeated by abusive service principals (#2431 ) Co-authored-by: ahussein <ahmed.hussein@verizonmedia.com>	2020-11-12 13:13:12 -06:00
Doroszlai, Attila	6f10a0506f	HADOOP-17365. Contract test for renaming over existing file is too lenient (#2447 ) Contributed by Attila Doroszlai.	2020-11-11 21:20:09 +00:00

1 2 3 4 5 ...

2130 Commits