Steve Loughran
55a576906d
HADOOP-19131. Assist reflection IO with WrappedOperations class ( #6686 )
...
1. The class WrappedIO has been extended with more filesystem operations
- openFile()
- PathCapabilities
- StreamCapabilities
- ByteBufferPositionedReadable
All these static methods raise UncheckedIOExceptions rather than
checked ones.
2. The adjacent class org.apache.hadoop.io.wrappedio.WrappedStatistics
provides similar access to IOStatistics/IOStatisticsContext classes
and operations.
Allows callers to:
* Get a serializable IOStatisticsSnapshot from an IOStatisticsSource or
IOStatistics instance
* Save an IOStatisticsSnapshot to file
* Convert an IOStatisticsSnapshot to JSON
* Given an object which may be an IOStatisticsSource, return an object
whose toString() value is a dynamically generated, human readable summary.
This is for logging.
* Separate getters to the different sections of IOStatistics.
* Mean values are returned as a Map.Pair<Long, Long> of (samples, sum)
from which means may be calculated.
There are examples of the dynamic bindings to these classes in:
org.apache.hadoop.io.wrappedio.impl.DynamicWrappedIO
org.apache.hadoop.io.wrappedio.impl.DynamicWrappedStatistics
These use DynMethods and other classes in the package
org.apache.hadoop.util.dynamic which are based on the
Apache Parquet equivalents.
This makes re-implementing these in that library and others
which their own fork of the classes (example: Apache Iceberg)
3. The openFile() option "fs.option.openfile.read.policy" has
added specific file format policies for the core filetypes
* avro
* columnar
* csv
* hbase
* json
* orc
* parquet
S3A chooses the appropriate sequential/random policy as a
A policy `parquet, columnar, vector, random, adaptive` will use the parquet policy for
any filesystem aware of it, falling back to the first entry in the list which
the specific version of the filesystem recognizes
4. New Path capability fs.capability.virtual.block.locations
Indicates that locations are generated client side
and don't refer to real hosts.
Contributed by Steve Loughran
2024-08-14 14:43:00 +01:00
Jinhu Wu
b5e8269d9b
HADOOP-18458: AliyunOSSBlockOutputStream to support heap/off-heap buffer before uploading data to OSS ( #4912 )
2023-03-28 14:27:01 +08:00
陈爽-Jack Chen
f6605f1b3a
HADOOP-18438: AliyunOSSFileSystemStore deleteObjects interface should return the objects that failed to delete ( #4857 )
...
Merged to trunk, thank @chenshuang778 for your contribution
2022-12-20 13:57:49 +08:00
Jinhu Wu
3ec4b932c1
HADOOP-18313: AliyunOSSBlockOutputStream should not mark the temporary file for deletion ( #4502 )
...
HADOOP-18313: AliyunOSSBlockOutputStream should not mark the temporary file for deletion. Contributed by wujinhu.
2022-07-06 14:23:46 +08:00
Viraj Jasani
c7ec1897c4
HADOOP-18018. unguava: remove Preconditions from hadoop-tools modules ( #3688 )
2021-11-23 13:34:10 +09:00
Jinhu Wu
a9c51ea57d
HADOOP-17374. support listObjectV2 ( #3587 )
2021-11-03 21:47:41 -07:00
Chao Sun
176bd88890
HADOOP-16080. hadoop-aws does not work with hadoop-client-api. ( #2522 )
...
Contributed by Chao Sun.
(Cherry-picked via PR #2575 )
2021-03-09 20:01:29 +00:00
Ayush Saxena
1e3a6efcef
HADOOP-17288. Use shaded guava from thirdparty. ( #2342 ). Contributed by Ayush Saxena.
2020-10-17 12:01:18 +05:30
Weiwei Yang
6dfe00c71e
HADOOP-16840. AliyunOSS: getFileStatus throws FileNotFoundException in versioning bucket. Contributed by wujinhu.
2020-03-08 21:01:34 -07:00
Weiwei Yang
2d8282bb82
HADOOP-16306. AliyunOSS: Remove temporary files when upload small files to OSS. Contributed by wujinhu.
2019-05-14 14:05:39 -07:00
Weiwei Yang
568d3ab8b6
HADOOP-16191. AliyunOSS: improvements for copyFile/copyDirectory and logging. Contributed by wujinhu.
2019-03-19 09:55:32 +08:00
Weiwei Yang
f87b3b11c4
HADOOP-16030. AliyunOSS: bring fixes back from HADOOP-15671. Contributed by wujinhu.
2019-01-07 15:56:49 +08:00
Sunil G
1f425271a7
Revert "HADOOP-15759. AliyunOSS: Update oss-sdk version to 3.0.0. Contributed by Jinhu Wu."
...
This reverts commit e4fca6aae4
.
Revert "HADOOP-15671. AliyunOSS: Support Assume Roles in AliyunOSS. Contributed by Jinhu Wu."
This reverts commit 2b635125fb
.
2019-01-05 17:35:29 +09:00
Weiwei Yang
040a202b20
HADOOP-15323. AliyunOSS: Improve copy file performance for AliyunOSSFileSystemStore. Contributed wujinhu.
2019-01-03 21:37:49 +08:00
Weiwei Yang
5ff0cf86a9
HADOOP-15943. AliyunOSS: add missing owner & group attributes for oss FileStatus. Contributed by wujinhu.
2018-11-23 13:56:54 +08:00
Weiwei Yang
be0708c6eb
HADOOP-15919. AliyunOSS: Enable Yarn to use OSS. Contributed by wujinhu.
2018-11-19 14:00:54 +08:00
Sammi Chen
3fade865ce
HADOOP-15917. AliyunOSS: fix incorrect ReadOps and WriteOps in statistics. Contributed by Jinhu Wu.
2018-11-14 12:58:57 +08:00
Sammi Chen
7574d18538
HADOOP-15868. AliyunOSS: update document for properties of multiple part download, multiple part upload and directory copy. Contributed by Jinhu Wu.
2018-10-26 15:19:56 +08:00
Sammi Chen
2b635125fb
HADOOP-15671. AliyunOSS: Support Assume Roles in AliyunOSS. Contributed by Jinhu Wu.
2018-09-25 19:48:30 +08:00
Akira Ajisaka
3e3963b035
HADOOP-15552. Move logging APIs over to slf4j in hadoop-tools - Part2. Contributed by Ian Pickering.
2018-08-16 00:31:59 +09:00
Sammi Chen
0857f116b7
HADOOP-15607. AliyunOSS: fix duplicated partNumber issue in AliyunOSSBlockOutputStream. Contributed by Jinhu Wu.
2018-07-30 10:53:44 +08:00
Akira Ajisaka
2b2399d623
HADOOP-15495. Upgrade commons-lang version to 3.7 in hadoop-common-project and hadoop-tools. Contributed by Takanobu Asanuma.
2018-06-28 14:37:22 +09:00
Yiqun Lin
1312f9ae4c
HADOOP-15391. Add missing css file in hadoop-aws, hadoop-aliyun, hadoop-azure and hadoop-azure-datalake modules.
2018-04-18 16:04:00 +08:00
Sammi Chen
6542d17ea4
HADOOP-14999. AliyunOSS: provide one asynchronous multi-part based uploading mechanism. Contributed by Genmao Yu.
2018-03-30 20:23:05 +08:00
Sammi Chen
d67a5e2dec
HADOOP-15262. AliyunOSS: move files under a directory in parallel when rename a directory. Contributed by Jinhu Wu.
2018-03-19 15:02:37 +08:00
Sammi Chen
9195a6e302
HADOOP-15027. AliyunOSS: Support multi-thread pre-read to improve sequential read from Hadoop to Aliyun OSS performance. (Contributed by Jinhu Wu)
2018-01-17 15:55:59 +08:00
Kai Zheng
ce04340ec7
HADOOP-15104. AliyunOSS: change the default value of max error retry. Contributed by Jinhu Wu
2017-12-08 23:03:54 +08:00
Steve Loughran
c326fc89b0
HADOOP-15024 Support user agent configuration and include that & Hadoop version information to oss server.
...
Contributed by Sammi Chen.
2017-11-20 18:56:42 +00:00
Kai Zheng
18621af7ae
HADOOP-14993. AliyunOSS: Override listFiles and listLocatedStatus. Contributed Genmao Yu
2017-11-14 17:58:37 +08:00
Steve Loughran
27ab5f7385
HADOOP-14787. AliyunOSS: Implement the createNonRecursive
operator.
...
Contributed by Genmao Yu
2017-08-22 11:55:48 +01:00
Kai Zheng
267e19a09f
HADOOP-14194. Aliyun OSS should not use empty endpoint as default. Contributed by Genmao Yu
2017-08-21 13:36:28 +08:00
Akira Ajisaka
6e6e96583f
HADOOP-14466. Remove useless document from TestAliyunOSSFileSystemContract.java. Contributed by Chen Liang.
2017-06-01 13:08:01 +09:00
Akira Ajisaka
07e60f85d8
HADOOP-14458. Add missing imports to TestAliyunOSSFileSystemContract.java. Contributed by Mingliang Liu.
2017-05-30 15:11:10 +09:00
Akira Ajisaka
6a52b5e144
HADOOP-14180. FileSystem contract tests to replace JUnit 3 with 4. Contributed by Xiaobing Zhou and Mingliang Liu.
2017-05-25 15:36:44 +09:00
Mingliang Liu
86035c1644
HADOOP-14192. Aliyun OSS FileSystem contract test should implement getTestBaseDir(). Contributed by Mingliang Liu
2017-03-17 11:01:00 -07:00
Kai Zheng
cd3e59a3dc
HADOOP-14072. AliyunOSS: Failed to read from stream when seek beyond the download size. Contributed by Genmao Yu
2017-02-15 16:34:30 +08:00
Kai Zheng
01be4503c3
HADOOP-14069. AliyunOSS: listStatus returns wrong file info. Contributed by Fei Hui
2017-02-13 13:18:20 +08:00
Kai Zheng
5b151290ae
HADOOP-13768. AliyunOSS: handle the failure in the batch delete operation deleteDirs
. Contributed by Genmao Yu
2017-02-10 14:53:18 +08:00
Kai Zheng
a8a594b4c8
HADOOP-14065. AliyunOSS: oss directory filestatus should use meta time. Contributed by Fei Hui
2017-02-09 18:00:22 +08:00
Mingliang Liu
d9f73f1b7c
HADOOP-13723. AliyunOSSInputStream#read() should update read bytes stat correctly. Contributed by Mingliang Liu
2016-10-13 22:33:55 -07:00
Kai Zheng
c31b5e61b1
HADOOP-13701. AbstractContractRootDirectoryTest can fail when handling delete "/". Contributed by Genmao Yu
2016-10-08 17:35:59 +06:00
Kai Zheng
26d5df390c
HADOOP-13634. Some configuration in doc has been outdated. Contributed by Genmao Yu
2016-09-23 08:44:28 +08:00
Kai Zheng
22af6f8db3
HADOOP-13624. Rename TestAliyunOSSContractDispCp. Contributed by Genmao Yu
2016-09-21 14:02:44 +08:00
Kai Zheng
08b37603d9
HADOOP-13591. Unit test failure in TestOSSContractGetFileStatus and TestOSSContractRootDir. Contributed by Genmao Yu
2016-09-20 15:12:02 +08:00
Kai Zheng
9cd4760257
HADOOP-13609. Refine credential provider related codes for AliyunOss integration. Contributed by Genmao Yu
2016-09-20 11:39:02 +08:00
Kai Zheng
a1940464a4
HADOOP-13610. Clean up AliyunOss integration tests. Contributed by Genmao Yu
2016-09-18 19:10:48 +08:00
Mingfei
e671a0f52b
HADOOP-13481. User documents for Aliyun OSS FileSystem. Contributed by Genmao Yu.
2016-09-07 11:17:43 +08:00
Mingfei
d33e928fbe
HADOOP-13529. Do some code refactoring. Contributed by Genmao Yu.
2016-09-07 11:15:48 +08:00
Mingfei
cdb77110e7
HADOOP-13498. The number of multi-part upload part should not bigger than 10000. Contributed by Genmao Yu.
2016-09-07 11:15:48 +08:00
Mingfei
6bb741b9f8
HADOOP-13499. Support session credentials for authenticating with Aliyun. Contributed by Genmao Yu.
2016-09-07 11:15:48 +08:00