f365957c63
S3A to implement S3 Select through this API. The new openFile() API is asynchronous, and implemented across FileSystem and FileContext. The MapReduce V2 inputs are moved to this API, and you can actually set must/may options to pass in. This is more useful for setting things like s3a seek policy than for S3 select, as the existing input format/record readers can't handle S3 select output where the stream is shorter than the file length, and splitting plain text is suboptimal. Future work is needed there. In the meantime, any/all filesystem connectors are now free to add their own filesystem-specific configuration parameters which can be set in jobs and used to set filesystem input stream options (seek policy, retry, encryption secrets, etc). Contributed by Steve Loughran |
||
---|---|---|
.. | ||
hadoop-mapreduce-client-app | ||
hadoop-mapreduce-client-common | ||
hadoop-mapreduce-client-core | ||
hadoop-mapreduce-client-hs | ||
hadoop-mapreduce-client-hs-plugins | ||
hadoop-mapreduce-client-jobclient | ||
hadoop-mapreduce-client-nativetask | ||
hadoop-mapreduce-client-shuffle | ||
hadoop-mapreduce-client-uploader | ||
pom.xml |