hadoop/hadoop-tools/hadoop-resourceestimator
Steve Loughran 9ca4ac0af0
HADOOP-18305. Preparing for 3.3.4 release: branch-3.3 version => 3.3.9 (#4482)
Updating the hadoop version of branch-3.3 to 3.3.9-SNAPSHOT
pending agreement on what number its future release should take.

Using 3.3.9-SNAPSHOT puts space in for other incremental releases,
while avoiding creating JIRA release ordering and autocompletion
confusion the way adding a 3.3.10 or higher version would do.

Contributed by Steve Loughran
2022-06-22 13:09:50 +01:00
..
src HADOOP-16232. Fix errors in the checkstyle configration xmls. Contributed by Wanqiang Ji. 2019-04-03 19:35:02 +09:00
pom.xml HADOOP-18305. Preparing for 3.3.4 release: branch-3.3 version => 3.3.9 (#4482) 2022-06-22 13:09:50 +01:00
README.md HADOOP-14840. Tool to estimate resource requirements of an application pipeline based on prior executions. (Rui Li via Subru). 2017-10-25 15:51:27 -07:00

Resource Estimator Service

Resource Estimator Service can parse the history logs of production jobs, extract their resource consumption skylines in the past runs and predict their resource requirements for the new run.

Current Status

  • Support Hadoop YARN ResourceManager logs.
  • In-memory store for parsed history resource skyline and estimation.
  • A Linear Programming based estimator.
  • Provides REST interface to parse logs, query history store and estimations.

Upcoming features

  • UI to query history and edit and save estimations.
  • Persisent store implementation for store (either DB based or distributed key-value like HBase).
  • Integrate directly with the Hadoop YARN Reservation System to make a recurring reservation based on the estimated resources.

Refer to the design document for more details.