Brahma Reddy Battula
4aae2d40a3
HADOOP-14410. Correct spelling of 'beginning' and variants. Contributed By Dongtao Zhang
2017-05-11 23:33:45 +05:30
Carlo Curino
5cb6e3e082
YARN-6473. Create ReservationInvariantChecker to validate ReservationSystem + Scheduler operations.
2017-05-10 14:49:25 -07:00
Daniel Templeton
e7654c4a1f
YARN-6571. Fix JavaDoc issues in SchedulingPolicy
...
(Contributed by Weiwei Yang via Daniel Templeton)
2017-05-10 14:34:13 -07:00
Karthik Kambatla
166be0ee95
YARN-3742. YARN RM will shut down if ZKClient creation times out. (Daniel Templeton via kasha)
2017-05-09 14:44:16 -07:00
Andrew Wang
543aac9f28
HADOOP-14386. Rewind trunk from Guava 21.0 back to Guava 11.0.2.
2017-05-09 09:22:53 -07:00
Daniel Templeton
9f0aea0ee2
YARN-6481. Yarn top shows negative container number in FS
...
(Contributed by Tao Jie via Daniel Templeton)
2017-05-02 13:06:47 -07:00
Subru Krishnan
6bf42e48ef
YARN-5331. Extend RLESparseResourceAllocation with period for supporting recurring reservations in YARN ReservationSystem. (Sangeetha Abdu Jyothi via Subru).
2017-05-01 18:48:36 -07:00
Subru Krishnan
a3a615eeab
YARN-4359. Update LowCost agents logic to take advantage of YARN-4358. (Jonathan Yaniv and Ishai Menache via Subru).
2017-05-01 16:01:07 -07:00
Naganarasimha
30fc580196
YARN-6519. Fix warnings from Spotbugs in hadoop-yarn-server-resourcemanager. Contributed by Weiwei Yang.
2017-05-01 20:15:27 +05:30
Daniel Templeton
2e52789edf
YARN-2962. ZKRMStateStore: Limit the number of znodes under a znode (Contributed by Varun Sexena via Daniel Templeton)
2017-04-28 13:30:28 -07:00
Wangda Tan
2ba21d6376
YARN-6392. Add submit time to Application Summary log. (Zhihai Xu via wangda)
2017-04-24 13:17:35 -07:00
Eric Payne
52adf71914
YARN-5617. AMs only intended to run one attempt can be run more than once? Contributed by Jason Lowe.
2017-04-24 12:03:28 -05:00
Daniel Templeton
dd43b895c2
YARN-6202. Configuration item Dispatcher.DISPATCHER_EXIT_ON_ERROR_KEY is disregarded
...
(Contributed by Yufei Gu via Daniel Templeton)
2017-04-19 11:44:55 -07:00
Sunil G
9ff25408bc
YARN-6164. Expose Queue Configurations per Node Label through YARN client api. Contributed by Benson Qiu.
2017-04-19 20:39:09 +05:30
Chris Douglas
af8e9842d2
YARN-6451. Add RM monitor validating metrics invariants. Contributed by Carlo Curino
2017-04-18 10:29:44 -07:00
Junping Du
6ed9d36224
YARN-6304. Skip rm.transitionToActive call to RM if RM is already active. Contributed by Rohith Sharma K S.
2017-04-17 15:51:10 -07:00
Karthik Kambatla
c3375175d6
YARN-6432. FairScheduler: Reserve preempted resources for corresponding applications. (Miklos Szegedi via kasha)
2017-04-12 14:21:20 -07:00
Wangda Tan
4d4ad0ebb7
YARN-6439. Fix ReservationSystem creation of default ReservationQueue. (Carlo Curino via wangda)
2017-04-11 14:56:18 -07:00
Naganarasimha
7d873c465b
YARN-6463. Correct spelling mistake in FileSystemRMStateStore. Contributed by Yeliang Cang.
2017-04-11 22:42:08 +05:30
Jason Lowe
0e065f2ede
YARN-6195. Export UsedCapacity and AbsoluteUsedCapacity to JMX. Contributed by Benson Qiu
2017-04-11 08:44:18 -05:00
Konstantinos Karanasos
7999318af1
YARN-6344. Add parameter for rack locality delay in CapacityScheduler. (kkaranasos)
2017-04-10 15:34:44 -07:00
Karthik Kambatla
2aa8967809
YARN-6298. Metric preemptCall is not used in new preemption. (Yufei Gu via kasha)
2017-04-07 17:31:59 -07:00
Rohith Sharma K S
1a9439e299
YARN-6424. TimelineCollector is not stopped when an app finishes in RM. Contributed by Varun Saxena.
2017-04-06 10:15:22 +05:30
Daniel Templeton
a2c57bb70d
YARN-6381. FSAppAttempt has several variables that should be final
...
(Contributed by Ameet Zaveri via Daniel Templeton)
2017-04-05 16:07:24 -07:00
Karthik Kambatla
b4c4f36594
YARN-6448. Continuous scheduling thread crashes while sorting nodes. (Yufei Gu via kasha)
2017-04-05 15:43:14 -07:00
Wangda Tan
87e2ef8c98
YARN-6406. Remove SchedulerRequestKeys when no more pending ResourceRequest. (Arun Suresh via wangda)
2017-04-05 11:28:30 -07:00
Varun Saxena
19b89c4c7b
YARN-6420. RM startup failure due to wrong order in nodelabel editlog (Bibin A Chundatt via Varun Saxena)
2017-04-05 09:38:35 +05:30
Wangda Tan
3fdae0a2b6
YARN-6109. Add an ability to convert ChildQueue to ParentQueue. (Xuan Gong via wangda)
2017-04-04 14:39:01 -07:00
Eric Payne
318bfb01bc
YARN-6354. LeveldbRMStateStore can parse invalid keys when recovering reservations. Contributed by Jason Lowe
2017-03-31 12:30:35 -05:00
Robert Kanter
9bae6720cb
YARN-6050. AMs can't be scheduled on racks or nodes (rkanter)
2017-03-27 17:23:13 -07:00
Wangda Tan
cd014d57aa
YARN-6339. Improve performance for createAndGetApplicationReport. (Yunjiong Zhao via wangda)
2017-03-27 13:29:09 -07:00
Akira Ajisaka
b63ae854f9
YARN-6379. Remove unused argument in ClientRMService. Contributed by Kai Sasaki.
2017-03-27 11:23:41 +09:00
Sunil G
cffea251be
YARN-5956. Refactor ClientRMService for unify error handling across apis. Contributed by Kai Sasaki.
2017-03-25 22:26:19 +05:30
Tsuyoshi Ozawa
84ddedc0b2
HADOOP-10101. Update guava dependency to the latest version. (ozawa)
2017-03-25 10:06:52 +09:00
Daniel Templeton
0a05c5c598
YARN-6284. hasAlreadyRun should be final in ResourceManager.StandByTransitionRunnable
...
(Contributed by Laura Adams via Daniel Templeton)
2017-03-21 14:15:40 -07:00
Varun Saxena
35034653d0
YARN-6353. Clean up OrderingPolicy javadoc (Daniel Templeton via Varun Saxena)
2017-03-20 23:29:09 +05:30
Sunil G
2d0e24eb01
YARN-6328. Fix a spelling mistake in CapacityScheduler. Contributed by Jin Yibo. This closes #202
2017-03-15 14:48:05 +05:30
Naganarasimha
0a3aa40fe7
YARN-6327. Removing queues from CapacitySchedulerQueueManager and ParentQueue should be done with iterator. Contributed by Jonathan Hung.
2017-03-15 01:22:25 +05:30
Ray Chiang
4db9cc70d0
YARN-6042. Dump scheduler and queue state information into FairScheduler DEBUG log. (Yufei Gu via rchiang)
2017-03-10 16:13:31 -08:00
Karthik Kambatla
a96afae125
YARN-6264. AM not launched when a single vcore is available on the cluster. (Yufei Gu via kasha)
2017-03-09 23:11:54 -08:00
Daniel Templeton
846a0cd678
YARN-1047. Expose # of pre-emptions as a queue counter (Contributed by Karthik Kambatla via Daniel Templeton)
2017-03-09 17:51:47 -08:00
Jason Lowe
d7762a5511
YARN-6165. Intra-queue preemption occurs even when preemption is turned off for a specific queue. Contributed by Eric Payne
2017-03-08 16:46:09 -06:00
Sunil G
1eb8186703
YARN-6207. Move application across queues should handle delayed event processing. Contributed by Bibin A Chundatt.
2017-03-08 12:04:30 +05:30
Daniel Templeton
e0c239cdbd
YARN-6287. RMCriticalThreadUncaughtExceptionHandler.rmContext should be final (Contributed by Corey Barker via Daniel Templeton)
2017-03-07 11:58:48 -08:00
Sunil G
c1386a90d7
YARN-6248. user is not removed from UsersManager’s when app is killed with pending container requests. Contributed by Eric Payne.
2017-03-04 13:46:35 +05:30
Junping Du
8db7a8c3ae
YARN-6271. yarn rmadin -getGroups returns information from standby RM. Contributed by Jian He.
2017-03-03 14:39:31 -08:00
Ray Chiang
2148b83993
YARN-6218. Fix TestAMRMClient when using FairScheduler. (Miklos Szegedi via rchiang)
2017-03-03 13:04:51 -08:00
Jason Lowe
e58fc76030
YARN-6263. NMTokenSecretManagerInRM.createAndGetNMToken is not thread safe. Contributed by Haibo Chen
2017-03-03 08:23:08 -06:00
Jian He
235203dffd
YARN-6153. KeepContainer does not work when AM retry window is set. Contributed by kyungwan nam
2017-02-28 13:23:36 -08:00
Xuan
e0bb867c3f
YARN-6189: Improve application status log message when RM restarted when
...
app is in NEW state. Contributed by Junping Du
2017-02-28 11:04:56 -08:00
Wangda Tan
eac6b4c35c
YARN-6216. Unify Container Resizing code paths with Container Updates making it scheduler agnostic. (Arun Suresh via wangda)
2017-02-28 10:35:50 -08:00
Naganarasimha
5f5b031d1f
YARN-5703. ReservationAgents are not correctly configured. Contributed by Manikandan R.
2017-02-27 20:38:29 +05:30
Karthik Kambatla
fbfe86deea
YARN-6172. FSLeafQueue demand update needs to be atomic. (Miklos Szegedi via kasha)
2017-02-26 20:36:33 -08:00
Karthik Kambatla
815d53506f
YARN-6215. FairScheduler preemption and update should not run concurrently. (Tao Jie via kasha)
2017-02-26 20:16:36 -08:00
Karthik Kambatla
694e680d20
YARN-6222. TestFairScheduler.testReservationMetrics is flaky. (Yufei Gu via kasha)
2017-02-23 15:21:52 -08:00
Sunil G
a207aa9930
YARN-6211. Synchronization improvement for moveApplicationAcrossQueues and updateApplicationPriority. Contributed by Bibin A Chundatt.
2017-02-23 14:19:07 +05:30
Karthik Kambatla
b10e962224
YARN-6194. Cluster capacity in SchedulingPolicy is updated only on allocation file reload. (Yufei Gu via kasha)
2017-02-22 15:58:49 -08:00
Karthik Kambatla
718ad9f6ee
YARN-6210. FairScheduler: Node reservations can interfere with preemption. (kasha)
2017-02-22 15:46:07 -08:00
Wangda Tan
1a6ca75f38
YARN-6143. Fix incompatible issue caused by YARN-3583. (Sunil G via wangda)
2017-02-22 11:17:09 -08:00
Karthik Kambatla
dbbfcf74ab
YARN-6193. FairScheduler might not trigger preemption when using DRF. (kasha)
2017-02-17 14:07:31 -08:00
Daniel Templeton
c7a36e6130
YARN-6125. The application attempt's diagnostic message should have a maximum size
...
(Contributed by Andras Piros via Daniel Templeton)
2017-02-17 13:40:58 -08:00
Daniel Templeton
9a92837786
YARN-6188. Fix OOM issue with decommissioningNodesWatcher in the case of clusters with
...
large number of nodes (Contributed by Ajay Jadhav via Daniel Templeton)
2017-02-17 13:13:46 -08:00
Karthik Kambatla
a77f432449
YARN-6171. ConcurrentModificationException on FSAppAttempt.containersToPreempt. (Miklos Szegedi via kasha)
2017-02-16 14:54:58 -08:00
Rohith Sharma K S
e63a7814d2
YARN-6200. addendum to fix compilation error caused by reverting YARN-5068.
2017-02-16 18:19:57 +05:30
Rohith Sharma K S
a393e84c6f
YARN-6200. Reverting since the same functionality achieved by YARN-1623.
...
Revert "YARN-5068. Expose scheduler queue to application master. (Harish Jaiprakash via rohithsharmaks)"
This reverts commit b7ac85259c
.
2017-02-16 18:00:25 +05:30
Karthik Kambatla
74dd142250
YARN-5798. Set UncaughtExceptionHandler for all FairScheduler threads. (Yufei Gu via kasha)
2017-02-16 00:03:09 -08:00
Karthik Kambatla
11be3f70e0
YARN-4212. FairScheduler: Can't create a DRF queue under a FAIR policy queue. (Yufei Gu via kasha)
2017-02-15 23:51:22 -08:00
Karthik Kambatla
6c25dbcdc0
YARN-6163. FS Preemption is a trickle for severely starved applications. (kasha)
2017-02-15 23:16:12 -08:00
Varun Saxena
b7613e0f40
YARN-6156. AM blacklisting to consider node label partition (Bibin A Chundatt via Varun Saxena)
2017-02-15 14:48:17 +05:30
Karthik Kambatla
353a9b2d91
YARN-6061. Addendum. Remove extraneous change.
2017-02-14 15:19:52 -08:00
Karthik Kambatla
652679aa8a
YARN-6061. Add an UncaughtExceptionHandler for critical threads in RM. (Yufei Gu via kasha)
2017-02-14 13:39:41 -08:00
Arun Suresh
aaf106fde3
YARN-5966. AMRMClient changes to support ExecutionType update. (asuresh)
2017-02-14 06:09:10 -08:00
Karthik Kambatla
646c6d6509
YARN-3933. FairScheduler: Multiple calls to completedContainer are not safe. (Shiwei Guo and Miklos Szegedi via kasha)
2017-02-13 11:26:30 -08:00
Karthik Kambatla
08f93978f3
YARN-6144. FairScheduler: preempted resources can become negative. (Miklos Szegedi via kasha)
2017-02-09 14:45:04 -08:00
Karthik Kambatla
9b85053583
YARN-6112. UpdateCallDuration is calculated only when debug logging is enabled. (Yufei Gu via kasha)
2017-02-09 13:09:32 -08:00
Wangda Tan
5fb723bb77
YARN-5889. Improve and refactor user-limit calculation in Capacity Scheduler. (Sunil G via wangda)
2017-02-09 10:23:50 -08:00
Daniel Templeton
87852b6ef4
YARN-6103. Log updates for ZKRMStateStore (Contributed by Daniel Sturman via Daniel Templeton)
2017-01-30 21:44:48 -08:00
Sunil G
165f07f51a
YARN-6123. [YARN-5864] Add a test to make sure queues of orderingPolicy will be updated when childQueues is added or removed. Contributed by Wangda Tan.
2017-01-27 18:37:51 +05:30
Daniel Templeton
f85b74ccf9
YARN-4975. Fair Scheduler: exception thrown when a parent queue marked 'parent' has configured child queues
...
(Contributed by Yufei Gu via Daniel Templeton)
2017-01-26 10:31:09 -08:00
Karthik Kambatla
abedb8a9d8
YARN-5830. FairScheduler: Avoid preempting AM containers. (Yufei Gu via kasha)
2017-01-25 12:17:28 -08:00
Rohith Sharma K S
0101267d9d
YARN-6082. Invalid REST api response for getApps since queueUsagePercentage is coming as INF. Contributed by Sunil G.
2017-01-24 10:59:55 +05:30
Wangda Tan
ce832059db
YARN-5864. Capacity Scheduler - Queue Priorities. (wangda)
2017-01-23 10:52:14 -08:00
Jason Lowe
69fa81679f
YARN-5910. Support for multi-cluster delegation tokens. Contributed by Jian He
2017-01-23 17:12:51 +00:00
Sunil G
3fa0d540df
YARN-6031. Application recovery has failed when node label feature is turned off during RM recovery. Contributed by Ying Zhang.
2017-01-23 12:19:35 +05:30
Wangda Tan
72054a817d
YARN-5556. CapacityScheduler: Support deleting queues without requiring a RM restart. (Naganarasimha G R via wangda)
2017-01-18 13:31:33 -08:00
Karthik Kambatla
e224c96234
YARN-5831. FairScheduler: Propagate allowPreemptionFrom flag all the way down to the app. (Yufei Gu via kasha)
2017-01-17 17:01:31 -08:00
Sunil G
d3170f9eba
YARN-6081. LeafQueue#getTotalPendingResourcesConsideringUserLimit should deduct reserved from pending to avoid unnecessary preemption of reserved container. Contributed by Wangda Tan.
2017-01-13 18:22:29 +05:30
Naganarasimha
a6b06f7179
YARN-6072. RM unable to start in secure mode. Contributed by Ajith S.
2017-01-12 06:40:41 +05:30
Daniel Templeton
7979939428
YARN-5554. MoveApplicationAcrossQueues does not check user permission on the target queue
...
(Contributed by Wilfred Spiegelenburg via Daniel Templeton)
2017-01-11 14:34:10 -08:00
Sunil G
9594c35dcb
YARN-5899. Debug log in AbstractCSQueue#canAssignToThisQueue needs improvement. Contributed by Ying Zhang.
2017-01-10 14:08:01 +05:30
Junping Du
945db55f2e
YARN-4148. When killing app, RM releases app's resource before they are released by NM. Contributed by Jason Lowe.
2017-01-09 18:14:46 -08:00
Wangda Tan
287d3d6804
YARN-3955. Support for application priority ACLs in queues of CapacityScheduler. (Sunil G via wangda)
2017-01-09 08:40:39 -08:00
Daniel Templeton
ac16400e1f
YARN-6022. Revert changes of AbstractResourceRequest (Contributed by Wangda Tan via Daniel Templeton)
2017-01-07 08:24:30 -08:00
Naganarasimha
a59df15757
YARN-6015. AsyncDispatcher thread name can be set to improved debugging. Contributed by Ajith S.
2017-01-07 08:11:58 +05:30
Jian He
020316458d
YARN-6009. Skip validating app timeout value on recovery. Contributed by Rohith Sharma K S
2017-01-06 18:19:09 -08:00
Arun Suresh
2977bc6a14
YARN-6040. Introduce api independent PendingAsk to replace usage of ResourceRequest within Scheduler classes. (Wangda Tan via asuresh)
2017-01-06 09:59:57 -08:00
Wangda Tan
0a55bd841e
YARN-5959. RM changes to support change of container ExecutionType. (Arun Suresh via wangda)
2017-01-05 10:31:05 -08:00
Rohith Sharma K S
e49e0a6e37
YARN-5988. RM unable to start in secure setup. Contributed by Ajith S.
2017-01-04 12:09:29 +05:30
Wangda Tan
f69a107aec
YARN-6025. Fix synchronization issues of AbstractYarnScheduler#nodeUpdate and its implementations. (Naganarasimha G R via wangda)
...
(cherry picked from commit e0f2379312
)
2017-01-03 14:53:36 -08:00
Wangda Tan
ab1faa4ba8
Revert "Fix synchronization issues of AbstractYarnScheduler#nodeUpdate and its implementations. (Naganarasimha G R via wangda)" to add correct JIRA number
...
This reverts commit e0f2379312
.
2017-01-03 14:53:13 -08:00
Wangda Tan
e0f2379312
Fix synchronization issues of AbstractYarnScheduler#nodeUpdate and its implementations. (Naganarasimha G R via wangda)
2017-01-03 14:46:41 -08:00
Daniel Templeton
165d01a73e
YARN-5931. Document timeout interfaces CLI and REST APIs (Contributed by Rohith Sharma K S via Daniel Templeton)
2016-12-30 08:41:52 -08:00
Robert Kanter
f216276d21
YARN-4882. Change the log level to DEBUG for recovering completed applications (templedf via rkanter)
2016-12-28 15:21:52 -08:00
Sunil G
9ca54f4810
YARN-5906. Update AppSchedulingInfo to use SchedulingPlacementSet. Contributed by Wangda Tan.
2016-12-28 22:48:19 +05:30
Wangda Tan
0840b4329b
YARN-5756. Add state-machine implementation for scheduler queues. (Xuan Gong via wangda)
2016-12-27 21:18:33 -08:00
Robert Kanter
1bbd023275
YARN-5962. Spelling errors in logging and exceptions for resource manager code (gsohn via rkanter)
2016-12-27 13:35:34 -08:00
Daniel Templeton
e216e8e233
HADOOP-13932. Fix indefinite article in comments (Contributed by LiXin Ge via Daniel Templeton)
2016-12-27 13:03:16 -08:00
Arun Suresh
ac1e5d4f77
YARN-5938. Refactoring OpportunisticContainerAllocator to use SchedulerRequestKey instead of Priority and other misc fixes (asuresh)
2016-12-27 12:40:44 -08:00
Karthik Kambatla
c3973e7080
YARN-5969. FairShareComparator: Cache value of getResourceUsage for better performance. (zhangshilong via kasha)
2016-12-27 12:30:35 -08:00
Karthik Kambatla
4d3f73acc0
YARN-6000. Make AllocationFileLoaderService.Listener public. (Tao Jie via kasha)
2016-12-23 11:40:56 -08:00
Sunil G
ef2dd7b78c
YARN-5650. Render Application Timeout value in web UI. Contributed by Akhil PB.
2016-12-19 11:44:03 +05:30
Arun Suresh
2273a74c1f
YARN-5646. Add documentation and update config parameter names for scheduling of OPPORTUNISTIC containers. (Konstantinos Karanasos via asuresh)
2016-12-16 08:14:34 -08:00
Jian He
ada876cd1d
Revert YARN-4126. RM should not issue delegation tokens in unsecure mode.
2016-12-13 20:49:54 -08:00
Robert Kanter
4b149a1e77
YARN-4457. Cleanup unchecked types for EventHandler (templedf via rkanter)
2016-12-09 17:28:25 -08:00
Jian He
a6410a542e
YARN-5709. Cleanup leader election configs and pluggability. Contribtued by Karthik Kambatla
2016-12-09 17:00:37 -08:00
Arun Suresh
b0aace21b1
YARN-5982. Simplify opportunistic container parameters and metrics. (Konstantinos Karanasos via asuresh)
2016-12-09 16:41:25 -08:00
Sunil G
74d0066d33
YARN-5970. Validate application update timeout request parameters. Contributed by Rohith Sharma K S.
2016-12-08 15:53:56 +05:30
Daniel Templeton
9f5d2c4fff
YARN-5136. Error in handling event type APP_ATTEMPT_REMOVED to the scheduler
...
(Contributed by Wilfred Spiegelenburg via Daniel Templeton)
2016-12-07 11:12:14 -08:00
Sunil
ab923a53fc
YARN-5965. Retrospect ApplicationReport#getApplicationTimeouts. Contributed by Rohith Sharma K S
2016-12-08 00:27:25 +05:30
Rohith Sharma K S
563480dccd
YARN-5932. Retrospect moveApplicationToQueue in align with YARN-5611. Contributed by Sunil G.
2016-12-07 10:39:14 +05:30
Naganarasimha
f3b8ff54ab
YARN-5921. Incorrect synchronization in RMContextImpl#setHAServiceState/getHAServiceState. Contributed by Varun Saxena
2016-12-06 06:53:38 +05:30
Jian He
f885160f4a
YARN-5746. The state of the parentQueue and its childQueues should be synchronized. Contributed by Xuan Gong
2016-12-02 16:17:31 -08:00
Daniel Templeton
5bd18c49bd
YARN-5929. Missing scheduling policy in the FS queue metric. (Contributed by Yufei Gu via Daniel Templeton)
2016-12-02 13:55:42 -08:00
Li Lu
69fb70c31a
YARN-5761. Separate QueueManager from Scheduler. (Xuan Gong via gtcarrera9)
2016-11-30 13:38:42 -08:00
Daniel Templeton
b3befc021b
YARN-4997. Update fair scheduler to use pluggable auth provider (Contributed by Tao Jie via Daniel Templeton)
2016-11-30 09:50:33 -08:00
Daniel Templeton
3b9d3acd20
YARN-5890. FairScheduler should log information about AM-resource-usage and max-AM-share for queues
...
(Contributed by Yufei Gu via Daniel Templeton)
2016-11-29 12:46:05 -08:00
Daniel Templeton
25f9872be6
YARN-5774. MR Job stuck in ACCEPTED status without any progress in Fair Scheduler
...
if set yarn.scheduler.minimum-allocation-mb to 0. (Contributed by Yufei Gu via Daniel Templeton)
2016-11-29 09:40:49 -08:00
Sunil
eb0a483ed0
YARN-4206. Add Application timeouts in Application report and CLI. Contributed by Rohith Sharma K S.
2016-11-24 18:18:42 +05:30
Daniel Templeton
10468529a9
YARN-4752. Improved preemption in FairScheduler. (kasha)
...
Contains:
YARN-5605. Preempt containers (all on one node) to meet the requirement of starved applications
YARN-5821. Drop left-over preemption-related code and clean up method visibilities in the Schedulable hierarchy
YARN-5783. Verify identification of starved applications.
YARN-5819. Verify fairshare and minshare preemption
YARN-5885. Cleanup YARN-4752 branch for merge
Change-Id: Iee0962377d019dd64dc69a020725d2eaf360858c
2016-11-23 19:48:59 -10:00
Jian He
1f12867a69
YARN-5649. Add REST endpoints for updating application timeouts. Contributed by Rohith Sharma K S
2016-11-23 16:25:39 -08:00
Arun Suresh
005850b28f
YARN-5918. Handle Opportunistic scheduling allocate request failure when NM is lost. (Bibin A Chundatt via asuresh)
2016-11-23 09:53:31 -08:00
Daniel Templeton
83cc7263af
YARN-5722. FairScheduler hides group resolution exceptions when assigning queue (Contributed by Wilfred Spiegelenburg via Daniel Templeton)
2016-11-22 11:16:00 -08:00
Rohith Sharma K S
a926f895c1
YARN-5865. Retrospect updateApplicationPriority api to handle state store exception in align with YARN-5611. Contributed by Sunil G.
2016-11-22 14:49:15 +05:30
Rohith Sharma K S
d65603517e
YARN-5375. invoke MockRM#drainEvents implicitly in MockRM methods to reduce test failures. Contributed by sandflee.
2016-11-16 15:14:00 +05:30
Arun Suresh
3219b7b4ac
YARN-4597. Introduce ContainerScheduler and a SCHEDULED state to NodeManager container lifecycle. (asuresh)
2016-11-15 07:56:25 -08:00
Rohith Sharma K S
b7070f3308
YARN-5874. RM -format-state-store and -remove-application-from-state-store commands fail with NPE. Contributed by Varun Saxena.
2016-11-15 10:58:25 +05:30
Rohith Sharma K S
296c5de0cf
YARN-5873. RM crashes with NPE if generic application history is enabled. Contributed by Varun Saxena.
2016-11-15 10:56:28 +05:30
Karthik Kambatla
79448d4ab1
YARN-4033. In FairScheduler, parent queues should also display queue status. (Siqi Li via kasha)
2016-11-13 21:38:36 -08:00
Jian He
fad9609d13
YARN-5825. ProportionalPreemptionalPolicy should use readLock over LeafQueue instead of synchronized block. Contributed by Sunil G
2016-11-11 15:16:21 -08:00
Wangda Tan
ede1a473f5
YARN-5552. Add Builder methods for common yarn API records. (Tao Jie via wangda)
2016-11-11 13:34:56 -08:00
Naganarasimha
503e73e849
YARN-5545. Fix issues related to Max App in capacity scheduler. Contributed by Bibin A Chundatt
2016-11-11 20:48:31 +05:30
Eric Payne
93eeb13164
YARN-4218. Metric for resource*time that was preempted. Contributed by Chang Li.
2016-11-10 22:35:12 +00:00
Karthik Kambatla
86ac1ad9fd
YARN-5453. FairScheduler#update may skip update demand resource of child queue/app if current demand reached maxResource. (sandflee via kasha)
2016-11-09 23:44:02 -08:00
Jian He
bcc15c6290
YARN-5611. Provide an API to update lifetime of an application. Contributed by Rohith Sharma K S
2016-11-09 16:08:05 -08:00
Naganarasimha
edbee9e609
YARN-4498. Application level node labels stats to be available in REST (addendum patch). Contributed by Bibin A Chundatt.
2016-11-10 05:00:05 +05:30
Daniel Templeton
59ee8b7a88
YARN-4329. [YARN-5437] Allow fetching exact reason as to why a submitted app
...
is in ACCEPTED state in Fair Scheduler (Contributed by Yufei Gu)
2016-11-09 13:11:37 -08:00
Arun Suresh
283fa33feb
YARN-5823. Update NMTokens in case of requests with only opportunistic containers. (Konstantinos Karanasos via asuresh)
2016-11-09 00:11:25 -08:00
Jason Lowe
3f93ac0733
YARN-5356. NodeManager should communicate physical resource capability to ResourceManager. Contributed by Inigo Goiri
2016-11-08 22:01:26 +00:00
Jian He
de3b4aac56
YARN-5716. Add global scheduler interface definition and update CapacityScheduler to use it. Contributed by Wangda Tan
2016-11-07 10:14:39 -08:00
Wangda Tan
c00b5d1e51
YARN-5804. New UI2 is not able to launch with jetty 9 upgrade post HADOOP-10075. (Sunil G via wangda)
...
(cherry picked from commit ddebaa6b7ae1031899139d153d57df5387d60542)
2016-11-06 13:13:31 -08:00
Wangda Tan
3de0da2a76
YARN-5698. [YARN-3368] Launch new YARN UI under hadoop web app port. (Sunil G via wangda)
2016-11-06 13:13:31 -08:00
Wangda Tan
817d5f4c96
YARN-4849. Addendum patch to fix javadocs. (Sunil G via wangda)
2016-11-06 13:13:31 -08:00
Wangda Tan
c85cc3b56e
YARN-4515. [YARN-3368] Support hosting web UI framework inside YARN RM. (Sunil G via wangda)
...
YARN-5000. [YARN-3368] App attempt page is not loading when timeline server is not started (Sunil G via wangda)
YARN-5038. [YARN-3368] Application and Container pages shows wrong values when RM is stopped. (Sunil G via wangda)
2016-11-06 13:13:31 -08:00
Jason Lowe
6bb741ff0e
YARN-5837. NPE when getting node status of a decommissioned node after an RM restart. Contributed by Robert Kanter
2016-11-04 22:20:21 +00:00
Arun Suresh
0aafc122d4
YARN-2995. Enhance UI to show cluster resource utilization of various container Execution types. (Konstantinos Karanasos via asuresh)
2016-11-04 07:31:54 -07:00
Sunil
19b3779ae7
YARN-5802. updateApplicationPriority api in scheduler should ensure to re-insert app to correct ordering policy. Contributred by Bibin A Chundatt
2016-11-04 16:07:28 +05:30
Jason Lowe
352cbaa7a5
YARN-4862. Handle duplicate completed containers in RMNodeImpl. Contributed by Rohith Sharma K S
2016-11-03 13:54:31 +00:00
Varun Saxena
7d2d8d25ba
YARN-5788. Apps not activiated and AM limit resource in UI and REST not updated after -replaceLabelsOnNode (Bibin A Chundatt via Varun Saxena)
2016-11-01 15:32:04 +05:30
Karthik Kambatla
733aa99313
YARN-4998. Minor cleanup to UGI use in AdminService. (Daniel Templeton via kasha)
2016-10-31 16:26:01 -07:00
Wangda Tan
90dd3a8148
YARN-2009. CapacityScheduler: Add intra-queue preemption for app priority support. (Sunil G via wangda)
2016-10-31 15:18:31 -07:00
Daniel Templeton
2528bea67f
YARN-4396. Log the trace information on FSAppAttempt#assignContainer (Contributed by Yiqun Li via Daniel Templeton)
2016-10-31 13:34:25 -07:00
Naganarasimha
e0bebbbcdd
YARN-4498. Application level node labels stats to be available in REST. Contributed by Bibin A Chundatt
2016-10-31 04:38:20 +05:30
Arun Suresh
aa3cab1eb2
YARN-5799. Fix Opportunistic Allocation to set the correct value of Node Http Address. (asuresh)
2016-10-29 02:03:57 -07:00
Varun Saxena
1c8ab41e8b
YARN-5773. RM recovery too slow due to LeafQueue#activateApplications (Bibin A Chundatt via Varun Saxena)
2016-10-29 13:47:39 +05:30
Jason Lowe
1eae719bce
YARN-4963. capacity scheduler: Make number of OFF_SWITCH assignments per heartbeat configurable. Contributed by Nathan Roberts
2016-10-28 17:30:15 +00:00
Karthik Kambatla
4df8ed63ed
YARN-4743. FairSharePolicy breaks TimSort assumption. (Zephyr Guo and Yufei Gu via kasha)
2016-10-27 17:45:48 -07:00
Robert Kanter
5877f20f9c
HADOOP-10075. Update jetty dependency to version 9 (rkanter)
2016-10-27 16:09:00 -07:00
Carlo Curino
892a8348fc
YARN-3432. Cluster metrics have wrong Total Memory when there is reserved memory on CS. (Brahma Reddy Battula via curino)
2016-10-27 15:15:49 -07:00
Daniel Templeton
b98fc8249f
YARN-4710. Reduce logging application reserved debug info in FSAppAttempt#assignContainer (Contributed by Yiqun Lin via Daniel Templeton)
2016-10-27 14:42:19 -07:00
Naganarasimha
b3c15e4ef7
YARN-5420. Delete org.apache.hadoop.yarn.server.resourcemanager.resource.Priority as its not necessary. Contributed by Sunil G.
2016-10-27 18:22:07 +05:30
Karthik Kambatla
a71fc81655
YARN-5754. Null check missing for earliest in FifoPolicy. (Yufei Gu via kasha)
2016-10-24 22:19:23 -07:00
Karthik Kambatla
754cb4e30f
YARN-5047. Refactor nodeUpdate across schedulers. (Ray Chiang via kasha)
2016-10-20 21:17:48 -07:00
Karthik Kambatla
a064865abf
YARN-4911. Bad placement policy in FairScheduler causes the RM to crash
2016-10-20 20:57:04 -07:00
Xuan
b733a6f862
YARN-5718. TimelineClient (and other places in YARN) shouldn't over-write HDFS client retry settings which could cause unexpected behavior. Contributed by Junping Du.
2016-10-18 11:06:47 -07:00
Varun Saxena
b154d3edce
YARN-5743. [Atsv2] Publish queue name and RMAppMetrics to ATS (Rohith Sharma K S via Varun Saxena)
2016-10-18 23:32:52 +05:30
Sangjin Lee
1f304b0c7f
YARN-5699. Retrospect yarn entity fields which are publishing in events info fields. Contributed by Rohith Sharma K S.
2016-10-15 13:54:40 -07:00
Karthik Kambatla
6476934ae5
YARN-5677. RM should transition to standby when connection is lost for an extended period. (Daniel Templeton via kasha)
2016-10-11 22:07:10 -07:00
Karthik Kambatla
6378845f9e
YARN-4464. Lower the default max applications stored in the RM and store. (Daniel Templeton via kasha)
2016-10-11 21:42:08 -07:00
Rohith Sharma K S
4d2f380d78
YARN-5101. YARN_APPLICATION_UPDATED event is parsed in ApplicationHistoryManagerOnTimelineStore#convertToApplicationReport with reversed order. Contributed by Sunil G.
2016-10-06 20:42:36 +05:30
Jian He
31f8da22d0
YARN-3139. Improve locks in AbstractYarnScheduler/CapacityScheduler/FairScheduler. Contributed by Wangda Tan
2016-10-04 17:23:13 -07:00
Karthik Kambatla
736d33cddd
YARN-4767. Network issues can cause persistent RM UI outage. (Daniel Templeton via kasha)
2016-10-03 14:35:57 -07:00
Karthik Kambatla
0da54e8848
YARN-5672. FairScheduler: Wrong queue name in log when adding application. (Wilfred Spiegelenburg via kasha)
2016-10-03 06:03:46 -07:00
Naganarasimha
6e130c308c
YARN-4855. Should check if node exists when replace nodelabels. Contributeed by Tao Jie
2016-10-03 02:02:26 -04:00
Karthik Kambatla
82857037b6
YARN-5678. Log demand as demand in FSLeafQueue and FSParentQueue. (Yufei Gu via kasha)
2016-10-02 22:09:43 -07:00
Subru Krishnan
3a3697deab
YARN-5384. Expose priority in ReservationSystem submission APIs. (Sean Po via Subru).
2016-09-30 19:41:43 -07:00
Arun Suresh
10be45986c
YARN-5486. Update OpportunisticContainerAllocatorAMService::allocate method to handle OPPORTUNISTIC container requests. (Konstantinos Karanasos via asuresh)
2016-09-29 15:11:41 -07:00
Jian He
2ae5a3a5bf
YARN-4205. Add a service for monitoring application life time out. Contributed by Rohith Sharma K S
2016-09-29 22:00:31 +08:00
Robert Kanter
bcb2528a51
YARN-5400. Light cleanup in ZKRMStateStore (templedf via rkanter)
2016-09-28 14:56:41 -07:00
Varun Saxena
9b0fd01d2e
YARN-5599. Publish AM launch command to ATS (Rohith Sharma K S via Varun Saxena)
2016-09-28 16:10:10 +05:30
Wangda Tan
1831be8e73
YARN-3142. Improve locks in AppSchedulingInfo. (Varun Saxena via wangda)
2016-09-27 11:54:55 -07:00
Naganarasimha
ebf528c443
YARN-5660. Wrong audit constants are used in Get/Put of priority in RMWebService. Contributed by Rohith Sharma K S
2016-09-27 16:37:59 +05:30
Akira Ajisaka
14a696f369
YARN-5663. Small refactor in ZKRMStateStore. Contributed by Oleksii Dymytrov.
2016-09-26 15:00:01 +09:00
Naganarasimha
d0372dc613
YARN-3692. Allow REST API to set a user generated message when killing an application. Contributed by Rohith Sharma K S
2016-09-23 06:30:49 +05:30
Arun Suresh
9f03b403ec
YARN-5656. Fix ReservationACLsTestBase. (Sean Po via asuresh)
2016-09-20 12:27:17 -07:00
Wangda Tan
e45307c9a0
Addendum patch for fix javadocs failure which is caused by YARN-3141. (wangda)
2016-09-20 11:21:01 -07:00
Jian He
2b66d9ec5b
YARN-3140. Improve locks in AbstractCSQueue/LeafQueue/ParentQueue. Contributed by Wangda Tan
2016-09-20 15:03:31 +08:00
Jason Lowe
7558dbbb48
YARN-5540. Scheduler spends too much time looking at empty priorities. Contributed by Jason Lowe
2016-09-19 20:31:35 +00:00
Jian He
b8a30f2f17
YARN-3141. Improve locks in SchedulerApplicationAttempt/FSAppAttempt/FiCaSchedulerApp. Contributed by Wangda Tan
2016-09-19 17:08:01 +08:00
Naganarasimha
4174b9756c
YARN-5642. Typos in 9 log messages. Contributed by Mehran Hassani
2016-09-17 10:35:39 +05:30
Karthik Kambatla
c0e492e50f
YARN-5616. Clean up WeightAdjuster. (Yufei Gu via kasha)
2016-09-06 22:40:20 -07:00
Karthik Kambatla
f6ea9be547
YARN-5264. Store all queue-specific information in FSQueue. (Yufei Gu via kasha)
2016-09-02 14:56:29 -07:00
Ray Chiang
378f624a39
YARN-5549. AMLauncher#createAMContainerLaunchContext() should not log the command to be launched indiscriminately. (Daniel Templeton via rchiang)
2016-09-02 11:14:35 -07:00
Varun Vasudev
05f5c0f631
YARN-5555. Scheduler UI: "% of Queue" is inaccurate if leaf queue is hierarchically nested. Contributed by Eric Payne.
2016-09-02 16:02:01 +05:30
Karthik Kambatla
74f4bae455
YARN-5566. Client-side NM graceful decom is not triggered when jobs finish. (Robert Kanter via kasha)
2016-09-01 14:44:01 -07:00
Arun Suresh
d6d9cff21b
YARN-5221. Expose UpdateResourceRequest API to allow AM to request for change in container properties. (asuresh)
2016-08-30 15:52:29 -07:00
Subru Krishnan
b930dc3ec0
YARN-5327. API changes required to support recurring reservations in the YARN ReservationSystem. (Sangeetha Abdu Jyothi via Subru).
2016-08-26 16:58:47 -07:00
Naganarasimha
46e02ab719
YARN-3940. Application moveToQueue should check NodeLabel permission. Contributed by Bibin A Chundatt
2016-08-26 20:19:11 +05:30
Naganarasimha
27c3b86252
YARN-5564. Fix typo in RM_SCHEDULER_RESERVATION_THRESHOLD_INCREMENT_MULTIPLE. Contributed by Ray Chiang
2016-08-26 08:47:21 +05:30
Wangda Tan
444b2ea7af
YARN-3388. Allocation in LeafQueue could get stuck because DRF calculator isn't well supported when computing user-limit. (Nathan Roberts via wangda)
2016-08-19 16:28:32 -07:00
Varun Saxena
091dd19e86
YARN-5533. JMX AM Used metrics for queue wrong when app submited to nodelabel partition (Bibin A Chundatt via Varun Saxena)
2016-08-19 17:30:17 +05:30
Varun Saxena
8aed374182
Revert "YARN-5533. JMX AM Used metrics for queue wrong when app submited to nodelabel partition (Bibin A Chundatt via Varun Saxena)"
...
This reverts commit 59557e85a4
.
2016-08-19 16:14:16 +05:30
Varun Saxena
59557e85a4
YARN-5533. JMX AM Used metrics for queue wrong when app submited to nodelabel partition (Bibin A Chundatt via Varun Saxena)
2016-08-19 15:01:48 +05:30
Junping Du
0da69c324d
YARN-4676. Automatic and Asynchronous Decommissioning Nodes Status Tracking. Contributed by Diniel Zhi.
...
(cherry picked from commit d464483bf7f0b3e3be3ba32cd6c3eee546747ab5)
Conflicts:
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/ResourceTrackerService.java
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/rmnode/RMNode.java
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/rmnode/RMNodeImpl.java
2016-08-18 07:27:23 -07:00
Karthik Kambatla
20f0eb871c
YARN-4702. FairScheduler: Allow setting maxResources for ad hoc queues. (Daniel Templeton via kasha)
2016-08-17 17:40:20 -07:00
Varun Saxena
23c6e3c4e4
YARN-5476. Non existent application reported as ACCEPTED by YarnClientImpl (Junping Du via Varun Saxena)
2016-08-12 20:37:58 +05:30
Naganarasimha
874577a67d
YARN-4833. For Queue AccessControlException client retries multiple times on both RM. Contributed by Bibin A Chundatt
2016-08-12 01:09:41 +05:30
Naganarasimha
89c0bff719
YARN-5495. Remove import wildcard in CapacityScheduler. Contributed by Ray Chiang.
2016-08-11 00:35:19 +05:30
Jason Lowe
5c95bb315b
YARN-5382. RM does not audit log kill request for active applications. Contributed by Vrushali C
2016-08-10 18:25:54 +00:00
Jason Lowe
e0b570dffb
YARN-5483. Optimize RMAppAttempt#pullJustFinishedContainers. Contributed by sandflee
2016-08-10 18:04:18 +00:00
Arun Suresh
82c9e06101
YARN-5457. Refactor DistributedScheduling framework to pull out common functionality. (asuresh)
2016-08-09 00:42:29 -07:00
Naganarasimha
d81b8163b4
YARN-4624. NPE in PartitionQueueCapacitiesInfo while accessing Schduler UI. Contributed by Brahma Reddy Battula
2016-08-06 01:13:36 +05:30
Wangda Tan
3f100d76ff
YARN-4888. Changes in scheduler to identify resource-requests explicitly by allocation-id. (Subru Krishnan via wangda)
2016-08-05 10:43:35 -07:00
Wangda Tan
e0d131f055
YARN-4091. Add REST API to retrieve scheduler activity. (Chen Ge and Sunil G via wangda)
2016-08-05 10:27:34 -07:00
Rohith Sharma K S
d9a354c2f3
YARN-5333. Some recovered apps are put into default queue when RM HA. Contributed by Jun Gong.
2016-08-05 21:35:49 +05:30
Jason Lowe
4d92aefd35
YARN-4280. CapacityScheduler reservations may not prevent indefinite postponement on a busy cluster. Contributed by Kuhu Shukla
2016-08-03 18:53:14 +00:00
Arun Suresh
e5766b1dbe
YARN-5113. Refactoring and other clean-up for distributed scheduling. (Konstantinos Karanasos via asuresh)
2016-07-31 11:48:25 -07:00
Subru Krishnan
4e756d7271
YARN-5203.Return ResourceRequest JAXB object in ResourceManager Cluster Applications REST API. Contributed by Ellen Hui.
2016-07-28 16:03:24 -07:00
Wangda Tan
d62e121ffc
YARN-5195. RM intermittently crashed with NPE while handling APP_ATTEMPT_REMOVED event when async-scheduling enabled in CapacityScheduler. (sandflee via wangda)
2016-07-26 21:22:59 -07:00
Wangda Tan
49969b16cd
YARN-5342. Improve non-exclusive node partition resource allocation in Capacity Scheduler. (Sunil G via wangda)
2016-07-26 18:14:09 -07:00
Arun Suresh
5aace38b74
YARN-5392. Replace use of Priority in the Scheduling infrastructure with an opaque ShedulerRequestKey. (asuresh and subru)
2016-07-26 14:54:03 -07:00
Chris Douglas
d383bfdcd4
YARN-5164. Use plan RLE to improve CapacityOverTimePolicy efficiency
2016-07-25 16:37:50 -07:00
Akira Ajisaka
c63afdbe14
YARN-4883. Make consistent operation name in AdminService. Contributed by Kai Sasaki.
2016-07-20 16:51:01 -07:00
Arun Suresh
cda0a280dd
YARN-5181. ClusterNodeTracker: add method to get list of nodes matching a specific resourceName. (kasha via asuresh)
2016-07-19 10:43:37 -07:00
Arun Suresh
5f2d33a551
Revert "YARN=5181. ClusterNodeTracker: add method to get list of nodes matching a specific resourceName. (kasha via asuresh)"
...
This reverts commit e905a42a2c
.
2016-07-19 10:43:19 -07:00
Ray Chiang
f5f1c81e7d
YARN-5272. Handle queue names consistently in FairScheduler. (Wilfred Spiegelenburg via rchiang)
2016-07-15 14:38:50 -07:00
Arun Suresh
e905a42a2c
YARN=5181. ClusterNodeTracker: add method to get list of nodes matching a specific resourceName. (kasha via asuresh)
2016-07-15 14:35:12 -07:00
Wangda Tan
24db9167f1
YARN-4484. Available Resource calculation for a queue is not correct when used with labels. (Sunil G via wangda)
2016-07-15 11:40:12 -07:00
Varun Saxena
06c56ff79b
YARN-5353. ResourceManager can leak delegation tokens when they are shared across apps. (Jason Lowe via Varun Saxena).
2016-07-13 07:55:34 +05:30
Jian He
819224dcf9
YARN-5270. Solve miscellaneous issues caused by YARN-4844. Contributed by Wangda Tan
2016-07-11 22:36:20 -07:00
Sangjin Lee
6cf6ab7b78
Made a number of miscellaneous fixes for javac, javadoc, and checstyle warnings.
2016-07-10 08:46:05 -07:00
Vrushali
6d943038f6
Cleanup changes during rebase with trunk (Vrushali C)
2016-07-10 08:46:04 -07:00
Varun Saxena
1ff6833bba
YARN-5243. fix several rebase and other miscellaneous issues before merge. (Sangjin Lee via Varun Saxena)
2016-07-10 08:46:03 -07:00
Sangjin Lee
702236129b
YARN-5095. flow activities and flow runs are populated with wrong timestamp when RM restarts w/ recovery enabled (Varun Saxena via sjlee)
2016-07-10 08:46:00 -07:00
Sangjin Lee
a1b6d7456f
YARN-5018. Online aggregation logic should not run immediately after collectors got started (Li Lu via sjlee)
2016-07-10 08:45:59 -07:00
Li Lu
c2055a97d5
YARN-3150. Documenting the timeline service v2. (Sangjin Lee and Vrushali C via gtcarrera9)
2016-07-10 08:45:57 -07:00
Varun Saxena
a3cf40e532
YARN-3461. Consolidate flow name/version/run defaults. (Sangjin Lee via Varun Saxena)
2016-07-10 08:45:55 -07:00
Sangjin Lee
960af7d471
YARN-4409. Fix javadoc and checkstyle issues in timelineservice code (Varun Saxena via sjlee)
2016-07-10 08:45:53 -07:00
Naganarasimha
06f0b50a28
YARN-4644. TestRMRestart fails and findbugs issue in YARN-2928 branch (Varun Saxena via Naganarasimha G R)
2016-07-10 08:45:52 -07:00
Li Lu
34f02f07d5
Rebase to latest trunk
2016-07-10 08:45:51 -07:00
Varun Saxena
829cceebc0
YARN-3586. RM to only get back addresses of Collectors that NM needs to know.
...
(Junping Du via Varun Saxena).
2016-07-10 08:45:50 -07:00
Varun Saxena
c4d7bbda5c
YARN-4460. [Bug fix] RM fails to start when SMP is enabled. (Li Lu via Varun Saxena)
2016-07-10 08:45:49 -07:00
Xuan
2e2dbf59d1
YARN-4392. ApplicationCreatedEvent event time resets after RM
...
restart/failover. Contributed by Naganarasimha G R and Xuan Gong
(cherry picked from commit 4546c7582b
)
Conflicts:
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/rmapp/RMAppImpl.java
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/rmapp/TestRMAppTransitions.java
2016-07-10 08:45:49 -07:00
Li Lu
89e5c44f9e
YARN-4356. Ensure the timeline service v.2 is disabled cleanly and has no
...
impact when it's turned off. Contributed by Sangjin Lee.
2016-07-10 08:45:48 -07:00
Sangjin Lee
10ec5586fb
YARN-4129. Refactor the SystemMetricPublisher in RM to better support newer events (Naganarasimha G R via sjlee)
2016-07-10 08:45:46 -07:00
Sangjin Lee
8d9476ec5f
YARN-4058. Miscellaneous issues in NodeManager project (Naganarasimha G R via sjlee)
2016-07-10 08:45:43 -07:00
Sangjin Lee
22e7ae5771
YARN-3792. Test case failures in TestDistributedShell and some issue fixes related to ATSV2 (Naganarasimha G R via sjlee)
...
(cherry picked from commit 84f37f1c7eefec6d139cbf091c50d6c06f734323)
2016-07-10 08:45:38 -07:00
Zhijie Shen
f3c661e8dd
YARN-3044. Made RM write app, attempt and optional container lifecycle events to timeline service v2. Contributed by Naganarasimha G R.
2016-07-10 08:45:37 -07:00
Sangjin Lee
dc1f306fdc
YARN-3562. unit tests failures and issues found from findbug from earlier ATS checkins (Naganarasimha G R via sjlee)
2016-07-10 08:45:35 -07:00
Sangjin Lee
11e8905d8d
YARN-3390. Reuse TimelineCollectorManager for RM (Zhijie Shen via sjlee)
...
(cherry picked from commit 58221188811e0f61d842dac89e1f4ad4fd8aa182)
2016-07-10 08:45:33 -07:00
Junping Du
47f35a30bb
YARN-3391. Clearly define flow ID/ flow run / flow version in API and storage. Contributed by Zhijie Shen
...
(cherry picked from commit 68c6232f8423e55b4d152ef3d1d66aeb2d6a555e)
2016-07-10 08:45:33 -07:00
Zhijie Shen
5712b8f9fd
YARN-3334. NM uses timeline client to publish container metrics to new timeline service. Contributed by Junping Du.
2016-07-10 08:45:33 -07:00
Junping Du
d67c9bdb4d
YARN-3040. Make putEntities operation be aware of the app's context. Contributed by Zhijie Shen
2016-07-10 08:45:32 -07:00
Junping Du
5e3d9a477b
YARN-3034. Implement RM starting its timeline collector. Contributed by Naganarasimha G R
2016-07-10 08:45:32 -07:00
Junping Du
2188a07e5b
YARN-3333. Rename TimelineAggregator etc. to TimelineCollector. Contributed by Sangjin Lee
2016-07-10 08:45:31 -07:00
Zhijie Shen
9b56364080
YARN-3039. Implemented the app-level timeline aggregator discovery service. Contributed by Junping Du.
2016-07-10 08:45:31 -07:00
Varun Saxena
5252562edf
YARN-5297. Avoid printing a stack trace when recovering an app after the RM restarts. (Junping Du via Varun Saxena).
2016-07-09 00:09:25 +05:30
Junping Du
30ee57ceb1
YARN-4939. The decommissioning Node should keep alive during NM restart. Contributed by sandflee.
2016-07-08 04:14:53 -07:00
Wangda Tan
04f6ebb66a
YARN-5294. Pass remote ip address down to YarnAuthorizationProvider. (Jian He via wangda)
2016-07-06 10:36:48 -07:00
Varun Saxena
8e672e3c71
YARN-5286. Add RPC port info in RM web service's response when getting app status. (Jun Gong via Varun Saxena).
2016-07-05 22:56:07 +05:30
Rohith Sharma K S
26b5e6116f
YARN-5262. Optimize sending RMNodeFinishedContainersPulledByAMEvent for every AM heartbeat.
2016-06-29 10:08:30 +05:30
Arun Suresh
99e5dd68d0
YARN-5171. Extend DistributedSchedulerProtocol to notify RM of containers allocated by the Node. (Inigo Goiri via asuresh)
2016-06-22 19:04:54 -07:00
Junping Du
d0162f2040
YARN-5246. NMWebAppFilter web redirects drop query parameters. Contributed by Varun Vasudev.
2016-06-19 17:44:54 -07:00
Karthik Kambatla
20f2799938
YARN-5077. Fix FSLeafQueue#getFairShare() for queues with zero fairshare. (Yufei Gu via kasha)
2016-06-17 22:24:42 -07:00
Wangda Tan
c77a1095dc
YARN-1942. Deprecate toString/fromString methods from ConverterUtils and move them to records classes like ContainerId/ApplicationId, etc. (wangda)
2016-06-14 15:06:38 -07:00
Arun Suresh
5279af7cd4
YARN-5082. Limit ContainerId increase in fair scheduler if the num of node app reserved reached the limit (sandflee via asuresh)
2016-06-10 22:33:42 -07:00
Rohith Sharma K S
e0f4620cc7
YARN-5197. RM leaks containers if running container disappears from node update. Contributed by Jason Lowe.
2016-06-11 10:22:27 +05:30
Wangda Tan
620325e816
YARN-4837. User facing aspects of 'AM blacklisting' feature need fixing. (vinodkv via wangda)
2016-06-07 15:06:42 -07:00
Arun Suresh
3a154f75ed
YARN-4525. Fix bug in RLESparseResourceAllocation.getRangeOverlapping(). (Ishai Menache and Carlo Curino via asuresh)
2016-06-06 21:18:32 -07:00
Arun Suresh
7a9b7372a1
YARN-5185. StageAllocaterGreedyRLE: Fix NPE in corner case. (Carlo Curino via asuresh)
2016-06-06 21:06:52 -07:00
Ming Ma
4a1cedc010
MAPREDUCE-5044. Have AM trigger jstack on task attempts that timeout before killing them. (Eric Payne and Gera Shegalov via mingma)
2016-06-06 14:30:51 -07:00
Arun Suresh
db54670e83
YARN-5165. Fix NoOvercommitPolicy to take advantage of RLE representation of plan. (Carlo Curino via asuresh)
2016-06-03 14:49:32 -07:00
Vinod Kumar Vavilapalli
f10ebc67f5
YARN-5098. Fixed ResourceManager's DelegationTokenRenewer to replace expiring system-tokens if RM stops and only restarts after a long time. Contributed by Jian He.
2016-06-03 13:00:07 -07:00
Jian He
097baaaeba
YARN-1815. Work preserving recovery of Unmanged AMs. Contributed by Subru Krishnan
2016-06-03 10:49:30 -07:00
Varun Vasudev
42f90ab885
YARN-4844. Add getMemorySize/getVirtualCoresSize to o.a.h.y.api.records.Resource. Contributed by Wangda Tan.
2016-05-29 21:24:16 +05:30
Karthik Kambatla
04ded558b0
YARN-5035. FairScheduler: Adjust maxAssign dynamically when assignMultiple is turned on. (kasha)
2016-05-26 14:41:07 -07:00
Karthik Kambatla
4f513a4a8e
YARN-4866. FairScheduler: AMs can consume all vcores leading to a livelock when using FAIR policy. (Yufei Gu via kasha)
2016-05-25 22:13:27 -07:00
Carlo Curino
013532a95e
YARN-4957. Add getNewReservation in ApplicationClientProtocol (Sean Po via curino)
2016-05-25 16:55:49 -07:00
Karthik Kambatla
f979d779e1
YARN-4878. Expose scheduling policy and max running apps over JMX for Yarn queues. (Yufei Gu via kasha)
2016-05-24 10:54:11 -07:00
Naganarasimha
b4078bd17b
YARN-3971. Skip RMNodeLabelsManager#checkRemoveFromClusterNodeLabelsOfQueue on nodelabel recovery. (addendum patch). Contributed by Bibin A chundatt
2016-05-24 08:06:53 +08:00
Karthik Kambatla
6d043aa4cf
YARN-4979. FSAppAttempt demand calculation considers demands at multiple locality levels different. (Zhihai Xu via kasha)
2016-05-23 14:29:28 -07:00
Jason Lowe
ac954486c5
YARN-5055. max apps per user can be larger than max per queue. Contributed by Eric Badger
2016-05-23 15:54:42 +00:00
Junping Du
22fcd819f0
YARN-5076. YARN web interfaces lack XFS protection. Contributed by Jonathan Maron.
...
(cherry picked from commit 2703ec68712279494d67b0d76b7ac10e7a1628be)
2016-05-19 14:15:21 -07:00
Jian He
feb90ffcca
YARN-4002. Make ResourceTrackerService#nodeHeartbeat more concurrent. Contributed by Rohith Sharma K S & Zhiguo Hong
2016-05-19 13:01:36 -07:00
Arun Suresh
1597630681
YARN-5110. Fix OpportunisticContainerAllocator to insert complete HostAddress in issued ContainerTokenIds. (Konstantinos Karanasos via asuresh)
2016-05-18 18:46:00 -07:00
Jian He
fa3bc3405d
YARN-4832. NM side resource value should get updated if change applied in RM side. Contributed by Junping Du
2016-05-17 12:52:19 -07:00
Arun Suresh
ccc93e7812
YARN-5075. Fix findbugs warnings in hadoop-yarn-common module. (asuresh)
2016-05-16 23:22:01 -07:00
Arun Suresh
f45bc5a83e
YARN-4738. Notify the RM about the status of OPPORTUNISTIC containers (Konstantinos Karanasos via asuresh)
2016-05-15 17:54:34 -07:00
Arun Suresh
f0ac18d001
YARN-2888. Corrective mechanisms for rebalancing NM container queues. (asuresh)
2016-05-13 13:38:36 -07:00
Jason Lowe
013000fbc2
YARN-5053. More informative diagnostics when applications killed by a user. Contributed by Eric Badger
2016-05-12 20:28:36 +00:00
Rohith Sharma K S
b7ac85259c
YARN-5068. Expose scheduler queue to application master. (Harish Jaiprakash via rohithsharmaks)
2016-05-12 15:17:49 +05:30
Karthik Kambatla
4b4e4c6ba8
YARN-4995. FairScheduler: Display per-queue demand on the scheduler page. (xupeng via kasha)
2016-05-11 17:36:21 -07:00
Junping Du
39f2bac38b
YARN-5029. RM needs to send update event with YarnApplicationState as Running to ATS/AHS. Contributed by Xuan Gong.
2016-05-11 09:28:35 -07:00
Naganarasimha
2750fb900f
YARN-4926. Change nodelabel rest API invalid reponse status to 400. Contributed by Bibin A Chundatt
2016-05-08 22:49:25 +05:30
Yongjun Zhang
47c41e7ac7
YARN-5048. DelegationTokenRenewer#skipTokenRenewal may throw NPE (Jian He via Yongjun Zhang)
2016-05-06 21:50:09 -07:00
Jason Lowe
b2ed6ae731
YARN-4747. AHS error 500 due to NPE when container start event is missing. Contributed by Varun Saxena
2016-05-06 22:59:39 +00:00
Wangda Tan
23248f63aa
getApplicationReport call may raise NPE for removed queues. (Jian He via wangda)
2016-05-06 15:30:45 -07:00
Jian He
bb62e05925
YARN-4390. Do surgical preemption based on reserved container in CapacityScheduler. Contributed by Wangda Tan
2016-05-05 12:56:21 -07:00
Jason Lowe
d0da13229c
YARN-4311. Removing nodes from include and exclude lists will not remove them from decommissioned nodes list. Contributed by Kuhu Shukla
2016-05-05 14:07:54 +00:00
Jason Lowe
ed54f5f1ff
YARN-5003. Add container resource to RM audit log. Contributed by Nathan Roberts
2016-05-03 20:03:41 +00:00
Jian He
dd80042c42
YARN-5008. LeveldbRMStateStore database can grow substantially leading to long recovery times. Contributed by Jason Lowe
2016-04-28 21:27:25 -07:00
Jian He
4beff01354
YARN-4983. JVM and UGI metrics disappear after RM transitioned to standby mode
2016-04-26 21:00:17 -07:00
Arun Suresh
341888a0aa
YARN-4412. Create ClusterMonitor to compute ordered list of preferred NMs for OPPORTUNITIC containers. (asuresh)
2016-04-26 20:12:12 -07:00
Karthik Kambatla
4b1dcbbe0c
YARN-1297. FairScheduler: Move some logs to debug and check if debug logging is enabled
2016-04-26 05:10:09 -07:00
Arun Suresh
c282a08f38
YARN-2885. Create AMRMProxy request interceptor and ContainerAllocator to distribute OPPORTUNISTIC containers to appropriate Nodes (asuresh)
...
(cherry picked from commit 2bf025278a318b0452fdc9ece4427b4c42124e39)
2016-04-24 22:38:33 -07:00
Wangda Tan
7cb3a3da96
YARN-4846. Fix random failures for TestCapacitySchedulerPreemption#testPreemptionPolicyShouldRespectAlreadyMarkedKillableContainers. (Bibin A Chundatt via wangda)
2016-04-22 11:40:32 -07:00
Karthik Kambatla
170c4fd4cd
YARN-4784. Fairscheduler: defaultQueueSchedulingPolicy should not accept FIFO. (Yufei Gu via kasha)
2016-04-20 23:58:12 -07:00
Wangda Tan
fdc46bfb37
YARN-4934. Reserved Resource for QueueMetrics needs to be handled correctly in few cases. (Sunil G via wangda)
2016-04-16 22:47:41 -07:00
Jason Lowe
69f3d428d5
YARN-4940. yarn node -list -all failed if RM start with decommissioned node. Contributed by sandflee
2016-04-15 20:36:45 +00:00
Jason Lowe
2a5da97f81
Revert "YARN-4311. Removing nodes from include and exclude lists will not remove them from decommissioned nodes list. Contributed by Kuhu Shukla"
...
This reverts commit 1cbcd4a491
.
2016-04-11 15:51:01 +00:00
Akira Ajisaka
1ff27f9d12
YARN-4630. Remove useless boxing/unboxing code. Contributed by Kousuke Saruta.
2016-04-11 14:55:03 +09:00
Karthik Kambatla
ff95fd547b
YARN-4927. TestRMHA#testTransitionedToActiveRefreshFail fails with FairScheduler. (Bibin A Chundatt via kasha)
2016-04-09 10:31:02 -07:00
Wangda Tan
ec06957941
YARN-3215. Respect labels in CapacityScheduler when computing headroom. (Naganarasimha G R via wangda)
2016-04-08 15:33:04 -07:00
Jian He
9cb0c963d2
YARN-4740. AM may not receive the container complete msg when it restarts. Contributed by Jun Gong
2016-04-08 11:20:35 -07:00
Jian He
93bacda08b
YARN-4769. Add support for CSRF header in the dump capacity scheduler logs and kill app buttons in RM web UI. Contributed by Varun Vasudev
2016-04-06 16:13:47 -07:00
Wangda Tan
21eb428448
YARN-4699. Scheduler UI and REST o/p is not in sync when -replaceLabelsOnNode is used to change label of a node. (Sunil G via wangda)
2016-04-05 16:24:11 -07:00
Jason Lowe
1cbcd4a491
YARN-4311. Removing nodes from include and exclude lists will not remove them from decommissioned nodes list. Contributed by Kuhu Shukla
2016-04-05 13:40:19 +00:00
Rohith Sharma K S
776b549e2a
YARN-4609. RM Nodes list page takes too much time to load. Contributed by Bibin A Chundatt
2016-04-05 14:47:25 +05:30
naganarasimha
5092c94195
YARN-4746. yarn web services should convert parse failures of appId, appAttemptId and containerId to 400. Contributed by Bibin A Chundatt
2016-04-04 16:25:03 +05:30
Rohith Sharma K S
1e6f92977d
YARN-4607. Pagination support for AppAttempt page TotalOutstandingResource Requests table. Contributed by Bibin A Chundatt
2016-04-04 08:09:29 +05:30
Wangda Tan
12b11e2e68
YARN-4634. Scheduler UI/Metrics need to consider cases like non-queue label mappings. (Sunil G via wangda)
2016-03-31 14:35:18 -07:00
Jian He
60e4116bf1
YARN-4822. Refactor existing Preemption Policy of CS for easier adding new approach to select preemption candidates. Contributed by Wangda Tan
2016-03-30 12:43:52 -07:00
Wangda Tan
fc055a3cbe
YARN-4865. Track Reserved resources in ResourceUsage and QueueCapacities. (Sunil G via wangda)
2016-03-29 17:07:55 -07:00
Jian He
524bc3c33a
YARN-998. Keep NM resource updated through dynamic resource config for RM/NM restart. Contributed by Junping Du
2016-03-28 11:12:33 -07:00
Arun Suresh
00bebb7e58
YARN-4823. Refactor the nested reservation id field in listReservation to simple string field. (subru via asuresh)
2016-03-25 15:54:38 -07:00
Arun Suresh
d82e797b65
YARN-4825. Remove redundant code in ClientRMService::listReservations. (subru via asuresh)
2016-03-24 09:59:55 -07:00
Junping Du
19b645c938
YARN-4820. ResourceManager web redirects in HA mode drops query parameters. Contributed by Varun Vasudev.
2016-03-23 19:34:30 -07:00
Junping Du
ca8106d2dd
YARN-4785. inconsistent value type of the type field for LeafQueueInfo in response of RM REST API.
2016-03-17 09:04:41 -07:00
Wangda Tan
ae14e5d07f
YARN-4108. CapacityScheduler: Improve preemption to only kill containers that would satisfy the incoming request. (Wangda Tan)
...
(cherry picked from commit 7e8c9beb41
)
2016-03-16 17:02:33 -07:00
Wangda Tan
fa7a43529d
Revert "CapacityScheduler: Improve preemption to only kill containers that would satisfy the incoming request. (Wangda Tan)"
...
This reverts commit 7e8c9beb41
.
2016-03-16 17:02:10 -07:00
Wangda Tan
7e8c9beb41
CapacityScheduler: Improve preemption to only kill containers that would satisfy the incoming request. (Wangda Tan)
2016-03-16 16:59:59 -07:00
Karthik Kambatla
3ef5500783
YARN-4560. Make scheduler error checking message more user friendly. (Ray Chiang via kasha)
2016-03-15 23:45:01 -07:00
Karthik Kambatla
20d389ce61
YARN-4719. Add a helper library to maintain node state and allows common queries. (kasha)
2016-03-14 14:19:05 -07:00
Wangda Tan
0233d4e0ee
YARN-4465. SchedulerUtils#validateRequest for Label check should happen only when nodelabel enabled. (Bibin A Chundatt via wangda)
2016-03-08 14:27:03 -08:00
Jian He
3c33158d1c
YARN-4764. Application submission fails when submitted queue is not available in scheduler xml. Contributed by Bibin A Chundatt
2016-03-08 13:07:57 -08:00
Varun Vasudev
e51a8c1056
YARN-4737. Add CSRF filter support in YARN. Contributed by Jonathan Maron.
2016-03-07 15:26:44 +05:30
Zhihai Xu
e1ccc9622b
YARN-4761. NMs reconnecting with changed capabilities can lead to wrong cluster resource calculations on fair scheduler. Contributed by Sangjin Lee
2016-03-06 19:46:09 -08:00
Rohith Sharma K S
19ee185907
YARN-4763. RMApps Page crashes with NPE. (Bibin A Chundatt via rohithsharmaks)
2016-03-05 13:02:57 +05:30
Jian He
5c465df904
YARN-4671. There is no need to acquire CS lock when completing a container. Contributed by Meng Ding
2016-03-01 13:14:12 -08:00
Karthik Kambatla
f9692770a5
YARN-4718. Rename variables in SchedulerNode to reduce ambiguity post YARN-1011. (Inigo Goiri via kasha)
2016-02-28 09:35:59 -08:00
Jason Lowe
6b0f813e89
YARN-4723. NodesListManager$UnknownNodeId ClassCastException. Contributed by Kuhu Shukla
2016-02-26 20:24:50 +00:00
Karthik Kambatla
c684f2b007
YARN-4729. SchedulerApplicationAttempt#getTotalRequiredResources can throw an NPE. (kasha)
2016-02-24 18:33:57 -08:00
Sangjin Lee
553b591ba0
YARN-4722. AsyncDispatcher logs redundant event queue sizes (Jason Lowe via sjlee)
2016-02-24 09:29:41 -08:00
Junping Du
9ed17f181d
YARN-3223. Resource update during NM graceful decommission. Contributed by Brook Zhou.
2016-02-23 03:30:26 -08:00
Junping Du
3fab88540f
YARN-4386. refreshNodesGracefully() should send recommission event to active RMNodes only. Contributed by Kuhu Shukla.
2016-02-22 07:04:19 -08:00
Sangjin Lee
7de70680fe
YARN-4690. Skip object allocation in FSAppAttempt#getResourceUsage when possible (Ming Ma via sjlee)
2016-02-17 20:55:21 -08:00
Karthik Kambatla
2ab4c476ed
YARN-4689. FairScheduler: Cleanup preemptContainer to be more readable. (Kai Sasaki via kasha)
2016-02-17 18:16:15 -08:00
Arun Suresh
23f937e3b7
YARN-2575. Create separate ACLs for Reservation create/update/delete/list ops (Sean Po via asuresh)
2016-02-11 10:47:43 -08:00
Jian He
d16b17b4d2
YARN-4138. Roll back container resource allocation after resource increase token expires. Contributed by Meng Ding
2016-02-11 10:06:27 +08:00
=
b706cbc1bc
YARN-4420. Add REST API for List Reservations (Sean Po via curino)
2016-02-10 10:19:26 -08:00
Arun Suresh
5cf5c41a89
YARN-4360. Improve GreedyReservationAgent to support "early" allocations, and performance improvements (curino via asuresh)
2016-02-10 09:11:15 -08:00
Devaraj K
565af873d5
YARN-4667. RM Admin CLI for refreshNodesResources throws NPE when nothing
...
is configured. Contributed by Naganarasimha G R.
2016-02-08 15:01:54 +05:30
Varun Vasudev
308d63f382
YARN-4307. Display blacklisted nodes for AM container in the RM web UI. Contributed by Naganarasimha G R.
2016-02-04 13:32:54 +05:30
Varun Vasudev
1adb64e09b
YARN-4625. Make ApplicationSubmissionContext and ApplicationSubmissionContextInfo more consistent. Contributed by Xuan Gong.
2016-02-03 16:26:28 +05:30
Wangda Tan
9875325d5c
YARN-4340. Add list API to reservation system. (Sean Po via wangda)
2016-02-02 10:17:33 +08:00
Jason Lowe
ed55950164
YARN-3102. Decommisioned Nodes not listed in Web UI. Contributed by Kuhu Shukla
2016-02-01 23:15:26 +00:00
Jason Lowe
772ea7b41b
YARN-4428. Redirect RM page to AHS page when AHS turned on and RM page is not available. Contributed by Chang Li
2016-01-29 21:48:54 +00:00
Jian He
f4a57d4a53
YARN-4617. LeafQueue#pendingOrderingPolicy should always use fixed ordering policy instead of using same as active applications ordering policy. Contributed by Rohith Sharma K S
2016-01-29 12:22:23 -08:00
Devaraj K
a277bdc9ed
YARN-4411. RMAppAttemptImpl#createApplicationAttemptReport throws
...
IllegalArgumentException. Contributed by Bibin A Chundatt and yarntime.
2016-01-29 13:51:37 +05:30
Jian He
7f46636495
YARN-4519. Potential deadlock of CapacityScheduler between decrease container and assign containers. Contributed by Meng Ding
2016-01-28 14:51:00 -08:00
Karthik Kambatla
fb238d7e5d
YARN-4462. FairScheduler: Disallow preemption from a queue. (Tao Jie via kasha)
2016-01-27 12:29:06 -08:00
rohithsharmaks
d6258b33a7
YARN-4497. RM might fail to restart when recovering apps whose attempts are missing. (Jun Gong via rohithsharmaks)
2016-01-22 20:27:38 +05:30
Akira Ajisaka
8f58f742ae
YARN-4605. Spelling mistake in the help message of "yarn applicationattempt" command. Contributed by Weiwei Yang.
2016-01-22 19:43:06 +09:00
Rohith Sharma K S
e30668106d
YARN-4584. RM startup failure when AM attempts greater than max-attempts. (Bibin A Chundatt via rohithsharmaks)
2016-01-22 10:14:46 +05:30
Jason Lowe
468a53b22f
YARN-4610. Reservations continue looking for one app causes other apps to starve. Contributed by Jason Lowe
2016-01-21 18:31:29 +00:00
Karthik Kambatla
4992398aee
YARN-4603. FairScheduler should mention user requested queuename in error message when failed in queue ACL check. (Tao Jie via kasha)
2016-01-21 17:40:59 +01:00
Wangda Tan
5ff5f67332
YARN-4557. Fix improper Queues sorting in PartitionedQueueComparator when accessible-node-labels=*. (Naganarasimha G R via wangda)
2016-01-21 11:21:06 +08:00
Xuan
890a2ebd1a
YARN-4559. Make leader elector and zk store share the same curator
...
client. Contributed by Jian He
2016-01-20 14:48:10 -08:00
Jian He
edc43a9097
YARN-4565. Fix a bug that leads to AM resource limit not hornored when sizeBasedWeight enabled for FairOrderingPolicy. Contributed by Wangda Tan
2016-01-18 21:04:36 -08:00
Wangda Tan
a44ce3f14f
YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)
2016-01-19 09:30:04 +08:00
Wangda Tan
150f5ae034
Revert "YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)"
...
This reverts commit 3fe5728563
.
Conflicts:
hadoop-yarn-project/CHANGES.txt
2016-01-19 09:27:36 +08:00
Jian He
f385851141
YARN-4596. SystemMetricPublisher should not swallow error messages from TimelineClient#putEntities. Contributed by Li Lu
2016-01-18 16:58:39 -08:00
Karthik Kambatla
d40859fab1
YARN-4526. Make SystemClock singleton so AppSchedulingInfo could use it. (kasha)
2016-01-18 10:58:14 +01:00
Wangda Tan
3fe5728563
YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)
...
(cherry picked from commit 805a9ed85e
)
2016-01-18 17:06:05 +08:00
Wangda Tan
adf260a728
Revert "YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)"
...
This reverts commit 805a9ed85e
.
2016-01-18 16:50:45 +08:00
Wangda Tan
b08ecf5c75
YARN-4304. AM max resource configuration per partition to be displayed/updated correctly in UI and in various partition related metrics. (Sunil G via wangda)
2016-01-18 11:11:32 +08:00
Wangda Tan
805a9ed85e
YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)
2016-01-18 11:04:25 +08:00
Wangda Tan
9523648d57
YARN-4538. QueueMetrics pending cores and memory metrics wrong. (Bibin A Chundatt via wangda)
2016-01-18 10:57:14 +08:00
rohithsharmaks
f7736f464f
YARN-4389. Allow application to enable or disable am blacklisting. (Sunil G via rohithsharmaks)
2016-01-15 21:38:26 +05:30
Karthik Kambatla
9d04f26d4c
YARN-3446. FairScheduler headroom calculation should exclude nodes in the blacklist. (Zhihai Xu via kasha)
2016-01-14 08:33:23 -08:00
Karthik Kambatla
321072ba81
YARN-4551. Address the duplication between StatusUpdateWhenHealthy and StatusUpdateWhenUnhealthy transitions. (Sunil G via kasha)
2016-01-13 12:09:34 -08:00
Wangda Tan
c0537bcd2c
YARN-4571. Make app id/name available to the yarn authorizer provider for better auditing. (Jian He via wangda)
2016-01-13 13:18:31 +08:00
Akira Ajisaka
da1e3e3c57
YARN-4567. javadoc failing on java 8. Contributed by Steve Loughran. This closes #67 .
2016-01-12 15:12:17 +09:00
Wangda Tan
9e792da014
YARN-4582. Label-related invalid resource request exception should be able to properly handled by application. (Bibin A Chundatt via wangda)
2016-01-12 12:53:31 +08:00
Jian He
5fab4ec31c
Missing file for YARN-4580
2016-01-11 17:00:44 -08:00
Jian He
b8942be888
YARN-4537. Pull out priority comparison from fifocomparator and use compound comparator for FifoOrdering policy. Contributed by Rohith Sharma K S
2016-01-11 16:44:28 -08:00
Jian He
109e528ef5
YARN-4479. Change CS LeafQueue pendingOrderingPolicy to hornor recovered apps. Contributed by Rohith Sharma K S
2016-01-08 15:51:10 -08:00
Xuan
89022f8d4b
YARN-4438. Implement RM leader election with curator. Contributed by Jian He
2016-01-07 14:33:06 -08:00
Junping Du
c1462a67ff
YARN-4546. ResourceManager crash due to scheduling opportunity overflow. Contributed by Jason Lowe.
2016-01-06 05:49:24 -08:00
rohithsharmaks
6da6d87872
YARN-4535. Fix checkstyle error in CapacityScheduler.java (Naganarasimha G R via rohithsharmaks)
2016-01-05 12:09:57 +05:30
Wangda Tan
4e4b3a8465
YARN-4524. Cleanup AppSchedulingInfo. (Karthik Kambatla via wangda)
...
(cherry picked from commit 05fa852d7567b7590d6b53bbf925f8f424736514)
2015-12-30 15:39:34 -08:00
Wangda Tan
8310b2e9ff
YARN-4522. Queue acl can be checked at app submission. (Jian He via wangda)
2015-12-30 15:30:12 -08:00
Junping Du
223ce323bb
YARN-1382. Remove unusableRMNodesConcurrentSet (never used) in NodeListManager to get rid of memory leak. Contributed by Rohith Sharma K S.
2015-12-30 07:52:07 -08:00
Jian He
5273413411
YARN-3480. Remove attempts that are beyond max-attempt limit from state store. Contributed by Jun Gong
2015-12-29 15:58:39 -08:00
Wangda Tan
561abb9fee
YARN-4315. NaN in Queue percentage for cluster apps page. (Bibin A Chundatt via wangda)
2015-12-29 13:28:00 -08:00
Jian He
d0a22bae9b
YARN-4417. Make RM and Timeline-server REST APIs more consistent. Contributed by Wangda Tan
2015-12-28 15:52:45 -08:00
rohithsharmaks
8c180a13c8
YARN-4109. Exception on RM scheduler page loading with labels. (Mohammad Shahid Khan via rohithsharmaks)
2015-12-23 09:12:32 +05:30
Arun Suresh
e88422df45
YARN-4477. FairScheduler: Handle condition which can result in an infinite loop in attemptScheduling. (Tao Jie via asuresh)
2015-12-21 22:41:09 -08:00
Jian He
85c2466048
YARN-4164. Changed updateApplicationPriority API to return the updated application priority. Contributed by Rohith Sharma K S
2015-12-18 14:13:48 -08:00
Junping Du
1de56b0448
YARN-3226. UI changes for decommissioning node. Contributed by Sunil G.
2015-12-17 15:20:17 -08:00
Jason Lowe
91828fef6b
YARN-4461. Redundant nodeLocalityDelay log in LeafQueue. Contributed by Eric Payne
2015-12-16 23:22:31 +00:00
Wangda Tan
9b856d9787
YARN-4416. Deadlock due to synchronised get Methods in AbstractCSQueue. (Naganarasimha G R via wangda)
2015-12-16 13:22:37 -08:00
Wangda Tan
7faa406f27
YARN-4225. Add preemption status to yarn queue -status for capacity scheduler. (Eric Payne via wangda)
2015-12-16 13:19:40 -08:00
Wangda Tan
79c41b1d83
YARN-4293. ResourceUtilization should be a part of yarn node CLI. (Sunil G via wangda)
2015-12-16 13:18:19 -08:00
Junping Du
50bd067e1d
YARN-4452. NPE when submit Unmanaged application. Contributed by Naganarasimha G R.
2015-12-16 10:57:39 -08:00
Zhihai Xu
2aaed10327
YARN-4440. FSAppAttempt#getAllowedLocalityLevelByTime should init the lastScheduler time. Contributed by Lin Yiqun
2015-12-15 00:17:21 -08:00
Jian He
1cb3299b48
YARN-4403. (AM/NM/Container)LivelinessMonitor should use monotonic time when calculating period. Contributed by Junping Du
2015-12-14 13:51:23 -08:00
Wangda Tan
07b0fb996a
YARN-4418. AM Resource Limit per partition can be updated to ResourceUsage as well. (Sunil G via wangda)
2015-12-14 11:24:30 -08:00
Wangda Tan
6cb0af3c39
YARN-3946. Update exact reason as to why a submitted app is in ACCEPTED state to app's diagnostic message. (Naganarasimha G R via wangda)
2015-12-14 10:52:46 -08:00
Arun Suresh
7fb212e5e6
YARN-4358 addendum patch to fix javadoc error
2015-12-12 22:22:55 -08:00
rohithsharmaks
a5e2e1ecb0
YARN-4421. Remove dead code in RmAppImpl.RMAppRecoveredTransition. (Daniel Templeton via rohithsharmaks)
2015-12-09 11:31:51 +05:30
Wangda Tan
7e4715186d
YARN-4424. Fix deadlock in RMAppImpl. (Jian he via wangda)
2015-12-08 14:25:16 -08:00
=
c25a635459
YARN-4248. REST API for submit/update/delete Reservations. (curino)
2015-12-07 13:33:28 -08:00
Jonathan Eagles
4ff973f96a
YARN-4422. Generic AHS sometimes doesn't show started, node, or logs on App page (Eric Payne via jeagles)
2015-12-07 15:04:48 -06:00
Xuan
4546c7582b
YARN-4392. ApplicationCreatedEvent event time resets after RM
...
restart/failover. Contributed by Naganarasimha G R and Xuan Gong
2015-12-07 12:24:55 -08:00
Steve Loughran
65f395226b
HADOOP-12321. Make JvmPauseMonitor an AbstractService. (Sunil G via Stevel) [includes HDFS-8947 MAPREDUCE-6462 and YARN-4072]
2015-12-06 17:43:35 +00:00
Arun Suresh
742632e346
YARN-4358. Reservation System: Improve relationship between SharingPolicy and ReservationAgent. (Carlo Curino via asuresh)
2015-12-05 21:26:16 -08:00
Wangda Tan
a2c3bfc8c1
YARN-4292. ResourceUtilization should be a part of NodeInfo REST API. (Sunil G via wangda)
2015-12-03 14:28:32 -08:00
Jian He
6b9a5beb2b
YARN-4398. Remove unnecessary synchronization in RMStateStore. Contributed by Ning Ding
2015-12-02 11:07:18 -08:00
Tsuyoshi Ozawa
28dfe721b8
YARN-4387. Fix typo in FairScheduler log message. Contributed by Xin Wang.
2015-11-24 19:24:01 +09:00
Karthik Kambatla
52948bb20b
YARN-3980. Plumb resource-utilization info in node heartbeat through to the scheduler. (Inigo Goiri via kasha)
2015-11-24 13:47:17 +05:30
Jian He
8676a118a1
YARN-4349. Support CallerContext in YARN. Contributed by Wangda Tan
2015-11-23 17:19:48 -08:00
Jason Lowe
d36b6e045f
YARN-4344. NMs reconnecting with changed capabilities can lead to wrong cluster resource calculations. Contributed by Varun Vasudev
2015-11-23 20:30:26 +00:00
Arun Suresh
da1016365a
YARN-3454. Add efficient merge operation to RLESparseResourceAllocation (Carlo Curino via asuresh)
2015-11-21 09:59:41 -08:00
Wangda Tan
2346fa3141
YARN-3769. Consider user limit when calculating total pending resource for preemption policy in Capacity Scheduler. (Eric Payne via wangda)
2015-11-20 15:55:50 -08:00
Jason Lowe
060cdcbe5d
YARN-4374. RM capacity scheduler UI rounds user limit factor. Contributed by Chang Li
2015-11-20 23:12:29 +00:00
Arun Suresh
6a61928fb7
YARN-4184. Remove update reservation state api from state store as its not used by ReservationSystem (Sean Po via asuresh)
2015-11-17 15:50:34 -08:00
Wangda Tan
7f55a18071
YARN-4347. Resource manager fails with Null pointer exception. (Jian He via wangda)
2015-11-12 11:23:40 -08:00
Wangda Tan
796638d9bc
YARN-4287. Capacity Scheduler: Rack Locality improvement (Nathan Roberts via wangda)
2015-11-12 11:09:37 -08:00
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
6351d3fa63
YARN-4183. Reverting the patch to fix behaviour change.
...
Revert "YARN-4183. Enabling generic application history forces every job to get a timeline service delegation token (jeagles)"
This reverts commit c293c58954
.
2015-11-11 10:40:43 -08:00
Jian He
e5b1733e04
YARN-4127. RM fail with noAuth error if switched from failover to non-failover. Contributed by Varun Saxena
2015-10-29 15:42:57 -07:00
Jonathan Eagles
c293c58954
YARN-4183. Enabling generic application history forces every job to get a timeline service delegation token (jeagles)
2015-10-29 16:41:10 -05:00
Arun Suresh
58d1df585c
YARN-4310. FairScheduler: Log skipping reservation messages at DEBUG level (asuresh)
2015-10-29 13:42:09 -07:00
Rohith Sharma K S
656c8f9527
YARN-4130. Duplicate declaration of ApplicationId in RMAppManager#submitApplication method. (Kai Sasaki via rohithsharmaks)
2015-10-29 12:22:44 +05:30
Wangda Tan
56e4f6237a
YARN-3216. Max-AM-Resource-Percentage should respect node labels. (Sunil G via wangda)
2015-10-26 16:44:39 -07:00
Wangda Tan
3cc73773eb
YARN-4285. Display resource usage as percentage of queue and cluster in the RM UI (Varun Vasudev via wangda)
2015-10-26 13:07:39 -07:00
Jason Lowe
33a03af3c3
YARN-4284. condition for AM blacklisting is too narrow. Contributed by Sangjin Lee
2015-10-26 19:53:03 +00:00
Rohith Sharma K S
5acdde4744
YARN-2729. Support script based NodeLabelsProvider Interface in Distributed Node Label Configuration Setup. (Naganarasimha G R via rohithsharmaks)
2015-10-26 15:42:42 +05:30
Arun Suresh
ab8eb8770c
YARN-3738. Add support for recovery of reserved apps running under dynamic queues (subru via asuresh)
2015-10-24 22:53:10 -07:00
Akira Ajisaka
7781fe1b9e
YARN-4294. [JDK8] Fix javadoc errors caused by wrong reference and illegal tag. (aajisaka)
2015-10-24 11:54:42 +09:00
Jason Lowe
d3a34a4f38
YARN-4041. Slow delegation token renewal can severely prolong RM recovery. Contributed by Sunil G
2015-10-23 20:57:01 +00:00
Ming Ma
934d96a334
YARN-2913. Fair scheduler should have ability to set MaxResourceDefault for each queue. (Siqi Li via mingma)
2015-10-23 08:36:33 -07:00
Jonathan Eagles
f8adeb712d
YARN-4009. CORS support for ResourceManager REST API. ( Varun Vasudev via jeagles)
2015-10-23 10:34:08 -05:00
Junping Du
0fce5f9a49
YARN-4243. Add retry on establishing Zookeeper conenction in EmbeddedElectorService#serviceInit. Contributed by Xuan Gong.
2015-10-22 13:41:09 -07:00
Zhihai Xu
960201b79b
YARN-4256. YARN fair scheduler vcores with decimal values. Contributed by Jun Gong
2015-10-22 12:28:03 -07:00
Anubhav Dhoot
2798723a54
YARN-3739. Add reservation system recovery to RM recovery process. Contributed by Subru Krishnan.
2015-10-22 06:51:00 -07:00
Arun Suresh
506d1b1dbc
YARN-3985. Make ReservationSystem persist state using RMStateStore reservation APIs. (adhoot via asuresh)
2015-10-20 16:46:14 -07:00
Arun Suresh
7e2837f830
YARN-4270. Limit application resource reservation on nodes for non-node/rack specific requests (asuresh)
2015-10-19 20:00:38 -07:00
Jian He
f9da5cdb2b
YARN-4170. AM need to be notified with priority in AllocateResponse. Contributed by Sunil G
2015-10-16 15:26:27 -07:00
Wangda Tan
4337b263aa
YARN-4162. CapacityScheduler: Add resource usage by partition and queue capacity by partition to REST API. (Naganarasimha G R via wangda)
2015-10-16 15:06:28 -07:00
Jian He
cf23f2c2b5
YARN-4000. RM crashes with NPE if leaf queue becomes parent queue during restart. Contributed by Varun Saxena
2015-10-15 17:12:46 -07:00
rohithsharmaks
d6c8bad869
YARN-4250. NPE in AppSchedulingInfo#isRequestLabelChanged. (Brahma Reddy Battula via rohithsharmaks)
2015-10-14 16:11:34 +05:30
Jian He
9849c8b386
YARN-4230. RM crashes with NPE when increasing container resource if there is no headroom left. Contributed by Meng Ding
2015-10-12 11:51:33 -07:00
Zhihai Xu
049c6e8dc0
YARN-4201. AMBlacklist does not work for minicluster. Contributed by Jun Gong.
2015-10-12 00:14:25 -07:00
Devaraj K
db93047881
YARN-3964. Support NodeLabelsProvider at Resource Manager side.
...
Contributed by Dian Fu.
2015-10-11 11:21:29 +05:30
Wangda Tan
def374e666
YARN-4140. RM container allocation delayed incase of app submitted to Nodelabel partition. (Bibin A Chundatt via wangda)
2015-10-09 16:38:59 -07:00
Karthik Kambatla
4aa9b3e75c
MAPREDUCE-6302. Incorrect headroom can lead to a deadlock between map and reduce allocations. (kasha)
2015-10-09 07:37:39 -07:00
Jason Lowe
a0bca2b5ad
YARN-261. Ability to fail AM attempts. Contributed by Andrey Klochkov and Rohith Sharma K S
2015-10-09 14:17:38 +00:00
Rohith Sharma K S
8f195387a4
YARN-4235. FairScheduler PrimaryGroup does not handle empty groups returned for a user. (Anubhav Dhoot via rohithsharmaks)
2015-10-09 10:09:26 +05:30
Rohith Sharma K S
3793cbe4c3
YARN-4228. FileSystemRMStateStore use IOUtils#close instead of fs#close. (Bibin A Chundatt via rohithsharmaks)
2015-10-07 10:12:14 +05:30
Rohith Sharma K S
9156fc60c6
YARN-4209. RMStateStore FENCED state doesn’t work due to updateFencedState called by stateMachine.doTransition. (Zhihai Xu via rohithsharmaks)
2015-10-07 09:34:59 +05:30
Wangda Tan
29a582ada0
YARN-4215. RMNodeLabels Manager Need to verify and replace node labels for the only modified Node Label Mappings in the request. (Naganarasimha G R via wangda)
2015-10-06 11:56:04 -07:00
Harsh J
c918f7be5e
HADOOP-12458. Retries is typoed to spell Retires in parts of hadoop-yarn and hadoop-common. Contributed by Neelesh Srinivas Salian.
2015-10-03 18:37:58 +05:30
Xuan
8f08532bde
YARN-1897. CLI and core support for signal container functionality. Contributed by Ming Ma
2015-10-02 18:50:47 -07:00
Karthik Kambatla
a0b5a0a419
YARN-4066. Large number of queues choke fair scheduler. (Johan Gustavsson via kasha)
2015-09-29 07:55:34 -07:00
Anubhav Dhoot
9735afe967
YARN-4180. AMLauncher does not retry on failures when talking to NM. (adhoot)
2015-09-28 16:13:41 -07:00
Jason Lowe
9f53a95ff6
YARN-4141. Runtime Application Priority change should not throw exception for applications at finishing states. Contributed by Sunil G
2015-09-28 22:55:20 +00:00
Anubhav Dhoot
fb2e525c07
YARN-4204. ConcurrentModificationException in FairSchedulerQueueInfo. (adhoot)
2015-09-28 09:05:45 -07:00
Rohith Sharma K S
a9aafad12b
YARN-4044. Running applications information changes such as movequeue is not published to TimeLine server. (Sunil G via rohithsharmaks)
2015-09-24 12:13:22 +05:30
Jian He
b3f6b641dc
YARN-4171. Fix findbugs warnings in YARN-1197 branch. Contributed by Wangda Tan
2015-09-23 13:29:38 -07:00
Jian He
89cab1ba5f
YARN-1651. CapacityScheduler side changes to support container resize. Contributed by Wangda Tan
2015-09-23 13:29:38 -07:00
Tsuyoshi Ozawa
dfd807afab
HADOOP-12428. Fix inconsistency between log-level guards and statements. Contributed by Jagadesh Kiran N and Jackie Chang.
2015-09-22 12:54:29 +09:00
Rohith Sharma K S
c9cb6a5960
YARN-4167. NPE on RMActiveServices#serviceStop when store is null. (Bibin A Chundatt via rohithsharmaks)
2015-09-21 09:59:30 +05:30
Arun Suresh
94dec5a916
YARN-3920. FairScheduler container reservation on a node should be configurable to limit it to large containers (adhoot via asuresh)
2015-09-18 14:02:55 -07:00
Wangda Tan
9bc913a35c
YARN-3212. RMNode State Transition Update with DECOMMISSIONING state. (Junping Du via wangda)
2015-09-18 10:04:17 -07:00
Jian He
6c6e734f0b
YARN-4034. Render cluster Max Priority in scheduler metrics in RM web UI. Contributed by Rohith Sharma K S
2015-09-17 14:55:50 +08:00
Jian He
452079af8b
YARN-4078. Add getPendingResourceRequestForAttempt in YarnScheduler interface. Contributed by Naganarasimha G R
2015-09-16 14:59:20 +08:00
Wangda Tan
ae5308fe1d
YARN-3717. Expose app/am/queue's node-label-expression to RM web UI / CLI / REST-API. (Naganarasimha G R via wangda)
2015-09-15 11:40:50 -07:00
Junping Du
73e3a49eb0
YARN-313. Add Admin API for supporting node resource configuration in command line. (Contributed by Inigo Goiri, Kenji Kikushima and Junping Du)
2015-09-15 07:56:47 -07:00
Jian He
5468baa80a
YARN-3635. Refactored current queue mapping implementation in CapacityScheduler to use a generic PlacementManager framework. Contributed by Wangda Tan
2015-09-15 15:39:20 +08:00
Jian He
e1b1d7e4ae
YARN-4126. RM should not issue delegation tokens in unsecure mode. Contributed by Bibin A Chundatt
2015-09-14 14:09:19 +08:00
Karthik Kambatla
332b520a48
YARN-3697. FairScheduler: ContinuousSchedulingThread can fail to shutdown. (Zhihai Xu via kasha)
2015-09-13 18:07:43 -07:00
Karthik Kambatla
81df7b586a
YARN-2005. Blacklisting support for scheduling AMs. (Anubhav Dhoot via kasha)
2015-09-13 17:03:15 -07:00
Steve Loughran
7269906254
HADOOP-12087. [JDK8] Fix javadoc errors caused by incorrect or illegal tags. (Akira AJISAKA via stevel).
2015-09-13 14:25:26 +01:00
Jian He
6f72f1e600
YARN-2884. Added a proxy service in NM to proxy the the communication between AM and RM. Contributed by Kishore Chaliparambil
2015-09-08 09:35:46 +08:00
Xuan
9b78e6e33d
YARN-4087. Followup fixes after YARN-2019 regarding RM behavior when
...
state-store error occurs. Contributed by Jian He
2015-09-07 17:45:47 -07:00
Wangda Tan
bcc85e3bab
YARN-4024. YARN RM should avoid unnecessary resolving IP when NMs doing heartbeat. (Hong Zhiguo via wangda)
2015-09-04 15:13:53 -07:00
Jason Lowe
6eaca2e363
YARN-4105. Capacity Scheduler headroom for DRF is wrong. Contributed by Chang Li
2015-09-04 15:30:53 +00:00
Varun Vasudev
40d222e862
YARN-4103. RM WebServices missing scheme for appattempts logLinks. Contributed by Jonathan Eagles.
2015-09-04 14:31:51 +05:30
Varun Vasudev
b469ac531a
YARN-3970. Add REST api support for Application Priority. Contributed by Naganarasimha G R.
2015-09-03 16:40:10 +05:30
Jian He
09c64ba1ba
YARN-4101. RM should print alert messages if Zookeeper and Resourcemanager gets connection issue. Contributed by Xuan Gong
2015-09-02 17:45:23 -07:00
Rohith Sharma K S
7d6687fe76
YARN-3893. Both RM in active state when Admin#transitionToActive failure from refeshAll() (Bibin A Chundatt via rohithsharmaks)
2015-09-02 15:22:48 +05:30
Varun Vasudev
bf669b6d9f
YARN-4082. Container shouldn't be killed when node's label updated. Contributed by Wangda Tan.
2015-09-01 14:19:11 +05:30
Jian He
a3fd2ccc86
YARN-4092. Fixed UI redirection to print useful messages when both RMs are in standby mode. Contributed by Xuan Gong
2015-08-31 17:33:24 -07:00
Junping Du
beb65c9465
YARN-1556. NPE getting application report with a null appId. Contributed by Weiwei Yang.
2015-08-28 05:57:34 -07:00
Jian He
a9c8ea71aa
YARN-3250. Support admin cli interface in for Application Priority. Contributed by Rohith Sharma K S
2015-08-27 13:25:53 -07:00
Jian He
57c7ae1aff
YARN-4014. Support user cli interface in for Application Priority. Contributed by Rohith Sharma K S
2015-08-24 20:36:44 -07:00
Rohith Sharma K S
feaf034994
YARN-3896. RMNode transitioned from RUNNING to REBOOTED because its response id has not been reset synchronously. (Jun Gong via rohithsharmaks)
2015-08-24 11:25:07 +05:30
Rohith Sharma K S
22de7c1dca
YARN-3986. getTransferredContainers in AbstractYarnScheduler should be present in YarnScheduler interface
2015-08-21 10:51:11 +05:30
Xuan
22dc5fc209
YARN-4028. AppBlock page key update and diagnostics value null on
...
recovery. Contributed by Bibin A Chundatt
2015-08-18 22:53:03 -07:00
Zhihai Xu
3a76a010b8
YARN-3857: Memory leak in ResourceManager with SIMPLE mode. Contributed by mujunchao.
2015-08-18 10:36:40 -07:00
Jian He
0a030546e2
YARN-3987. Send AM container completed msg to NM once AM finishes. Contributed by sandflee
2015-08-13 16:22:53 -07:00
Jian He
7a445fcfab
YARN-4047. ClientRMService getApplications has high scheduler lock contention. Contributed by Jason Lowe
2015-08-13 16:02:57 -07:00
Jian He
e5003be907
YARN-4026. Refactored ContainerAllocator to accept a list of priorites rather than a single priority. Contributed by Wangda Tan
2015-08-12 15:07:50 -07:00
rohithsharmaks
1c12adb71f
YARN-4023. Publish Application Priority to TimelineServer. (Sunil G via rohithsharmaks)
2015-08-12 14:45:41 +05:30
Xuan
3ae716fa69
YARN-3999. RM hangs on draing events. Contributed by Jian He
2015-08-11 18:25:11 -07:00
Jian He
fa1d84ae27
YARN-3887. Support changing Application priority during runtime. Contributed by Sunil G
2015-08-10 20:51:54 -07:00
Wangda Tan
cf9d3c9256
YARN-3873. PendingApplications in LeafQueue should also use OrderingPolicy. (Sunil G via wangda)
2015-08-10 14:54:55 -07:00
Wangda Tan
4bc42d76e7
YARN-3966. Fix excessive loggings in CapacityScheduler. (Jian He via wangda)
2015-08-07 09:46:57 -07:00
Rohith Sharma K S
b6265d39c5
YARN-3948. Display Application Priority in RM Web UI.(Sunil G via rohithsharmaks)
2015-08-07 10:43:41 +05:30
Carlo Curino
8572a5a14b
YARN-3974. Refactor the reservation system test cases to use parameterized base test. (subru via curino)
2015-08-02 01:55:31 -07:00
Junping Du
cfee02b3bd
YARN-4019. Add JvmPauseMonitor to ResourceManager and NodeManager. Contributed by Robert Kanter.
2015-08-06 06:49:45 -07:00
Arun Suresh
154c9d2e42
YARN-3961. Expose pending, running and reserved containers of a queue in REST api and yarn top (adhoot via asuresh)
2015-08-05 23:14:14 -07:00
Jian He
ba2313d614
YARN-3983. Refactored CapacityScheduleri#FiCaSchedulerApp to easier extend container allocation logic. Contributed by Wangda Tan
2015-08-05 13:47:40 -07:00
Arun Suresh
f271d37735
YARN-3736. Add RMStateStore apis to store and load accepted reservations for failover (adhoot via asuresh)
2015-08-05 12:57:12 -07:00
Xuan
0306d902f5
YARN-3543. ApplicationReport should be able to tell whether the
...
Application is AM managed or not. Contributed by Rohith Sharma K S
2015-08-03 15:46:00 -07:00
Jonathan Eagles
3cd02b9522
YARN-3978. Configurably turn off the saving of container info in Generic AHS (Eric Payne via jeagles)
2015-08-03 10:38:05 -05:00
Jason Lowe
32e490b6c0
YARN-3990. AsyncDispatcher may overloaded with RMAppNodeUpdateEvent when Node is connected/disconnected. Contributed by Bibin A Chundatt
2015-07-31 17:37:24 +00:00
Zhihai Xu
ab80e27703
YARN-433. When RM is catching up with node updates then it should not expire acquired containers. Contributed by Xuan Gong
2015-07-30 21:57:11 -07:00
Wangda Tan
91b42e7d6e
YARN-3971. Skip RMNodeLabelsManager#checkRemoveFromClusterNodeLabelsOfQueue on nodelabel recovery. (Bibin A Chundatt via wangda)
2015-07-30 10:00:31 -07:00
Karthik Kambatla
5205a330b3
YARN-2768. Avoid cloning Resource in FSAppAttempt#updateDemand. (Hong Zhiguo via kasha)
2015-07-29 09:42:32 -07:00
Jian He
3572ebd738
YARN-3846. RM Web UI queue filter is not working for sub queue. Contributed by Mohammad Shahid Khan
2015-07-27 17:12:05 -07:00
ccurino
156f24ead0
YARN-3656. LowCost: A Cost-Based Placement Agent for YARN Reservations. (Jonathan Yaniv and Ishai Menache via curino)
2015-07-25 07:39:47 -07:00
Wangda Tan
a3bd7b4a59
YARN-3973. Recent changes to application priority management break reservation system from YARN-1051 (Carlo Curino via wangda)
2015-07-24 16:44:18 -07:00
Jian He
83fe34ac08
YARN-3026. Move application-specific container allocation logic from LeafQueue to FiCaSchedulerApp. Contributed by Wangda Tan
2015-07-24 14:00:25 -07:00
Karthik Kambatla
d19d187753
YARN-3957. FairScheduler NPE In FairSchedulerQueueInfo causing scheduler page to return 500. (Anubhav Dhoot via kasha)
2015-07-24 11:44:37 -07:00
carlo curino
0fcb4a8cf2
YARN-3969. Allow jobs to be submitted to reservation that is active but does not have any allocations. (subru via curino)
2015-07-23 19:33:59 -07:00
Rohith Sharma K S
e202efaf93
YARN-3845. Scheduler page does not render RGBA color combinations in IE11. (Contributed by Mohammad Shahid Khan)
2015-07-24 12:43:06 +05:30
Wangda Tan
3bba180051
YARN-3941. Proportional Preemption policy should try to avoid sending duplicate PREEMPT_CONTAINER event to scheduler. (Sunil G via wangda)
2015-07-23 10:07:57 -07:00
Junping Du
ee98d6354b
YARN-2019. Retrospect on decision of making RM crashed if any exception throw in ZKRMStateStore. Contributed by Jian He.
2015-07-22 17:52:35 -07:00
Wangda Tan
76ec26de80
YARN-3932. SchedulerApplicationAttempt#getResourceUsageReport and UserInfo should based on total-used-resources. (Bibin A Chundatt via wangda)
2015-07-22 11:54:02 -07:00
Wangda Tan
c39ca541f4
YARN-2003. Support for Application priority : Changes in RM and Capacity Scheduler. (Sunil G via wangda)
2015-07-21 09:57:23 -07:00
Arun Suresh
9b272ccae7
YARN-3535. Scheduler must re-request container resources when RMContainer transitions from ALLOCATED to KILLED (rohithsharma and peng.zhang via asuresh)
2015-07-17 04:31:34 -07:00
Wangda Tan
3540d5fe4b
YARN-3885. ProportionalCapacityPreemptionPolicy doesn't preempt if queue is more than 2 level. (Ajith S via wangda)
2015-07-16 16:13:32 -07:00
Arun Suresh
ac94ba3e18
YARN-3453. Ensure preemption logic in FairScheduler uses DominantResourceCalculator in DRF queues to prevent unnecessary thrashing. (asuresh)
2015-07-14 00:23:55 -07:00
Akira Ajisaka
19295b36d9
YARN-3381. Fix typo InvalidStateTransitonException. Contributed by Brahma Reddy Battula.
2015-07-13 17:52:13 +09:00
Wangda Tan
5ed1fead6b
YARN-3894. RM startup should fail for wrong CS xml NodeLabel capacity configuration. (Bibin A Chundatt via wangda)
2015-07-12 21:52:11 -07:00
Wangda Tan
1df39c1efc
YARN-3849. Too much of preemption activity causing continuos killing of containers across queues. (Sunil G via wangda)
2015-07-11 10:26:46 -07:00
Zhijie Shen
1ea36299a4
YARN-3116. RM notifies NM whether a container is an AM container or normal task container. Contributed by Giovanni Matteo Fumarola.
2015-07-10 18:58:10 -07:00
Ming Ma
08244264c0
YARN-3445. Cache runningApps in RMNode for getting running apps on given NodeId. (Junping Du via mingma)
2015-07-10 08:30:10 -07:00
Xuan
5214876792
YARN-3888. ApplicationMaster link is broken in RM WebUI when appstate is
...
NEW. Contributed by Bibin A Chundatt
2015-07-09 21:37:33 -07:00
carlo curino
0e602fa3a1
YARN-3800. Reduce storage footprint for ReservationAllocation. Contributed by Anubhav Dhoot.
2015-07-09 16:51:59 -07:00
Jian He
c9dd2cada0
YARN-3892. Fixed NPE on RMStateStore#serviceStop when CapacityScheduler#serviceInit fails. Contributed by Bibin A Chundatt
2015-07-07 14:16:21 -07:00
Devaraj K
37d7395773
YARN-3875. FSSchedulerNode#reserveResource() doesn't print Application Id
...
properly in log. Contributed by Bibin A Chundatt.
2015-07-02 10:20:31 +05:30
Wangda Tan
0e4b06690f
YARN-3508. Prevent processing preemption events on the main RM dispatcher. (Varun Saxena via wangda)
2015-07-01 17:32:22 -07:00
Devaraj K
80a68d6056
YARN-3830. AbstractYarnScheduler.createReleaseCache may try to clean a
...
null attempt. Contributed by nijel.
2015-07-01 19:03:44 +05:30
Devaraj K
b543d1a390
YARN-3859. LeafQueue doesn't print user properly for application add.
...
Contributed by Varun Saxena.
2015-06-28 10:04:50 +05:30
Devaraj K
57f1a01eda
YARN-3826. Race condition in ResourceTrackerService leads to wrong
...
diagnostics messages. Contributed by Chengbing Liu.
2015-06-25 16:13:59 +05:30
rohithsharmaks
dd4b387d96
YARN-3790. usedResource from rootQueue metrics may get stale data for FS scheduler after recovering the container (Zhihai Xu via rohithsharmaks)
2015-06-24 23:00:14 +05:30
Jason Lowe
2a20dd9b61
YARN-3809. Failed to launch new attempts because ApplicationMasterLauncher's threads all hang. Contributed by Jun Gong
2015-06-24 16:23:48 +00:00
Xuan
5b5bb8dcdc
YARN-3802. Two RMNodes for the same NodeId are used in RM sometimes
...
after NM is reconnected. Contributed by zhihai xu
2015-06-18 14:37:49 -07:00
Xuan
a826d432f9
YARN-3804. Both RM are on standBy state when kerberos user not in yarn.admin.acl. Contributed by Varun Saxena
2015-06-17 16:23:27 -07:00
Devaraj K
b039e69bb0
YARN-3789. Improve logs for LeafQueue#activateApplications(). Contributed
...
by Bibin A Chundatt.
2015-06-16 14:03:22 +05:30
Xuan
2b2465dfac
YARN-3778. Fix Yarn resourcemanger CLI usage. Contributed by Brahma Reddy Battula
2015-06-08 15:43:03 -07:00
Jian He
960b8f19ca
YARN-2716. Refactor ZKRMStateStore retry code with Apache Curator. Contributed by Karthik Kambatla
2015-06-08 14:50:58 -07:00
Devaraj K
c7ee6c151c
YARN-3780. Should use equals when compare Resource in
...
RMNodeImpl#ReconnectNodeTransition. Contributed by zhihai xu.
2015-06-08 11:54:55 +05:30
Karthik Kambatla
bd69ea408f
YARN-3655. FairScheduler: potential livelock due to maxAMShare limitation and container reservation. (Zhihai Xu via kasha)
2015-06-07 11:37:52 -07:00
Xuan
3e000a919f
YARN-1462. AHS API and other AHS changes to handle tags for completed MR jobs. Contributed by Xuan Gong
2015-06-05 12:48:52 -07:00
Karthik Kambatla
75885852cc
YARN-3259. FairScheduler: Trigger fairShare updates on node events. (Anubhav Dhoot via kasha)
2015-06-05 09:39:41 -07:00
Jian He
1970ca7cbc
YARN-2392. Add more diags about app retry limits on AM failures. Contributed by Steve Loughran
2015-06-04 11:14:09 -07:00
Jian He
6ad4e59cfc
YARN-3764. CapacityScheduler should forbid moving LeafQueue from one parent to another. Contributed by Wangda Tan
2015-06-04 10:52:59 -07:00
Junping Du
d7e7f6aa03
YARN-41. The RM should handle the graceful shutdown of the NM. Contributed by Devaraj K.
2015-06-04 04:59:27 -07:00
Xuan
5766a04428
YARN-3749. We should make a copy of configuration when init
...
MiniYARNCluster with multiple RMs. Contributed by Chun Chen
2015-06-03 17:20:15 -07:00
Zhijie Shen
bc85959edd
Revert "YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong."
...
This reverts commit 4a9ec1a824
.
2015-06-03 14:15:56 -07:00
Karthik Kambatla
edb9cd0f7a
YARN-3762. FairScheduler: CME on FSParentQueue#getQueueUserAclInfo. (kasha)
2015-06-03 13:47:24 -07:00
Zhijie Shen
4a9ec1a824
YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong.
2015-05-30 21:05:36 -07:00
Karthik Kambatla
a8acdd65b3
YARN-3467. Expose allocatedMB, allocatedVCores, and runningContainers metrics on running Applications in RM Web UI. (Anubhav Dhoot via kasha)
2015-05-30 13:36:24 -07:00
Karthik Kambatla
3ae2a62501
YARN-3547. FairScheduler: Apps that have no resource demand should not participate scheduling. (Xianyin Xin via kasha)
2015-05-29 15:17:22 -07:00
Vinod Kumar Vavilapalli
9acd24fec4
Fixed more FilesSystemRMStateStore issues. Contributed by Vinod Kumar Vavilapalli.
2015-05-28 15:25:56 -07:00
Allen Wittenauer
d6e3164d4a
YARN-2355. MAX_APP_ATTEMPTS_ENV may no longer be a useful env var for a container (Darrell Taylor via aw)
2015-05-27 16:40:56 -07:00
Wangda Tan
ec0a852a37
YARN-3647. RMWebServices api's should use updated api from CommonNodeLabelsManager to get NodeLabel object. (Sunil G via wangda)
2015-05-27 12:51:20 -07:00
Wangda Tan
cdbd66be11
YARN-3686. CapacityScheduler should trim default_node_label_expression. (Sunil G via wangda)
2015-05-26 15:58:47 -07:00
Jian He
10732d515f
YARN-3632. Ordering policy should be allowed to reorder an application when demand changes. Contributed by Craig Welch
2015-05-26 12:00:51 -07:00
Tsuyoshi Ozawa
9a3d617b63
YARN-2336. Fair scheduler's REST API returns a missing '[' bracket JSON for deep queue tree. Contributed by Kenji Kikushima and Akira Ajisaka.
2015-05-26 19:07:40 +09:00
Jian He
446d51591e
YARN-3707. RM Web UI queue filter doesn't work. Contributed by Wangda Tan
2015-05-22 15:58:22 -07:00
Karthik Kambatla
4513761869
YARN-3675. FairScheduler: RM quits when node removal races with continuous-scheduling on the same node. (Anubhav Dhoot via kasha)
2015-05-21 13:44:42 -07:00
Tsuyoshi Ozawa
7401e5b5e8
YARN-3677. Fix findbugs warnings in yarn-server-resourcemanager. Contributed by Vinod Kumar Vavilapalli.
2015-05-20 08:59:26 +09:00
Wangda Tan
563eb1ad2a
YARN-3583. Support of NodeLabel object instead of plain String in YarnClient side. (Sunil G via wangda)
2015-05-19 16:54:38 -07:00
Wangda Tan
b37da52a1c
YARN-3565. NodeHeartbeatRequest/RegisterNodeManagerRequest should use NodeLabel object instead of String. (Naganarasimha G R via wangda)
2015-05-19 16:34:17 -07:00
Xuan
b0ad644083
YARN-3526. ApplicationMaster tracking URL is incorrectly redirected on a QJM cluster. Contributed by Weiwei Yang
2015-05-15 22:39:49 -07:00
Jason Lowe
f7e051c431
YARN-2421. RM still allocates containers to an app in the FINISHING state. Contributed by Chang Li
2015-05-15 22:09:30 +00:00
Vinod Kumar Vavilapalli
9a2a9553ee
Fixing HDFS state-store. Contributed by Arun Suresh.
2015-05-14 16:13:51 -07:00
Junping Du
15ccd967ee
YARN-3505. Node's Log Aggregation Report with SUCCEED should not cached in RMApps. Contributed by Xuan Gong.
2015-05-14 10:58:12 -07:00
Wangda Tan
0e85044e26
YARN-3362. Add node label usage in RM CapacityScheduler web UI. (Naganarasimha G R via wangda)
2015-05-13 17:00:36 -07:00
Wangda Tan
7f19e7a254
YARN-3521. Support return structured NodeLabel objects in REST API (Sunil G via wangda)
2015-05-13 13:43:17 -07:00
Wangda Tan
d6f6741296
YARN-3489. RMServerUtils.validateResourceRequests should only obtain queue info once. (Varun Saxena via wangda)
2015-05-11 17:31:15 -07:00
Junping Du
7e543c27fa
YARN-3587. Fix the javadoc of DelegationTokenSecretManager in yarn, etc. projects. Contributed by Gabor Liptak.
2015-05-11 05:27:07 -07:00
Karthik Kambatla
a60f78e98e
YARN-3395. FairScheduler: Trim whitespaces when using username for queuename. (Zhihai Xu via kasha)
2015-05-09 15:41:20 -07:00
Robert Kanter
5658998845
YARN-3473. Fix RM Web UI configuration for some properties (rchiang via rkanter)
2015-05-08 16:23:10 -07:00
Jian He
d0b73287c2
YARN-3604. Fixed ZKRMStateStore#removeApplication to also disable watch. Contributed Zhihai Xu
2015-05-08 15:12:33 -07:00
Vinod Kumar Vavilapalli
f7a74d2491
YARN-3018. Unified the default value for the configuration property yarn.scheduler.capacity.node-locality-delay in code and default xml file. Contributed by Nijel SF.
2015-05-08 14:45:06 -07:00
Wangda Tan
f55d21d894
YARN-3593. Add label-type and Improve "DEFAULT_PARTITION" in Node Labels Page. (Naganarasimha G R via wangda)
2015-05-08 12:04:50 -07:00
Thomas Graves
5d708a4725
YARN-3600. AM container link is broken (Naganarasimha G R via tgraves
2015-05-08 16:35:40 +00:00
Devaraj K
ef3d66d462
YARN-3358. Audit log not present while refreshing Service ACLs.
...
Contributed by Varun Saxena.
2015-05-08 12:14:10 +05:30
Devaraj K
a28cd02ba0
YARN-3592. Fix typos in RMNodeLabelsManager. Contributed by Sunil G.
2015-05-08 10:54:36 +05:30
Jian He
f489a4ec96
YARN-2918. RM should not fail on startup if queue's configured labels do not exist in cluster-node-labels. Contributed by Wangda Tan
2015-05-07 17:35:41 -07:00
Jian He
b88700dcd0
YARN-3584. Fixed attempt diagnostics format shown on the UI. Contributed by nijel
2015-05-07 13:54:07 -07:00
Vinod Kumar Vavilapalli
4c7b9b6abe
YARN-3385. Fixed a race-condition in ResourceManager's ZooKeeper based state-store to avoid crashing on duplicate deletes. Contributed by Zhihai Xu.
2015-05-06 17:51:17 -07:00
Jian He
0d3188fd25
YARN-3301. Fixed the format issue of the new RM attempt web page. Contributed by Xuan Gong
2015-05-06 13:45:00 -07:00
Xuan
b7dd3a4f04
YARN-2123. Progress bars in Web UI always at 100% due to non-US locale.
...
Contributed by Akira AJISAKA
2015-05-05 10:40:16 -07:00
Jason Lowe
9356cf8676
YARN-3552. RM Web UI shows -1 running containers for completed apps. Contributed by Rohith
2015-05-05 13:42:32 +00:00
Jian He
d701acc9c6
YARN-2725. Added test cases of retrying creating znode in ZKRMStateStore. Contributed by Tsuyoshi Ozawa
2015-05-04 16:13:29 -07:00
Gera Shegalov
f8204e241d
YARN-2893. AMLaucher: sporadic job failures due to EOFException in readTokenStorageStream. (Zhihai Xu via gera)
2015-05-01 18:18:55 -07:00
Zhijie Shen
7e8639fda4
YARN-3544. Got back AM logs link on the RM web UI for a completed app. Contributed by Xuan Gong.
2015-04-30 13:05:55 -07:00
tgraves
2e215484bd
YARN-3517. RM web ui for dumping scheduler logs should be for admins only (Varun Vasudev via tgraves)
2015-04-29 21:25:42 +00:00
Karthik Kambatla
8f82970e0c
YARN-3485. FairScheduler headroom calculation doesn't consider maxResources for Fifo and FairShare policies. (kasha)
2015-04-28 21:00:35 -07:00
Wangda Tan
db1b674b50
YARN-2740. Fix NodeLabelsManager to properly handle node label modifications when distributed node label configuration enabled. (Naganarasimha G R via wangda)
2015-04-27 16:24:38 -07:00
Jian He
d497f6ea2b
YARN-2498. Respect labels in preemption policy of capacity scheduler for inter-queue preemption. Contributed by Wangda Tan
2015-04-24 17:03:13 -07:00
Tsuyoshi Ozawa
4a3dabd94f
YARN-3406. Display count of running containers in the RM's Web UI. Contributed by Ryu Kobayashi.
2015-04-25 07:17:11 +09:00
Jian He
d03dcb9635
YARN-3387. Previous AM's container completed status couldn't pass to current AM if AM and RM restarted during the same time. Contributed by Sandflee
2015-04-24 12:13:29 -07:00
Xuan
eee9facbba
YARN-3511. Add errors and warnings page to ATS. Contributed by Varun Vasudev
2015-04-24 09:41:59 -07:00
Vinod Kumar Vavilapalli
f5fe35e297
YARN-3413. Changed Nodelabel attributes (like exclusivity) to be settable only via addToClusterNodeLabels but not changeable at runtime. (Wangda Tan via vinodkv)
2015-04-23 11:19:55 -07:00
Wangda Tan
395205444e
YARN-3319. Implement a FairOrderingPolicy. (Craig Welch via wangda)
2015-04-23 10:47:15 -07:00
tgraves
189a63a719
YARN-3434. Interaction between reservations and userlimit can result in significant ULF violation
2015-04-23 14:39:25 +00:00
Steve Loughran
d9bcf99aef
YARN-2605. [RM HA] Rest api endpoints doing redirect incorrectly. (Xuan Gong via stevel)
2015-04-23 09:22:48 +01:00
Junping Du
fad9d7e85b
New parameter of CLI for decommissioning node gracefully in RMAdmin CLI. Contributed by Devaraj K
2015-04-22 10:07:20 -07:00
Jian He
bdd90110e6
YARN-3494. Expose AM resource limit and usage in CS QueueMetrics. Contributed by Rohith Sharmaks
2015-04-21 20:06:20 -07:00
Wangda Tan
e71d0d87d9
YARN-3410. YARN admin should be able to remove individual application records from RMStateStore. (Rohith Sharmaks via wangda)
2015-04-21 17:51:22 -07:00
Tsuyoshi Ozawa
105afd5477
YARN-3495. Confusing log generated by FairScheduler. Contributed by Brahma Reddy Battula.
2015-04-22 05:47:59 +09:00
Wangda Tan
44872b76fc
YARN-3463. Integrate OrderingPolicy Framework with CapacityScheduler. (Craig Welch via wangda)
2015-04-20 17:12:32 -07:00
Jian He
497c86b485
YARN-3136. Fixed a synchronization problem of AbstractYarnScheduler#getTransferredContainers. Contributed by Sunil G
2015-04-18 12:46:37 -07:00
Wangda Tan
f47a5763ac
YARN-3487. CapacityScheduler scheduler lock obtained unnecessarily when calling getQueue (Jason Lowe via wangda)
2015-04-17 17:22:27 -07:00
Wangda Tan
f65eeb412d
YARN-3493. RM fails to come up with error "Failed to load/recover state" when mem settings are changed. (Jian He via wangda)
2015-04-17 17:11:22 -07:00
Jian He
6779467ab6
YARN-3451. Display attempt start time and elapsed time on the web UI. Contributed by Rohith Sharmaks
2015-04-17 15:55:34 -07:00
Jian He
d573f09fb9
YARN-2696. Queue sorting in CapacityScheduler should consider node label. Contributed by Wangda Tan
2015-04-17 13:36:59 -07:00
Junping Du
1db355a875
YARN-1402. Update related Web UI and CLI with exposing client API to check log aggregation status. Contributed by Xuan Gong.
2015-04-17 13:18:59 -07:00
Jian He
bb6dde68f1
YARN-3021. YARN's delegation-token handling disallows certain trust setups to operate properly over DistCp. Contributed by Yongjun Zhang
2015-04-16 19:43:37 -07:00
Akira Ajisaka
1fa8075794
MAPREDUCE-6300. Task list sort by task id broken. Contributed by Siqi Li.
2015-04-16 10:04:09 -05:00
Jian He
1b89a3e173
YARN-3354. Add node label expression in ContainerTokenIdentifier to support RM recovery. Contributed by Wangda Tan
2015-04-15 13:57:06 -07:00
Tsuyoshi Ozawa
e48cedc663
YARN-3326. Support RESTful API for getLabelsToNodes. Contributed by Naganarasimha G R.
2015-04-15 14:03:55 -05:00
Wangda Tan
5004e75332
YARN-3318. Create Initial OrderingPolicy Framework and FifoOrderingPolicy. (Craig Welch via wangda)
2015-04-15 09:56:32 -07:00
Jian He
0fefda645b
YARN-3361. CapacityScheduler side changes to support non-exclusive node labels. Contributed by Wangda Tan
2015-04-14 11:45:58 -07:00
Jian He
b46ee1e7a3
YARN-3266. RMContext#inactiveNodes should have NodeId as map key. Contributed by Chengbing Liu
2015-04-14 10:54:22 -07:00
Jian He
a1afbc48b5
YARN-3472. Fixed possible leak in DelegationTokenRenewer#allTokens. Contributed by Rohith Sharmaks
2015-04-13 14:07:17 -07:00
Jian He
944a16579f
YARN-3382. Some of UserMetricsInfo metrics are incorrectly set to root queue metrics. Contributed by Rohit Agarwal
2015-04-13 10:53:34 -07:00
Karthik Kambatla
e516706b89
YARN-3469. ZKRMStateStore: Avoid setting watches that are not required. (Jun Hong via kasha)
2015-04-10 11:20:34 -07:00
Jian He
577d755e4b
YARN-3348. Add a 'yarn top' tool to help understand cluster usage. Contributed by Varun Vasudev
2015-04-10 09:57:17 -07:00
Junping Du
92431c9617
YARN-1376. NM need to notify the log aggregation status to RM through Node heartbeat. Contributed by Xuan Gong.
2015-04-10 08:56:18 -07:00
Xuan
afa5d4715a
YARN-3293. Track and display capacity scheduler health metrics in web
...
UI. Contributed by Varun Vasudev
2015-04-09 23:38:04 -07:00
Vinod Kumar Vavilapalli
9c5911294e
YARN-3055. Fixed ResourceManager's DelegationTokenRenewer to not stop token renewal of applications part of a bigger workflow. Contributed by Daryn Sharp.
2015-04-09 13:08:53 -07:00
Wangda Tan
1885141e90
YARN-3466. Fix RM nodes web page to sort by node HTTP-address, #containers and node-label column. (Jason Lowe via wangda)
2015-04-09 10:35:12 -07:00
Xuan
d27e9241e8
YARN-3294. Allow dumping of Capacity Scheduler debug logs via web UI for
...
a fixed time period. Contributed by Varun Vasudev
2015-04-07 09:52:36 -07:00
Xuan
96d72118f5
YARN-3435. AM container to be allocated Appattempt AM container shown as
...
null. Contributed by Bibin A Chundatt
2015-04-05 00:40:57 -07:00
Wangda Tan
bad070fe15
YARN-2901. Add errors and warning metrics page to RM, NM web UI. (Varun Vasudev via wangda)
2015-04-02 17:23:20 -07:00
Sandy Ryza
6a6a59db7f
YARN-3415. Non-AM containers can be counted towards amResourceUsage of a fairscheduler queue (Zhihai Xu via Sandy Ryza)
2015-04-02 13:56:08 -07:00
Zhijie Shen
8366a36ad3
YARN-3430. Made headroom data available on app attempt page of RM WebUI. Contributed by Xuan Gong.
2015-04-01 13:47:54 -07:00
Xuan
4728bdfa15
YARN-3248. Display count of nodes blacklisted by apps in the web UI.
...
Contributed by Varun Vasudev
2015-04-01 04:19:18 -07:00
Tsuyoshi Ozawa
b5a22e9838
YARN-3258. FairScheduler: Need to add more logging to investigate allocations. Contributed by Anubhav Dhoot.
2015-03-31 17:42:44 +09:00
Wangda Tan
2a945d24f7
YARN-2495. Allow admin specify labels from each NM (Distributed configuration for node label). (Naganarasimha G R via wangda)
2015-03-30 12:05:21 -07:00
Wangda Tan
97a7277a2d
YARN-3383. AdminService should use warn instead of info to log exception when operation fails. (Li Lu via wangda)
2015-03-24 10:38:54 -07:00
Karthik Kambatla
972f1f1ab9
YARN-2868. FairScheduler: Metric for latency to allocate first container for an application. (Ray Chiang via kasha)
2015-03-23 14:07:05 -07:00
Karthik Kambatla
2bc097cd14
YARN-3241. FairScheduler handles invalid queue names inconsistently. (Zhihai Xu via kasha)
2015-03-23 13:22:03 -07:00
cnauroth
6ca1f12024
YARN-3336. FileSystem memory leak in DelegationTokenRenewer.
2015-03-23 10:45:50 -07:00
Jian He
e1feb4ea1a
YARN-3345. Add non-exclusive node label API. Contributed by Wangda Tan
2015-03-20 19:04:38 -07:00
Jian He
586348e4cb
YARN-3356. Capacity Scheduler FiCaSchedulerApp should use ResourceUsage to track used-resources-by-label. Contributed by Wangda Tan
2015-03-20 13:54:01 -07:00
Wangda Tan
6bc7710ec7
YARN-3369. Missing NullPointer check in AppSchedulingInfo causes RM to die. (Brahma Reddy Battula via wangda)
2015-03-20 10:42:05 -07:00
Jian He
4e886eb9cb
YARN-3379. Fixed missing data in localityTable and ResourceRequests table in RM WebUI. Contributed by Xuan Gong
2015-03-19 22:27:21 -07:00
Jian He
658097d6da
YARN-3273. Improve scheduler UI to facilitate scheduling analysis and debugging. Contributed Rohith Sharmaks
2015-03-17 21:30:23 -07:00
Tsuyoshi Ozawa
3bc72cc16d
YARN-3205. FileSystemRMStateStore should disable FileSystem Cache to avoid get a Filesystem with an old configuration. Contributed by Zhihai Xu.
2015-03-18 11:53:19 +09:00
Jian He
968425e9f7
YARN-3305. Normalize AM resource request on app submission. Contributed by Rohith Sharmaks
2015-03-17 13:49:59 -07:00
Karthik Kambatla
32b4330456
Revert "YARN-3181. FairScheduler: Fix up outdated findbugs issues. (kasha)"
...
This reverts commit c2b185def8
.
2015-03-17 12:31:15 -07:00
Jian He
487374b7fe
YARN-3243. CapacityScheduler should pass headroom from parent to children to make sure ParentQueue obey its capacity limits. Contributed by Wangda Tan.
2015-03-17 10:24:23 -07:00
Devaraj K
7179f94f9d
YARN-3197. Confusing log generated by CapacityScheduler. Contributed by
...
Varun Saxena.
2015-03-17 15:57:57 +05:30
Tsuyoshi Ozawa
3da9a97cfb
YARN-1453. [JDK8] Fix Javadoc errors caused by incorrect or illegal tags in doc comments. Contributed by Akira AJISAKA, Andrew Purtell, and Allen Wittenauer.
2015-03-16 23:19:05 +09:00
Zhijie Shen
85f6d67fa7
YARN-1884. Added nodeHttpAddress into ContainerReport and fixed the link to NM web page. Contributed by Xuan Gong.
2015-03-11 19:35:19 -07:00
Allen Wittenauer
a5cf985bf5
YARN-2280. Resource manager web service fields are not accessible (Krisztian Horvath via aw)
2015-03-10 16:32:20 -07:00
Jian He
c3003eba6f
YARN-3300. Outstanding_resource_requests table should not be shown in AHS. Contributed by Xuan Gong
2015-03-09 20:46:48 -07:00
Jason Lowe
27e8ea820f
YARN-3275. CapacityScheduler: Preemption happening on non-preemptable queues. Contributed by Eric Payne
2015-03-06 22:37:26 +00:00
Jian He
95bfd087dc
YARN-1809. Synchronize RM and TimeLineServer Web-UIs. Contributed by Zhijie Shen and Xuan Gong
2015-03-05 21:20:09 -08:00
Tsuyoshi Ozawa
1b67209612
YARN-3249. Add a 'kill application' button to Resource Manager's Web UI. Contributed by Ryu Kobayashi.
2015-03-05 19:55:56 +09:00
Karthik Kambatla
8d88691d16
YARN-3242. Asynchrony in ZK-close can lead to ZKRMStateStore watcher receiving events for old client. (Zhihai Xu via kasha)
2015-03-04 19:49:05 -08:00
Karthik Kambatla
22426a1c9f
YARN-3231. FairScheduler: Changing queueMaxRunningApps interferes with pending jobs. (Siqi Li via kasha)
2015-03-04 18:06:58 -08:00
Jian He
b2f1ec312e
YARN-3222. Fixed NPE on RMNodeImpl#ReconnectNodeTransition when a node is reconnected with a different port. Contributed by Rohith Sharmaks
2015-03-03 16:28:28 -08:00
Wangda Tan
e17e5ba9d7
YARN-3272. Surface container locality info in RM web UI (Jian He via wangda)
2015-03-03 11:49:01 -08:00
Tsuyoshi Ozawa
d1c6accb6f
HADOOP-11602. Fix toUpperCase/toLowerCase to use Locale.ENGLISH. (ozawa)
2015-03-03 14:17:52 +09:00
Vinod Kumar Vavilapalli
14dd647c55
YARN-3265. Fixed a deadlock in CapacityScheduler by always passing a queue's available resource-limit from the parent queue. Contributed by Wangda Tan.
2015-03-02 17:52:47 -08:00
Wangda Tan
edcecedc1c
YARN-3262. Surface application outstanding resource requests table in RM web UI. (Jian He via wangda)
2015-02-27 16:13:32 -08:00
Tsuyoshi Ozawa
01a1621930
YARN-2820. Retry in FileSystemRMStateStore when FS's operations fail due to IOException. Contributed by Zhihai Xu.
2015-02-28 00:56:44 +09:00
Konstantin V Shvachko
8ca0d957c4
YARN-3255. RM, NM, JobHistoryServer, and WebAppProxyServer's main() should support generic options. Contributed by Konstantin Shvachko.
2015-02-26 17:12:19 -08:00
Tsuyoshi Ozawa
9cedad11d8
Revert "HADOOP-11602. Fix toUpperCase/toLowerCase to use Locale.ENGLISH. (ozawa)"
...
This reverts commit 946456c6d8
.
Conflicts:
hadoop-common-project/hadoop-common/CHANGES.txt
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/QuotaByStorageTypeEntry.java
2015-02-25 00:32:04 +09:00
Wangda Tan
ce5bf927c3
YARN-3230. Clarify application states on the web UI. (Jian He via wangda)
2015-02-20 10:39:28 -08:00
Jason Lowe
a64dd3d24b
YARN-3194. RM should handle NMContainerStatuses sent by NM while registering if NM is Reconnected node. Contributed by Rohith
2015-02-20 15:10:10 +00:00
Jian He
c0d9b93953
YARN-933. Fixed InvalidStateTransitonException at FINAL_SAVING state in RMApp. Contributed by Rohith Sharmaks
2015-02-19 15:42:39 -08:00
Wangda Tan
d49ae725d5
YARN-3076. Add API/Implementation to YarnClient to retrieve label-to-node mapping (Varun Saxena via wangda)
2015-02-19 11:00:57 -08:00
Tsuyoshi Ozawa
946456c6d8
HADOOP-11602. Fix toUpperCase/toLowerCase to use Locale.ENGLISH. (ozawa)
2015-02-19 13:06:53 +09:00
Tsuyoshi Ozawa
b8a14efdf5
YARN-1615. Fix typos in delay scheduler's description. Contributed by Akira Ajisaka.
2015-02-19 10:24:07 +09:00
Jian He
f5da5566d9
YARN-3132. RMNodeLabelsManager should remove node from node-to-label mapping when node becomes deactivated. Contributed by Wangda Tan
2015-02-18 11:51:51 -08:00
Tsuyoshi Ozawa
556386a070
YARN-1299. Improve a log message in AppSchedulingInfo by adding application id. Contributed by Ashutosh Jindal and Devaraj K.
2015-02-17 01:53:31 +09:00
Tsuyoshi Ozawa
3f32357c36
Revert "YARN-1299. Improve a log message in AppSchedulingInfo by adding application id. Contributed by Ashutosh Jindal and devaraj."
...
This reverts commit 9aae81c934
.
2015-02-17 01:52:22 +09:00
Tsuyoshi Ozawa
9aae81c934
YARN-1299. Improve a log message in AppSchedulingInfo by adding application id. Contributed by Ashutosh Jindal and devaraj.
2015-02-17 01:48:22 +09:00
Tsuyoshi Ozawa
110cf6b9a3
YARN-3182. Cleanup switch statement in ApplicationMasterLauncher#handle(). Contributed by Ray Chiang.
2015-02-13 14:21:21 +09:00
Xuan
99f6bd4f7a
YARN-3158. Correct log messages in ResourceTrackerService. Contributed
...
by Varun Saxena
2015-02-12 16:42:15 -08:00
Jian He
18297e0972
YARN-3104. Fixed RM to not generate new AMRM tokens on every heartbeat between rolling and activation. Contributed by Jason Lowe
2015-02-12 16:02:45 -08:00
Jian He
18a594257e
YARN-3124. Fixed CS LeafQueue/ParentQueue to use QueueCapacities to track capacities-by-label. Contributed by Wangda Tan
2015-02-12 14:58:09 -08:00
Karthik Kambatla
c2b185def8
YARN-3181. FairScheduler: Fix up outdated findbugs issues. (kasha)
2015-02-12 13:44:47 -08:00
Xuan
65c69e296e
YARN-3151. On Failover tracking url wrong in application cli for KILLED
...
application. Contributed by Rohith
2015-02-11 21:19:48 -08:00
Junping Du
c541a374d8
YARN-3160. Fix non-atomic operation on nodeUpdateQueue in RMNodeImpl. (Contributed by Chengbing Liu)
2015-02-11 03:44:31 -08:00
Zhijie Shen
d5855c0e46
YARN-2246. Made the proxy tracking URL always be http(s)://proxy addr:port/proxy/<appId> to avoid duplicate sections. Contributed by Devaraj K.
2015-02-10 15:24:01 -08:00
Zhijie Shen
23bf6c7207
YARN-3100. Made YARN authorization pluggable. Contributed by Jian He.
2015-02-09 20:34:56 -08:00
Jian He
0af6a99a3f
YARN-3094. Reset timer for liveness monitors after RM recovery. Contributed by Jun Gong
2015-02-09 13:47:08 -08:00
Karthik Kambatla
7e42088abf
YARN-2990. FairScheduler's delay-scheduling always waits for node-local and rack-local delays, even for off-rack-only requests. (kasha)
2015-02-08 22:48:10 -08:00
Jason Lowe
da2fb2bc46
YARN-3143. RM Apps REST API can return NPE or entries missing id and other fields. Contributed by Jason Lowe
2015-02-06 21:47:32 +00:00
Jian He
c1957fef29
YARN-2694. Ensure only single node label specified in ResourceRequest. Contributed by Wangda Tan
2015-02-06 11:34:20 -08:00
Arun C. Murthy
18b2507eda
YARN-1904. Ensure exceptions thrown in ClientRMService & ApplicationHistoryClientService are uniform when application-attempt is not found. Contributed by Zhijie Shen.
2015-02-05 23:48:55 -08:00
Jian He
4641196fe0
YARN-3145. Fixed ConcurrentModificationException on CapacityScheduler ParentQueue#getQueueUserAclInfo. Contributed by Tsuyoshi OZAWA
2015-02-05 16:12:53 -08:00
Jason Lowe
69c8a7f45b
YARN-1582. Capacity Scheduler: add a maximum-allocation-mb setting per queue. Contributed by Thomas Graves
2015-02-05 19:28:49 +00:00
Sandy Ryza
b6466deac6
YARN-3101. In Fair Scheduler, fix canceling of reservations for exceeding max share (Anubhav Dhoot via Sandy Ryza)
2015-02-05 09:39:28 -08:00
Tsuyoshi Ozawa
26dee1486b
YARN-3058. Fix error message of tokens' activation delay configuration. Contributed by Yi Liu.
2015-02-04 11:41:14 +09:00
Zhijie Shen
e5b56e2af6
YARN-2543. Made resource usage be published to the timeline server too. Contributed by Naganarasimha G R.
2015-02-03 17:34:22 -08:00
Wangda Tan
5bd984691b
YARN-3075. NodeLabelsManager implementation to retrieve label to node mapping (Varun Saxena via wangda)
2015-02-03 12:52:42 -08:00
Jian He
21d80b3dd9
YARN-3098. Created common QueueCapacities class in Capacity Scheduler to track capacities-by-labels of queues. Contributed by Wangda Tan
2015-02-03 11:43:12 -08:00
Jason Lowe
a761bf8726
YARN-3085. Application summary should include the application type. Contributed by Rohith
2015-02-03 14:56:34 +00:00
Jian He
054a947989
YARN-3077. Fixed RM to create zk root path recursively. Contributed by Chun Chen
2015-01-30 17:34:49 -08:00
Jian He
86358221fc
YARN-3099. Capacity Scheduler LeafQueue/ParentQueue should use ResourceUsage to track used-resources-by-label. Contributed by Wangda Tan
2015-01-30 15:15:20 -08:00
Akira Ajisaka
342efa110a
HADOOP-9907. Webapp http://hostname:port/metrics link is not working. (aajisaka)
2015-01-30 02:49:10 +09:00
Wangda Tan
7882bc0f14
YARN-3079. Scheduler should also update maximumAllocation when updateNodeResource. (Zhihai Xu via wangda)
2015-01-28 21:54:38 -08:00
Wangda Tan
18741adf97
YARN-2932. Add entry for preemptable status (enabled/disabled) to scheduler web UI and queue initialize/refresh logging. (Eric Payne via wangda)
2015-01-27 15:36:09 -08:00
Jian He
6f9fe76918
YARN-3092. Created a common ResourceUsage class to track labeled resource usages in Capacity Scheduler. Contributed by Wangda Tan
2015-01-26 15:38:00 -08:00
Tsuyoshi Ozawa
24aa462673
YARN-2800. Remove MemoryNodeLabelsStore and add a way to enable/disable node labels feature. Contributed by Wangda Tan.
2015-01-23 20:37:05 +09:00
Wangda Tan
0a2d3e717d
YARN-2933. Capacity Scheduler preemption policy should only consider capacity without labels temporarily. Contributed by Mayank Bansal
2015-01-19 16:48:50 -08:00
Jian He
9e33116d1d
YARN-2861. Fixed Timeline DT secret manager to not reuse RM's configs. Contributed by Zhijie Shen
2015-01-15 11:07:43 -08:00
Jian He
c53420f583
YARN-2637. Fixed max-am-resource-percent calculation in CapacityScheduler when activating applications. Contributed by Craig Welch
2015-01-13 17:32:07 -08:00
Robert Kanter
51881535e6
YARN-2643. Don't create a new DominantResourceCalculator on every FairScheduler.allocate call. (kasha via rkanter)
2015-01-12 17:51:00 -08:00
Robert Kanter
ae7bf31fe1
YARN-3027. Scheduler should use totalAvailable resource from node instead of availableResource for maxAllocation. (adhoot via rkanter)
2015-01-12 10:47:52 -08:00
Jian He
a260406268
YARN-3014. Replaces labels on a host should update all NM's labels on that host. Contributed by Wangda Tan
2015-01-09 17:49:53 -08:00
Zhijie Shen
dc2eaa26b2
YARN-2996. Improved synchronization and I/O operations of FS- and Mem- RMStateStore. Contributed by Yi Liu.
2015-01-08 09:47:02 -08:00
Jian He
e13a484a2b
YARN-3010. Fixed findbugs warning in AbstractYarnScheduler. Contributed by Yi Liu
2015-01-07 14:12:20 -08:00
Zhijie Shen
60103fca04
YARN-2427. Added the API of moving apps between queues in RM web services. Contributed by Varun Vasudev.
2015-01-06 14:37:44 -08:00
Jian He
dd57c2047b
YARN-2978. Fixed potential NPE while getting queue info. Contributed by Varun Saxena
2015-01-06 14:15:57 -08:00
Karthik Kambatla
0c4b112677
YARN-2881. [YARN-2574] Implement PlanFollower for FairScheduler. (Anubhav Dhoot via kasha)
2015-01-06 04:42:10 +05:30
Zhijie Shen
562a701945
YARN-2958. Made RMStateStore not update the last sequence number when updating the delegation token. Contributed by Varun Saxena.
2015-01-05 13:33:07 -08:00
Tsuyoshi Ozawa
ddc5be48fc
YARN-2922. ConcurrentModificationException in CapacityScheduler's LeafQueue. Contributed by Rohith Sharmaks.
2015-01-05 00:08:31 +09:00
Karthik Kambatla
e7257acd8a
YARN-2998. Abstract out scheduler independent PlanFollower components. (Anubhav Dhoot via kasha)
2014-12-30 19:55:24 -08:00
Jian He
e2351c7ae2
YARN-2987. Fixed ClientRMService#getQueueInfo to check against queue and app ACLs. Contributed by Varun Saxena
2014-12-30 17:15:37 -08:00
Jian He
b7442bf92e
YARN-2493. Added node-labels page on RM web UI. Contributed by Wangda Tan
2014-12-30 16:49:01 -08:00
Jian He
746ad6e989
Revert "YARN-2492(wrong jira number). Added node-labels page on RM web UI. Contributed by Wangda Tan"
...
This reverts commit 5f57b904f5
.
2014-12-30 16:48:49 -08:00
Jian He
5f57b904f5
YARN-2492. Added node-labels page on RM web UI. Contributed by Wangda Tan
2014-12-30 15:38:28 -08:00
Zhijie Shen
241d3b3a50
YARN-2938. Fixed new findbugs warnings in hadoop-yarn-resourcemanager and hadoop-yarn-applicationhistoryservice. Contributed by Varun Saxena.
2014-12-29 09:59:35 -08:00
Jian He
1454efe5d4
YARN-2992. ZKRMStateStore crashes due to session expiry. Contributed by Karthik Kambatla
2014-12-26 17:59:58 -08:00
Junping Du
40ee4bff65
YARN-2993. Several fixes (missing acl check, error log msg ...) and some refinement in AdminService. (Contributed by Yi Liu)
2014-12-26 06:59:25 -08:00
Jian He
4f18018b7a
YARN-2946. Fixed potential deadlock in RMStateStore. Contributed by Rohith Sharmaks
2014-12-23 22:14:29 -08:00
Jian He
0d89859b51
YARN-2340. Fixed NPE when queue is stopped during RM restart. Contributed by Rohith Sharmaks
2014-12-22 21:53:22 -08:00
Jian He
fdf042dfff
YARN-2920. Changed CapacityScheduler to kill containers on nodes where node labels are changed. Contributed by Wangda Tan
2014-12-22 16:51:15 -08:00
Junping Du
a696fbb001
YARN-2939. Fix new findbugs warnings in hadoop-yarn-common. (Li Lu via junping_du)
2014-12-22 03:06:37 -08:00
Karthik Kambatla
24ee9e3431
YARN-2975. FSLeafQueue app lists are accessed without required locks. (kasha)
2014-12-20 12:17:50 -08:00
Jian He
808cba3821
YARN-2952. Fixed incorrect version check in StateStore. Contributed by Rohith Sharmaks
2014-12-19 16:56:30 -08:00
Karthik Kambatla
a22ffc3188
YARN-2738. [YARN-2574] Add FairReservationSystem for FairScheduler. (Anubhav Dhoot via kasha)
2014-12-19 15:37:12 -08:00
Jason Lowe
0402bada19
YARN-2964. RM prematurely cancels tokens for jobs that submit jobs (oozie). Contributed by Jian He
2014-12-18 23:28:18 +00:00
Junping Du
2b4b0e8847
YARN-2972. DelegationTokenRenewer thread pool never expands. Contributed by Jason Lowe
2014-12-17 17:40:00 -08:00
Karthik Kambatla
f2d150ea12
YARN-2964. FSLeafQueue#assignContainer - document the reason for using both write and read locks. (Tsuyoshi Ozawa via kasha)
2014-12-17 12:03:16 -08:00
Devaraj K
bda748ac3a
YARN-2243. Order of arguments for Preconditions.checkNotNull() is wrong in
...
SchedulerApplicationAttempt ctor. Contributed by Devaraj K.
2014-12-12 12:34:43 +05:30
Karthik Kambatla
a2e07a5456
YARN-2910. FSLeafQueue can throw ConcurrentModificationException. (Wilfred Spiegelenburg via kasha)
2014-12-09 14:00:31 -08:00
Jian He
e69af836f3
YARN-2869. CapacityScheduler should trim sub queue names when parse configuration. Contributed by Wangda Tan
2014-12-05 17:33:39 -08:00
Jason Lowe
4b13082199
YARN-2056. Disable preemption at Queue level. Contributed by Eric Payne
2014-12-05 21:06:48 +00:00
Jian He
258623ff8b
YARN-2301. Improved yarn container command. Contributed by Naganarasimha G R
2014-12-04 12:53:18 -08:00
Karthik Kambatla
799353e2c7
YARN-2874. Dead lock in DelegationTokenRenewer which blocks RM to execute any further apps. (Naganarasimha G R via kasha)
2014-12-03 13:44:41 -08:00
Jian He
392c3aaea8
YARN-2894. Fixed a bug regarding application view acl when RM fails over. Contributed by Rohith Sharmaks
2014-12-02 17:16:35 -08:00
Jian He
52bcefca8b
YARN-2136. Changed RMStateStore to ignore store opearations when fenced. Contributed by Varun Saxena
2014-12-02 10:54:48 -08:00
Jian He
a7fba0bc28
YARN-2765. Added leveldb-based implementation for RMStateStore. Contributed by Jason Lowe
2014-12-01 16:38:25 -08:00
Junping Du
c732ed760e
YARN-2907. SchedulerNode#toString should print all resource detail instead of only memory. (Contributed by Rohith)
2014-12-01 05:38:22 -08:00
Jian He
8a7ca13b13
YARN-2906. CapacitySchedulerPage shows HTML tags for a queue's Active Users. Contributed by Jason Lowe
2014-11-25 16:16:56 -08:00
Jian He
5805a81efb
YARN-2404. Removed ApplicationAttemptState and ApplicationState class in RMStateStore. Contributed by Tsuyoshi OZAWA
2014-11-25 12:48:22 -08:00
Zhijie Shen
e37a4ff0c1
YARN-2697. Remove useless RMAuthenticationHandler. Contributed by Haosong Huang.
2014-11-24 14:58:59 -08:00
Sandy Ryza
a128cca305
YARN-2669. FairScheduler: queue names shouldn't allow periods (Wei Yan via Sandy Ryza)
2014-11-21 16:06:41 -08:00
Karthik Kambatla
3114d4731d
YARN-2604. Scheduler should consider max-allocation-* in conjunction with the largest node. (Robert Kanter via kasha)
2014-11-21 10:32:28 -08:00
Karthik Kambatla
a9a0cc3679
YARN-2315. FairScheduler: Set current capacity in addition to capacity. (Zhihai Xu via kasha)
2014-11-19 20:15:40 -08:00
Karthik Kambatla
c90fb84aaa
YARN-2802. ClusterMetrics to include AM launch and register delays. (Zhihai Xu via kasha)
2014-11-19 19:50:12 -08:00
Jian He
9cb8b75ba5
YARN-2865. Fixed RM to always create a new RMContext when transtions from StandBy to Active. Contributed by Rohith Sharmaks
2014-11-19 19:48:52 -08:00
Karthik Kambatla
2fce6d6141
YARN-2690. [YARN-2574] Make ReservationSystem and its dependent classes independent of Scheduler type. (Anubhav Dhoot via kasha)
2014-11-17 16:45:57 -08:00
Jason Lowe
81c9d17af8
YARN-2414. RM web UI: app page will crash if app is failed before any attempt has been created. Contributed by Wangda Tan
2014-11-17 21:15:48 +00:00
Jian He
9b860661a5
YARN-2432. RMStateStore should process the pending events before close. Contributed by Varun Saxena
2014-11-14 17:24:09 -08:00
Sandy Ryza
1a47f890ba
YARN-2811. In Fair Scheduler, reservation fulfillments shouldn't ignore max share (Siqi Li via Sandy Ryza)
2014-11-14 15:18:56 -08:00
Jian He
d005404ef7
YARN-2856. Fixed RMAppImpl to handle ATTEMPT_KILLED event at ACCEPTED state on app recovery. Contributed by Rohith Sharmaks
2014-11-13 15:34:26 -08:00
Vinod Kumar Vavilapalli
3651fe1b08
YARN-2853. Fixed a bug in ResourceManager causing apps to hang when the user kill request races with ApplicationMaster finish. Contributed by Jian He.
2014-11-13 08:13:03 -08:00
Jason Lowe
f8aefa5e9c
YARN-2780. Log aggregated resource allocation in rm-appsummary.log. Contributed by Eric Payne
2014-11-12 17:01:15 +00:00
Vinod Kumar Vavilapalli
e76faebc95
YARN-2834. Fixed ResourceManager to ignore token-renewal failures on recovery consistent with the (somewhat incorrect) behaviour in the non-recovery case. Contributed by Jian He.
2014-11-09 18:56:58 -08:00
Zhijie Shen
9a4e0d343e
YARN-2505. Supported get/add/remove/change labels in RM REST API. Contributed by Craig Welch.
2014-11-07 20:35:46 -08:00
Vinod Kumar Vavilapalli
df36edf751
YARN-2826. Fixed user-groups mappings' refresh bug caused by YARN-2826. Contributed by Wangda Tan.
2014-11-07 19:43:55 -08:00
Vinod Kumar Vavilapalli
2ac1be7dec
YARN-2824. Fixed Capacity Scheduler to not crash when some node-labels are not mapped to queues by making default capacities per label to be zero. Contributed by Wangda Tan.
2014-11-07 10:39:37 -08:00
Vinod Kumar Vavilapalli
a5657182a7
YARN-2823. Fixed ResourceManager app-attempt state machine to inform schedulers about previous finished attempts of a running appliation to avoid expectation mismatch w.r.t transferred containers. Contributed by Jian He.
2014-11-07 09:28:36 -08:00
Vinod Kumar Vavilapalli
a3839a9fbf
YARN-2744. Fixed CapacityScheduler to validate node-labels correctly against queues. Contributed by Wangda Tan.
2014-11-06 17:28:12 -08:00
Jian He
395275af86
YARN-2579. Fixed a deadlock issue when EmbeddedElectorService and FatalEventDispatcher try to transition RM to StandBy at the same time. Contributed by Rohith Sharmaks
2014-11-05 16:59:54 -08:00
Vinod Kumar Vavilapalli
834e931d8e
YARN-2805. Fixed ResourceManager to load HA configs correctly before kerberos login. Contributed by Wangda Tan.
2014-11-05 15:30:33 -08:00
Karthik Kambatla
b2cd269802
YARN-2010. Handle app-recovery failures gracefully. (Jian He and Karthik Kambatla via kasha)
2014-11-04 17:45:24 -08:00
Vinod Kumar Vavilapalli
ec6cbece8e
YARN-2795. Fixed ResourceManager to not crash loading node-label data from HDFS in secure mode. Contributed by Wangda Tan.
2014-11-03 13:44:06 -08:00
Vinod Kumar Vavilapalli
d1828d9443
YARN-2779. Fixed ResourceManager to not require delegation tokens for communicating with Timeline Service. Contributed by Zhijie Shen.
2014-10-30 23:16:19 -07:00
Vinod Kumar Vavilapalli
e0233c16eb
YARN-2698. Moved some node label APIs to be correctly placed in client protocol. Contributed by Wangda Tan.
2014-10-30 22:59:31 -07:00
Karthik Kambatla
179cab81e0
YARN-2712. TestWorkPreservingRMRestart: Augment FS tests with queue and headroom checks. (Tsuyoshi Ozawa via kasha)
2014-10-30 00:29:07 -07:00
Karthik Kambatla
782971ae7a
YARN-2742. FairSchedulerConfiguration should allow extra spaces between value and unit. (Wei Yan via kasha)
2014-10-29 10:24:57 -07:00
Jian He
d5e0a09721
YARN-2503. Added node lablels in web UI. Contributed by Wangda Tan
2014-10-28 17:57:54 -07:00
Vinod Kumar Vavilapalli
a16d022ca4
YARN-2704. Changed ResourceManager to optionally obtain tokens itself for the sake of localization and log-aggregation for long-running services. Contributed by Jian He.
2014-10-27 15:49:47 -07:00