Varun Vasudev
308d63f382
YARN-4307. Display blacklisted nodes for AM container in the RM web UI. Contributed by Naganarasimha G R.
2016-02-04 13:32:54 +05:30
Varun Vasudev
1adb64e09b
YARN-4625. Make ApplicationSubmissionContext and ApplicationSubmissionContextInfo more consistent. Contributed by Xuan Gong.
2016-02-03 16:26:28 +05:30
Wangda Tan
9875325d5c
YARN-4340. Add list API to reservation system. (Sean Po via wangda)
2016-02-02 10:17:33 +08:00
Jason Lowe
ed55950164
YARN-3102. Decommisioned Nodes not listed in Web UI. Contributed by Kuhu Shukla
2016-02-01 23:15:26 +00:00
Rohith Sharma K S
2673cbaf55
YARN-4615. Fix random test failure in TestAbstractYarnScheduler#testResourceRequestRecoveryToTheRightAppAttempt. (Sunil G via rohithsharmaks)
2016-02-01 10:43:56 +05:30
Jason Lowe
772ea7b41b
YARN-4428. Redirect RM page to AHS page when AHS turned on and RM page is not available. Contributed by Chang Li
2016-01-29 21:48:54 +00:00
Jian He
f4a57d4a53
YARN-4617. LeafQueue#pendingOrderingPolicy should always use fixed ordering policy instead of using same as active applications ordering policy. Contributed by Rohith Sharma K S
2016-01-29 12:22:23 -08:00
Devaraj K
a277bdc9ed
YARN-4411. RMAppAttemptImpl#createApplicationAttemptReport throws
...
IllegalArgumentException. Contributed by Bibin A Chundatt and yarntime.
2016-01-29 13:51:37 +05:30
Jian He
7f46636495
YARN-4519. Potential deadlock of CapacityScheduler between decrease container and assign containers. Contributed by Meng Ding
2016-01-28 14:51:00 -08:00
Rohith Sharma K S
ef343be82b
YARN-4633. Fix random test failure in TestRMRestart#testRMRestartAfterPreemption. (Bibin A Chundatt via rohithsharmaks)
2016-01-28 21:53:45 +05:30
Karthik Kambatla
fb238d7e5d
YARN-4462. FairScheduler: Disallow preemption from a queue. (Tao Jie via kasha)
2016-01-27 12:29:06 -08:00
Rohith Sharma K S
c01bee0108
YARN-4573. Fix test failure in TestRMAppTransitions#testAppRunningKill and testAppKilledKilled. (Takashi Ohnishi via rohithsharmaks)
2016-01-27 08:23:02 +05:30
rohithsharmaks
10dc2c0493
YARN-4613. Fix test failure in TestClientRMService#testGetClusterNodes. (Takashi Ohnishi via rohithsharmaks)
2016-01-24 23:36:15 +05:30
rohithsharmaks
99829eb221
YARN-4614. Fix random failure in TestApplicationPriority#testApplicationPriorityAllocationWithChangeInPriority. (Sunil G via rohithsharmaks)
2016-01-23 07:56:57 +05:30
rohithsharmaks
d6258b33a7
YARN-4497. RM might fail to restart when recovering apps whose attempts are missing. (Jun Gong via rohithsharmaks)
2016-01-22 20:27:38 +05:30
Akira Ajisaka
8f58f742ae
YARN-4605. Spelling mistake in the help message of "yarn applicationattempt" command. Contributed by Weiwei Yang.
2016-01-22 19:43:06 +09:00
Rohith Sharma K S
e30668106d
YARN-4584. RM startup failure when AM attempts greater than max-attempts. (Bibin A Chundatt via rohithsharmaks)
2016-01-22 10:14:46 +05:30
Jason Lowe
468a53b22f
YARN-4610. Reservations continue looking for one app causes other apps to starve. Contributed by Jason Lowe
2016-01-21 18:31:29 +00:00
Wangda Tan
5ff5f67332
YARN-4557. Fix improper Queues sorting in PartitionedQueueComparator when accessible-node-labels=*. (Naganarasimha G R via wangda)
2016-01-21 11:21:06 +08:00
Xuan
890a2ebd1a
YARN-4559. Make leader elector and zk store share the same curator
...
client. Contributed by Jian He
2016-01-20 14:48:10 -08:00
Jian He
edc43a9097
YARN-4565. Fix a bug that leads to AM resource limit not hornored when sizeBasedWeight enabled for FairOrderingPolicy. Contributed by Wangda Tan
2016-01-18 21:04:36 -08:00
Wangda Tan
a44ce3f14f
YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)
2016-01-19 09:30:04 +08:00
Wangda Tan
150f5ae034
Revert "YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)"
...
This reverts commit 3fe5728563
.
Conflicts:
hadoop-yarn-project/CHANGES.txt
2016-01-19 09:27:36 +08:00
Karthik Kambatla
d40859fab1
YARN-4526. Make SystemClock singleton so AppSchedulingInfo could use it. (kasha)
2016-01-18 10:58:14 +01:00
Wangda Tan
3fe5728563
YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)
...
(cherry picked from commit 805a9ed85e
)
2016-01-18 17:06:05 +08:00
Wangda Tan
adf260a728
Revert "YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)"
...
This reverts commit 805a9ed85e
.
2016-01-18 16:50:45 +08:00
Wangda Tan
b08ecf5c75
YARN-4304. AM max resource configuration per partition to be displayed/updated correctly in UI and in various partition related metrics. (Sunil G via wangda)
2016-01-18 11:11:32 +08:00
Wangda Tan
805a9ed85e
YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)
2016-01-18 11:04:25 +08:00
Wangda Tan
9523648d57
YARN-4538. QueueMetrics pending cores and memory metrics wrong. (Bibin A Chundatt via wangda)
2016-01-18 10:57:14 +08:00
rohithsharmaks
f7736f464f
YARN-4389. Allow application to enable or disable am blacklisting. (Sunil G via rohithsharmaks)
2016-01-15 21:38:26 +05:30
Karthik Kambatla
9d04f26d4c
YARN-3446. FairScheduler headroom calculation should exclude nodes in the blacklist. (Zhihai Xu via kasha)
2016-01-14 08:33:23 -08:00
Wangda Tan
c0537bcd2c
YARN-4571. Make app id/name available to the yarn authorizer provider for better auditing. (Jian He via wangda)
2016-01-13 13:18:31 +08:00
Wangda Tan
9e792da014
YARN-4582. Label-related invalid resource request exception should be able to properly handled by application. (Bibin A Chundatt via wangda)
2016-01-12 12:53:31 +08:00
Jian He
b8942be888
YARN-4537. Pull out priority comparison from fifocomparator and use compound comparator for FifoOrdering policy. Contributed by Rohith Sharma K S
2016-01-11 16:44:28 -08:00
Jian He
109e528ef5
YARN-4479. Change CS LeafQueue pendingOrderingPolicy to hornor recovered apps. Contributed by Rohith Sharma K S
2016-01-08 15:51:10 -08:00
Xuan
89022f8d4b
YARN-4438. Implement RM leader election with curator. Contributed by Jian He
2016-01-07 14:33:06 -08:00
Junping Du
c1462a67ff
YARN-4546. ResourceManager crash due to scheduling opportunity overflow. Contributed by Jason Lowe.
2016-01-06 05:49:24 -08:00
Wangda Tan
8310b2e9ff
YARN-4522. Queue acl can be checked at app submission. (Jian He via wangda)
2015-12-30 15:30:12 -08:00
Jian He
5273413411
YARN-3480. Remove attempts that are beyond max-attempt limit from state store. Contributed by Jun Gong
2015-12-29 15:58:39 -08:00
Wangda Tan
561abb9fee
YARN-4315. NaN in Queue percentage for cluster apps page. (Bibin A Chundatt via wangda)
2015-12-29 13:28:00 -08:00
Jian He
d0a22bae9b
YARN-4417. Make RM and Timeline-server REST APIs more consistent. Contributed by Wangda Tan
2015-12-28 15:52:45 -08:00
Karthik Kambatla
0af492b4bd
YARN-4156. TestAMRestart#testAMBlacklistPreventsRestartOnSameNode assumes CapacityScheduler. (Anubhav Dhoot via kasha)
2015-12-23 17:52:36 -08:00
Arun Suresh
e88422df45
YARN-4477. FairScheduler: Handle condition which can result in an infinite loop in attemptScheduling. (Tao Jie via asuresh)
2015-12-21 22:41:09 -08:00
Wangda Tan
bc038b382c
YARN-4454. NM to nodelabel mapping going wrong after RM restart. (Bibin A Chundatt via wangda)
2015-12-21 11:30:13 -08:00
Jian He
85c2466048
YARN-4164. Changed updateApplicationPriority API to return the updated application priority. Contributed by Rohith Sharma K S
2015-12-18 14:13:48 -08:00
Junping Du
1de56b0448
YARN-3226. UI changes for decommissioning node. Contributed by Sunil G.
2015-12-17 15:20:17 -08:00
Wangda Tan
7faa406f27
YARN-4225. Add preemption status to yarn queue -status for capacity scheduler. (Eric Payne via wangda)
2015-12-16 13:19:40 -08:00
Wangda Tan
79c41b1d83
YARN-4293. ResourceUtilization should be a part of yarn node CLI. (Sunil G via wangda)
2015-12-16 13:18:19 -08:00
Junping Du
50bd067e1d
YARN-4452. NPE when submit Unmanaged application. Contributed by Naganarasimha G R.
2015-12-16 10:57:39 -08:00
Zhihai Xu
2aaed10327
YARN-4440. FSAppAttempt#getAllowedLocalityLevelByTime should init the lastScheduler time. Contributed by Lin Yiqun
2015-12-15 00:17:21 -08:00
Wangda Tan
07b0fb996a
YARN-4418. AM Resource Limit per partition can be updated to ResourceUsage as well. (Sunil G via wangda)
2015-12-14 11:24:30 -08:00
Wangda Tan
6cb0af3c39
YARN-3946. Update exact reason as to why a submitted app is in ACCEPTED state to app's diagnostic message. (Naganarasimha G R via wangda)
2015-12-14 10:52:46 -08:00
=
c25a635459
YARN-4248. REST API for submit/update/delete Reservations. (curino)
2015-12-07 13:33:28 -08:00
Xuan
4546c7582b
YARN-4392. ApplicationCreatedEvent event time resets after RM
...
restart/failover. Contributed by Naganarasimha G R and Xuan Gong
2015-12-07 12:24:55 -08:00
Arun Suresh
742632e346
YARN-4358. Reservation System: Improve relationship between SharingPolicy and ReservationAgent. (Carlo Curino via asuresh)
2015-12-05 21:26:16 -08:00
Jian He
755dda8dd8
YARN-4405. Support node label store in non-appendable file system. Contributed by Wangda Tan
2015-12-03 17:45:31 -08:00
Wangda Tan
a2c3bfc8c1
YARN-4292. ResourceUtilization should be a part of NodeInfo REST API. (Sunil G via wangda)
2015-12-03 14:28:32 -08:00
Jian He
9f77ccad73
YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999). Contributed by Mohammad Shahid Khan and Varun Saxena
2015-12-03 12:48:50 -08:00
Karthik Kambatla
52948bb20b
YARN-3980. Plumb resource-utilization info in node heartbeat through to the scheduler. (Inigo Goiri via kasha)
2015-11-24 13:47:17 +05:30
Jian He
8676a118a1
YARN-4349. Support CallerContext in YARN. Contributed by Wangda Tan
2015-11-23 17:19:48 -08:00
Jason Lowe
d36b6e045f
YARN-4344. NMs reconnecting with changed capabilities can lead to wrong cluster resource calculations. Contributed by Varun Vasudev
2015-11-23 20:30:26 +00:00
Arun Suresh
da1016365a
YARN-3454. Add efficient merge operation to RLESparseResourceAllocation (Carlo Curino via asuresh)
2015-11-21 09:59:41 -08:00
Wangda Tan
2346fa3141
YARN-3769. Consider user limit when calculating total pending resource for preemption policy in Capacity Scheduler. (Eric Payne via wangda)
2015-11-20 15:55:50 -08:00
Arun Suresh
6a61928fb7
YARN-4184. Remove update reservation state api from state store as its not used by ReservationSystem (Sean Po via asuresh)
2015-11-17 15:50:34 -08:00
Jian He
fcd7888029
Revert "YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999) Contributed by Mohammad Shahid Khan"
...
This reverts commit 8fbea531d7
.
Conflicts:
hadoop-yarn-project/CHANGES.txt
2015-11-16 20:18:44 -08:00
Wangda Tan
7f55a18071
YARN-4347. Resource manager fails with Null pointer exception. (Jian He via wangda)
2015-11-12 11:23:40 -08:00
Wangda Tan
796638d9bc
YARN-4287. Capacity Scheduler: Rack Locality improvement (Nathan Roberts via wangda)
2015-11-12 11:09:37 -08:00
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
6351d3fa63
YARN-4183. Reverting the patch to fix behaviour change.
...
Revert "YARN-4183. Enabling generic application history forces every job to get a timeline service delegation token (jeagles)"
This reverts commit c293c58954
.
2015-11-11 10:40:43 -08:00
Jian He
8fbea531d7
YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999) Contributed by Mohammad Shahid Khan
2015-11-09 10:43:45 -08:00
Jian He
e5b1733e04
YARN-4127. RM fail with noAuth error if switched from failover to non-failover. Contributed by Varun Saxena
2015-10-29 15:42:57 -07:00
Jonathan Eagles
c293c58954
YARN-4183. Enabling generic application history forces every job to get a timeline service delegation token (jeagles)
2015-10-29 16:41:10 -05:00
Wangda Tan
56e4f6237a
YARN-3216. Max-AM-Resource-Percentage should respect node labels. (Sunil G via wangda)
2015-10-26 16:44:39 -07:00
Wangda Tan
6f606214e7
YARN-4169. Fix racing condition of TestNodeStatusUpdaterForLabels. (Naganarasimha G R via wangda)
2015-10-26 16:36:34 -07:00
Wangda Tan
3cc73773eb
YARN-4285. Display resource usage as percentage of queue and cluster in the RM UI (Varun Vasudev via wangda)
2015-10-26 13:07:39 -07:00
Jason Lowe
33a03af3c3
YARN-4284. condition for AM blacklisting is too narrow. Contributed by Sangjin Lee
2015-10-26 19:53:03 +00:00
Arun Suresh
ab8eb8770c
YARN-3738. Add support for recovery of reserved apps running under dynamic queues (subru via asuresh)
2015-10-24 22:53:10 -07:00
Jason Lowe
d3a34a4f38
YARN-4041. Slow delegation token renewal can severely prolong RM recovery. Contributed by Sunil G
2015-10-23 20:57:01 +00:00
Ming Ma
934d96a334
YARN-2913. Fair scheduler should have ability to set MaxResourceDefault for each queue. (Siqi Li via mingma)
2015-10-23 08:36:33 -07:00
Zhihai Xu
960201b79b
YARN-4256. YARN fair scheduler vcores with decimal values. Contributed by Jun Gong
2015-10-22 12:28:03 -07:00
Anubhav Dhoot
2798723a54
YARN-3739. Add reservation system recovery to RM recovery process. Contributed by Subru Krishnan.
2015-10-22 06:51:00 -07:00
Arun Suresh
506d1b1dbc
YARN-3985. Make ReservationSystem persist state using RMStateStore reservation APIs. (adhoot via asuresh)
2015-10-20 16:46:14 -07:00
Arun Suresh
7e2837f830
YARN-4270. Limit application resource reservation on nodes for non-node/rack specific requests (asuresh)
2015-10-19 20:00:38 -07:00
Jian He
f9da5cdb2b
YARN-4170. AM need to be notified with priority in AllocateResponse. Contributed by Sunil G
2015-10-16 15:26:27 -07:00
Wangda Tan
4337b263aa
YARN-4162. CapacityScheduler: Add resource usage by partition and queue capacity by partition to REST API. (Naganarasimha G R via wangda)
2015-10-16 15:06:28 -07:00
Jian He
cf23f2c2b5
YARN-4000. RM crashes with NPE if leaf queue becomes parent queue during restart. Contributed by Varun Saxena
2015-10-15 17:12:46 -07:00
Jian He
9849c8b386
YARN-4230. RM crashes with NPE when increasing container resource if there is no headroom left. Contributed by Meng Ding
2015-10-12 11:51:33 -07:00
Zhihai Xu
049c6e8dc0
YARN-4201. AMBlacklist does not work for minicluster. Contributed by Jun Gong.
2015-10-12 00:14:25 -07:00
Devaraj K
db93047881
YARN-3964. Support NodeLabelsProvider at Resource Manager side.
...
Contributed by Dian Fu.
2015-10-11 11:21:29 +05:30
Wangda Tan
def374e666
YARN-4140. RM container allocation delayed incase of app submitted to Nodelabel partition. (Bibin A Chundatt via wangda)
2015-10-09 16:38:59 -07:00
Jason Lowe
a0bca2b5ad
YARN-261. Ability to fail AM attempts. Contributed by Andrey Klochkov and Rohith Sharma K S
2015-10-09 14:17:38 +00:00
Rohith Sharma K S
8f195387a4
YARN-4235. FairScheduler PrimaryGroup does not handle empty groups returned for a user. (Anubhav Dhoot via rohithsharmaks)
2015-10-09 10:09:26 +05:30
Rohith Sharma K S
9156fc60c6
YARN-4209. RMStateStore FENCED state doesn’t work due to updateFencedState called by stateMachine.doTransition. (Zhihai Xu via rohithsharmaks)
2015-10-07 09:34:59 +05:30
Wangda Tan
29a582ada0
YARN-4215. RMNodeLabels Manager Need to verify and replace node labels for the only modified Node Label Mappings in the request. (Naganarasimha G R via wangda)
2015-10-06 11:56:04 -07:00
Xuan
8f08532bde
YARN-1897. CLI and core support for signal container functionality. Contributed by Ming Ma
2015-10-02 18:50:47 -07:00
Anubhav Dhoot
9735afe967
YARN-4180. AMLauncher does not retry on failures when talking to NM. (adhoot)
2015-09-28 16:13:41 -07:00
Jason Lowe
9f53a95ff6
YARN-4141. Runtime Application Priority change should not throw exception for applications at finishing states. Contributed by Sunil G
2015-09-28 22:55:20 +00:00
Anubhav Dhoot
fb2e525c07
YARN-4204. ConcurrentModificationException in FairSchedulerQueueInfo. (adhoot)
2015-09-28 09:05:45 -07:00
Rohith Sharma K S
a9aafad12b
YARN-4044. Running applications information changes such as movequeue is not published to TimeLine server. (Sunil G via rohithsharmaks)
2015-09-24 12:13:22 +05:30
Jian He
89cab1ba5f
YARN-1651. CapacityScheduler side changes to support container resize. Contributed by Wangda Tan
2015-09-23 13:29:38 -07:00
Jian He
5f5a968d65
YARN-3867. ContainerImpl changes to support container resizing. Contributed by Meng Ding
2015-09-23 13:29:37 -07:00
Jian He
83a18add10
YARN-1449. AM-NM protocol changes to support container resizing. Contributed by Meng Ding & Wangda Tan)
2015-09-23 13:29:36 -07:00
Arun Suresh
94dec5a916
YARN-3920. FairScheduler container reservation on a node should be configurable to limit it to large containers (adhoot via asuresh)
2015-09-18 14:02:55 -07:00
Wangda Tan
9bc913a35c
YARN-3212. RMNode State Transition Update with DECOMMISSIONING state. (Junping Du via wangda)
2015-09-18 10:04:17 -07:00
Rohith Sharma K S
723c31d45b
YARN-4135. Improve the assertion message in MockRM while failing after waiting for the state.(Nijel S F via rohithsharmaks)
2015-09-18 08:44:10 +05:30
Jian He
6c6e734f0b
YARN-4034. Render cluster Max Priority in scheduler metrics in RM web UI. Contributed by Rohith Sharma K S
2015-09-17 14:55:50 +08:00
Jian He
452079af8b
YARN-4078. Add getPendingResourceRequestForAttempt in YarnScheduler interface. Contributed by Naganarasimha G R
2015-09-16 14:59:20 +08:00
Wangda Tan
ae5308fe1d
YARN-3717. Expose app/am/queue's node-label-expression to RM web UI / CLI / REST-API. (Naganarasimha G R via wangda)
2015-09-15 11:40:50 -07:00
Junping Du
73e3a49eb0
YARN-313. Add Admin API for supporting node resource configuration in command line. (Contributed by Inigo Goiri, Kenji Kikushima and Junping Du)
2015-09-15 07:56:47 -07:00
Jian He
5468baa80a
YARN-3635. Refactored current queue mapping implementation in CapacityScheduler to use a generic PlacementManager framework. Contributed by Wangda Tan
2015-09-15 15:39:20 +08:00
Jian He
e1b1d7e4ae
YARN-4126. RM should not issue delegation tokens in unsecure mode. Contributed by Bibin A Chundatt
2015-09-14 14:09:19 +08:00
Karthik Kambatla
332b520a48
YARN-3697. FairScheduler: ContinuousSchedulingThread can fail to shutdown. (Zhihai Xu via kasha)
2015-09-13 18:07:43 -07:00
Karthik Kambatla
81df7b586a
YARN-2005. Blacklisting support for scheduling AMs. (Anubhav Dhoot via kasha)
2015-09-13 17:03:15 -07:00
Robert Kanter
ea4bb2749f
YARN-4145. Make RMHATestBase abstract so its not run when running all tests under that namespace (adhoot via rkanter)
2015-09-11 11:46:10 -07:00
Wangda Tan
bcc85e3bab
YARN-4024. YARN RM should avoid unnecessary resolving IP when NMs doing heartbeat. (Hong Zhiguo via wangda)
2015-09-04 15:13:53 -07:00
Jason Lowe
6eaca2e363
YARN-4105. Capacity Scheduler headroom for DRF is wrong. Contributed by Chang Li
2015-09-04 15:30:53 +00:00
Varun Vasudev
40d222e862
YARN-4103. RM WebServices missing scheme for appattempts logLinks. Contributed by Jonathan Eagles.
2015-09-04 14:31:51 +05:30
Varun Vasudev
b469ac531a
YARN-3970. Add REST api support for Application Priority. Contributed by Naganarasimha G R.
2015-09-03 16:40:10 +05:30
Jian He
09c64ba1ba
YARN-4101. RM should print alert messages if Zookeeper and Resourcemanager gets connection issue. Contributed by Xuan Gong
2015-09-02 17:45:23 -07:00
Rohith Sharma K S
7d6687fe76
YARN-3893. Both RM in active state when Admin#transitionToActive failure from refeshAll() (Bibin A Chundatt via rohithsharmaks)
2015-09-02 15:22:48 +05:30
Varun Vasudev
bf669b6d9f
YARN-4082. Container shouldn't be killed when node's label updated. Contributed by Wangda Tan.
2015-09-01 14:19:11 +05:30
Junping Du
beb65c9465
YARN-1556. NPE getting application report with a null appId. Contributed by Weiwei Yang.
2015-08-28 05:57:34 -07:00
Jian He
a9c8ea71aa
YARN-3250. Support admin cli interface in for Application Priority. Contributed by Rohith Sharma K S
2015-08-27 13:25:53 -07:00
Jian He
57c7ae1aff
YARN-4014. Support user cli interface in for Application Priority. Contributed by Rohith Sharma K S
2015-08-24 20:36:44 -07:00
Rohith Sharma K S
feaf034994
YARN-3896. RMNode transitioned from RUNNING to REBOOTED because its response id has not been reset synchronously. (Jun Gong via rohithsharmaks)
2015-08-24 11:25:07 +05:30
Xuan
37e1c3d82a
YARN-221. NM should provide a way for AM to tell it not to aggregate
...
logs. Contributed by Ming Ma
2015-08-22 16:25:24 -07:00
Rohith Sharma K S
22de7c1dca
YARN-3986. getTransferredContainers in AbstractYarnScheduler should be present in YarnScheduler interface
2015-08-21 10:51:11 +05:30
Zhihai Xu
3a76a010b8
YARN-3857: Memory leak in ResourceManager with SIMPLE mode. Contributed by mujunchao.
2015-08-18 10:36:40 -07:00
Jian He
e5003be907
YARN-4026. Refactored ContainerAllocator to accept a list of priorites rather than a single priority. Contributed by Wangda Tan
2015-08-12 15:07:50 -07:00
rohithsharmaks
1c12adb71f
YARN-4023. Publish Application Priority to TimelineServer. (Sunil G via rohithsharmaks)
2015-08-12 14:45:41 +05:30
Xuan
3ae716fa69
YARN-3999. RM hangs on draing events. Contributed by Jian He
2015-08-11 18:25:11 -07:00
Jian He
fa1d84ae27
YARN-3887. Support changing Application priority during runtime. Contributed by Sunil G
2015-08-10 20:51:54 -07:00
Wangda Tan
cf9d3c9256
YARN-3873. PendingApplications in LeafQueue should also use OrderingPolicy. (Sunil G via wangda)
2015-08-10 14:54:55 -07:00
Rohith Sharma K S
b6265d39c5
YARN-3948. Display Application Priority in RM Web UI.(Sunil G via rohithsharmaks)
2015-08-07 10:43:41 +05:30
Carlo Curino
8572a5a14b
YARN-3974. Refactor the reservation system test cases to use parameterized base test. (subru via curino)
2015-08-02 01:55:31 -07:00
Arun Suresh
154c9d2e42
YARN-3961. Expose pending, running and reserved containers of a queue in REST api and yarn top (adhoot via asuresh)
2015-08-05 23:14:14 -07:00
rohithsharmaks
df9e7280db
YARN-3992. TestApplicationPriority.testApplicationPriorityAllocation fails intermittently. (Contributed by Sunil G)
2015-08-06 10:43:37 +05:30
Jian He
ba2313d614
YARN-3983. Refactored CapacityScheduleri#FiCaSchedulerApp to easier extend container allocation logic. Contributed by Wangda Tan
2015-08-05 13:47:40 -07:00
Arun Suresh
f271d37735
YARN-3736. Add RMStateStore apis to store and load accepted reservations for failover (adhoot via asuresh)
2015-08-05 12:57:12 -07:00
Xuan
0306d902f5
YARN-3543. ApplicationReport should be able to tell whether the
...
Application is AM managed or not. Contributed by Rohith Sharma K S
2015-08-03 15:46:00 -07:00
Jonathan Eagles
3cd02b9522
YARN-3978. Configurably turn off the saving of container info in Generic AHS (Eric Payne via jeagles)
2015-08-03 10:38:05 -05:00
Jason Lowe
32e490b6c0
YARN-3990. AsyncDispatcher may overloaded with RMAppNodeUpdateEvent when Node is connected/disconnected. Contributed by Bibin A Chundatt
2015-07-31 17:37:24 +00:00
Zhihai Xu
ab80e27703
YARN-433. When RM is catching up with node updates then it should not expire acquired containers. Contributed by Xuan Gong
2015-07-30 21:57:11 -07:00
Wangda Tan
91b42e7d6e
YARN-3971. Skip RMNodeLabelsManager#checkRemoveFromClusterNodeLabelsOfQueue on nodelabel recovery. (Bibin A Chundatt via wangda)
2015-07-30 10:00:31 -07:00
ccurino
156f24ead0
YARN-3656. LowCost: A Cost-Based Placement Agent for YARN Reservations. (Jonathan Yaniv and Ishai Menache via curino)
2015-07-25 07:39:47 -07:00
Jian He
83fe34ac08
YARN-3026. Move application-specific container allocation logic from LeafQueue to FiCaSchedulerApp. Contributed by Wangda Tan
2015-07-24 14:00:25 -07:00
Karthik Kambatla
d19d187753
YARN-3957. FairScheduler NPE In FairSchedulerQueueInfo causing scheduler page to return 500. (Anubhav Dhoot via kasha)
2015-07-24 11:44:37 -07:00
carlo curino
0fcb4a8cf2
YARN-3969. Allow jobs to be submitted to reservation that is active but does not have any allocations. (subru via curino)
2015-07-23 19:33:59 -07:00
Robert Kanter
1d3026e7b3
YARN-3900. Protobuf layout of yarn_security_token causes errors in other protos that include it (adhoot via rkanter)
2015-07-23 14:46:54 -07:00
Wangda Tan
3bba180051
YARN-3941. Proportional Preemption policy should try to avoid sending duplicate PREEMPT_CONTAINER event to scheduler. (Sunil G via wangda)
2015-07-23 10:07:57 -07:00
Wangda Tan
76ec26de80
YARN-3932. SchedulerApplicationAttempt#getResourceUsageReport and UserInfo should based on total-used-resources. (Bibin A Chundatt via wangda)
2015-07-22 11:54:02 -07:00
Wangda Tan
c39ca541f4
YARN-2003. Support for Application priority : Changes in RM and Capacity Scheduler. (Sunil G via wangda)
2015-07-21 09:57:23 -07:00
Arun Suresh
9b272ccae7
YARN-3535. Scheduler must re-request container resources when RMContainer transitions from ALLOCATED to KILLED (rohithsharma and peng.zhang via asuresh)
2015-07-17 04:31:34 -07:00
Wangda Tan
3540d5fe4b
YARN-3885. ProportionalCapacityPreemptionPolicy doesn't preempt if queue is more than 2 level. (Ajith S via wangda)
2015-07-16 16:13:32 -07:00
Arun Suresh
ac94ba3e18
YARN-3453. Ensure preemption logic in FairScheduler uses DominantResourceCalculator in DRF queues to prevent unnecessary thrashing. (asuresh)
2015-07-14 00:23:55 -07:00
Wangda Tan
5ed1fead6b
YARN-3894. RM startup should fail for wrong CS xml NodeLabel capacity configuration. (Bibin A Chundatt via wangda)
2015-07-12 21:52:11 -07:00
Wangda Tan
1df39c1efc
YARN-3849. Too much of preemption activity causing continuos killing of containers across queues. (Sunil G via wangda)
2015-07-11 10:26:46 -07:00
Zhijie Shen
1ea36299a4
YARN-3116. RM notifies NM whether a container is an AM container or normal task container. Contributed by Giovanni Matteo Fumarola.
2015-07-10 18:58:10 -07:00
Ming Ma
08244264c0
YARN-3445. Cache runningApps in RMNode for getting running apps on given NodeId. (Junping Du via mingma)
2015-07-10 08:30:10 -07:00
carlo curino
0e602fa3a1
YARN-3800. Reduce storage footprint for ReservationAllocation. Contributed by Anubhav Dhoot.
2015-07-09 16:51:59 -07:00
Wangda Tan
0e4b06690f
YARN-3508. Prevent processing preemption events on the main RM dispatcher. (Varun Saxena via wangda)
2015-07-01 17:32:22 -07:00
Devaraj K
80a68d6056
YARN-3830. AbstractYarnScheduler.createReleaseCache may try to clean a
...
null attempt. Contributed by nijel.
2015-07-01 19:03:44 +05:30
Xuan
fe6c1bd73a
YARN-2871. TestRMRestart#testRMRestartGetApplicationList sometime fails
...
in trunk. Contributed by zhihai xu
2015-06-26 19:43:59 -07:00
Xuan
5b5bb8dcdc
YARN-3802. Two RMNodes for the same NodeId are used in RM sometimes
...
after NM is reconnected. Contributed by zhihai xu
2015-06-18 14:37:49 -07:00
Xuan
a826d432f9
YARN-3804. Both RM are on standBy state when kerberos user not in yarn.admin.acl. Contributed by Varun Saxena
2015-06-17 16:23:27 -07:00
Devaraj K
d8dcfa98e3
YARN-3794. TestRMEmbeddedElector fails because of ambiguous LOG reference.
...
Contributed by Chengbing Liu.
2015-06-12 13:42:49 +05:30
Xuan
5583f88bf7
YARN-3785. Support for Resource as an argument during submitApp call in
...
MockRM test class. Contributed by Sunil G
2015-06-10 21:40:48 -07:00
Jian He
960b8f19ca
YARN-2716. Refactor ZKRMStateStore retry code with Apache Curator. Contributed by Karthik Kambatla
2015-06-08 14:50:58 -07:00
Karthik Kambatla
bd69ea408f
YARN-3655. FairScheduler: potential livelock due to maxAMShare limitation and container reservation. (Zhihai Xu via kasha)
2015-06-07 11:37:52 -07:00
Xuan
3e000a919f
YARN-1462. AHS API and other AHS changes to handle tags for completed MR jobs. Contributed by Xuan Gong
2015-06-05 12:48:52 -07:00
Karthik Kambatla
75885852cc
YARN-3259. FairScheduler: Trigger fairShare updates on node events. (Anubhav Dhoot via kasha)
2015-06-05 09:39:41 -07:00
Jian He
6ad4e59cfc
YARN-3764. CapacityScheduler should forbid moving LeafQueue from one parent to another. Contributed by Wangda Tan
2015-06-04 10:52:59 -07:00
Wangda Tan
ebd797c48f
YARN-3733. Fix DominantRC#compare() does not work as expected if cluster resource is empty. (Rohith Sharmaks via wangda)
2015-06-04 10:22:57 -07:00
Junping Du
d7e7f6aa03
YARN-41. The RM should handle the graceful shutdown of the NM. Contributed by Devaraj K.
2015-06-04 04:59:27 -07:00
Xuan
5766a04428
YARN-3749. We should make a copy of configuration when init
...
MiniYARNCluster with multiple RMs. Contributed by Chun Chen
2015-06-03 17:20:15 -07:00
Zhijie Shen
bc85959edd
Revert "YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong."
...
This reverts commit 4a9ec1a824
.
2015-06-03 14:15:56 -07:00
Zhijie Shen
4a9ec1a824
YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong.
2015-05-30 21:05:36 -07:00
Vinod Kumar Vavilapalli
9acd24fec4
Fixed more FilesSystemRMStateStore issues. Contributed by Vinod Kumar Vavilapalli.
2015-05-28 15:25:56 -07:00
Allen Wittenauer
d6e3164d4a
YARN-2355. MAX_APP_ATTEMPTS_ENV may no longer be a useful env var for a container (Darrell Taylor via aw)
2015-05-27 16:40:56 -07:00
Wangda Tan
ec0a852a37
YARN-3647. RMWebServices api's should use updated api from CommonNodeLabelsManager to get NodeLabel object. (Sunil G via wangda)
2015-05-27 12:51:20 -07:00
Wangda Tan
cdbd66be11
YARN-3686. CapacityScheduler should trim default_node_label_expression. (Sunil G via wangda)
2015-05-26 15:58:47 -07:00
Jian He
10732d515f
YARN-3632. Ordering policy should be allowed to reorder an application when demand changes. Contributed by Craig Welch
2015-05-26 12:00:51 -07:00
Tsuyoshi Ozawa
9a3d617b63
YARN-2336. Fair scheduler's REST API returns a missing '[' bracket JSON for deep queue tree. Contributed by Kenji Kikushima and Akira Ajisaka.
2015-05-26 19:07:40 +09:00
Karthik Kambatla
4513761869
YARN-3675. FairScheduler: RM quits when node removal races with continuous-scheduling on the same node. (Anubhav Dhoot via kasha)
2015-05-21 13:44:42 -07:00
Jian He
8966d42179
YARN-3609. Load node labels from storage inside RM serviceStart. Contributed by Wangda Tan
2015-05-20 16:30:07 -07:00
Wangda Tan
563eb1ad2a
YARN-3583. Support of NodeLabel object instead of plain String in YarnClient side. (Sunil G via wangda)
2015-05-19 16:54:38 -07:00
Wangda Tan
b37da52a1c
YARN-3565. NodeHeartbeatRequest/RegisterNodeManagerRequest should use NodeLabel object instead of String. (Naganarasimha G R via wangda)
2015-05-19 16:34:17 -07:00
Jason Lowe
f7e051c431
YARN-2421. RM still allocates containers to an app in the FINISHING state. Contributed by Chang Li
2015-05-15 22:09:30 +00:00
Vinod Kumar Vavilapalli
9a2a9553ee
Fixing HDFS state-store. Contributed by Arun Suresh.
2015-05-14 16:13:51 -07:00
Junping Du
15ccd967ee
YARN-3505. Node's Log Aggregation Report with SUCCEED should not cached in RMApps. Contributed by Xuan Gong.
2015-05-14 10:58:12 -07:00
Wangda Tan
0e85044e26
YARN-3362. Add node label usage in RM CapacityScheduler web UI. (Naganarasimha G R via wangda)
2015-05-13 17:00:36 -07:00
Wangda Tan
7f19e7a254
YARN-3521. Support return structured NodeLabel objects in REST API (Sunil G via wangda)
2015-05-13 13:43:17 -07:00
Wangda Tan
341a476812
YARN-2921. Fix MockRM/MockAM#waitForState sleep too long. (Tsuyoshi Ozawa via wangda)
2015-05-13 13:06:07 -07:00
Karthik Kambatla
a60f78e98e
YARN-3395. FairScheduler: Trim whitespaces when using username for queuename. (Zhihai Xu via kasha)
2015-05-09 15:41:20 -07:00
Karthik Kambatla
70fb37cd79
YARN-1287. Consolidate MockClocks. (Sebastian Wong and Anubhav Dhoot via kasha)
2015-05-09 14:34:54 -07:00
Karthik Kambatla
2fb44c8aaf
YARN-3271. FairScheduler: Move tests related to max-runnable-apps from TestFairScheduler to TestAppRunnability. (nijel via kasha)
2015-05-08 16:39:10 -07:00
Jian He
f489a4ec96
YARN-2918. RM should not fail on startup if queue's configured labels do not exist in cluster-node-labels. Contributed by Wangda Tan
2015-05-07 17:35:41 -07:00
Akira Ajisaka
918af8efff
YARN-3577. Misspelling of threshold in log4j.properties for tests. Contributed by Brahma Reddy Battula.
2015-05-07 13:33:03 +09:00
Vinod Kumar Vavilapalli
4c7b9b6abe
YARN-3385. Fixed a race-condition in ResourceManager's ZooKeeper based state-store to avoid crashing on duplicate deletes. Contributed by Zhihai Xu.
2015-05-06 17:51:17 -07:00
Junping Du
31b627b2a8
YARN-3580. [JDK8] TestClientRMService.testGetLabelsToNodes fails. Contributed by Robert Kanter.
2015-05-06 16:51:05 -07:00
Jian He
e4c3b52c89
YARN-3343. Increased TestCapacitySchedulerNodeLabelUpdate#testNodeUpdate timeout. Contributed by Rohith Sharmaks
2015-05-05 11:33:47 -07:00
Jian He
d701acc9c6
YARN-2725. Added test cases of retrying creating znode in ZKRMStateStore. Contributed by Tsuyoshi Ozawa
2015-05-04 16:13:29 -07:00
Gera Shegalov
f8204e241d
YARN-2893. AMLaucher: sporadic job failures due to EOFException in readTokenStorageStream. (Zhihai Xu via gera)
2015-05-01 18:18:55 -07:00
Wangda Tan
e2e8f77118
YARN-3564. Fix TestContainerAllocation.testAMContainerAllocationWhenDNSUnavailable fails randomly. (Jian He via wangda)
2015-04-30 11:03:19 -07:00
Jian He
4c1af156ae
YARN-3533. Test: Fix launchAM in MockRM to wait for attempt to be scheduled. Contributed by Anubhav Dhoot
2015-04-29 14:50:01 -07:00
tgraves
2e215484bd
YARN-3517. RM web ui for dumping scheduler logs should be for admins only (Varun Vasudev via tgraves)
2015-04-29 21:25:42 +00:00
Karthik Kambatla
8f82970e0c
YARN-3485. FairScheduler headroom calculation doesn't consider maxResources for Fifo and FairShare policies. (kasha)
2015-04-28 21:00:35 -07:00
Wangda Tan
db1b674b50
YARN-2740. Fix NodeLabelsManager to properly handle node label modifications when distributed node label configuration enabled. (Naganarasimha G R via wangda)
2015-04-27 16:24:38 -07:00
Jian He
d497f6ea2b
YARN-2498. Respect labels in preemption policy of capacity scheduler for inter-queue preemption. Contributed by Wangda Tan
2015-04-24 17:03:13 -07:00
Jian He
d03dcb9635
YARN-3387. Previous AM's container completed status couldn't pass to current AM if AM and RM restarted during the same time. Contributed by Sandflee
2015-04-24 12:13:29 -07:00
Vinod Kumar Vavilapalli
f5fe35e297
YARN-3413. Changed Nodelabel attributes (like exclusivity) to be settable only via addToClusterNodeLabels but not changeable at runtime. (Wangda Tan via vinodkv)
2015-04-23 11:19:55 -07:00
Wangda Tan
395205444e
YARN-3319. Implement a FairOrderingPolicy. (Craig Welch via wangda)
2015-04-23 10:47:15 -07:00
tgraves
189a63a719
YARN-3434. Interaction between reservations and userlimit can result in significant ULF violation
2015-04-23 14:39:25 +00:00
Junping Du
fad9d7e85b
New parameter of CLI for decommissioning node gracefully in RMAdmin CLI. Contributed by Devaraj K
2015-04-22 10:07:20 -07:00
Jian He
bdd90110e6
YARN-3494. Expose AM resource limit and usage in CS QueueMetrics. Contributed by Rohith Sharmaks
2015-04-21 20:06:20 -07:00
Wangda Tan
e71d0d87d9
YARN-3410. YARN admin should be able to remove individual application records from RMStateStore. (Rohith Sharmaks via wangda)
2015-04-21 17:51:22 -07:00
Wangda Tan
44872b76fc
YARN-3463. Integrate OrderingPolicy Framework with CapacityScheduler. (Craig Welch via wangda)
2015-04-20 17:12:32 -07:00
Wangda Tan
f65eeb412d
YARN-3493. RM fails to come up with error "Failed to load/recover state" when mem settings are changed. (Jian He via wangda)
2015-04-17 17:11:22 -07:00
Jian He
d573f09fb9
YARN-2696. Queue sorting in CapacityScheduler should consider node label. Contributed by Wangda Tan
2015-04-17 13:36:59 -07:00
Junping Du
1db355a875
YARN-1402. Update related Web UI and CLI with exposing client API to check log aggregation status. Contributed by Xuan Gong.
2015-04-17 13:18:59 -07:00
Jian He
bb6dde68f1
YARN-3021. YARN's delegation-token handling disallows certain trust setups to operate properly over DistCp. Contributed by Yongjun Zhang
2015-04-16 19:43:37 -07:00
Jian He
1b89a3e173
YARN-3354. Add node label expression in ContainerTokenIdentifier to support RM recovery. Contributed by Wangda Tan
2015-04-15 13:57:06 -07:00
Tsuyoshi Ozawa
e48cedc663
YARN-3326. Support RESTful API for getLabelsToNodes. Contributed by Naganarasimha G R.
2015-04-15 14:03:55 -05:00
Wangda Tan
5004e75332
YARN-3318. Create Initial OrderingPolicy Framework and FifoOrderingPolicy. (Craig Welch via wangda)
2015-04-15 09:56:32 -07:00
Jian He
0fefda645b
YARN-3361. CapacityScheduler side changes to support non-exclusive node labels. Contributed by Wangda Tan
2015-04-14 11:45:58 -07:00
Jian He
b46ee1e7a3
YARN-3266. RMContext#inactiveNodes should have NodeId as map key. Contributed by Chengbing Liu
2015-04-14 10:54:22 -07:00
Jian He
a1afbc48b5
YARN-3472. Fixed possible leak in DelegationTokenRenewer#allTokens. Contributed by Rohith Sharmaks
2015-04-13 14:07:17 -07:00
Junping Du
92431c9617
YARN-1376. NM need to notify the log aggregation status to RM through Node heartbeat. Contributed by Xuan Gong.
2015-04-10 08:56:18 -07:00
Xuan
afa5d4715a
YARN-3293. Track and display capacity scheduler health metrics in web
...
UI. Contributed by Varun Vasudev
2015-04-09 23:38:04 -07:00
Vinod Kumar Vavilapalli
9c5911294e
YARN-3055. Fixed ResourceManager's DelegationTokenRenewer to not stop token renewal of applications part of a bigger workflow. Contributed by Daryn Sharp.
2015-04-09 13:08:53 -07:00
Robert Kanter
99b08a748e
YARN-2429. TestAMRMTokens.testTokenExpiry fails Intermittently with error message:Invalid AMRMToken (zxu via rkanter)
2015-04-06 14:11:20 -07:00
Tsuyoshi Ozawa
53959e69f7
TestFairScheduler.testContinuousScheduling fails Intermittently. Contributed by Zhihai Xu.
2015-04-06 20:19:13 +09:00
Sandy Ryza
6a6a59db7f
YARN-3415. Non-AM containers can be counted towards amResourceUsage of a fairscheduler queue (Zhihai Xu via Sandy Ryza)
2015-04-02 13:56:08 -07:00
Xuan
4728bdfa15
YARN-3248. Display count of nodes blacklisted by apps in the web UI.
...
Contributed by Varun Vasudev
2015-04-01 04:19:18 -07:00
Karthik Kambatla
79f7f2aabf
YARN-3412. RM tests should use MockRM where possible. (kasha)
2015-03-31 09:14:15 -07:00
Wangda Tan
2a945d24f7
YARN-2495. Allow admin specify labels from each NM (Distributed configuration for node label). (Naganarasimha G R via wangda)
2015-03-30 12:05:21 -07:00
Karthik Kambatla
2bc097cd14
YARN-3241. FairScheduler handles invalid queue names inconsistently. (Zhihai Xu via kasha)
2015-03-23 13:22:03 -07:00
cnauroth
6ca1f12024
YARN-3336. FileSystem memory leak in DelegationTokenRenewer.
2015-03-23 10:45:50 -07:00
Jian He
e1feb4ea1a
YARN-3345. Add non-exclusive node label API. Contributed by Wangda Tan
2015-03-20 19:04:38 -07:00
Jian He
586348e4cb
YARN-3356. Capacity Scheduler FiCaSchedulerApp should use ResourceUsage to track used-resources-by-label. Contributed by Wangda Tan
2015-03-20 13:54:01 -07:00
Devaraj K
93d0f4acc8
YARN-3357. Move TestFifoScheduler to FIFO package. Contributed by Rohith
...
Sharmaks.
2015-03-19 12:16:52 +05:30
Jian He
658097d6da
YARN-3273. Improve scheduler UI to facilitate scheduling analysis and debugging. Contributed Rohith Sharmaks
2015-03-17 21:30:23 -07:00
Tsuyoshi Ozawa
3bc72cc16d
YARN-3205. FileSystemRMStateStore should disable FileSystem Cache to avoid get a Filesystem with an old configuration. Contributed by Zhihai Xu.
2015-03-18 11:53:19 +09:00
Jian He
968425e9f7
YARN-3305. Normalize AM resource request on app submission. Contributed by Rohith Sharmaks
2015-03-17 13:49:59 -07:00
Jian He
487374b7fe
YARN-3243. CapacityScheduler should pass headroom from parent to children to make sure ParentQueue obey its capacity limits. Contributed by Wangda Tan.
2015-03-17 10:24:23 -07:00
Tsuyoshi Ozawa
7522a643fa
YARN-3349. Treat all exceptions as failure in TestFSRMStateStore#testFSRMStateStoreClientRetry. Contributed by Zhihai Xu.
2015-03-17 08:09:55 +09:00
Vinod Kumar Vavilapalli
863079bb87
YARN-3154. Added additional APIs in LogAggregationContext to avoid aggregating running logs of application when rolling is enabled. Contributed by Xuan Gong.
2015-03-12 13:32:29 -07:00
Zhijie Shen
85f6d67fa7
YARN-1884. Added nodeHttpAddress into ContainerReport and fixed the link to NM web page. Contributed by Xuan Gong.
2015-03-11 19:35:19 -07:00
Jason Lowe
27e8ea820f
YARN-3275. CapacityScheduler: Preemption happening on non-preemptable queues. Contributed by Eric Payne
2015-03-06 22:37:26 +00:00
Jian He
95bfd087dc
YARN-1809. Synchronize RM and TimeLineServer Web-UIs. Contributed by Zhijie Shen and Xuan Gong
2015-03-05 21:20:09 -08:00
Karthik Kambatla
8d88691d16
YARN-3242. Asynchrony in ZK-close can lead to ZKRMStateStore watcher receiving events for old client. (Zhihai Xu via kasha)
2015-03-04 19:49:05 -08:00
Karthik Kambatla
22426a1c9f
YARN-3231. FairScheduler: Changing queueMaxRunningApps interferes with pending jobs. (Siqi Li via kasha)
2015-03-04 18:06:58 -08:00
Jian He
b2f1ec312e
YARN-3222. Fixed NPE on RMNodeImpl#ReconnectNodeTransition when a node is reconnected with a different port. Contributed by Rohith Sharmaks
2015-03-03 16:28:28 -08:00
Wangda Tan
e17e5ba9d7
YARN-3272. Surface container locality info in RM web UI (Jian He via wangda)
2015-03-03 11:49:01 -08:00
Vinod Kumar Vavilapalli
14dd647c55
YARN-3265. Fixed a deadlock in CapacityScheduler by always passing a queue's available resource-limit from the parent queue. Contributed by Wangda Tan.
2015-03-02 17:52:47 -08:00
Wangda Tan
edcecedc1c
YARN-3262. Surface application outstanding resource requests table in RM web UI. (Jian He via wangda)
2015-02-27 16:13:32 -08:00
Tsuyoshi Ozawa
01a1621930
YARN-2820. Retry in FileSystemRMStateStore when FS's operations fail due to IOException. Contributed by Zhihai Xu.
2015-02-28 00:56:44 +09:00
Devaraj K
0d4296f0e0
YARN-3256. TestClientToAMTokens#testClientTokenRace is not running against
...
all Schedulers even when using ParameterizedSchedulerTestBase. Contributed
by Anubhav Dhoot.
2015-02-26 15:45:41 +05:30
Tsuyoshi Ozawa
6cbd9f1113
YARN-3247. TestQueueMappings should use CapacityScheduler explicitly. Contributed by Zhihai Xu.
2015-02-25 10:38:11 +09:00
Xuan
fe7a302473
YARN-2797. TestWorkPreservingRMRestart should use
...
ParametrizedSchedulerTestBase. Contributed by Karthik Kambatla
2015-02-21 19:17:29 -08:00
Jason Lowe
a64dd3d24b
YARN-3194. RM should handle NMContainerStatuses sent by NM while registering if NM is Reconnected node. Contributed by Rohith
2015-02-20 15:10:10 +00:00
Jian He
c0d9b93953
YARN-933. Fixed InvalidStateTransitonException at FINAL_SAVING state in RMApp. Contributed by Rohith Sharmaks
2015-02-19 15:42:39 -08:00
Wangda Tan
d49ae725d5
YARN-3076. Add API/Implementation to YarnClient to retrieve label-to-node mapping (Varun Saxena via wangda)
2015-02-19 11:00:57 -08:00
Jian He
1c03376300
YARN-1514. Utility to benchmark ZKRMStateStore#loadState for RM HA. Contributed by Tsuyoshi OZAWA
2015-02-18 16:06:55 -08:00
Jian He
f5da5566d9
YARN-3132. RMNodeLabelsManager should remove node from node-to-label mapping when node becomes deactivated. Contributed by Wangda Tan
2015-02-18 11:51:51 -08:00
Jian He
18297e0972
YARN-3104. Fixed RM to not generate new AMRM tokens on every heartbeat between rolling and activation. Contributed by Jason Lowe
2015-02-12 16:02:45 -08:00
Jian He
18a594257e
YARN-3124. Fixed CS LeafQueue/ParentQueue to use QueueCapacities to track capacities-by-label. Contributed by Wangda Tan
2015-02-12 14:58:09 -08:00
Xuan
65c69e296e
YARN-3151. On Failover tracking url wrong in application cli for KILLED
...
application. Contributed by Rohith
2015-02-11 21:19:48 -08:00
Zhijie Shen
d5855c0e46
YARN-2246. Made the proxy tracking URL always be http(s)://proxy addr:port/proxy/<appId> to avoid duplicate sections. Contributed by Devaraj K.
2015-02-10 15:24:01 -08:00
Zhijie Shen
23bf6c7207
YARN-3100. Made YARN authorization pluggable. Contributed by Jian He.
2015-02-09 20:34:56 -08:00
Jian He
0af6a99a3f
YARN-3094. Reset timer for liveness monitors after RM recovery. Contributed by Jun Gong
2015-02-09 13:47:08 -08:00
Karthik Kambatla
7e42088abf
YARN-2990. FairScheduler's delay-scheduling always waits for node-local and rack-local delays, even for off-rack-only requests. (kasha)
2015-02-08 22:48:10 -08:00
Jason Lowe
da2fb2bc46
YARN-3143. RM Apps REST API can return NPE or entries missing id and other fields. Contributed by Jason Lowe
2015-02-06 21:47:32 +00:00
Jian He
c1957fef29
YARN-2694. Ensure only single node label specified in ResourceRequest. Contributed by Wangda Tan
2015-02-06 11:34:20 -08:00
Jason Lowe
69c8a7f45b
YARN-1582. Capacity Scheduler: add a maximum-allocation-mb setting per queue. Contributed by Thomas Graves
2015-02-05 19:28:49 +00:00
Sandy Ryza
b6466deac6
YARN-3101. In Fair Scheduler, fix canceling of reservations for exceeding max share (Anubhav Dhoot via Sandy Ryza)
2015-02-05 09:39:28 -08:00
Zhijie Shen
e5b56e2af6
YARN-2543. Made resource usage be published to the timeline server too. Contributed by Naganarasimha G R.
2015-02-03 17:34:22 -08:00
Wangda Tan
5bd984691b
YARN-3075. NodeLabelsManager implementation to retrieve label to node mapping (Varun Saxena via wangda)
2015-02-03 12:52:42 -08:00
Jian He
21d80b3dd9
YARN-3098. Created common QueueCapacities class in Capacity Scheduler to track capacities-by-labels of queues. Contributed by Wangda Tan
2015-02-03 11:43:12 -08:00
Jason Lowe
a761bf8726
YARN-3085. Application summary should include the application type. Contributed by Rohith
2015-02-03 14:56:34 +00:00
Jian He
054a947989
YARN-3077. Fixed RM to create zk root path recursively. Contributed by Chun Chen
2015-01-30 17:34:49 -08:00
Jian He
86358221fc
YARN-3099. Capacity Scheduler LeafQueue/ParentQueue should use ResourceUsage to track used-resources-by-label. Contributed by Wangda Tan
2015-01-30 15:15:20 -08:00
Wangda Tan
7882bc0f14
YARN-3079. Scheduler should also update maximumAllocation when updateNodeResource. (Zhihai Xu via wangda)
2015-01-28 21:54:38 -08:00
Wangda Tan
18741adf97
YARN-2932. Add entry for preemptable status (enabled/disabled) to scheduler web UI and queue initialize/refresh logging. (Eric Payne via wangda)
2015-01-27 15:36:09 -08:00
Jian He
6f9fe76918
YARN-3092. Created a common ResourceUsage class to track labeled resource usages in Capacity Scheduler. Contributed by Wangda Tan
2015-01-26 15:38:00 -08:00
Tsuyoshi Ozawa
24aa462673
YARN-2800. Remove MemoryNodeLabelsStore and add a way to enable/disable node labels feature. Contributed by Wangda Tan.
2015-01-23 20:37:05 +09:00
Wangda Tan
0a2d3e717d
YARN-2933. Capacity Scheduler preemption policy should only consider capacity without labels temporarily. Contributed by Mayank Bansal
2015-01-19 16:48:50 -08:00
Junping Du
5d1cca34fa
YARN-3064. TestRMRestart/TestContainerResourceUsage/TestNodeManagerResync failure with allocation timeout. (Contributed by Jian He)
2015-01-16 00:08:36 -08:00
Jian He
c53420f583
YARN-2637. Fixed max-am-resource-percent calculation in CapacityScheduler when activating applications. Contributed by Craig Welch
2015-01-13 17:32:07 -08:00
Robert Kanter
ae7bf31fe1
YARN-3027. Scheduler should use totalAvailable resource from node instead of availableResource for maxAllocation. (adhoot via rkanter)
2015-01-12 10:47:52 -08:00
Zhijie Shen
60103fca04
YARN-2427. Added the API of moving apps between queues in RM web services. Contributed by Varun Vasudev.
2015-01-06 14:37:44 -08:00
Karthik Kambatla
0c4b112677
YARN-2881. [YARN-2574] Implement PlanFollower for FairScheduler. (Anubhav Dhoot via kasha)
2015-01-06 04:42:10 +05:30
Zhijie Shen
562a701945
YARN-2958. Made RMStateStore not update the last sequence number when updating the delegation token. Contributed by Varun Saxena.
2015-01-05 13:33:07 -08:00
Tsuyoshi Ozawa
ddc5be48fc
YARN-2922. ConcurrentModificationException in CapacityScheduler's LeafQueue. Contributed by Rohith Sharmaks.
2015-01-05 00:08:31 +09:00
Karthik Kambatla
e7257acd8a
YARN-2998. Abstract out scheduler independent PlanFollower components. (Anubhav Dhoot via kasha)
2014-12-30 19:55:24 -08:00
Jian He
e2351c7ae2
YARN-2987. Fixed ClientRMService#getQueueInfo to check against queue and app ACLs. Contributed by Varun Saxena
2014-12-30 17:15:37 -08:00
Jian He
b7442bf92e
YARN-2493. Added node-labels page on RM web UI. Contributed by Wangda Tan
2014-12-30 16:49:01 -08:00
Jian He
746ad6e989
Revert "YARN-2492(wrong jira number). Added node-labels page on RM web UI. Contributed by Wangda Tan"
...
This reverts commit 5f57b904f5
.
2014-12-30 16:48:49 -08:00
Jian He
5f57b904f5
YARN-2492. Added node-labels page on RM web UI. Contributed by Wangda Tan
2014-12-30 15:38:28 -08:00
Jian He
4f18018b7a
YARN-2946. Fixed potential deadlock in RMStateStore. Contributed by Rohith Sharmaks
2014-12-23 22:14:29 -08:00
Jian He
149512a837
YARN-2837. Support TimeLine server to recover delegation token when restarting. Contributed by Zhijie Shen
2014-12-23 18:25:37 -08:00
Jian He
0d89859b51
YARN-2340. Fixed NPE when queue is stopped during RM restart. Contributed by Rohith Sharmaks
2014-12-22 21:53:22 -08:00
Jian He
fdf042dfff
YARN-2920. Changed CapacityScheduler to kill containers on nodes where node labels are changed. Contributed by Wangda Tan
2014-12-22 16:51:15 -08:00
Karthik Kambatla
24ee9e3431
YARN-2975. FSLeafQueue app lists are accessed without required locks. (kasha)
2014-12-20 12:17:50 -08:00
Jian He
808cba3821
YARN-2952. Fixed incorrect version check in StateStore. Contributed by Rohith Sharmaks
2014-12-19 16:56:30 -08:00
Karthik Kambatla
a22ffc3188
YARN-2738. [YARN-2574] Add FairReservationSystem for FairScheduler. (Anubhav Dhoot via kasha)
2014-12-19 15:37:12 -08:00
Jason Lowe
0402bada19
YARN-2964. RM prematurely cancels tokens for jobs that submit jobs (oozie). Contributed by Jian He
2014-12-18 23:28:18 +00:00
Steve Loughran
3681de2039
YARN-2912 Jersey Tests failing with port in use. (varun saxena via stevel)
2014-12-12 17:11:07 +00:00
Jian He
2ed90a57fd
YARN-2930. Fixed TestRMRestart#testRMRestartRecoveringNodeLabelManager intermittent failure. Contributed by Wangda Tan
2014-12-09 16:48:04 -08:00
Karthik Kambatla
a2e07a5456
YARN-2910. FSLeafQueue can throw ConcurrentModificationException. (Wilfred Spiegelenburg via kasha)
2014-12-09 14:00:31 -08:00
Jian He
e69af836f3
YARN-2869. CapacityScheduler should trim sub queue names when parse configuration. Contributed by Wangda Tan
2014-12-05 17:33:39 -08:00
Jason Lowe
4b13082199
YARN-2056. Disable preemption at Queue level. Contributed by Eric Payne
2014-12-05 21:06:48 +00:00
Jian He
258623ff8b
YARN-2301. Improved yarn container command. Contributed by Naganarasimha G R
2014-12-04 12:53:18 -08:00
Jian He
73fbb3c66b
YARN-2880. Added a test to make sure node labels will be recovered if RM restart is enabled. Contributed by Rohith Sharmaks
2014-12-03 17:14:52 -08:00
Jian He
392c3aaea8
YARN-2894. Fixed a bug regarding application view acl when RM fails over. Contributed by Rohith Sharmaks
2014-12-02 17:16:35 -08:00
Jian He
52bcefca8b
YARN-2136. Changed RMStateStore to ignore store opearations when fenced. Contributed by Varun Saxena
2014-12-02 10:54:48 -08:00
Jian He
a7fba0bc28
YARN-2765. Added leveldb-based implementation for RMStateStore. Contributed by Jason Lowe
2014-12-01 16:38:25 -08:00
Junping Du
c732ed760e
YARN-2907. SchedulerNode#toString should print all resource detail instead of only memory. (Contributed by Rohith)
2014-12-01 05:38:22 -08:00
Jian He
5805a81efb
YARN-2404. Removed ApplicationAttemptState and ApplicationState class in RMStateStore. Contributed by Tsuyoshi OZAWA
2014-11-25 12:48:22 -08:00
Sandy Ryza
a128cca305
YARN-2669. FairScheduler: queue names shouldn't allow periods (Wei Yan via Sandy Ryza)
2014-11-21 16:06:41 -08:00
Karthik Kambatla
3114d4731d
YARN-2604. Scheduler should consider max-allocation-* in conjunction with the largest node. (Robert Kanter via kasha)
2014-11-21 10:32:28 -08:00
Karthik Kambatla
a9a0cc3679
YARN-2315. FairScheduler: Set current capacity in addition to capacity. (Zhihai Xu via kasha)
2014-11-19 20:15:40 -08:00
Karthik Kambatla
c90fb84aaa
YARN-2802. ClusterMetrics to include AM launch and register delays. (Zhihai Xu via kasha)
2014-11-19 19:50:12 -08:00
Jian He
9cb8b75ba5
YARN-2865. Fixed RM to always create a new RMContext when transtions from StandBy to Active. Contributed by Rohith Sharmaks
2014-11-19 19:48:52 -08:00
Karthik Kambatla
2fce6d6141
YARN-2690. [YARN-2574] Make ReservationSystem and its dependent classes independent of Scheduler type. (Anubhav Dhoot via kasha)
2014-11-17 16:45:57 -08:00
Jason Lowe
81c9d17af8
YARN-2414. RM web UI: app page will crash if app is failed before any attempt has been created. Contributed by Wangda Tan
2014-11-17 21:15:48 +00:00
Sandy Ryza
1a47f890ba
YARN-2811. In Fair Scheduler, reservation fulfillments shouldn't ignore max share (Siqi Li via Sandy Ryza)
2014-11-14 15:18:56 -08:00
Jian He
d005404ef7
YARN-2856. Fixed RMAppImpl to handle ATTEMPT_KILLED event at ACCEPTED state on app recovery. Contributed by Rohith Sharmaks
2014-11-13 15:34:26 -08:00
Vinod Kumar Vavilapalli
3651fe1b08
YARN-2853. Fixed a bug in ResourceManager causing apps to hang when the user kill request races with ApplicationMaster finish. Contributed by Jian He.
2014-11-13 08:13:03 -08:00
Jason Lowe
f8aefa5e9c
YARN-2780. Log aggregated resource allocation in rm-appsummary.log. Contributed by Eric Payne
2014-11-12 17:01:15 +00:00
Vinod Kumar Vavilapalli
e76faebc95
YARN-2834. Fixed ResourceManager to ignore token-renewal failures on recovery consistent with the (somewhat incorrect) behaviour in the non-recovery case. Contributed by Jian He.
2014-11-09 18:56:58 -08:00
Arun C. Murthy
43cd07b408
YARN-2830. Add backwords compatible ContainerId.newInstance constructor. Contributed by Jonathan Eagles.
2014-11-09 14:57:37 -08:00
Zhijie Shen
9a4e0d343e
YARN-2505. Supported get/add/remove/change labels in RM REST API. Contributed by Craig Welch.
2014-11-07 20:35:46 -08:00
Vinod Kumar Vavilapalli
4cfd5bc7c1
YARN-2753. Fixed a bunch of bugs in the NodeLabelsManager classes. Contributed by Zhihai xu.
2014-11-07 14:15:53 -08:00
Vinod Kumar Vavilapalli
2ac1be7dec
YARN-2824. Fixed Capacity Scheduler to not crash when some node-labels are not mapped to queues by making default capacities per label to be zero. Contributed by Wangda Tan.
2014-11-07 10:39:37 -08:00
Xuan
1e97f2f094
YARN-2810. TestRMProxyUsersConf fails on Windows VMs. Contributed by Varun Vasudev
2014-11-07 09:44:43 -08:00
Vinod Kumar Vavilapalli
a5657182a7
YARN-2823. Fixed ResourceManager app-attempt state machine to inform schedulers about previous finished attempts of a running appliation to avoid expectation mismatch w.r.t transferred containers. Contributed by Jian He.
2014-11-07 09:28:36 -08:00
Vinod Kumar Vavilapalli
a3839a9fbf
YARN-2744. Fixed CapacityScheduler to validate node-labels correctly against queues. Contributed by Wangda Tan.
2014-11-06 17:28:12 -08:00
Jian He
395275af86
YARN-2579. Fixed a deadlock issue when EmbeddedElectorService and FatalEventDispatcher try to transition RM to StandBy at the same time. Contributed by Rohith Sharmaks
2014-11-05 16:59:54 -08:00
Zhijie Shen
b4c951ab83
YARN-2767. Added a test case to verify that http static user cannot kill or submit apps in the secure mode. Contributed by Varun Vasudev.
2014-11-05 10:57:38 -08:00
Karthik Kambatla
b2cd269802
YARN-2010. Handle app-recovery failures gracefully. (Jian He and Karthik Kambatla via kasha)
2014-11-04 17:45:24 -08:00
Vinod Kumar Vavilapalli
ec6cbece8e
YARN-2795. Fixed ResourceManager to not crash loading node-label data from HDFS in secure mode. Contributed by Wangda Tan.
2014-11-03 13:44:06 -08:00
Zhijie Shen
27715ec63b
YARN-2785. Fixed intermittent TestContainerResourceUsage failure. Contributed by Varun Vasudev.
2014-11-02 15:20:40 -08:00
Vinod Kumar Vavilapalli
e0233c16eb
YARN-2698. Moved some node label APIs to be correctly placed in client protocol. Contributed by Wangda Tan.
2014-10-30 22:59:31 -07:00
Karthik Kambatla
179cab81e0
YARN-2712. TestWorkPreservingRMRestart: Augment FS tests with queue and headroom checks. (Tsuyoshi Ozawa via kasha)
2014-10-30 00:29:07 -07:00
Karthik Kambatla
782971ae7a
YARN-2742. FairSchedulerConfiguration should allow extra spaces between value and unit. (Wei Yan via kasha)
2014-10-29 10:24:57 -07:00
Jian He
d5e0a09721
YARN-2503. Added node lablels in web UI. Contributed by Wangda Tan
2014-10-28 17:57:54 -07:00
Vinod Kumar Vavilapalli
a16d022ca4
YARN-2704. Changed ResourceManager to optionally obtain tokens itself for the sake of localization and log-aggregation for long-running services. Contributed by Jian He.
2014-10-27 15:49:47 -07:00
Vinod Kumar Vavilapalli
0186645505
YARN-2743. Fixed a bug in ResourceManager that was causing RMDelegationToken identifiers to be tampered and thus causing app submission failures in secure mode. Contributed by Jian He.
2014-10-26 11:14:34 -07:00
Jian He
5864dd99a4
YARN-1915. Fixed a race condition that client could use the ClientToAMToken to contact with AM before AM actually receives the ClientToAMTokenMasterKey. Contributed by Jason Lowe
2014-10-24 22:47:56 -07:00
Zhijie Shen
0f3b6900be
YARN-2209. Replaced AM resync/shutdown command with corresponding exceptions and made related MR changes. Contributed by Jian He.
2014-10-23 21:56:03 -07:00
Vinod Kumar Vavilapalli
c0e034336c
YARN-2715. Fixed ResourceManager to respect common configurations for proxy users/groups beyond just the YARN level config. Contributed by Zhijie Shen.
2014-10-21 20:09:40 -07:00
Vinod Kumar Vavilapalli
39063cd36f
YARN-2676. Enhanced Timeline auth-filter to support proxy users. Contributed by Zhijie Shen.
2014-10-17 22:02:50 -07:00
Vinod Kumar Vavilapalli
e9c66e8fd2
YARN-2705. Fixed bugs in ResourceManager node-label manager that were causing test-failures: added a dummy in-memory labels-manager. Contributed by Wangda Tan.
2014-10-17 18:26:12 -07:00
Jian He
c3de2412eb
YARN-1879. Marked Idempotent/AtMostOnce annotations to ApplicationMasterProtocol for RM fail over. Contributed by Tsuyoshi OZAWA
2014-10-17 16:35:27 -07:00
Jian He
a6aa6e42ca
YARN-2588. Standby RM fails to transitionToActive if previous transitionToActive failed with ZK exception. Contributed by Rohith Sharmaks
2014-10-17 10:54:24 -07:00
Vinod Kumar Vavilapalli
abae63caf9
YARN-2699. Fixed a bug in CommonNodeLabelsManager that caused tests to fail when using ephemeral ports on NodeIDs. Contributed by Wangda Tan.
2014-10-17 08:58:08 -07:00
Vinod Kumar Vavilapalli
b3056c266a
YARN-2685. Fixed a bug in CommonNodeLabelsManager that caused wrong resource tracking per label when a host runs multiple node-managers. Contributed by Wangda Tan.
2014-10-15 18:47:26 -07:00
Vinod Kumar Vavilapalli
f2ea555ac6
YARN-2496. Enhanced Capacity Scheduler to have basic support for allocating resources based on node-labels. Contributed by Wangda Tan.
...
YARN-2500. Ehnaced ResourceManager to support schedulers allocating resources based on node-labels. Contributed by Wangda Tan.
2014-10-15 18:33:06 -07:00
Jian He
0af1a2b5bc
YARN-2312. Deprecated old ContainerId#getId API and updated MapReduce to use ContainerId#getContainerId instead. Contributed by Tsuyoshi OZAWA
2014-10-15 15:22:07 -07:00
Zhijie Shen
1220bb72d4
YARN-2656. Made RM web services authentication filter support proxy user. Contributed by Varun Vasudev and Zhijie Shen.
2014-10-14 21:50:46 -07:00
Zhijie Shen
cdce88376a
HADOOP-11181. Generalized o.a.h.s.t.d.DelegationTokenManager to handle all sub-classes of AbstractDelegationTokenIdentifier. Contributed by Zhijie Shen.
2014-10-14 11:35:38 -07:00
Karthik Kambatla
da709a2eac
YARN-2641. Decommission nodes on -refreshNodes instead of next NM-RM heartbeat. (Zhihai Xu via kasha)
2014-10-13 16:23:04 -07:00
Jian He
f9680d9a16
YARN-2308. Changed CapacityScheduler to explicitly throw exception if the queue
...
to which the apps were submitted is changed across RM restart. Contributed by Craig Welch & Chang Li
2014-10-13 14:09:04 -07:00
Zhijie Shen
4aed2d8e91
YARN-2651. Spun off LogRollingInterval from LogAggregationContext. Contributed by Xuan Gong.
2014-10-13 10:54:09 -07:00
Vinod Kumar Vavilapalli
db7f165319
YARN-2494. Added NodeLabels Manager internal API and implementation. Contributed by Wangda Tan.
2014-10-10 11:44:21 -07:00
Jian He
e16e25ab1b
YARN-2649. Fixed TestAMRMRPCNodeUpdates test failure. Contributed by Ming Ma
2014-10-08 10:58:51 -07:00
Jian He
30d56fdbb4
YARN-1857. CapacityScheduler headroom doesn't account for other AM's running. Contributed by Chen He and Craig Welch
2014-10-07 13:45:04 -07:00
Jian He
519e5a7dd2
YARN-2644. Fixed CapacityScheduler to return up-to-date headroom when AM allocates. Contributed by Craig Welch
2014-10-06 15:48:46 -07:00
Jian He
ea26cc0b4a
YARN-2615. Changed ClientToAMTokenIdentifier/RM(Timeline)DelegationTokenIdentifier to use protobuf as payload. Contributed by Junping Du
2014-10-06 10:47:43 -07:00
subru
a2986234be
YARN-2611. Fixing jenkins findbugs warning and TestRMWebServicesCapacitySched for branch YARN-1051. Contributed by Subru Krishnan and Carlo Curino.
...
(cherry picked from commit c47464aba407d1dafe10be23fe454f0489cc4367)
2014-10-03 15:43:23 -07:00
subru
5e10a13bb4
YARN-2576. Making test patch pass in branch. Contributed by Subru Krishnan and Carlo Curino.
...
(cherry picked from commit 90ac0be86b898aefec5471db4027554c8e1b310c)
2014-10-03 15:43:13 -07:00
subru
6261f7cc69
YARN-2080. Integrating reservation system with ResourceManager and client-RM protocol. Contributed by Subru Krishnan and Carlo Curino.
...
(cherry picked from commit 8baeaead8532898163f1006276b731a237b1a559)
Conflicts:
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/rmapp/RMAppImpl.java
2014-10-03 15:42:43 -07:00
subru
3418c56bcf
YARN-1712. Plan follower that synchronizes the current state of reservation subsystem with the scheduler. Contributed by Subru Krishnan and Carlo Curino.
...
(cherry picked from commit 169085319b8b76641f8b9f6840a3fef06d221e2b)
2014-10-03 15:42:10 -07:00
carlo curino
b6df0dddcd
YARN-1711. Policy to enforce instantaneous and over-time quotas on user reservation. Contributed by Carlo Curino and Subru Krishnan.
...
(cherry picked from commit c4918cb4cb5a267a8cfd6eace28fcfe7ad6174e8)
2014-10-03 15:42:03 -07:00
carlo curino
f66ffcf832
YARN-1710. Logic to find allocations within a Plan that satisfy user ReservationRequest(s). Contributed by Carlo Curino and Subru Krishnan.
...
(cherry picked from commit aef7928899b37262773f3dc117157bb746bf8918)
2014-10-03 15:41:57 -07:00
subru
cf4b34282a
YARN-1709. In-memory data structures used to track resources over time to enable reservations.
...
(cherry picked from commit 0d8b2cd88b958b1e602fd4ea4078ef8d4742a7c3)
2014-10-03 15:41:51 -07:00
carlo curino
1c6950354f
YARN-2475. Logic for responding to capacity drops for the ReservationSystem. Contributed by Carlo Curino and Subru Krishnan.
...
(cherry picked from commit f83a07f266f2c5e6eead554d8a331ed7e75e10d5)
2014-10-03 15:41:21 -07:00
carlo curino
eb3e40b833
YARN-1707. Introduce APIs to add/remove/resize queues in the CapacityScheduler. Contributed by Carlo Curino and Subru Krishnan
...
(cherry picked from commit aac47fda7fecda9fc18ade34d633eca895865a70)
Conflicts:
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/CapacityScheduler.java
2014-10-03 15:41:02 -07:00
Karthik Kambatla
80d11eb68e
YARN-2635. TestRM, TestRMRestart, TestClientToAMTokens should run with both CS and FS. (Wei Yan and kasha via kasha)
2014-10-03 11:49:49 -07:00
Jian He
054f285526
YARN-2628. Capacity scheduler with DominantResourceCalculator carries out reservation even though slots are free. Contributed by Varun Vasudev
2014-10-02 15:13:33 -07:00
Karthik Kambatla
5e0b49da9c
YARN-2254. TestRMWebServicesAppsModification should run against both CS and FS. (Zhihai Xu via kasha)
2014-10-02 10:15:04 -07:00
Zhijie Shen
52bbe0f11b
YARN-2630. Prevented previous AM container status from being acquired by the current restarted AM. Contributed by Jian He.
2014-10-01 15:38:11 -07:00
Jian He
bbff96be48
YARN-2602. Fixed possible NPE in ApplicationHistoryManagerOnTimelineStore. Contributed by Zhijie Shen
2014-09-30 16:44:17 -07:00
Jason Lowe
9c22065109
YARN-1769. CapacityScheduler: Improve reservations. Contributed by Thomas Graves
2014-09-29 14:12:18 +00:00
Jian He
5391919b09
YARN-668. Changed NMTokenIdentifier/AMRMTokenIdentifier/ContainerTokenIdentifier to use protobuf object as the payload. Contributed by Junping Du.
2014-09-26 17:48:41 -07:00
Jason Lowe
8269bfa613
YARN-2523. ResourceManager UI showing negative value for "Decommissioned Nodes" field. Contributed by Rohith
2014-09-25 22:37:05 +00:00
Zhijie Shen
72b0881ca6
YARN-2546. Made REST API for application creation/submission use numeric and boolean types instead of the string of them. Contributed by Varun Vasudev.
2014-09-24 17:57:32 -07:00
Zhijie Shen
c86674a3a4
YARN-2581. Passed LogAggregationContext to NM via ContainerTokenIdentifier. Contributed by Xuan Gong.
2014-09-24 17:50:26 -07:00
Karthik Kambatla
f5578207d2
YARN-2252. Intermittent failure of TestFairScheduler.testContinuousScheduling. (Ratandeep Ratti and kasha via kasha)
2014-09-23 00:03:16 -07:00
Karthik Kambatla
568d3dc2bb
YARN-1959. Fix headroom calculation in FairScheduler. (Anubhav Dhoot via kasha)
2014-09-22 23:49:39 -07:00
Karthik Kambatla
43efdd30b5
YARN-2539. FairScheduler: Set the default value for maxAMShare to 0.5. (Wei Yan via kasha)
2014-09-22 16:09:52 -07:00
Jian He
0a641496c7
YARN-1372. Ensure all completed containers are reported to the AMs across RM restart. Contributed by Anubhav Dhoot
2014-09-22 10:30:53 -07:00
Karthik Kambatla
9721e2c1fe
YARN-2453. TestProportionalCapacityPreemptionPolicy fails with FairScheduler. (Zhihai Xu via kasha)
2014-09-21 23:13:45 -07:00
Karthik Kambatla
c50fc92502
YARN-2452. TestRMApplicationHistoryWriter fails with FairScheduler. (Zhihai Xu via kasha)
2014-09-21 13:15:04 -07:00
Jian He
444acf8ea7
YARN-2565. Fixed RM to not use FileSystemApplicationHistoryStore unless explicitly set. Contributed by Zhijie Shen
2014-09-19 11:26:29 -07:00
Zhijie Shen
6fe5c6b746
YARN-2568. Fixed the potential test failures due to race conditions when RM work-preserving recovery is enabled. Contributed by Jian He.
2014-09-18 21:56:56 -07:00
Jason Lowe
a337f0e354
YARN-2561. MR job client cannot reconnect to AM after NM restart. Contributed by Junping Du
2014-09-18 21:34:40 +00:00
Jason Lowe
9ea7b6c063
YARN-2363. Submitted applications occasionally lack a tracking URL. Contributed by Jason Lowe
2014-09-18 20:13:16 +00:00
Vinod Kumar Vavilapalli
485c96e3cb
YARN-2001. Added a time threshold for RM to wait before starting container allocations after restart/failover. Contributed by Jian He.
2014-09-18 11:03:12 -07:00
Jian He
ee21b13cbd
YARN-2559. Fixed NPE in SystemMetricsPublisher when retrieving FinalApplicationStatus. Contributed by Zhijie Shen
2014-09-17 21:44:15 -07:00
junping_du
90a0c03f0a
YARN-1250. Generic history service should support application-acls. (Contributed by Zhijie Shen)
2014-09-16 18:20:49 -07:00
Vinod Kumar Vavilapalli
14e2639fd0
YARN-611. Added an API to let apps specify an interval beyond which AM failures should be ignored towards counting max-attempts. Contributed by Xuan Gong.
2014-09-13 18:04:05 -07:00
XuanGong
e65ae575a0
YARN-2456. Possible livelock in CapacityScheduler when RM is recovering
...
apps. Contributed by Jian He
2014-09-12 15:21:46 -07:00
Jian He
3122daa802
YARN-2229. Changed the integer field of ContainerId to be long type. Contributed by Tsuyoshi OZAWA
2014-09-12 10:33:33 -07:00
junping_du
6b8b1608e6
YARN-2033. Merging generic-history into the Timeline Store (Contributed by Zhijie Shen)
2014-09-12 10:04:51 +08:00
Karthik Kambatla
c11ada5ea6
YARN-2534. FairScheduler: Potential integer overflow calculating totalMaxShare. (Zhihai Xu via kasha)
2014-09-11 12:06:06 -07:00
Jian He
83be3ad444
YARN-415. Capture aggregate memory allocation at the app-level for chargeback. Contributed by Eric Payne & Andrey Klochkov
2014-09-10 18:20:54 -07:00
Jian He
cbfe26370b
YARN-2158. Fixed TestRMWebServicesAppsModification#testSingleAppKill test failure. Contributed by Varun Vasudev
2014-09-10 12:47:34 -07:00
XUAN
47bdfa044a
YARN-2459. RM crashes if App gets rejected for any reason and HA is enabled. Contributed by Jian He
2014-09-10 11:44:41 -07:00
Vinod Kumar Vavilapalli
b67d5ba784
YARN-2448. Changed ApplicationMasterProtocol to expose RM-recognized resource types to the AMs. Contributed by Varun Vasudev.
2014-09-10 10:15:47 -07:00
Karthik Kambatla
3072c83b38
YARN-1458. FairScheduler: Zero weight can lead to livelock. (Zhihai Xu via kasha)
2014-09-10 08:26:14 -07:00
Karthik Kambatla
1dcaba9a7a
YARN-2394. FairScheduler: Configure fairSharePreemptionThreshold per queue. (Wei Yan via kasha)
2014-09-03 10:27:36 -07:00
Karthik Kambatla
0f34e6f387
YARN-2395. FairScheduler: Preemption timeout should be configurable per queue. (Wei Yan via kasha)
2014-08-30 01:17:13 -07:00
Jian He
5c14bc426b
YARN-1506. Changed RMNode/SchedulerNode to update resource with event notification. Contributed by Junping Du
2014-08-29 23:05:51 -07:00
Jian He
c686aa3533
YARN-2447. RM web service app submission doesn't pass secrets correctly. Contributed by Varun Vasudev
2014-08-29 11:40:47 -07:00
Karthik Kambatla
fa80ca49bd
YARN-2405. NPE in FairSchedulerAppsBlock. (Tsuyoshi Ozawa via kasha)
2014-08-28 23:21:37 -07:00
Karthik Kambatla
d16bfd1d0f
YARN-1326. RM should log using RMStore at startup time. (Tsuyoshi Ozawa via kasha)
2014-08-27 01:43:58 -07:00
Karthik Kambatla
0097b15e21
YARN-2393. FairScheduler: Add the notion of steady fair share. (Wei Yan via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1619845 13f79535-47bb-0310-9956-ffa450edef68
2014-08-22 15:44:47 +00:00
Jason Darrell Lowe
4236c6600e
YARN-2434. RM should not recover containers from previously failed attempt when AM restart is not enabled. Contributed by Jian He
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1619614 13f79535-47bb-0310-9956-ffa450edef68
2014-08-21 22:41:34 +00:00
Zhijie Shen
f6a778c372
YARN-2249. Avoided AM release requests being lost on work preserving RM restart. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618972 13f79535-47bb-0310-9956-ffa450edef68
2014-08-19 20:33:49 +00:00
Jian He
375c221960
YARN-2409. RM ActiveToStandBy transition missing stoping previous rmDispatcher. Contributed by Rohith
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618915 13f79535-47bb-0310-9956-ffa450edef68
2014-08-19 17:49:39 +00:00
Jian He
519c4be95a
YARN-2411. Support simple user and group mappings to queues. Contributed by Ram Venkatesh
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618542 13f79535-47bb-0310-9956-ffa450edef68
2014-08-18 06:08:45 +00:00
Jian He
c3084d6c16
YARN-2389. Added functionality for schedulers to kill all applications in a queue. Contributed by Subramaniam Venkatraman Krishnan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618294 13f79535-47bb-0310-9956-ffa450edef68
2014-08-15 23:53:57 +00:00
Jian He
7360cec692
YARN-2378. Added support for moving applications across queues in CapacityScheduler. Contributed by Subramaniam Venkatraman Krishnan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618106 13f79535-47bb-0310-9956-ffa450edef68
2014-08-15 06:00:31 +00:00
Zhijie Shen
a9023c2736
YARN-2397. Avoided loading two authentication filters for RM and TS web interfaces. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618054 13f79535-47bb-0310-9956-ffa450edef68
2014-08-14 21:17:20 +00:00
Karthik Kambatla
5197f8c3c5
YARN-1370. Fair scheduler to re-populate container allocation state. (Anubhav Dhoot via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617645 13f79535-47bb-0310-9956-ffa450edef68
2014-08-13 01:38:59 +00:00
Karthik Kambatla
4239695588
YARN-2399. Delete old versions of files. FairScheduler: Merge AppSchedulable and FSSchedulerApp into FSAppAttempt. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617619 13f79535-47bb-0310-9956-ffa450edef68
2014-08-12 22:51:57 +00:00
Karthik Kambatla
486e718fc1
YARN-2399. FairScheduler: Merge AppSchedulable and FSSchedulerApp into FSAppAttempt. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617600 13f79535-47bb-0310-9956-ffa450edef68
2014-08-12 21:43:27 +00:00
Junping Du
c2febdcbaa
YARN-1337. Recover containers upon nodemanager restart. (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617448 13f79535-47bb-0310-9956-ffa450edef68
2014-08-12 10:56:13 +00:00
Jian He
c4dc685343
YARN-2138. Cleaned up notifyDone* APIs in RMStateStore. Contributed by Varun Saxena
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617341 13f79535-47bb-0310-9956-ffa450edef68
2014-08-11 18:24:24 +00:00
Xuan Gong
946be75704
YARN-2400: Addendum fix for TestAMRestart failure. Contributed by Jian He
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617333 13f79535-47bb-0310-9956-ffa450edef68
2014-08-11 17:42:53 +00:00
Xuan Gong
743f7f30da
YARN-2400. Fixed TestAMRestart fails intermittently. Contributed by Jian He:
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617028 13f79535-47bb-0310-9956-ffa450edef68
2014-08-09 23:31:11 +00:00
Karthik Kambatla
a7643f4de7
YARN-2026. Fair scheduler: Consider only active queues for computing fairshare. (Ashwin Shankar via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1616915 13f79535-47bb-0310-9956-ffa450edef68
2014-08-09 02:10:00 +00:00
Xuan Gong
eeb4acd955
YARN-2212: ApplicationMaster needs to find a way to update the AMRMToken periodically. Contributed by Xuan Gong
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1616892 13f79535-47bb-0310-9956-ffa450edef68
2014-08-08 21:38:24 +00:00
Karthik Kambatla
14864e9c7c
YARN-2352. FairScheduler: Collect metrics on duration of critical methods that affect performance. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1616769 13f79535-47bb-0310-9956-ffa450edef68
2014-08-08 14:17:54 +00:00
Jian He
8437df8ba9
YARN-2008. Fixed CapacityScheduler to calculate headroom based on max available capacity instead of configured max capacity. Contributed by Craig Welch
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1616580 13f79535-47bb-0310-9956-ffa450edef68
2014-08-07 20:00:04 +00:00
Karthik Kambatla
8feddc4c84
YARN-2359. Application hangs when it fails to launch AM container. (Zhihai Xu via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1616375 13f79535-47bb-0310-9956-ffa450edef68
2014-08-07 00:06:17 +00:00
Junping Du
b8f151231b
YARN-1354. Recover applications upon nodemanager restart. (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1615550 13f79535-47bb-0310-9956-ffa450edef68
2014-08-04 13:25:37 +00:00
Xuan Gong
e52f67e389
YARN-1994. Expose YARN/MR endpoints on multiple interfaces. Contributed by Craig Welch, Milan Potocnik,and Arpit Agarwal
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1614981 13f79535-47bb-0310-9956-ffa450edef68
2014-07-31 20:06:02 +00:00
Zhijie Shen
1d6e178144
YARN-2347. Consolidated RMStateVersion and NMDBSchemaVersion into Version in yarn-server-common. Contributed by Junping Du.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1614838 13f79535-47bb-0310-9956-ffa450edef68
2014-07-31 09:27:43 +00:00
Karthik Kambatla
c0b49ff107
YARN-2328. FairScheduler: Verify update and continuous scheduling threads are stopped when the scheduler is stopped. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1614432 13f79535-47bb-0310-9956-ffa450edef68
2014-07-29 17:41:52 +00:00
Zhijie Shen
d6532d3a77
YARN-2247. Made RM web services authenticate users via kerberos and delegation token. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1613821 13f79535-47bb-0310-9956-ffa450edef68
2014-07-27 17:55:06 +00:00
Jian He
d4fec34933
YARN-2211. Persist AMRMToken master key in RMStateStore for RM recovery. Contributed by Xuan Gong
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1613515 13f79535-47bb-0310-9956-ffa450edef68
2014-07-25 20:42:37 +00:00
Karthik Kambatla
1e553858f9
YARN-2214. FairScheduler: preemptContainerPreCheck() in FSParentQueue delays convergence towards fairness. (Ashwin Shankar via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1613459 13f79535-47bb-0310-9956-ffa450edef68
2014-07-25 16:13:07 +00:00
Jason Darrell Lowe
28fca92521
YARN-2147. client lacks delegation token exception details when application submit fails. Contributed by Chen He
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612950 13f79535-47bb-0310-9956-ffa450edef68
2014-07-23 21:40:57 +00:00
Sanford Ryza
c88402f36d
YARN-2313. Livelock can occur in FairScheduler when there are lots of running apps (Tsuyoshi Ozawa via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612769 13f79535-47bb-0310-9956-ffa450edef68
2014-07-23 05:00:52 +00:00
Karthik Kambatla
ff77582991
YARN-2273. NPE in ContinuousScheduling thread when we lose a node. (Wei Yan via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612720 13f79535-47bb-0310-9956-ffa450edef68
2014-07-22 22:44:38 +00:00
Zhijie Shen
eac0701c96
YARN-2319. Made the MiniKdc instance start/close before/after the class of TestRMWebServicesDelegationTokens. Contributed by Wenwu Peng.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612588 13f79535-47bb-0310-9956-ffa450edef68
2014-07-22 15:15:29 +00:00
Junping Du
afb9394c91
YARN-2242. Addendum patch. Improve exception information on AM launch crashes. (Contributed by Li Lu)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612565 13f79535-47bb-0310-9956-ffa450edef68
2014-07-22 13:07:23 +00:00
Karthik Kambatla
8871d8ed9f
YARN-2244. FairScheduler missing handling of containers for unknown application attempts. (Anubhav Dhoot via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1611840 13f79535-47bb-0310-9956-ffa450edef68
2014-07-19 00:12:05 +00:00
Xuan Gong
f1b831ccfb
YARN-2208. AMRMTokenManager need to have a way to roll over AMRMToken. Contributed by Xuan Gong
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1611820 13f79535-47bb-0310-9956-ffa450edef68
2014-07-18 21:46:29 +00:00
Jian He
3c193811ca
YARN-2219. Addendum patch for YARN-2219
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1611240 13f79535-47bb-0310-9956-ffa450edef68
2014-07-17 03:28:39 +00:00
Vinod Kumar Vavilapalli
bda23181bf
YARN-2219. Changed ResourceManager to avoid AMs and NMs getting exceptions after RM recovery but before scheduler learns about apps and app-attempts. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1611222 13f79535-47bb-0310-9956-ffa450edef68
2014-07-17 00:14:56 +00:00
Vinod Kumar Vavilapalli
030580387a
YARN-2233. Implemented ResourceManager web-services to create, renew and cancel delegation tokens. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1610876 13f79535-47bb-0310-9956-ffa450edef68
2014-07-15 23:00:17 +00:00
Mayank Bansal
43589a8df7
YARN-1408 Preemption caused Invalid State Event: ACQUIRED at KILLED and caused a task timeout for 30mins. (Sunil G via mayank)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1610860 13f79535-47bb-0310-9956-ffa450edef68
2014-07-15 21:48:58 +00:00
Vinod Kumar Vavilapalli
c6cc6a6a8e
YARN-2260. Fixed ResourceManager's RMNode to correctly remember containers when nodes resync during work-preserving RM restart. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1610557 13f79535-47bb-0310-9956-ffa450edef68
2014-07-14 23:32:03 +00:00
Jian He
c9fb040c87
YARN-2181. Added preemption info to logs and RM web UI. Contributed by Wangda Tan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1609561 13f79535-47bb-0310-9956-ffa450edef68
2014-07-10 20:03:35 +00:00
Karthik Kambatla
8fbca62a90
YARN-2131. Add a way to format the RMStateStore. (Robert Kanter via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1609278 13f79535-47bb-0310-9956-ffa450edef68
2014-07-09 19:58:43 +00:00
Zhijie Shen
12c4197b35
YARN-2158. Improved assertion messages of TestRMWebServicesAppsModification. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1608667 13f79535-47bb-0310-9956-ffa450edef68
2014-07-08 05:50:04 +00:00
Sanford Ryza
5644f529f3
YARN-2250. FairScheduler.findLowestCommonAncestorQueue returns null when queues not identical (Krisztian Horvath via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1607872 13f79535-47bb-0310-9956-ffa450edef68
2014-07-04 15:16:43 +00:00
Junping Du
5cb489f9d3
YARN-2242. Improve exception information on AM launch crashes. (Contributed by Li Lu)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1607655 13f79535-47bb-0310-9956-ffa450edef68
2014-07-03 14:15:19 +00:00
Vinod Kumar Vavilapalli
45b191e38c
YARN-2232. Fixed ResourceManager to allow DelegationToken owners to be able to cancel their own tokens in secure mode. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1607484 13f79535-47bb-0310-9956-ffa450edef68
2014-07-02 21:36:42 +00:00
Mayank Bansal
03a25d2cc1
YARN-2022 Preempting an Application Master container can be kept as least priority when multiple applications are marked for preemption by ProportionalCapacityPreemptionPolicy (Sunil G via mayank)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1607227 13f79535-47bb-0310-9956-ffa450edef68
2014-07-02 01:54:47 +00:00
Vinod Kumar Vavilapalli
075ff276ca
YARN-1713. Added get-new-app and submit-app functionality to RM web services. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1607216 13f79535-47bb-0310-9956-ffa450edef68
2014-07-02 00:23:07 +00:00
Xuan Gong
e5ae7c55d1
TestRMApplicationHistoryWriter sometimes fails in trunk. Contributed by Zhijie Shen
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1606835 13f79535-47bb-0310-9956-ffa450edef68
2014-06-30 16:51:22 +00:00
Jian He
b0c51504c4
YARN-2052. Embedded an epoch number in container id to ensure the uniqueness of container id after RM restarts. Contributed by Tsuyoshi OZAWA
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1606557 13f79535-47bb-0310-9956-ffa450edef68
2014-06-29 18:24:03 +00:00
Jian He
b717d44b52
YARN-614. Changed ResourceManager to not count disk failure, node loss and RM restart towards app failures. Contributed by Xuan Gong
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1606407 13f79535-47bb-0310-9956-ffa450edef68
2014-06-28 23:37:46 +00:00
Zhijie Shen
55a0aa0bad
YARN-2201. Made TestRMWebServicesAppsModification be independent of the changes on yarn-default.xml. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1606285 13f79535-47bb-0310-9956-ffa450edef68
2014-06-28 03:30:44 +00:00
Karthik Kambatla
f911f5495b
YARN-2204. Addendum patch. TestAMRestart#testAMRestartWithExistingContainers assumes CapacityScheduler. (Robert Kanter via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1606168 13f79535-47bb-0310-9956-ffa450edef68
2014-06-27 18:09:41 +00:00
Vinod Kumar Vavilapalli
9571db19eb
YARN-2171. Improved CapacityScheduling to not lock on nodemanager-count when AMs heartbeat in. Contributed by Jason Lowe.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605616 13f79535-47bb-0310-9956-ffa450edef68
2014-06-25 21:56:42 +00:00
Karthik Kambatla
1a3a7e0c1a
YARN-2204. TestAMRestart#testAMRestartWithExistingContainers assumes CapacityScheduler. (Robert Kanter via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605548 13f79535-47bb-0310-9956-ffa450edef68
2014-06-25 18:50:53 +00:00
Jian He
c3f1c30e65
YARN-1365. Changed ApplicationMasterService to allow an app to re-register after RM restart. Contributed by Anubhav Dhoot
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605263 13f79535-47bb-0310-9956-ffa450edef68
2014-06-25 04:42:39 +00:00
Vinod Kumar Vavilapalli
e285b98f0f
YARN-2152. Added missing information into ContainerTokenIdentifier so that NodeManagers can report the same to RM when RM restarts. Contributed Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605205 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 21:43:22 +00:00
Thomas Graves
1f9a0fd927
YARN-2072. RM/NM UIs and webservices are missing vcore information. (Nathan Roberts via tgraves)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605162 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 19:34:34 +00:00
Karthik Kambatla
c0991d11eb
YARN-2109. Fix TestRM to work with both schedulers. (Anubhav Dhoot via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605142 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 17:30:53 +00:00
Karthik Kambatla
db4d277117
YARN-2192. TestRMHA fails when run with a mix of Schedulers. (Anubhav Dhoot via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605138 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 17:05:35 +00:00
Sanford Ryza
29c102cad0
YARN-2111. In FairScheduler.attemptScheduling, we don't count containers as assigned if they have 0 memory but non-zero cores (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605113 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 15:40:39 +00:00
Vinod Kumar Vavilapalli
d16470025a
YARN-2074. Changed ResourceManager to not count AM preemptions towards app failures. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605106 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 15:15:12 +00:00
Jian He
59b5e9fa15
YARN-2191. Added a new test to ensure NM will clean up completed applications in the case of RM restart. Contributed by Wangda Tan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1604949 13f79535-47bb-0310-9956-ffa450edef68
2014-06-23 22:52:38 +00:00
Karthik Kambatla
6fcbf9b848
YARN-2187. FairScheduler: Disable max-AM-share check by default. (Robert Kanter via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1604321 13f79535-47bb-0310-9956-ffa450edef68
2014-06-21 07:30:07 +00:00
Jian He
95897ca14b
YARN-1885. Fixed a bug that RM may not send application-clean-up signal to NMs where the completed applications previously ran in case of RM restart. Contributed by Wangda Tan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1603028 13f79535-47bb-0310-9956-ffa450edef68
2014-06-16 23:56:12 +00:00
Vinod Kumar Vavilapalli
dc7dd1fa19
YARN-1702. Added kill app functionality to RM web services. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1602298 13f79535-47bb-0310-9956-ffa450edef68
2014-06-12 21:31:52 +00:00
Karthik Kambatla
4bc91b44c9
YARN-2155. FairScheduler: Incorrect threshold check for preemption. (Wei Yan via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1602295 13f79535-47bb-0310-9956-ffa450edef68
2014-06-12 21:23:32 +00:00
Jian He
710a8693e5
YARN-2124. Fixed NPE in ProportionalCapacityPreemptionPolicy. Contributed by Wangda Tan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1601964 13f79535-47bb-0310-9956-ffa450edef68
2014-06-11 17:30:18 +00:00
Karthik Kambatla
5de6f72054
YARN-1424. RMAppAttemptImpl should return the DummyApplicationResourceUsageReport for all invalid accesses. (Ray Chiang via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1601742 13f79535-47bb-0310-9956-ffa450edef68
2014-06-10 19:03:06 +00:00
Jian He
c94f2cec3a
Augmented RMStateStore with state machine. Contributed by Binglin Chang.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1601491 13f79535-47bb-0310-9956-ffa450edef68
2014-06-09 19:44:31 +00:00
Vinod Kumar Vavilapalli
424fd9494f
YARN-1368. Added core functionality of recovering container state into schedulers after ResourceManager Restart so as to preserve running work in the cluster. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1601303 13f79535-47bb-0310-9956-ffa450edef68
2014-06-09 03:09:21 +00:00
Karthik Kambatla
85d4c787e0
YARN-2128. FairScheduler: Incorrect calculation of amResource usage. (Wei Yan via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1601050 13f79535-47bb-0310-9956-ffa450edef68
2014-06-07 01:21:33 +00:00
Junping Du
0ceb742549
YARN-1977. Add tests on getApplicationRequest with filtering start time range. (Contributed by Junping Du)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1600644 13f79535-47bb-0310-9956-ffa450edef68
2014-06-05 13:15:44 +00:00
Sanford Ryza
16caa3fd18
YARN-1913. With Fair Scheduler, cluster can logjam when all resources are consumed by AMs (Wei Yan via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1599400 13f79535-47bb-0310-9956-ffa450edef68
2014-06-03 00:56:48 +00:00
Karthik Kambatla
0aad2d56df
YARN-1550. NPE in FairSchedulerAppsBlock#render. (Anubhav Dhoot via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1599345 13f79535-47bb-0310-9956-ffa450edef68
2014-06-02 20:22:52 +00:00
Karthik Kambatla
a4ba451802
YARN-1474. Make schedulers services. (Tsuyoshi Ozawa via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1598908 13f79535-47bb-0310-9956-ffa450edef68
2014-05-31 19:33:09 +00:00
Vinod Kumar Vavilapalli
23c325ad47
YARN-2115. Replaced RegisterNodeManagerRequest's ContainerStatus with a new NMContainerStatus which has more information that is needed for work-preserving RM-restart. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1598790 13f79535-47bb-0310-9956-ffa450edef68
2014-05-31 00:20:50 +00:00
Karthik Kambatla
49a3a0cd0c
YARN-2054. Better defaults for YARN ZK configs for retries and retry-inteval when HA is enabled. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1598630 13f79535-47bb-0310-9956-ffa450edef68
2014-05-30 15:24:49 +00:00
Arpit Agarwal
4a4868e523
HADOOP-10448. Support pluggable mechanism to specify proxy user settings (Contributed by Benoy Antony)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1598396 13f79535-47bb-0310-9956-ffa450edef68
2014-05-29 20:52:01 +00:00
Sanford Ryza
342da5b4d3
YARN-596. Use scheduling policies throughout the queue hierarchy to decide which containers to preempt (Wei Yan via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1598197 13f79535-47bb-0310-9956-ffa450edef68
2014-05-29 04:01:24 +00:00
Sanford Ryza
edfbc8ad4a
YARN-2105. Fix TestFairScheduler after YARN-2012. (Ashwin Shankar via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1597902 13f79535-47bb-0310-9956-ffa450edef68
2014-05-27 23:46:22 +00:00
Karthik Kambatla
7dd378c274
YARN-2096. Race in TestRMRestart#testQueueMetricsOnRMRestart. (Anubhav Dhoot via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1597223 13f79535-47bb-0310-9956-ffa450edef68
2014-05-23 23:51:00 +00:00
Sanford Ryza
a00b2d4f37
YARN-2073. Fair Scheduler: Add a utilization threshold to prevent preempting resources when cluster is free (Karthik Kambatla via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1597209 13f79535-47bb-0310-9956-ffa450edef68
2014-05-23 22:52:46 +00:00
Sanford Ryza
6c56612af5
YARN-2012. Fair Scheduler: allow default queue placement rule to take an arbitrary queue (Ashwin Shankar via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1597204 13f79535-47bb-0310-9956-ffa450edef68
2014-05-23 22:38:52 +00:00