Wangda Tan
620325e816
YARN-4837. User facing aspects of 'AM blacklisting' feature need fixing. (vinodkv via wangda)
2016-06-07 15:06:42 -07:00
Arun Suresh
3a154f75ed
YARN-4525. Fix bug in RLESparseResourceAllocation.getRangeOverlapping(). (Ishai Menache and Carlo Curino via asuresh)
2016-06-06 21:18:32 -07:00
Arun Suresh
7a9b7372a1
YARN-5185. StageAllocaterGreedyRLE: Fix NPE in corner case. (Carlo Curino via asuresh)
2016-06-06 21:06:52 -07:00
Ming Ma
4a1cedc010
MAPREDUCE-5044. Have AM trigger jstack on task attempts that timeout before killing them. (Eric Payne and Gera Shegalov via mingma)
2016-06-06 14:30:51 -07:00
Arun Suresh
db54670e83
YARN-5165. Fix NoOvercommitPolicy to take advantage of RLE representation of plan. (Carlo Curino via asuresh)
2016-06-03 14:49:32 -07:00
Vinod Kumar Vavilapalli
f10ebc67f5
YARN-5098. Fixed ResourceManager's DelegationTokenRenewer to replace expiring system-tokens if RM stops and only restarts after a long time. Contributed by Jian He.
2016-06-03 13:00:07 -07:00
Jian He
097baaaeba
YARN-1815. Work preserving recovery of Unmanged AMs. Contributed by Subru Krishnan
2016-06-03 10:49:30 -07:00
Varun Vasudev
42f90ab885
YARN-4844. Add getMemorySize/getVirtualCoresSize to o.a.h.y.api.records.Resource. Contributed by Wangda Tan.
2016-05-29 21:24:16 +05:30
Karthik Kambatla
04ded558b0
YARN-5035. FairScheduler: Adjust maxAssign dynamically when assignMultiple is turned on. (kasha)
2016-05-26 14:41:07 -07:00
Karthik Kambatla
4f513a4a8e
YARN-4866. FairScheduler: AMs can consume all vcores leading to a livelock when using FAIR policy. (Yufei Gu via kasha)
2016-05-25 22:13:27 -07:00
Carlo Curino
013532a95e
YARN-4957. Add getNewReservation in ApplicationClientProtocol (Sean Po via curino)
2016-05-25 16:55:49 -07:00
Karthik Kambatla
f979d779e1
YARN-4878. Expose scheduling policy and max running apps over JMX for Yarn queues. (Yufei Gu via kasha)
2016-05-24 10:54:11 -07:00
Naganarasimha
b4078bd17b
YARN-3971. Skip RMNodeLabelsManager#checkRemoveFromClusterNodeLabelsOfQueue on nodelabel recovery. (addendum patch). Contributed by Bibin A chundatt
2016-05-24 08:06:53 +08:00
Karthik Kambatla
6d043aa4cf
YARN-4979. FSAppAttempt demand calculation considers demands at multiple locality levels different. (Zhihai Xu via kasha)
2016-05-23 14:29:28 -07:00
Jason Lowe
ac954486c5
YARN-5055. max apps per user can be larger than max per queue. Contributed by Eric Badger
2016-05-23 15:54:42 +00:00
Junping Du
22fcd819f0
YARN-5076. YARN web interfaces lack XFS protection. Contributed by Jonathan Maron.
...
(cherry picked from commit 2703ec68712279494d67b0d76b7ac10e7a1628be)
2016-05-19 14:15:21 -07:00
Jian He
feb90ffcca
YARN-4002. Make ResourceTrackerService#nodeHeartbeat more concurrent. Contributed by Rohith Sharma K S & Zhiguo Hong
2016-05-19 13:01:36 -07:00
Arun Suresh
1597630681
YARN-5110. Fix OpportunisticContainerAllocator to insert complete HostAddress in issued ContainerTokenIds. (Konstantinos Karanasos via asuresh)
2016-05-18 18:46:00 -07:00
Jian He
fa3bc3405d
YARN-4832. NM side resource value should get updated if change applied in RM side. Contributed by Junping Du
2016-05-17 12:52:19 -07:00
Arun Suresh
ccc93e7812
YARN-5075. Fix findbugs warnings in hadoop-yarn-common module. (asuresh)
2016-05-16 23:22:01 -07:00
Arun Suresh
f45bc5a83e
YARN-4738. Notify the RM about the status of OPPORTUNISTIC containers (Konstantinos Karanasos via asuresh)
2016-05-15 17:54:34 -07:00
Arun Suresh
f0ac18d001
YARN-2888. Corrective mechanisms for rebalancing NM container queues. (asuresh)
2016-05-13 13:38:36 -07:00
Jason Lowe
013000fbc2
YARN-5053. More informative diagnostics when applications killed by a user. Contributed by Eric Badger
2016-05-12 20:28:36 +00:00
Rohith Sharma K S
b7ac85259c
YARN-5068. Expose scheduler queue to application master. (Harish Jaiprakash via rohithsharmaks)
2016-05-12 15:17:49 +05:30
Karthik Kambatla
4b4e4c6ba8
YARN-4995. FairScheduler: Display per-queue demand on the scheduler page. (xupeng via kasha)
2016-05-11 17:36:21 -07:00
Junping Du
39f2bac38b
YARN-5029. RM needs to send update event with YarnApplicationState as Running to ATS/AHS. Contributed by Xuan Gong.
2016-05-11 09:28:35 -07:00
Naganarasimha
2750fb900f
YARN-4926. Change nodelabel rest API invalid reponse status to 400. Contributed by Bibin A Chundatt
2016-05-08 22:49:25 +05:30
Yongjun Zhang
47c41e7ac7
YARN-5048. DelegationTokenRenewer#skipTokenRenewal may throw NPE (Jian He via Yongjun Zhang)
2016-05-06 21:50:09 -07:00
Jason Lowe
b2ed6ae731
YARN-4747. AHS error 500 due to NPE when container start event is missing. Contributed by Varun Saxena
2016-05-06 22:59:39 +00:00
Wangda Tan
23248f63aa
getApplicationReport call may raise NPE for removed queues. (Jian He via wangda)
2016-05-06 15:30:45 -07:00
Jian He
bb62e05925
YARN-4390. Do surgical preemption based on reserved container in CapacityScheduler. Contributed by Wangda Tan
2016-05-05 12:56:21 -07:00
Jason Lowe
d0da13229c
YARN-4311. Removing nodes from include and exclude lists will not remove them from decommissioned nodes list. Contributed by Kuhu Shukla
2016-05-05 14:07:54 +00:00
Jason Lowe
ed54f5f1ff
YARN-5003. Add container resource to RM audit log. Contributed by Nathan Roberts
2016-05-03 20:03:41 +00:00
Jian He
dd80042c42
YARN-5008. LeveldbRMStateStore database can grow substantially leading to long recovery times. Contributed by Jason Lowe
2016-04-28 21:27:25 -07:00
Jian He
4beff01354
YARN-4983. JVM and UGI metrics disappear after RM transitioned to standby mode
2016-04-26 21:00:17 -07:00
Arun Suresh
341888a0aa
YARN-4412. Create ClusterMonitor to compute ordered list of preferred NMs for OPPORTUNITIC containers. (asuresh)
2016-04-26 20:12:12 -07:00
Karthik Kambatla
4b1dcbbe0c
YARN-1297. FairScheduler: Move some logs to debug and check if debug logging is enabled
2016-04-26 05:10:09 -07:00
Arun Suresh
c282a08f38
YARN-2885. Create AMRMProxy request interceptor and ContainerAllocator to distribute OPPORTUNISTIC containers to appropriate Nodes (asuresh)
...
(cherry picked from commit 2bf025278a318b0452fdc9ece4427b4c42124e39)
2016-04-24 22:38:33 -07:00
Wangda Tan
7cb3a3da96
YARN-4846. Fix random failures for TestCapacitySchedulerPreemption#testPreemptionPolicyShouldRespectAlreadyMarkedKillableContainers. (Bibin A Chundatt via wangda)
2016-04-22 11:40:32 -07:00
Karthik Kambatla
170c4fd4cd
YARN-4784. Fairscheduler: defaultQueueSchedulingPolicy should not accept FIFO. (Yufei Gu via kasha)
2016-04-20 23:58:12 -07:00
Wangda Tan
fdc46bfb37
YARN-4934. Reserved Resource for QueueMetrics needs to be handled correctly in few cases. (Sunil G via wangda)
2016-04-16 22:47:41 -07:00
Jason Lowe
69f3d428d5
YARN-4940. yarn node -list -all failed if RM start with decommissioned node. Contributed by sandflee
2016-04-15 20:36:45 +00:00
Jason Lowe
2a5da97f81
Revert "YARN-4311. Removing nodes from include and exclude lists will not remove them from decommissioned nodes list. Contributed by Kuhu Shukla"
...
This reverts commit 1cbcd4a491
.
2016-04-11 15:51:01 +00:00
Akira Ajisaka
1ff27f9d12
YARN-4630. Remove useless boxing/unboxing code. Contributed by Kousuke Saruta.
2016-04-11 14:55:03 +09:00
Karthik Kambatla
ff95fd547b
YARN-4927. TestRMHA#testTransitionedToActiveRefreshFail fails with FairScheduler. (Bibin A Chundatt via kasha)
2016-04-09 10:31:02 -07:00
Wangda Tan
ec06957941
YARN-3215. Respect labels in CapacityScheduler when computing headroom. (Naganarasimha G R via wangda)
2016-04-08 15:33:04 -07:00
Jian He
9cb0c963d2
YARN-4740. AM may not receive the container complete msg when it restarts. Contributed by Jun Gong
2016-04-08 11:20:35 -07:00
Jian He
93bacda08b
YARN-4769. Add support for CSRF header in the dump capacity scheduler logs and kill app buttons in RM web UI. Contributed by Varun Vasudev
2016-04-06 16:13:47 -07:00
Wangda Tan
21eb428448
YARN-4699. Scheduler UI and REST o/p is not in sync when -replaceLabelsOnNode is used to change label of a node. (Sunil G via wangda)
2016-04-05 16:24:11 -07:00
Jason Lowe
1cbcd4a491
YARN-4311. Removing nodes from include and exclude lists will not remove them from decommissioned nodes list. Contributed by Kuhu Shukla
2016-04-05 13:40:19 +00:00
Rohith Sharma K S
776b549e2a
YARN-4609. RM Nodes list page takes too much time to load. Contributed by Bibin A Chundatt
2016-04-05 14:47:25 +05:30
naganarasimha
5092c94195
YARN-4746. yarn web services should convert parse failures of appId, appAttemptId and containerId to 400. Contributed by Bibin A Chundatt
2016-04-04 16:25:03 +05:30
Rohith Sharma K S
1e6f92977d
YARN-4607. Pagination support for AppAttempt page TotalOutstandingResource Requests table. Contributed by Bibin A Chundatt
2016-04-04 08:09:29 +05:30
Wangda Tan
12b11e2e68
YARN-4634. Scheduler UI/Metrics need to consider cases like non-queue label mappings. (Sunil G via wangda)
2016-03-31 14:35:18 -07:00
Jian He
60e4116bf1
YARN-4822. Refactor existing Preemption Policy of CS for easier adding new approach to select preemption candidates. Contributed by Wangda Tan
2016-03-30 12:43:52 -07:00
Wangda Tan
fc055a3cbe
YARN-4865. Track Reserved resources in ResourceUsage and QueueCapacities. (Sunil G via wangda)
2016-03-29 17:07:55 -07:00
Jian He
524bc3c33a
YARN-998. Keep NM resource updated through dynamic resource config for RM/NM restart. Contributed by Junping Du
2016-03-28 11:12:33 -07:00
Arun Suresh
00bebb7e58
YARN-4823. Refactor the nested reservation id field in listReservation to simple string field. (subru via asuresh)
2016-03-25 15:54:38 -07:00
Arun Suresh
d82e797b65
YARN-4825. Remove redundant code in ClientRMService::listReservations. (subru via asuresh)
2016-03-24 09:59:55 -07:00
Junping Du
19b645c938
YARN-4820. ResourceManager web redirects in HA mode drops query parameters. Contributed by Varun Vasudev.
2016-03-23 19:34:30 -07:00
Junping Du
ca8106d2dd
YARN-4785. inconsistent value type of the type field for LeafQueueInfo in response of RM REST API.
2016-03-17 09:04:41 -07:00
Wangda Tan
ae14e5d07f
YARN-4108. CapacityScheduler: Improve preemption to only kill containers that would satisfy the incoming request. (Wangda Tan)
...
(cherry picked from commit 7e8c9beb41
)
2016-03-16 17:02:33 -07:00
Wangda Tan
fa7a43529d
Revert "CapacityScheduler: Improve preemption to only kill containers that would satisfy the incoming request. (Wangda Tan)"
...
This reverts commit 7e8c9beb41
.
2016-03-16 17:02:10 -07:00
Wangda Tan
7e8c9beb41
CapacityScheduler: Improve preemption to only kill containers that would satisfy the incoming request. (Wangda Tan)
2016-03-16 16:59:59 -07:00
Karthik Kambatla
3ef5500783
YARN-4560. Make scheduler error checking message more user friendly. (Ray Chiang via kasha)
2016-03-15 23:45:01 -07:00
Karthik Kambatla
20d389ce61
YARN-4719. Add a helper library to maintain node state and allows common queries. (kasha)
2016-03-14 14:19:05 -07:00
Wangda Tan
0233d4e0ee
YARN-4465. SchedulerUtils#validateRequest for Label check should happen only when nodelabel enabled. (Bibin A Chundatt via wangda)
2016-03-08 14:27:03 -08:00
Jian He
3c33158d1c
YARN-4764. Application submission fails when submitted queue is not available in scheduler xml. Contributed by Bibin A Chundatt
2016-03-08 13:07:57 -08:00
Varun Vasudev
e51a8c1056
YARN-4737. Add CSRF filter support in YARN. Contributed by Jonathan Maron.
2016-03-07 15:26:44 +05:30
Zhihai Xu
e1ccc9622b
YARN-4761. NMs reconnecting with changed capabilities can lead to wrong cluster resource calculations on fair scheduler. Contributed by Sangjin Lee
2016-03-06 19:46:09 -08:00
Rohith Sharma K S
19ee185907
YARN-4763. RMApps Page crashes with NPE. (Bibin A Chundatt via rohithsharmaks)
2016-03-05 13:02:57 +05:30
Jian He
5c465df904
YARN-4671. There is no need to acquire CS lock when completing a container. Contributed by Meng Ding
2016-03-01 13:14:12 -08:00
Karthik Kambatla
f9692770a5
YARN-4718. Rename variables in SchedulerNode to reduce ambiguity post YARN-1011. (Inigo Goiri via kasha)
2016-02-28 09:35:59 -08:00
Jason Lowe
6b0f813e89
YARN-4723. NodesListManager$UnknownNodeId ClassCastException. Contributed by Kuhu Shukla
2016-02-26 20:24:50 +00:00
Karthik Kambatla
c684f2b007
YARN-4729. SchedulerApplicationAttempt#getTotalRequiredResources can throw an NPE. (kasha)
2016-02-24 18:33:57 -08:00
Sangjin Lee
553b591ba0
YARN-4722. AsyncDispatcher logs redundant event queue sizes (Jason Lowe via sjlee)
2016-02-24 09:29:41 -08:00
Junping Du
9ed17f181d
YARN-3223. Resource update during NM graceful decommission. Contributed by Brook Zhou.
2016-02-23 03:30:26 -08:00
Junping Du
3fab88540f
YARN-4386. refreshNodesGracefully() should send recommission event to active RMNodes only. Contributed by Kuhu Shukla.
2016-02-22 07:04:19 -08:00
Sangjin Lee
7de70680fe
YARN-4690. Skip object allocation in FSAppAttempt#getResourceUsage when possible (Ming Ma via sjlee)
2016-02-17 20:55:21 -08:00
Karthik Kambatla
2ab4c476ed
YARN-4689. FairScheduler: Cleanup preemptContainer to be more readable. (Kai Sasaki via kasha)
2016-02-17 18:16:15 -08:00
Arun Suresh
23f937e3b7
YARN-2575. Create separate ACLs for Reservation create/update/delete/list ops (Sean Po via asuresh)
2016-02-11 10:47:43 -08:00
Jian He
d16b17b4d2
YARN-4138. Roll back container resource allocation after resource increase token expires. Contributed by Meng Ding
2016-02-11 10:06:27 +08:00
=
b706cbc1bc
YARN-4420. Add REST API for List Reservations (Sean Po via curino)
2016-02-10 10:19:26 -08:00
Arun Suresh
5cf5c41a89
YARN-4360. Improve GreedyReservationAgent to support "early" allocations, and performance improvements (curino via asuresh)
2016-02-10 09:11:15 -08:00
Devaraj K
565af873d5
YARN-4667. RM Admin CLI for refreshNodesResources throws NPE when nothing
...
is configured. Contributed by Naganarasimha G R.
2016-02-08 15:01:54 +05:30
Varun Vasudev
308d63f382
YARN-4307. Display blacklisted nodes for AM container in the RM web UI. Contributed by Naganarasimha G R.
2016-02-04 13:32:54 +05:30
Varun Vasudev
1adb64e09b
YARN-4625. Make ApplicationSubmissionContext and ApplicationSubmissionContextInfo more consistent. Contributed by Xuan Gong.
2016-02-03 16:26:28 +05:30
Wangda Tan
9875325d5c
YARN-4340. Add list API to reservation system. (Sean Po via wangda)
2016-02-02 10:17:33 +08:00
Jason Lowe
ed55950164
YARN-3102. Decommisioned Nodes not listed in Web UI. Contributed by Kuhu Shukla
2016-02-01 23:15:26 +00:00
Jason Lowe
772ea7b41b
YARN-4428. Redirect RM page to AHS page when AHS turned on and RM page is not available. Contributed by Chang Li
2016-01-29 21:48:54 +00:00
Jian He
f4a57d4a53
YARN-4617. LeafQueue#pendingOrderingPolicy should always use fixed ordering policy instead of using same as active applications ordering policy. Contributed by Rohith Sharma K S
2016-01-29 12:22:23 -08:00
Devaraj K
a277bdc9ed
YARN-4411. RMAppAttemptImpl#createApplicationAttemptReport throws
...
IllegalArgumentException. Contributed by Bibin A Chundatt and yarntime.
2016-01-29 13:51:37 +05:30
Jian He
7f46636495
YARN-4519. Potential deadlock of CapacityScheduler between decrease container and assign containers. Contributed by Meng Ding
2016-01-28 14:51:00 -08:00
Karthik Kambatla
fb238d7e5d
YARN-4462. FairScheduler: Disallow preemption from a queue. (Tao Jie via kasha)
2016-01-27 12:29:06 -08:00
rohithsharmaks
d6258b33a7
YARN-4497. RM might fail to restart when recovering apps whose attempts are missing. (Jun Gong via rohithsharmaks)
2016-01-22 20:27:38 +05:30
Akira Ajisaka
8f58f742ae
YARN-4605. Spelling mistake in the help message of "yarn applicationattempt" command. Contributed by Weiwei Yang.
2016-01-22 19:43:06 +09:00
Rohith Sharma K S
e30668106d
YARN-4584. RM startup failure when AM attempts greater than max-attempts. (Bibin A Chundatt via rohithsharmaks)
2016-01-22 10:14:46 +05:30
Jason Lowe
468a53b22f
YARN-4610. Reservations continue looking for one app causes other apps to starve. Contributed by Jason Lowe
2016-01-21 18:31:29 +00:00
Karthik Kambatla
4992398aee
YARN-4603. FairScheduler should mention user requested queuename in error message when failed in queue ACL check. (Tao Jie via kasha)
2016-01-21 17:40:59 +01:00
Wangda Tan
5ff5f67332
YARN-4557. Fix improper Queues sorting in PartitionedQueueComparator when accessible-node-labels=*. (Naganarasimha G R via wangda)
2016-01-21 11:21:06 +08:00
Xuan
890a2ebd1a
YARN-4559. Make leader elector and zk store share the same curator
...
client. Contributed by Jian He
2016-01-20 14:48:10 -08:00
Jian He
edc43a9097
YARN-4565. Fix a bug that leads to AM resource limit not hornored when sizeBasedWeight enabled for FairOrderingPolicy. Contributed by Wangda Tan
2016-01-18 21:04:36 -08:00
Wangda Tan
a44ce3f14f
YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)
2016-01-19 09:30:04 +08:00
Wangda Tan
150f5ae034
Revert "YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)"
...
This reverts commit 3fe5728563
.
Conflicts:
hadoop-yarn-project/CHANGES.txt
2016-01-19 09:27:36 +08:00
Jian He
f385851141
YARN-4596. SystemMetricPublisher should not swallow error messages from TimelineClient#putEntities. Contributed by Li Lu
2016-01-18 16:58:39 -08:00
Karthik Kambatla
d40859fab1
YARN-4526. Make SystemClock singleton so AppSchedulingInfo could use it. (kasha)
2016-01-18 10:58:14 +01:00
Wangda Tan
3fe5728563
YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)
...
(cherry picked from commit 805a9ed85e
)
2016-01-18 17:06:05 +08:00
Wangda Tan
adf260a728
Revert "YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)"
...
This reverts commit 805a9ed85e
.
2016-01-18 16:50:45 +08:00
Wangda Tan
b08ecf5c75
YARN-4304. AM max resource configuration per partition to be displayed/updated correctly in UI and in various partition related metrics. (Sunil G via wangda)
2016-01-18 11:11:32 +08:00
Wangda Tan
805a9ed85e
YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)
2016-01-18 11:04:25 +08:00
Wangda Tan
9523648d57
YARN-4538. QueueMetrics pending cores and memory metrics wrong. (Bibin A Chundatt via wangda)
2016-01-18 10:57:14 +08:00
rohithsharmaks
f7736f464f
YARN-4389. Allow application to enable or disable am blacklisting. (Sunil G via rohithsharmaks)
2016-01-15 21:38:26 +05:30
Karthik Kambatla
9d04f26d4c
YARN-3446. FairScheduler headroom calculation should exclude nodes in the blacklist. (Zhihai Xu via kasha)
2016-01-14 08:33:23 -08:00
Karthik Kambatla
321072ba81
YARN-4551. Address the duplication between StatusUpdateWhenHealthy and StatusUpdateWhenUnhealthy transitions. (Sunil G via kasha)
2016-01-13 12:09:34 -08:00
Wangda Tan
c0537bcd2c
YARN-4571. Make app id/name available to the yarn authorizer provider for better auditing. (Jian He via wangda)
2016-01-13 13:18:31 +08:00
Akira Ajisaka
da1e3e3c57
YARN-4567. javadoc failing on java 8. Contributed by Steve Loughran. This closes #67 .
2016-01-12 15:12:17 +09:00
Wangda Tan
9e792da014
YARN-4582. Label-related invalid resource request exception should be able to properly handled by application. (Bibin A Chundatt via wangda)
2016-01-12 12:53:31 +08:00
Jian He
5fab4ec31c
Missing file for YARN-4580
2016-01-11 17:00:44 -08:00
Jian He
b8942be888
YARN-4537. Pull out priority comparison from fifocomparator and use compound comparator for FifoOrdering policy. Contributed by Rohith Sharma K S
2016-01-11 16:44:28 -08:00
Jian He
109e528ef5
YARN-4479. Change CS LeafQueue pendingOrderingPolicy to hornor recovered apps. Contributed by Rohith Sharma K S
2016-01-08 15:51:10 -08:00
Xuan
89022f8d4b
YARN-4438. Implement RM leader election with curator. Contributed by Jian He
2016-01-07 14:33:06 -08:00
Junping Du
c1462a67ff
YARN-4546. ResourceManager crash due to scheduling opportunity overflow. Contributed by Jason Lowe.
2016-01-06 05:49:24 -08:00
rohithsharmaks
6da6d87872
YARN-4535. Fix checkstyle error in CapacityScheduler.java (Naganarasimha G R via rohithsharmaks)
2016-01-05 12:09:57 +05:30
Wangda Tan
4e4b3a8465
YARN-4524. Cleanup AppSchedulingInfo. (Karthik Kambatla via wangda)
...
(cherry picked from commit 05fa852d7567b7590d6b53bbf925f8f424736514)
2015-12-30 15:39:34 -08:00
Wangda Tan
8310b2e9ff
YARN-4522. Queue acl can be checked at app submission. (Jian He via wangda)
2015-12-30 15:30:12 -08:00
Junping Du
223ce323bb
YARN-1382. Remove unusableRMNodesConcurrentSet (never used) in NodeListManager to get rid of memory leak. Contributed by Rohith Sharma K S.
2015-12-30 07:52:07 -08:00
Jian He
5273413411
YARN-3480. Remove attempts that are beyond max-attempt limit from state store. Contributed by Jun Gong
2015-12-29 15:58:39 -08:00
Wangda Tan
561abb9fee
YARN-4315. NaN in Queue percentage for cluster apps page. (Bibin A Chundatt via wangda)
2015-12-29 13:28:00 -08:00
Jian He
d0a22bae9b
YARN-4417. Make RM and Timeline-server REST APIs more consistent. Contributed by Wangda Tan
2015-12-28 15:52:45 -08:00
rohithsharmaks
8c180a13c8
YARN-4109. Exception on RM scheduler page loading with labels. (Mohammad Shahid Khan via rohithsharmaks)
2015-12-23 09:12:32 +05:30
Arun Suresh
e88422df45
YARN-4477. FairScheduler: Handle condition which can result in an infinite loop in attemptScheduling. (Tao Jie via asuresh)
2015-12-21 22:41:09 -08:00
Jian He
85c2466048
YARN-4164. Changed updateApplicationPriority API to return the updated application priority. Contributed by Rohith Sharma K S
2015-12-18 14:13:48 -08:00
Junping Du
1de56b0448
YARN-3226. UI changes for decommissioning node. Contributed by Sunil G.
2015-12-17 15:20:17 -08:00
Jason Lowe
91828fef6b
YARN-4461. Redundant nodeLocalityDelay log in LeafQueue. Contributed by Eric Payne
2015-12-16 23:22:31 +00:00
Wangda Tan
9b856d9787
YARN-4416. Deadlock due to synchronised get Methods in AbstractCSQueue. (Naganarasimha G R via wangda)
2015-12-16 13:22:37 -08:00
Wangda Tan
7faa406f27
YARN-4225. Add preemption status to yarn queue -status for capacity scheduler. (Eric Payne via wangda)
2015-12-16 13:19:40 -08:00
Wangda Tan
79c41b1d83
YARN-4293. ResourceUtilization should be a part of yarn node CLI. (Sunil G via wangda)
2015-12-16 13:18:19 -08:00
Junping Du
50bd067e1d
YARN-4452. NPE when submit Unmanaged application. Contributed by Naganarasimha G R.
2015-12-16 10:57:39 -08:00
Zhihai Xu
2aaed10327
YARN-4440. FSAppAttempt#getAllowedLocalityLevelByTime should init the lastScheduler time. Contributed by Lin Yiqun
2015-12-15 00:17:21 -08:00
Jian He
1cb3299b48
YARN-4403. (AM/NM/Container)LivelinessMonitor should use monotonic time when calculating period. Contributed by Junping Du
2015-12-14 13:51:23 -08:00
Wangda Tan
07b0fb996a
YARN-4418. AM Resource Limit per partition can be updated to ResourceUsage as well. (Sunil G via wangda)
2015-12-14 11:24:30 -08:00
Wangda Tan
6cb0af3c39
YARN-3946. Update exact reason as to why a submitted app is in ACCEPTED state to app's diagnostic message. (Naganarasimha G R via wangda)
2015-12-14 10:52:46 -08:00
Arun Suresh
7fb212e5e6
YARN-4358 addendum patch to fix javadoc error
2015-12-12 22:22:55 -08:00
rohithsharmaks
a5e2e1ecb0
YARN-4421. Remove dead code in RmAppImpl.RMAppRecoveredTransition. (Daniel Templeton via rohithsharmaks)
2015-12-09 11:31:51 +05:30
Wangda Tan
7e4715186d
YARN-4424. Fix deadlock in RMAppImpl. (Jian he via wangda)
2015-12-08 14:25:16 -08:00
=
c25a635459
YARN-4248. REST API for submit/update/delete Reservations. (curino)
2015-12-07 13:33:28 -08:00
Jonathan Eagles
4ff973f96a
YARN-4422. Generic AHS sometimes doesn't show started, node, or logs on App page (Eric Payne via jeagles)
2015-12-07 15:04:48 -06:00
Xuan
4546c7582b
YARN-4392. ApplicationCreatedEvent event time resets after RM
...
restart/failover. Contributed by Naganarasimha G R and Xuan Gong
2015-12-07 12:24:55 -08:00
Steve Loughran
65f395226b
HADOOP-12321. Make JvmPauseMonitor an AbstractService. (Sunil G via Stevel) [includes HDFS-8947 MAPREDUCE-6462 and YARN-4072]
2015-12-06 17:43:35 +00:00
Arun Suresh
742632e346
YARN-4358. Reservation System: Improve relationship between SharingPolicy and ReservationAgent. (Carlo Curino via asuresh)
2015-12-05 21:26:16 -08:00
Wangda Tan
a2c3bfc8c1
YARN-4292. ResourceUtilization should be a part of NodeInfo REST API. (Sunil G via wangda)
2015-12-03 14:28:32 -08:00
Jian He
6b9a5beb2b
YARN-4398. Remove unnecessary synchronization in RMStateStore. Contributed by Ning Ding
2015-12-02 11:07:18 -08:00
Tsuyoshi Ozawa
28dfe721b8
YARN-4387. Fix typo in FairScheduler log message. Contributed by Xin Wang.
2015-11-24 19:24:01 +09:00
Karthik Kambatla
52948bb20b
YARN-3980. Plumb resource-utilization info in node heartbeat through to the scheduler. (Inigo Goiri via kasha)
2015-11-24 13:47:17 +05:30
Jian He
8676a118a1
YARN-4349. Support CallerContext in YARN. Contributed by Wangda Tan
2015-11-23 17:19:48 -08:00
Jason Lowe
d36b6e045f
YARN-4344. NMs reconnecting with changed capabilities can lead to wrong cluster resource calculations. Contributed by Varun Vasudev
2015-11-23 20:30:26 +00:00
Arun Suresh
da1016365a
YARN-3454. Add efficient merge operation to RLESparseResourceAllocation (Carlo Curino via asuresh)
2015-11-21 09:59:41 -08:00
Wangda Tan
2346fa3141
YARN-3769. Consider user limit when calculating total pending resource for preemption policy in Capacity Scheduler. (Eric Payne via wangda)
2015-11-20 15:55:50 -08:00
Jason Lowe
060cdcbe5d
YARN-4374. RM capacity scheduler UI rounds user limit factor. Contributed by Chang Li
2015-11-20 23:12:29 +00:00
Arun Suresh
6a61928fb7
YARN-4184. Remove update reservation state api from state store as its not used by ReservationSystem (Sean Po via asuresh)
2015-11-17 15:50:34 -08:00
Wangda Tan
7f55a18071
YARN-4347. Resource manager fails with Null pointer exception. (Jian He via wangda)
2015-11-12 11:23:40 -08:00
Wangda Tan
796638d9bc
YARN-4287. Capacity Scheduler: Rack Locality improvement (Nathan Roberts via wangda)
2015-11-12 11:09:37 -08:00
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
6351d3fa63
YARN-4183. Reverting the patch to fix behaviour change.
...
Revert "YARN-4183. Enabling generic application history forces every job to get a timeline service delegation token (jeagles)"
This reverts commit c293c58954
.
2015-11-11 10:40:43 -08:00
Jian He
e5b1733e04
YARN-4127. RM fail with noAuth error if switched from failover to non-failover. Contributed by Varun Saxena
2015-10-29 15:42:57 -07:00
Jonathan Eagles
c293c58954
YARN-4183. Enabling generic application history forces every job to get a timeline service delegation token (jeagles)
2015-10-29 16:41:10 -05:00
Arun Suresh
58d1df585c
YARN-4310. FairScheduler: Log skipping reservation messages at DEBUG level (asuresh)
2015-10-29 13:42:09 -07:00
Rohith Sharma K S
656c8f9527
YARN-4130. Duplicate declaration of ApplicationId in RMAppManager#submitApplication method. (Kai Sasaki via rohithsharmaks)
2015-10-29 12:22:44 +05:30
Wangda Tan
56e4f6237a
YARN-3216. Max-AM-Resource-Percentage should respect node labels. (Sunil G via wangda)
2015-10-26 16:44:39 -07:00
Wangda Tan
3cc73773eb
YARN-4285. Display resource usage as percentage of queue and cluster in the RM UI (Varun Vasudev via wangda)
2015-10-26 13:07:39 -07:00
Jason Lowe
33a03af3c3
YARN-4284. condition for AM blacklisting is too narrow. Contributed by Sangjin Lee
2015-10-26 19:53:03 +00:00
Rohith Sharma K S
5acdde4744
YARN-2729. Support script based NodeLabelsProvider Interface in Distributed Node Label Configuration Setup. (Naganarasimha G R via rohithsharmaks)
2015-10-26 15:42:42 +05:30
Arun Suresh
ab8eb8770c
YARN-3738. Add support for recovery of reserved apps running under dynamic queues (subru via asuresh)
2015-10-24 22:53:10 -07:00
Akira Ajisaka
7781fe1b9e
YARN-4294. [JDK8] Fix javadoc errors caused by wrong reference and illegal tag. (aajisaka)
2015-10-24 11:54:42 +09:00
Jason Lowe
d3a34a4f38
YARN-4041. Slow delegation token renewal can severely prolong RM recovery. Contributed by Sunil G
2015-10-23 20:57:01 +00:00
Ming Ma
934d96a334
YARN-2913. Fair scheduler should have ability to set MaxResourceDefault for each queue. (Siqi Li via mingma)
2015-10-23 08:36:33 -07:00
Jonathan Eagles
f8adeb712d
YARN-4009. CORS support for ResourceManager REST API. ( Varun Vasudev via jeagles)
2015-10-23 10:34:08 -05:00
Junping Du
0fce5f9a49
YARN-4243. Add retry on establishing Zookeeper conenction in EmbeddedElectorService#serviceInit. Contributed by Xuan Gong.
2015-10-22 13:41:09 -07:00
Zhihai Xu
960201b79b
YARN-4256. YARN fair scheduler vcores with decimal values. Contributed by Jun Gong
2015-10-22 12:28:03 -07:00
Anubhav Dhoot
2798723a54
YARN-3739. Add reservation system recovery to RM recovery process. Contributed by Subru Krishnan.
2015-10-22 06:51:00 -07:00
Arun Suresh
506d1b1dbc
YARN-3985. Make ReservationSystem persist state using RMStateStore reservation APIs. (adhoot via asuresh)
2015-10-20 16:46:14 -07:00
Arun Suresh
7e2837f830
YARN-4270. Limit application resource reservation on nodes for non-node/rack specific requests (asuresh)
2015-10-19 20:00:38 -07:00
Jian He
f9da5cdb2b
YARN-4170. AM need to be notified with priority in AllocateResponse. Contributed by Sunil G
2015-10-16 15:26:27 -07:00
Wangda Tan
4337b263aa
YARN-4162. CapacityScheduler: Add resource usage by partition and queue capacity by partition to REST API. (Naganarasimha G R via wangda)
2015-10-16 15:06:28 -07:00
Jian He
cf23f2c2b5
YARN-4000. RM crashes with NPE if leaf queue becomes parent queue during restart. Contributed by Varun Saxena
2015-10-15 17:12:46 -07:00
rohithsharmaks
d6c8bad869
YARN-4250. NPE in AppSchedulingInfo#isRequestLabelChanged. (Brahma Reddy Battula via rohithsharmaks)
2015-10-14 16:11:34 +05:30
Jian He
9849c8b386
YARN-4230. RM crashes with NPE when increasing container resource if there is no headroom left. Contributed by Meng Ding
2015-10-12 11:51:33 -07:00
Zhihai Xu
049c6e8dc0
YARN-4201. AMBlacklist does not work for minicluster. Contributed by Jun Gong.
2015-10-12 00:14:25 -07:00
Devaraj K
db93047881
YARN-3964. Support NodeLabelsProvider at Resource Manager side.
...
Contributed by Dian Fu.
2015-10-11 11:21:29 +05:30
Wangda Tan
def374e666
YARN-4140. RM container allocation delayed incase of app submitted to Nodelabel partition. (Bibin A Chundatt via wangda)
2015-10-09 16:38:59 -07:00
Karthik Kambatla
4aa9b3e75c
MAPREDUCE-6302. Incorrect headroom can lead to a deadlock between map and reduce allocations. (kasha)
2015-10-09 07:37:39 -07:00
Jason Lowe
a0bca2b5ad
YARN-261. Ability to fail AM attempts. Contributed by Andrey Klochkov and Rohith Sharma K S
2015-10-09 14:17:38 +00:00
Rohith Sharma K S
8f195387a4
YARN-4235. FairScheduler PrimaryGroup does not handle empty groups returned for a user. (Anubhav Dhoot via rohithsharmaks)
2015-10-09 10:09:26 +05:30
Rohith Sharma K S
3793cbe4c3
YARN-4228. FileSystemRMStateStore use IOUtils#close instead of fs#close. (Bibin A Chundatt via rohithsharmaks)
2015-10-07 10:12:14 +05:30
Rohith Sharma K S
9156fc60c6
YARN-4209. RMStateStore FENCED state doesn’t work due to updateFencedState called by stateMachine.doTransition. (Zhihai Xu via rohithsharmaks)
2015-10-07 09:34:59 +05:30
Wangda Tan
29a582ada0
YARN-4215. RMNodeLabels Manager Need to verify and replace node labels for the only modified Node Label Mappings in the request. (Naganarasimha G R via wangda)
2015-10-06 11:56:04 -07:00
Harsh J
c918f7be5e
HADOOP-12458. Retries is typoed to spell Retires in parts of hadoop-yarn and hadoop-common. Contributed by Neelesh Srinivas Salian.
2015-10-03 18:37:58 +05:30
Xuan
8f08532bde
YARN-1897. CLI and core support for signal container functionality. Contributed by Ming Ma
2015-10-02 18:50:47 -07:00
Karthik Kambatla
a0b5a0a419
YARN-4066. Large number of queues choke fair scheduler. (Johan Gustavsson via kasha)
2015-09-29 07:55:34 -07:00
Anubhav Dhoot
9735afe967
YARN-4180. AMLauncher does not retry on failures when talking to NM. (adhoot)
2015-09-28 16:13:41 -07:00
Jason Lowe
9f53a95ff6
YARN-4141. Runtime Application Priority change should not throw exception for applications at finishing states. Contributed by Sunil G
2015-09-28 22:55:20 +00:00
Anubhav Dhoot
fb2e525c07
YARN-4204. ConcurrentModificationException in FairSchedulerQueueInfo. (adhoot)
2015-09-28 09:05:45 -07:00
Rohith Sharma K S
a9aafad12b
YARN-4044. Running applications information changes such as movequeue is not published to TimeLine server. (Sunil G via rohithsharmaks)
2015-09-24 12:13:22 +05:30
Jian He
b3f6b641dc
YARN-4171. Fix findbugs warnings in YARN-1197 branch. Contributed by Wangda Tan
2015-09-23 13:29:38 -07:00
Jian He
89cab1ba5f
YARN-1651. CapacityScheduler side changes to support container resize. Contributed by Wangda Tan
2015-09-23 13:29:38 -07:00
Tsuyoshi Ozawa
dfd807afab
HADOOP-12428. Fix inconsistency between log-level guards and statements. Contributed by Jagadesh Kiran N and Jackie Chang.
2015-09-22 12:54:29 +09:00
Rohith Sharma K S
c9cb6a5960
YARN-4167. NPE on RMActiveServices#serviceStop when store is null. (Bibin A Chundatt via rohithsharmaks)
2015-09-21 09:59:30 +05:30
Arun Suresh
94dec5a916
YARN-3920. FairScheduler container reservation on a node should be configurable to limit it to large containers (adhoot via asuresh)
2015-09-18 14:02:55 -07:00
Wangda Tan
9bc913a35c
YARN-3212. RMNode State Transition Update with DECOMMISSIONING state. (Junping Du via wangda)
2015-09-18 10:04:17 -07:00
Jian He
6c6e734f0b
YARN-4034. Render cluster Max Priority in scheduler metrics in RM web UI. Contributed by Rohith Sharma K S
2015-09-17 14:55:50 +08:00
Jian He
452079af8b
YARN-4078. Add getPendingResourceRequestForAttempt in YarnScheduler interface. Contributed by Naganarasimha G R
2015-09-16 14:59:20 +08:00
Wangda Tan
ae5308fe1d
YARN-3717. Expose app/am/queue's node-label-expression to RM web UI / CLI / REST-API. (Naganarasimha G R via wangda)
2015-09-15 11:40:50 -07:00
Junping Du
73e3a49eb0
YARN-313. Add Admin API for supporting node resource configuration in command line. (Contributed by Inigo Goiri, Kenji Kikushima and Junping Du)
2015-09-15 07:56:47 -07:00
Jian He
5468baa80a
YARN-3635. Refactored current queue mapping implementation in CapacityScheduler to use a generic PlacementManager framework. Contributed by Wangda Tan
2015-09-15 15:39:20 +08:00
Jian He
e1b1d7e4ae
YARN-4126. RM should not issue delegation tokens in unsecure mode. Contributed by Bibin A Chundatt
2015-09-14 14:09:19 +08:00
Karthik Kambatla
332b520a48
YARN-3697. FairScheduler: ContinuousSchedulingThread can fail to shutdown. (Zhihai Xu via kasha)
2015-09-13 18:07:43 -07:00
Karthik Kambatla
81df7b586a
YARN-2005. Blacklisting support for scheduling AMs. (Anubhav Dhoot via kasha)
2015-09-13 17:03:15 -07:00
Steve Loughran
7269906254
HADOOP-12087. [JDK8] Fix javadoc errors caused by incorrect or illegal tags. (Akira AJISAKA via stevel).
2015-09-13 14:25:26 +01:00
Jian He
6f72f1e600
YARN-2884. Added a proxy service in NM to proxy the the communication between AM and RM. Contributed by Kishore Chaliparambil
2015-09-08 09:35:46 +08:00
Xuan
9b78e6e33d
YARN-4087. Followup fixes after YARN-2019 regarding RM behavior when
...
state-store error occurs. Contributed by Jian He
2015-09-07 17:45:47 -07:00
Wangda Tan
bcc85e3bab
YARN-4024. YARN RM should avoid unnecessary resolving IP when NMs doing heartbeat. (Hong Zhiguo via wangda)
2015-09-04 15:13:53 -07:00
Jason Lowe
6eaca2e363
YARN-4105. Capacity Scheduler headroom for DRF is wrong. Contributed by Chang Li
2015-09-04 15:30:53 +00:00
Varun Vasudev
40d222e862
YARN-4103. RM WebServices missing scheme for appattempts logLinks. Contributed by Jonathan Eagles.
2015-09-04 14:31:51 +05:30
Varun Vasudev
b469ac531a
YARN-3970. Add REST api support for Application Priority. Contributed by Naganarasimha G R.
2015-09-03 16:40:10 +05:30
Jian He
09c64ba1ba
YARN-4101. RM should print alert messages if Zookeeper and Resourcemanager gets connection issue. Contributed by Xuan Gong
2015-09-02 17:45:23 -07:00
Rohith Sharma K S
7d6687fe76
YARN-3893. Both RM in active state when Admin#transitionToActive failure from refeshAll() (Bibin A Chundatt via rohithsharmaks)
2015-09-02 15:22:48 +05:30
Varun Vasudev
bf669b6d9f
YARN-4082. Container shouldn't be killed when node's label updated. Contributed by Wangda Tan.
2015-09-01 14:19:11 +05:30
Jian He
a3fd2ccc86
YARN-4092. Fixed UI redirection to print useful messages when both RMs are in standby mode. Contributed by Xuan Gong
2015-08-31 17:33:24 -07:00
Junping Du
beb65c9465
YARN-1556. NPE getting application report with a null appId. Contributed by Weiwei Yang.
2015-08-28 05:57:34 -07:00
Jian He
a9c8ea71aa
YARN-3250. Support admin cli interface in for Application Priority. Contributed by Rohith Sharma K S
2015-08-27 13:25:53 -07:00
Jian He
57c7ae1aff
YARN-4014. Support user cli interface in for Application Priority. Contributed by Rohith Sharma K S
2015-08-24 20:36:44 -07:00
Rohith Sharma K S
feaf034994
YARN-3896. RMNode transitioned from RUNNING to REBOOTED because its response id has not been reset synchronously. (Jun Gong via rohithsharmaks)
2015-08-24 11:25:07 +05:30
Rohith Sharma K S
22de7c1dca
YARN-3986. getTransferredContainers in AbstractYarnScheduler should be present in YarnScheduler interface
2015-08-21 10:51:11 +05:30
Xuan
22dc5fc209
YARN-4028. AppBlock page key update and diagnostics value null on
...
recovery. Contributed by Bibin A Chundatt
2015-08-18 22:53:03 -07:00
Zhihai Xu
3a76a010b8
YARN-3857: Memory leak in ResourceManager with SIMPLE mode. Contributed by mujunchao.
2015-08-18 10:36:40 -07:00
Jian He
0a030546e2
YARN-3987. Send AM container completed msg to NM once AM finishes. Contributed by sandflee
2015-08-13 16:22:53 -07:00
Jian He
7a445fcfab
YARN-4047. ClientRMService getApplications has high scheduler lock contention. Contributed by Jason Lowe
2015-08-13 16:02:57 -07:00
Jian He
e5003be907
YARN-4026. Refactored ContainerAllocator to accept a list of priorites rather than a single priority. Contributed by Wangda Tan
2015-08-12 15:07:50 -07:00
rohithsharmaks
1c12adb71f
YARN-4023. Publish Application Priority to TimelineServer. (Sunil G via rohithsharmaks)
2015-08-12 14:45:41 +05:30
Xuan
3ae716fa69
YARN-3999. RM hangs on draing events. Contributed by Jian He
2015-08-11 18:25:11 -07:00
Jian He
fa1d84ae27
YARN-3887. Support changing Application priority during runtime. Contributed by Sunil G
2015-08-10 20:51:54 -07:00
Wangda Tan
cf9d3c9256
YARN-3873. PendingApplications in LeafQueue should also use OrderingPolicy. (Sunil G via wangda)
2015-08-10 14:54:55 -07:00
Wangda Tan
4bc42d76e7
YARN-3966. Fix excessive loggings in CapacityScheduler. (Jian He via wangda)
2015-08-07 09:46:57 -07:00
Rohith Sharma K S
b6265d39c5
YARN-3948. Display Application Priority in RM Web UI.(Sunil G via rohithsharmaks)
2015-08-07 10:43:41 +05:30
Carlo Curino
8572a5a14b
YARN-3974. Refactor the reservation system test cases to use parameterized base test. (subru via curino)
2015-08-02 01:55:31 -07:00
Junping Du
cfee02b3bd
YARN-4019. Add JvmPauseMonitor to ResourceManager and NodeManager. Contributed by Robert Kanter.
2015-08-06 06:49:45 -07:00
Arun Suresh
154c9d2e42
YARN-3961. Expose pending, running and reserved containers of a queue in REST api and yarn top (adhoot via asuresh)
2015-08-05 23:14:14 -07:00
Jian He
ba2313d614
YARN-3983. Refactored CapacityScheduleri#FiCaSchedulerApp to easier extend container allocation logic. Contributed by Wangda Tan
2015-08-05 13:47:40 -07:00
Arun Suresh
f271d37735
YARN-3736. Add RMStateStore apis to store and load accepted reservations for failover (adhoot via asuresh)
2015-08-05 12:57:12 -07:00
Xuan
0306d902f5
YARN-3543. ApplicationReport should be able to tell whether the
...
Application is AM managed or not. Contributed by Rohith Sharma K S
2015-08-03 15:46:00 -07:00
Jonathan Eagles
3cd02b9522
YARN-3978. Configurably turn off the saving of container info in Generic AHS (Eric Payne via jeagles)
2015-08-03 10:38:05 -05:00
Jason Lowe
32e490b6c0
YARN-3990. AsyncDispatcher may overloaded with RMAppNodeUpdateEvent when Node is connected/disconnected. Contributed by Bibin A Chundatt
2015-07-31 17:37:24 +00:00
Zhihai Xu
ab80e27703
YARN-433. When RM is catching up with node updates then it should not expire acquired containers. Contributed by Xuan Gong
2015-07-30 21:57:11 -07:00
Wangda Tan
91b42e7d6e
YARN-3971. Skip RMNodeLabelsManager#checkRemoveFromClusterNodeLabelsOfQueue on nodelabel recovery. (Bibin A Chundatt via wangda)
2015-07-30 10:00:31 -07:00
Karthik Kambatla
5205a330b3
YARN-2768. Avoid cloning Resource in FSAppAttempt#updateDemand. (Hong Zhiguo via kasha)
2015-07-29 09:42:32 -07:00
Jian He
3572ebd738
YARN-3846. RM Web UI queue filter is not working for sub queue. Contributed by Mohammad Shahid Khan
2015-07-27 17:12:05 -07:00
ccurino
156f24ead0
YARN-3656. LowCost: A Cost-Based Placement Agent for YARN Reservations. (Jonathan Yaniv and Ishai Menache via curino)
2015-07-25 07:39:47 -07:00
Wangda Tan
a3bd7b4a59
YARN-3973. Recent changes to application priority management break reservation system from YARN-1051 (Carlo Curino via wangda)
2015-07-24 16:44:18 -07:00
Jian He
83fe34ac08
YARN-3026. Move application-specific container allocation logic from LeafQueue to FiCaSchedulerApp. Contributed by Wangda Tan
2015-07-24 14:00:25 -07:00
Karthik Kambatla
d19d187753
YARN-3957. FairScheduler NPE In FairSchedulerQueueInfo causing scheduler page to return 500. (Anubhav Dhoot via kasha)
2015-07-24 11:44:37 -07:00
carlo curino
0fcb4a8cf2
YARN-3969. Allow jobs to be submitted to reservation that is active but does not have any allocations. (subru via curino)
2015-07-23 19:33:59 -07:00
Rohith Sharma K S
e202efaf93
YARN-3845. Scheduler page does not render RGBA color combinations in IE11. (Contributed by Mohammad Shahid Khan)
2015-07-24 12:43:06 +05:30
Wangda Tan
3bba180051
YARN-3941. Proportional Preemption policy should try to avoid sending duplicate PREEMPT_CONTAINER event to scheduler. (Sunil G via wangda)
2015-07-23 10:07:57 -07:00
Junping Du
ee98d6354b
YARN-2019. Retrospect on decision of making RM crashed if any exception throw in ZKRMStateStore. Contributed by Jian He.
2015-07-22 17:52:35 -07:00
Wangda Tan
76ec26de80
YARN-3932. SchedulerApplicationAttempt#getResourceUsageReport and UserInfo should based on total-used-resources. (Bibin A Chundatt via wangda)
2015-07-22 11:54:02 -07:00
Wangda Tan
c39ca541f4
YARN-2003. Support for Application priority : Changes in RM and Capacity Scheduler. (Sunil G via wangda)
2015-07-21 09:57:23 -07:00
Arun Suresh
9b272ccae7
YARN-3535. Scheduler must re-request container resources when RMContainer transitions from ALLOCATED to KILLED (rohithsharma and peng.zhang via asuresh)
2015-07-17 04:31:34 -07:00
Wangda Tan
3540d5fe4b
YARN-3885. ProportionalCapacityPreemptionPolicy doesn't preempt if queue is more than 2 level. (Ajith S via wangda)
2015-07-16 16:13:32 -07:00
Arun Suresh
ac94ba3e18
YARN-3453. Ensure preemption logic in FairScheduler uses DominantResourceCalculator in DRF queues to prevent unnecessary thrashing. (asuresh)
2015-07-14 00:23:55 -07:00
Akira Ajisaka
19295b36d9
YARN-3381. Fix typo InvalidStateTransitonException. Contributed by Brahma Reddy Battula.
2015-07-13 17:52:13 +09:00
Wangda Tan
5ed1fead6b
YARN-3894. RM startup should fail for wrong CS xml NodeLabel capacity configuration. (Bibin A Chundatt via wangda)
2015-07-12 21:52:11 -07:00
Wangda Tan
1df39c1efc
YARN-3849. Too much of preemption activity causing continuos killing of containers across queues. (Sunil G via wangda)
2015-07-11 10:26:46 -07:00
Zhijie Shen
1ea36299a4
YARN-3116. RM notifies NM whether a container is an AM container or normal task container. Contributed by Giovanni Matteo Fumarola.
2015-07-10 18:58:10 -07:00
Ming Ma
08244264c0
YARN-3445. Cache runningApps in RMNode for getting running apps on given NodeId. (Junping Du via mingma)
2015-07-10 08:30:10 -07:00
Xuan
5214876792
YARN-3888. ApplicationMaster link is broken in RM WebUI when appstate is
...
NEW. Contributed by Bibin A Chundatt
2015-07-09 21:37:33 -07:00
carlo curino
0e602fa3a1
YARN-3800. Reduce storage footprint for ReservationAllocation. Contributed by Anubhav Dhoot.
2015-07-09 16:51:59 -07:00
Jian He
c9dd2cada0
YARN-3892. Fixed NPE on RMStateStore#serviceStop when CapacityScheduler#serviceInit fails. Contributed by Bibin A Chundatt
2015-07-07 14:16:21 -07:00
Devaraj K
37d7395773
YARN-3875. FSSchedulerNode#reserveResource() doesn't print Application Id
...
properly in log. Contributed by Bibin A Chundatt.
2015-07-02 10:20:31 +05:30
Wangda Tan
0e4b06690f
YARN-3508. Prevent processing preemption events on the main RM dispatcher. (Varun Saxena via wangda)
2015-07-01 17:32:22 -07:00
Devaraj K
80a68d6056
YARN-3830. AbstractYarnScheduler.createReleaseCache may try to clean a
...
null attempt. Contributed by nijel.
2015-07-01 19:03:44 +05:30
Devaraj K
b543d1a390
YARN-3859. LeafQueue doesn't print user properly for application add.
...
Contributed by Varun Saxena.
2015-06-28 10:04:50 +05:30
Devaraj K
57f1a01eda
YARN-3826. Race condition in ResourceTrackerService leads to wrong
...
diagnostics messages. Contributed by Chengbing Liu.
2015-06-25 16:13:59 +05:30
rohithsharmaks
dd4b387d96
YARN-3790. usedResource from rootQueue metrics may get stale data for FS scheduler after recovering the container (Zhihai Xu via rohithsharmaks)
2015-06-24 23:00:14 +05:30
Jason Lowe
2a20dd9b61
YARN-3809. Failed to launch new attempts because ApplicationMasterLauncher's threads all hang. Contributed by Jun Gong
2015-06-24 16:23:48 +00:00
Xuan
5b5bb8dcdc
YARN-3802. Two RMNodes for the same NodeId are used in RM sometimes
...
after NM is reconnected. Contributed by zhihai xu
2015-06-18 14:37:49 -07:00
Xuan
a826d432f9
YARN-3804. Both RM are on standBy state when kerberos user not in yarn.admin.acl. Contributed by Varun Saxena
2015-06-17 16:23:27 -07:00
Devaraj K
b039e69bb0
YARN-3789. Improve logs for LeafQueue#activateApplications(). Contributed
...
by Bibin A Chundatt.
2015-06-16 14:03:22 +05:30
Xuan
2b2465dfac
YARN-3778. Fix Yarn resourcemanger CLI usage. Contributed by Brahma Reddy Battula
2015-06-08 15:43:03 -07:00
Jian He
960b8f19ca
YARN-2716. Refactor ZKRMStateStore retry code with Apache Curator. Contributed by Karthik Kambatla
2015-06-08 14:50:58 -07:00
Devaraj K
c7ee6c151c
YARN-3780. Should use equals when compare Resource in
...
RMNodeImpl#ReconnectNodeTransition. Contributed by zhihai xu.
2015-06-08 11:54:55 +05:30
Karthik Kambatla
bd69ea408f
YARN-3655. FairScheduler: potential livelock due to maxAMShare limitation and container reservation. (Zhihai Xu via kasha)
2015-06-07 11:37:52 -07:00
Xuan
3e000a919f
YARN-1462. AHS API and other AHS changes to handle tags for completed MR jobs. Contributed by Xuan Gong
2015-06-05 12:48:52 -07:00
Karthik Kambatla
75885852cc
YARN-3259. FairScheduler: Trigger fairShare updates on node events. (Anubhav Dhoot via kasha)
2015-06-05 09:39:41 -07:00
Jian He
1970ca7cbc
YARN-2392. Add more diags about app retry limits on AM failures. Contributed by Steve Loughran
2015-06-04 11:14:09 -07:00
Jian He
6ad4e59cfc
YARN-3764. CapacityScheduler should forbid moving LeafQueue from one parent to another. Contributed by Wangda Tan
2015-06-04 10:52:59 -07:00
Junping Du
d7e7f6aa03
YARN-41. The RM should handle the graceful shutdown of the NM. Contributed by Devaraj K.
2015-06-04 04:59:27 -07:00
Xuan
5766a04428
YARN-3749. We should make a copy of configuration when init
...
MiniYARNCluster with multiple RMs. Contributed by Chun Chen
2015-06-03 17:20:15 -07:00
Zhijie Shen
bc85959edd
Revert "YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong."
...
This reverts commit 4a9ec1a824
.
2015-06-03 14:15:56 -07:00
Karthik Kambatla
edb9cd0f7a
YARN-3762. FairScheduler: CME on FSParentQueue#getQueueUserAclInfo. (kasha)
2015-06-03 13:47:24 -07:00
Zhijie Shen
4a9ec1a824
YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong.
2015-05-30 21:05:36 -07:00
Karthik Kambatla
a8acdd65b3
YARN-3467. Expose allocatedMB, allocatedVCores, and runningContainers metrics on running Applications in RM Web UI. (Anubhav Dhoot via kasha)
2015-05-30 13:36:24 -07:00
Karthik Kambatla
3ae2a62501
YARN-3547. FairScheduler: Apps that have no resource demand should not participate scheduling. (Xianyin Xin via kasha)
2015-05-29 15:17:22 -07:00
Vinod Kumar Vavilapalli
9acd24fec4
Fixed more FilesSystemRMStateStore issues. Contributed by Vinod Kumar Vavilapalli.
2015-05-28 15:25:56 -07:00
Allen Wittenauer
d6e3164d4a
YARN-2355. MAX_APP_ATTEMPTS_ENV may no longer be a useful env var for a container (Darrell Taylor via aw)
2015-05-27 16:40:56 -07:00
Wangda Tan
ec0a852a37
YARN-3647. RMWebServices api's should use updated api from CommonNodeLabelsManager to get NodeLabel object. (Sunil G via wangda)
2015-05-27 12:51:20 -07:00
Wangda Tan
cdbd66be11
YARN-3686. CapacityScheduler should trim default_node_label_expression. (Sunil G via wangda)
2015-05-26 15:58:47 -07:00
Jian He
10732d515f
YARN-3632. Ordering policy should be allowed to reorder an application when demand changes. Contributed by Craig Welch
2015-05-26 12:00:51 -07:00
Tsuyoshi Ozawa
9a3d617b63
YARN-2336. Fair scheduler's REST API returns a missing '[' bracket JSON for deep queue tree. Contributed by Kenji Kikushima and Akira Ajisaka.
2015-05-26 19:07:40 +09:00
Jian He
446d51591e
YARN-3707. RM Web UI queue filter doesn't work. Contributed by Wangda Tan
2015-05-22 15:58:22 -07:00
Karthik Kambatla
4513761869
YARN-3675. FairScheduler: RM quits when node removal races with continuous-scheduling on the same node. (Anubhav Dhoot via kasha)
2015-05-21 13:44:42 -07:00
Tsuyoshi Ozawa
7401e5b5e8
YARN-3677. Fix findbugs warnings in yarn-server-resourcemanager. Contributed by Vinod Kumar Vavilapalli.
2015-05-20 08:59:26 +09:00
Wangda Tan
563eb1ad2a
YARN-3583. Support of NodeLabel object instead of plain String in YarnClient side. (Sunil G via wangda)
2015-05-19 16:54:38 -07:00
Wangda Tan
b37da52a1c
YARN-3565. NodeHeartbeatRequest/RegisterNodeManagerRequest should use NodeLabel object instead of String. (Naganarasimha G R via wangda)
2015-05-19 16:34:17 -07:00
Xuan
b0ad644083
YARN-3526. ApplicationMaster tracking URL is incorrectly redirected on a QJM cluster. Contributed by Weiwei Yang
2015-05-15 22:39:49 -07:00
Jason Lowe
f7e051c431
YARN-2421. RM still allocates containers to an app in the FINISHING state. Contributed by Chang Li
2015-05-15 22:09:30 +00:00
Vinod Kumar Vavilapalli
9a2a9553ee
Fixing HDFS state-store. Contributed by Arun Suresh.
2015-05-14 16:13:51 -07:00
Junping Du
15ccd967ee
YARN-3505. Node's Log Aggregation Report with SUCCEED should not cached in RMApps. Contributed by Xuan Gong.
2015-05-14 10:58:12 -07:00
Wangda Tan
0e85044e26
YARN-3362. Add node label usage in RM CapacityScheduler web UI. (Naganarasimha G R via wangda)
2015-05-13 17:00:36 -07:00
Wangda Tan
7f19e7a254
YARN-3521. Support return structured NodeLabel objects in REST API (Sunil G via wangda)
2015-05-13 13:43:17 -07:00
Wangda Tan
d6f6741296
YARN-3489. RMServerUtils.validateResourceRequests should only obtain queue info once. (Varun Saxena via wangda)
2015-05-11 17:31:15 -07:00
Junping Du
7e543c27fa
YARN-3587. Fix the javadoc of DelegationTokenSecretManager in yarn, etc. projects. Contributed by Gabor Liptak.
2015-05-11 05:27:07 -07:00
Karthik Kambatla
a60f78e98e
YARN-3395. FairScheduler: Trim whitespaces when using username for queuename. (Zhihai Xu via kasha)
2015-05-09 15:41:20 -07:00
Robert Kanter
5658998845
YARN-3473. Fix RM Web UI configuration for some properties (rchiang via rkanter)
2015-05-08 16:23:10 -07:00
Jian He
d0b73287c2
YARN-3604. Fixed ZKRMStateStore#removeApplication to also disable watch. Contributed Zhihai Xu
2015-05-08 15:12:33 -07:00
Vinod Kumar Vavilapalli
f7a74d2491
YARN-3018. Unified the default value for the configuration property yarn.scheduler.capacity.node-locality-delay in code and default xml file. Contributed by Nijel SF.
2015-05-08 14:45:06 -07:00
Wangda Tan
f55d21d894
YARN-3593. Add label-type and Improve "DEFAULT_PARTITION" in Node Labels Page. (Naganarasimha G R via wangda)
2015-05-08 12:04:50 -07:00
Thomas Graves
5d708a4725
YARN-3600. AM container link is broken (Naganarasimha G R via tgraves
2015-05-08 16:35:40 +00:00
Devaraj K
ef3d66d462
YARN-3358. Audit log not present while refreshing Service ACLs.
...
Contributed by Varun Saxena.
2015-05-08 12:14:10 +05:30
Devaraj K
a28cd02ba0
YARN-3592. Fix typos in RMNodeLabelsManager. Contributed by Sunil G.
2015-05-08 10:54:36 +05:30
Jian He
f489a4ec96
YARN-2918. RM should not fail on startup if queue's configured labels do not exist in cluster-node-labels. Contributed by Wangda Tan
2015-05-07 17:35:41 -07:00
Jian He
b88700dcd0
YARN-3584. Fixed attempt diagnostics format shown on the UI. Contributed by nijel
2015-05-07 13:54:07 -07:00
Vinod Kumar Vavilapalli
4c7b9b6abe
YARN-3385. Fixed a race-condition in ResourceManager's ZooKeeper based state-store to avoid crashing on duplicate deletes. Contributed by Zhihai Xu.
2015-05-06 17:51:17 -07:00
Jian He
0d3188fd25
YARN-3301. Fixed the format issue of the new RM attempt web page. Contributed by Xuan Gong
2015-05-06 13:45:00 -07:00
Xuan
b7dd3a4f04
YARN-2123. Progress bars in Web UI always at 100% due to non-US locale.
...
Contributed by Akira AJISAKA
2015-05-05 10:40:16 -07:00
Jason Lowe
9356cf8676
YARN-3552. RM Web UI shows -1 running containers for completed apps. Contributed by Rohith
2015-05-05 13:42:32 +00:00
Jian He
d701acc9c6
YARN-2725. Added test cases of retrying creating znode in ZKRMStateStore. Contributed by Tsuyoshi Ozawa
2015-05-04 16:13:29 -07:00
Gera Shegalov
f8204e241d
YARN-2893. AMLaucher: sporadic job failures due to EOFException in readTokenStorageStream. (Zhihai Xu via gera)
2015-05-01 18:18:55 -07:00
Zhijie Shen
7e8639fda4
YARN-3544. Got back AM logs link on the RM web UI for a completed app. Contributed by Xuan Gong.
2015-04-30 13:05:55 -07:00
tgraves
2e215484bd
YARN-3517. RM web ui for dumping scheduler logs should be for admins only (Varun Vasudev via tgraves)
2015-04-29 21:25:42 +00:00
Karthik Kambatla
8f82970e0c
YARN-3485. FairScheduler headroom calculation doesn't consider maxResources for Fifo and FairShare policies. (kasha)
2015-04-28 21:00:35 -07:00
Wangda Tan
db1b674b50
YARN-2740. Fix NodeLabelsManager to properly handle node label modifications when distributed node label configuration enabled. (Naganarasimha G R via wangda)
2015-04-27 16:24:38 -07:00
Jian He
d497f6ea2b
YARN-2498. Respect labels in preemption policy of capacity scheduler for inter-queue preemption. Contributed by Wangda Tan
2015-04-24 17:03:13 -07:00
Tsuyoshi Ozawa
4a3dabd94f
YARN-3406. Display count of running containers in the RM's Web UI. Contributed by Ryu Kobayashi.
2015-04-25 07:17:11 +09:00
Jian He
d03dcb9635
YARN-3387. Previous AM's container completed status couldn't pass to current AM if AM and RM restarted during the same time. Contributed by Sandflee
2015-04-24 12:13:29 -07:00
Xuan
eee9facbba
YARN-3511. Add errors and warnings page to ATS. Contributed by Varun Vasudev
2015-04-24 09:41:59 -07:00
Vinod Kumar Vavilapalli
f5fe35e297
YARN-3413. Changed Nodelabel attributes (like exclusivity) to be settable only via addToClusterNodeLabels but not changeable at runtime. (Wangda Tan via vinodkv)
2015-04-23 11:19:55 -07:00
Wangda Tan
395205444e
YARN-3319. Implement a FairOrderingPolicy. (Craig Welch via wangda)
2015-04-23 10:47:15 -07:00
tgraves
189a63a719
YARN-3434. Interaction between reservations and userlimit can result in significant ULF violation
2015-04-23 14:39:25 +00:00
Steve Loughran
d9bcf99aef
YARN-2605. [RM HA] Rest api endpoints doing redirect incorrectly. (Xuan Gong via stevel)
2015-04-23 09:22:48 +01:00
Junping Du
fad9d7e85b
New parameter of CLI for decommissioning node gracefully in RMAdmin CLI. Contributed by Devaraj K
2015-04-22 10:07:20 -07:00
Jian He
bdd90110e6
YARN-3494. Expose AM resource limit and usage in CS QueueMetrics. Contributed by Rohith Sharmaks
2015-04-21 20:06:20 -07:00
Wangda Tan
e71d0d87d9
YARN-3410. YARN admin should be able to remove individual application records from RMStateStore. (Rohith Sharmaks via wangda)
2015-04-21 17:51:22 -07:00
Tsuyoshi Ozawa
105afd5477
YARN-3495. Confusing log generated by FairScheduler. Contributed by Brahma Reddy Battula.
2015-04-22 05:47:59 +09:00
Wangda Tan
44872b76fc
YARN-3463. Integrate OrderingPolicy Framework with CapacityScheduler. (Craig Welch via wangda)
2015-04-20 17:12:32 -07:00
Jian He
497c86b485
YARN-3136. Fixed a synchronization problem of AbstractYarnScheduler#getTransferredContainers. Contributed by Sunil G
2015-04-18 12:46:37 -07:00
Wangda Tan
f47a5763ac
YARN-3487. CapacityScheduler scheduler lock obtained unnecessarily when calling getQueue (Jason Lowe via wangda)
2015-04-17 17:22:27 -07:00
Wangda Tan
f65eeb412d
YARN-3493. RM fails to come up with error "Failed to load/recover state" when mem settings are changed. (Jian He via wangda)
2015-04-17 17:11:22 -07:00
Jian He
6779467ab6
YARN-3451. Display attempt start time and elapsed time on the web UI. Contributed by Rohith Sharmaks
2015-04-17 15:55:34 -07:00
Jian He
d573f09fb9
YARN-2696. Queue sorting in CapacityScheduler should consider node label. Contributed by Wangda Tan
2015-04-17 13:36:59 -07:00
Junping Du
1db355a875
YARN-1402. Update related Web UI and CLI with exposing client API to check log aggregation status. Contributed by Xuan Gong.
2015-04-17 13:18:59 -07:00
Jian He
bb6dde68f1
YARN-3021. YARN's delegation-token handling disallows certain trust setups to operate properly over DistCp. Contributed by Yongjun Zhang
2015-04-16 19:43:37 -07:00
Akira Ajisaka
1fa8075794
MAPREDUCE-6300. Task list sort by task id broken. Contributed by Siqi Li.
2015-04-16 10:04:09 -05:00
Jian He
1b89a3e173
YARN-3354. Add node label expression in ContainerTokenIdentifier to support RM recovery. Contributed by Wangda Tan
2015-04-15 13:57:06 -07:00
Tsuyoshi Ozawa
e48cedc663
YARN-3326. Support RESTful API for getLabelsToNodes. Contributed by Naganarasimha G R.
2015-04-15 14:03:55 -05:00
Wangda Tan
5004e75332
YARN-3318. Create Initial OrderingPolicy Framework and FifoOrderingPolicy. (Craig Welch via wangda)
2015-04-15 09:56:32 -07:00
Jian He
0fefda645b
YARN-3361. CapacityScheduler side changes to support non-exclusive node labels. Contributed by Wangda Tan
2015-04-14 11:45:58 -07:00
Jian He
b46ee1e7a3
YARN-3266. RMContext#inactiveNodes should have NodeId as map key. Contributed by Chengbing Liu
2015-04-14 10:54:22 -07:00
Jian He
a1afbc48b5
YARN-3472. Fixed possible leak in DelegationTokenRenewer#allTokens. Contributed by Rohith Sharmaks
2015-04-13 14:07:17 -07:00
Jian He
944a16579f
YARN-3382. Some of UserMetricsInfo metrics are incorrectly set to root queue metrics. Contributed by Rohit Agarwal
2015-04-13 10:53:34 -07:00
Karthik Kambatla
e516706b89
YARN-3469. ZKRMStateStore: Avoid setting watches that are not required. (Jun Hong via kasha)
2015-04-10 11:20:34 -07:00
Jian He
577d755e4b
YARN-3348. Add a 'yarn top' tool to help understand cluster usage. Contributed by Varun Vasudev
2015-04-10 09:57:17 -07:00
Junping Du
92431c9617
YARN-1376. NM need to notify the log aggregation status to RM through Node heartbeat. Contributed by Xuan Gong.
2015-04-10 08:56:18 -07:00
Xuan
afa5d4715a
YARN-3293. Track and display capacity scheduler health metrics in web
...
UI. Contributed by Varun Vasudev
2015-04-09 23:38:04 -07:00
Vinod Kumar Vavilapalli
9c5911294e
YARN-3055. Fixed ResourceManager's DelegationTokenRenewer to not stop token renewal of applications part of a bigger workflow. Contributed by Daryn Sharp.
2015-04-09 13:08:53 -07:00
Wangda Tan
1885141e90
YARN-3466. Fix RM nodes web page to sort by node HTTP-address, #containers and node-label column. (Jason Lowe via wangda)
2015-04-09 10:35:12 -07:00
Xuan
d27e9241e8
YARN-3294. Allow dumping of Capacity Scheduler debug logs via web UI for
...
a fixed time period. Contributed by Varun Vasudev
2015-04-07 09:52:36 -07:00
Xuan
96d72118f5
YARN-3435. AM container to be allocated Appattempt AM container shown as
...
null. Contributed by Bibin A Chundatt
2015-04-05 00:40:57 -07:00
Wangda Tan
bad070fe15
YARN-2901. Add errors and warning metrics page to RM, NM web UI. (Varun Vasudev via wangda)
2015-04-02 17:23:20 -07:00
Sandy Ryza
6a6a59db7f
YARN-3415. Non-AM containers can be counted towards amResourceUsage of a fairscheduler queue (Zhihai Xu via Sandy Ryza)
2015-04-02 13:56:08 -07:00
Zhijie Shen
8366a36ad3
YARN-3430. Made headroom data available on app attempt page of RM WebUI. Contributed by Xuan Gong.
2015-04-01 13:47:54 -07:00
Xuan
4728bdfa15
YARN-3248. Display count of nodes blacklisted by apps in the web UI.
...
Contributed by Varun Vasudev
2015-04-01 04:19:18 -07:00
Tsuyoshi Ozawa
b5a22e9838
YARN-3258. FairScheduler: Need to add more logging to investigate allocations. Contributed by Anubhav Dhoot.
2015-03-31 17:42:44 +09:00
Wangda Tan
2a945d24f7
YARN-2495. Allow admin specify labels from each NM (Distributed configuration for node label). (Naganarasimha G R via wangda)
2015-03-30 12:05:21 -07:00
Wangda Tan
97a7277a2d
YARN-3383. AdminService should use warn instead of info to log exception when operation fails. (Li Lu via wangda)
2015-03-24 10:38:54 -07:00
Karthik Kambatla
972f1f1ab9
YARN-2868. FairScheduler: Metric for latency to allocate first container for an application. (Ray Chiang via kasha)
2015-03-23 14:07:05 -07:00
Karthik Kambatla
2bc097cd14
YARN-3241. FairScheduler handles invalid queue names inconsistently. (Zhihai Xu via kasha)
2015-03-23 13:22:03 -07:00
cnauroth
6ca1f12024
YARN-3336. FileSystem memory leak in DelegationTokenRenewer.
2015-03-23 10:45:50 -07:00
Jian He
e1feb4ea1a
YARN-3345. Add non-exclusive node label API. Contributed by Wangda Tan
2015-03-20 19:04:38 -07:00
Jian He
586348e4cb
YARN-3356. Capacity Scheduler FiCaSchedulerApp should use ResourceUsage to track used-resources-by-label. Contributed by Wangda Tan
2015-03-20 13:54:01 -07:00
Wangda Tan
6bc7710ec7
YARN-3369. Missing NullPointer check in AppSchedulingInfo causes RM to die. (Brahma Reddy Battula via wangda)
2015-03-20 10:42:05 -07:00
Jian He
4e886eb9cb
YARN-3379. Fixed missing data in localityTable and ResourceRequests table in RM WebUI. Contributed by Xuan Gong
2015-03-19 22:27:21 -07:00
Jian He
658097d6da
YARN-3273. Improve scheduler UI to facilitate scheduling analysis and debugging. Contributed Rohith Sharmaks
2015-03-17 21:30:23 -07:00
Tsuyoshi Ozawa
3bc72cc16d
YARN-3205. FileSystemRMStateStore should disable FileSystem Cache to avoid get a Filesystem with an old configuration. Contributed by Zhihai Xu.
2015-03-18 11:53:19 +09:00
Jian He
968425e9f7
YARN-3305. Normalize AM resource request on app submission. Contributed by Rohith Sharmaks
2015-03-17 13:49:59 -07:00
Karthik Kambatla
32b4330456
Revert "YARN-3181. FairScheduler: Fix up outdated findbugs issues. (kasha)"
...
This reverts commit c2b185def8
.
2015-03-17 12:31:15 -07:00
Jian He
487374b7fe
YARN-3243. CapacityScheduler should pass headroom from parent to children to make sure ParentQueue obey its capacity limits. Contributed by Wangda Tan.
2015-03-17 10:24:23 -07:00
Devaraj K
7179f94f9d
YARN-3197. Confusing log generated by CapacityScheduler. Contributed by
...
Varun Saxena.
2015-03-17 15:57:57 +05:30
Tsuyoshi Ozawa
3da9a97cfb
YARN-1453. [JDK8] Fix Javadoc errors caused by incorrect or illegal tags in doc comments. Contributed by Akira AJISAKA, Andrew Purtell, and Allen Wittenauer.
2015-03-16 23:19:05 +09:00
Zhijie Shen
85f6d67fa7
YARN-1884. Added nodeHttpAddress into ContainerReport and fixed the link to NM web page. Contributed by Xuan Gong.
2015-03-11 19:35:19 -07:00
Allen Wittenauer
a5cf985bf5
YARN-2280. Resource manager web service fields are not accessible (Krisztian Horvath via aw)
2015-03-10 16:32:20 -07:00
Jian He
c3003eba6f
YARN-3300. Outstanding_resource_requests table should not be shown in AHS. Contributed by Xuan Gong
2015-03-09 20:46:48 -07:00
Jason Lowe
27e8ea820f
YARN-3275. CapacityScheduler: Preemption happening on non-preemptable queues. Contributed by Eric Payne
2015-03-06 22:37:26 +00:00
Jian He
95bfd087dc
YARN-1809. Synchronize RM and TimeLineServer Web-UIs. Contributed by Zhijie Shen and Xuan Gong
2015-03-05 21:20:09 -08:00
Tsuyoshi Ozawa
1b67209612
YARN-3249. Add a 'kill application' button to Resource Manager's Web UI. Contributed by Ryu Kobayashi.
2015-03-05 19:55:56 +09:00
Karthik Kambatla
8d88691d16
YARN-3242. Asynchrony in ZK-close can lead to ZKRMStateStore watcher receiving events for old client. (Zhihai Xu via kasha)
2015-03-04 19:49:05 -08:00
Karthik Kambatla
22426a1c9f
YARN-3231. FairScheduler: Changing queueMaxRunningApps interferes with pending jobs. (Siqi Li via kasha)
2015-03-04 18:06:58 -08:00
Jian He
b2f1ec312e
YARN-3222. Fixed NPE on RMNodeImpl#ReconnectNodeTransition when a node is reconnected with a different port. Contributed by Rohith Sharmaks
2015-03-03 16:28:28 -08:00
Wangda Tan
e17e5ba9d7
YARN-3272. Surface container locality info in RM web UI (Jian He via wangda)
2015-03-03 11:49:01 -08:00
Tsuyoshi Ozawa
d1c6accb6f
HADOOP-11602. Fix toUpperCase/toLowerCase to use Locale.ENGLISH. (ozawa)
2015-03-03 14:17:52 +09:00
Vinod Kumar Vavilapalli
14dd647c55
YARN-3265. Fixed a deadlock in CapacityScheduler by always passing a queue's available resource-limit from the parent queue. Contributed by Wangda Tan.
2015-03-02 17:52:47 -08:00
Wangda Tan
edcecedc1c
YARN-3262. Surface application outstanding resource requests table in RM web UI. (Jian He via wangda)
2015-02-27 16:13:32 -08:00
Tsuyoshi Ozawa
01a1621930
YARN-2820. Retry in FileSystemRMStateStore when FS's operations fail due to IOException. Contributed by Zhihai Xu.
2015-02-28 00:56:44 +09:00
Konstantin V Shvachko
8ca0d957c4
YARN-3255. RM, NM, JobHistoryServer, and WebAppProxyServer's main() should support generic options. Contributed by Konstantin Shvachko.
2015-02-26 17:12:19 -08:00
Tsuyoshi Ozawa
9cedad11d8
Revert "HADOOP-11602. Fix toUpperCase/toLowerCase to use Locale.ENGLISH. (ozawa)"
...
This reverts commit 946456c6d8
.
Conflicts:
hadoop-common-project/hadoop-common/CHANGES.txt
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/QuotaByStorageTypeEntry.java
2015-02-25 00:32:04 +09:00
Wangda Tan
ce5bf927c3
YARN-3230. Clarify application states on the web UI. (Jian He via wangda)
2015-02-20 10:39:28 -08:00
Jason Lowe
a64dd3d24b
YARN-3194. RM should handle NMContainerStatuses sent by NM while registering if NM is Reconnected node. Contributed by Rohith
2015-02-20 15:10:10 +00:00
Jian He
c0d9b93953
YARN-933. Fixed InvalidStateTransitonException at FINAL_SAVING state in RMApp. Contributed by Rohith Sharmaks
2015-02-19 15:42:39 -08:00
Wangda Tan
d49ae725d5
YARN-3076. Add API/Implementation to YarnClient to retrieve label-to-node mapping (Varun Saxena via wangda)
2015-02-19 11:00:57 -08:00
Tsuyoshi Ozawa
946456c6d8
HADOOP-11602. Fix toUpperCase/toLowerCase to use Locale.ENGLISH. (ozawa)
2015-02-19 13:06:53 +09:00
Tsuyoshi Ozawa
b8a14efdf5
YARN-1615. Fix typos in delay scheduler's description. Contributed by Akira Ajisaka.
2015-02-19 10:24:07 +09:00
Jian He
f5da5566d9
YARN-3132. RMNodeLabelsManager should remove node from node-to-label mapping when node becomes deactivated. Contributed by Wangda Tan
2015-02-18 11:51:51 -08:00
Tsuyoshi Ozawa
556386a070
YARN-1299. Improve a log message in AppSchedulingInfo by adding application id. Contributed by Ashutosh Jindal and Devaraj K.
2015-02-17 01:53:31 +09:00
Tsuyoshi Ozawa
3f32357c36
Revert "YARN-1299. Improve a log message in AppSchedulingInfo by adding application id. Contributed by Ashutosh Jindal and devaraj."
...
This reverts commit 9aae81c934
.
2015-02-17 01:52:22 +09:00
Tsuyoshi Ozawa
9aae81c934
YARN-1299. Improve a log message in AppSchedulingInfo by adding application id. Contributed by Ashutosh Jindal and devaraj.
2015-02-17 01:48:22 +09:00
Tsuyoshi Ozawa
110cf6b9a3
YARN-3182. Cleanup switch statement in ApplicationMasterLauncher#handle(). Contributed by Ray Chiang.
2015-02-13 14:21:21 +09:00
Xuan
99f6bd4f7a
YARN-3158. Correct log messages in ResourceTrackerService. Contributed
...
by Varun Saxena
2015-02-12 16:42:15 -08:00
Jian He
18297e0972
YARN-3104. Fixed RM to not generate new AMRM tokens on every heartbeat between rolling and activation. Contributed by Jason Lowe
2015-02-12 16:02:45 -08:00
Jian He
18a594257e
YARN-3124. Fixed CS LeafQueue/ParentQueue to use QueueCapacities to track capacities-by-label. Contributed by Wangda Tan
2015-02-12 14:58:09 -08:00
Karthik Kambatla
c2b185def8
YARN-3181. FairScheduler: Fix up outdated findbugs issues. (kasha)
2015-02-12 13:44:47 -08:00
Xuan
65c69e296e
YARN-3151. On Failover tracking url wrong in application cli for KILLED
...
application. Contributed by Rohith
2015-02-11 21:19:48 -08:00
Junping Du
c541a374d8
YARN-3160. Fix non-atomic operation on nodeUpdateQueue in RMNodeImpl. (Contributed by Chengbing Liu)
2015-02-11 03:44:31 -08:00
Zhijie Shen
d5855c0e46
YARN-2246. Made the proxy tracking URL always be http(s)://proxy addr:port/proxy/<appId> to avoid duplicate sections. Contributed by Devaraj K.
2015-02-10 15:24:01 -08:00
Zhijie Shen
23bf6c7207
YARN-3100. Made YARN authorization pluggable. Contributed by Jian He.
2015-02-09 20:34:56 -08:00
Jian He
0af6a99a3f
YARN-3094. Reset timer for liveness monitors after RM recovery. Contributed by Jun Gong
2015-02-09 13:47:08 -08:00
Karthik Kambatla
7e42088abf
YARN-2990. FairScheduler's delay-scheduling always waits for node-local and rack-local delays, even for off-rack-only requests. (kasha)
2015-02-08 22:48:10 -08:00
Jason Lowe
da2fb2bc46
YARN-3143. RM Apps REST API can return NPE or entries missing id and other fields. Contributed by Jason Lowe
2015-02-06 21:47:32 +00:00
Jian He
c1957fef29
YARN-2694. Ensure only single node label specified in ResourceRequest. Contributed by Wangda Tan
2015-02-06 11:34:20 -08:00
Arun C. Murthy
18b2507eda
YARN-1904. Ensure exceptions thrown in ClientRMService & ApplicationHistoryClientService are uniform when application-attempt is not found. Contributed by Zhijie Shen.
2015-02-05 23:48:55 -08:00
Jian He
4641196fe0
YARN-3145. Fixed ConcurrentModificationException on CapacityScheduler ParentQueue#getQueueUserAclInfo. Contributed by Tsuyoshi OZAWA
2015-02-05 16:12:53 -08:00
Jason Lowe
69c8a7f45b
YARN-1582. Capacity Scheduler: add a maximum-allocation-mb setting per queue. Contributed by Thomas Graves
2015-02-05 19:28:49 +00:00
Sandy Ryza
b6466deac6
YARN-3101. In Fair Scheduler, fix canceling of reservations for exceeding max share (Anubhav Dhoot via Sandy Ryza)
2015-02-05 09:39:28 -08:00
Tsuyoshi Ozawa
26dee1486b
YARN-3058. Fix error message of tokens' activation delay configuration. Contributed by Yi Liu.
2015-02-04 11:41:14 +09:00
Zhijie Shen
e5b56e2af6
YARN-2543. Made resource usage be published to the timeline server too. Contributed by Naganarasimha G R.
2015-02-03 17:34:22 -08:00
Wangda Tan
5bd984691b
YARN-3075. NodeLabelsManager implementation to retrieve label to node mapping (Varun Saxena via wangda)
2015-02-03 12:52:42 -08:00
Jian He
21d80b3dd9
YARN-3098. Created common QueueCapacities class in Capacity Scheduler to track capacities-by-labels of queues. Contributed by Wangda Tan
2015-02-03 11:43:12 -08:00
Jason Lowe
a761bf8726
YARN-3085. Application summary should include the application type. Contributed by Rohith
2015-02-03 14:56:34 +00:00
Jian He
054a947989
YARN-3077. Fixed RM to create zk root path recursively. Contributed by Chun Chen
2015-01-30 17:34:49 -08:00
Jian He
86358221fc
YARN-3099. Capacity Scheduler LeafQueue/ParentQueue should use ResourceUsage to track used-resources-by-label. Contributed by Wangda Tan
2015-01-30 15:15:20 -08:00
Akira Ajisaka
342efa110a
HADOOP-9907. Webapp http://hostname:port/metrics link is not working. (aajisaka)
2015-01-30 02:49:10 +09:00
Wangda Tan
7882bc0f14
YARN-3079. Scheduler should also update maximumAllocation when updateNodeResource. (Zhihai Xu via wangda)
2015-01-28 21:54:38 -08:00
Wangda Tan
18741adf97
YARN-2932. Add entry for preemptable status (enabled/disabled) to scheduler web UI and queue initialize/refresh logging. (Eric Payne via wangda)
2015-01-27 15:36:09 -08:00
Jian He
6f9fe76918
YARN-3092. Created a common ResourceUsage class to track labeled resource usages in Capacity Scheduler. Contributed by Wangda Tan
2015-01-26 15:38:00 -08:00
Tsuyoshi Ozawa
24aa462673
YARN-2800. Remove MemoryNodeLabelsStore and add a way to enable/disable node labels feature. Contributed by Wangda Tan.
2015-01-23 20:37:05 +09:00
Wangda Tan
0a2d3e717d
YARN-2933. Capacity Scheduler preemption policy should only consider capacity without labels temporarily. Contributed by Mayank Bansal
2015-01-19 16:48:50 -08:00
Jian He
9e33116d1d
YARN-2861. Fixed Timeline DT secret manager to not reuse RM's configs. Contributed by Zhijie Shen
2015-01-15 11:07:43 -08:00
Jian He
c53420f583
YARN-2637. Fixed max-am-resource-percent calculation in CapacityScheduler when activating applications. Contributed by Craig Welch
2015-01-13 17:32:07 -08:00
Robert Kanter
51881535e6
YARN-2643. Don't create a new DominantResourceCalculator on every FairScheduler.allocate call. (kasha via rkanter)
2015-01-12 17:51:00 -08:00
Robert Kanter
ae7bf31fe1
YARN-3027. Scheduler should use totalAvailable resource from node instead of availableResource for maxAllocation. (adhoot via rkanter)
2015-01-12 10:47:52 -08:00
Jian He
a260406268
YARN-3014. Replaces labels on a host should update all NM's labels on that host. Contributed by Wangda Tan
2015-01-09 17:49:53 -08:00
Zhijie Shen
dc2eaa26b2
YARN-2996. Improved synchronization and I/O operations of FS- and Mem- RMStateStore. Contributed by Yi Liu.
2015-01-08 09:47:02 -08:00
Jian He
e13a484a2b
YARN-3010. Fixed findbugs warning in AbstractYarnScheduler. Contributed by Yi Liu
2015-01-07 14:12:20 -08:00
Zhijie Shen
60103fca04
YARN-2427. Added the API of moving apps between queues in RM web services. Contributed by Varun Vasudev.
2015-01-06 14:37:44 -08:00
Jian He
dd57c2047b
YARN-2978. Fixed potential NPE while getting queue info. Contributed by Varun Saxena
2015-01-06 14:15:57 -08:00
Karthik Kambatla
0c4b112677
YARN-2881. [YARN-2574] Implement PlanFollower for FairScheduler. (Anubhav Dhoot via kasha)
2015-01-06 04:42:10 +05:30
Zhijie Shen
562a701945
YARN-2958. Made RMStateStore not update the last sequence number when updating the delegation token. Contributed by Varun Saxena.
2015-01-05 13:33:07 -08:00
Tsuyoshi Ozawa
ddc5be48fc
YARN-2922. ConcurrentModificationException in CapacityScheduler's LeafQueue. Contributed by Rohith Sharmaks.
2015-01-05 00:08:31 +09:00
Karthik Kambatla
e7257acd8a
YARN-2998. Abstract out scheduler independent PlanFollower components. (Anubhav Dhoot via kasha)
2014-12-30 19:55:24 -08:00
Jian He
e2351c7ae2
YARN-2987. Fixed ClientRMService#getQueueInfo to check against queue and app ACLs. Contributed by Varun Saxena
2014-12-30 17:15:37 -08:00
Jian He
b7442bf92e
YARN-2493. Added node-labels page on RM web UI. Contributed by Wangda Tan
2014-12-30 16:49:01 -08:00
Jian He
746ad6e989
Revert "YARN-2492(wrong jira number). Added node-labels page on RM web UI. Contributed by Wangda Tan"
...
This reverts commit 5f57b904f5
.
2014-12-30 16:48:49 -08:00
Jian He
5f57b904f5
YARN-2492. Added node-labels page on RM web UI. Contributed by Wangda Tan
2014-12-30 15:38:28 -08:00
Zhijie Shen
241d3b3a50
YARN-2938. Fixed new findbugs warnings in hadoop-yarn-resourcemanager and hadoop-yarn-applicationhistoryservice. Contributed by Varun Saxena.
2014-12-29 09:59:35 -08:00
Jian He
1454efe5d4
YARN-2992. ZKRMStateStore crashes due to session expiry. Contributed by Karthik Kambatla
2014-12-26 17:59:58 -08:00
Junping Du
40ee4bff65
YARN-2993. Several fixes (missing acl check, error log msg ...) and some refinement in AdminService. (Contributed by Yi Liu)
2014-12-26 06:59:25 -08:00
Jian He
4f18018b7a
YARN-2946. Fixed potential deadlock in RMStateStore. Contributed by Rohith Sharmaks
2014-12-23 22:14:29 -08:00
Jian He
0d89859b51
YARN-2340. Fixed NPE when queue is stopped during RM restart. Contributed by Rohith Sharmaks
2014-12-22 21:53:22 -08:00
Jian He
fdf042dfff
YARN-2920. Changed CapacityScheduler to kill containers on nodes where node labels are changed. Contributed by Wangda Tan
2014-12-22 16:51:15 -08:00
Junping Du
a696fbb001
YARN-2939. Fix new findbugs warnings in hadoop-yarn-common. (Li Lu via junping_du)
2014-12-22 03:06:37 -08:00
Karthik Kambatla
24ee9e3431
YARN-2975. FSLeafQueue app lists are accessed without required locks. (kasha)
2014-12-20 12:17:50 -08:00
Jian He
808cba3821
YARN-2952. Fixed incorrect version check in StateStore. Contributed by Rohith Sharmaks
2014-12-19 16:56:30 -08:00
Karthik Kambatla
a22ffc3188
YARN-2738. [YARN-2574] Add FairReservationSystem for FairScheduler. (Anubhav Dhoot via kasha)
2014-12-19 15:37:12 -08:00
Jason Lowe
0402bada19
YARN-2964. RM prematurely cancels tokens for jobs that submit jobs (oozie). Contributed by Jian He
2014-12-18 23:28:18 +00:00
Junping Du
2b4b0e8847
YARN-2972. DelegationTokenRenewer thread pool never expands. Contributed by Jason Lowe
2014-12-17 17:40:00 -08:00
Karthik Kambatla
f2d150ea12
YARN-2964. FSLeafQueue#assignContainer - document the reason for using both write and read locks. (Tsuyoshi Ozawa via kasha)
2014-12-17 12:03:16 -08:00
Devaraj K
bda748ac3a
YARN-2243. Order of arguments for Preconditions.checkNotNull() is wrong in
...
SchedulerApplicationAttempt ctor. Contributed by Devaraj K.
2014-12-12 12:34:43 +05:30
Karthik Kambatla
a2e07a5456
YARN-2910. FSLeafQueue can throw ConcurrentModificationException. (Wilfred Spiegelenburg via kasha)
2014-12-09 14:00:31 -08:00
Jian He
e69af836f3
YARN-2869. CapacityScheduler should trim sub queue names when parse configuration. Contributed by Wangda Tan
2014-12-05 17:33:39 -08:00
Jason Lowe
4b13082199
YARN-2056. Disable preemption at Queue level. Contributed by Eric Payne
2014-12-05 21:06:48 +00:00
Jian He
258623ff8b
YARN-2301. Improved yarn container command. Contributed by Naganarasimha G R
2014-12-04 12:53:18 -08:00
Karthik Kambatla
799353e2c7
YARN-2874. Dead lock in DelegationTokenRenewer which blocks RM to execute any further apps. (Naganarasimha G R via kasha)
2014-12-03 13:44:41 -08:00
Jian He
392c3aaea8
YARN-2894. Fixed a bug regarding application view acl when RM fails over. Contributed by Rohith Sharmaks
2014-12-02 17:16:35 -08:00
Jian He
52bcefca8b
YARN-2136. Changed RMStateStore to ignore store opearations when fenced. Contributed by Varun Saxena
2014-12-02 10:54:48 -08:00
Jian He
a7fba0bc28
YARN-2765. Added leveldb-based implementation for RMStateStore. Contributed by Jason Lowe
2014-12-01 16:38:25 -08:00
Junping Du
c732ed760e
YARN-2907. SchedulerNode#toString should print all resource detail instead of only memory. (Contributed by Rohith)
2014-12-01 05:38:22 -08:00
Jian He
8a7ca13b13
YARN-2906. CapacitySchedulerPage shows HTML tags for a queue's Active Users. Contributed by Jason Lowe
2014-11-25 16:16:56 -08:00
Jian He
5805a81efb
YARN-2404. Removed ApplicationAttemptState and ApplicationState class in RMStateStore. Contributed by Tsuyoshi OZAWA
2014-11-25 12:48:22 -08:00
Zhijie Shen
e37a4ff0c1
YARN-2697. Remove useless RMAuthenticationHandler. Contributed by Haosong Huang.
2014-11-24 14:58:59 -08:00
Sandy Ryza
a128cca305
YARN-2669. FairScheduler: queue names shouldn't allow periods (Wei Yan via Sandy Ryza)
2014-11-21 16:06:41 -08:00
Karthik Kambatla
3114d4731d
YARN-2604. Scheduler should consider max-allocation-* in conjunction with the largest node. (Robert Kanter via kasha)
2014-11-21 10:32:28 -08:00
Karthik Kambatla
a9a0cc3679
YARN-2315. FairScheduler: Set current capacity in addition to capacity. (Zhihai Xu via kasha)
2014-11-19 20:15:40 -08:00
Karthik Kambatla
c90fb84aaa
YARN-2802. ClusterMetrics to include AM launch and register delays. (Zhihai Xu via kasha)
2014-11-19 19:50:12 -08:00
Jian He
9cb8b75ba5
YARN-2865. Fixed RM to always create a new RMContext when transtions from StandBy to Active. Contributed by Rohith Sharmaks
2014-11-19 19:48:52 -08:00
Karthik Kambatla
2fce6d6141
YARN-2690. [YARN-2574] Make ReservationSystem and its dependent classes independent of Scheduler type. (Anubhav Dhoot via kasha)
2014-11-17 16:45:57 -08:00
Jason Lowe
81c9d17af8
YARN-2414. RM web UI: app page will crash if app is failed before any attempt has been created. Contributed by Wangda Tan
2014-11-17 21:15:48 +00:00
Jian He
9b860661a5
YARN-2432. RMStateStore should process the pending events before close. Contributed by Varun Saxena
2014-11-14 17:24:09 -08:00
Sandy Ryza
1a47f890ba
YARN-2811. In Fair Scheduler, reservation fulfillments shouldn't ignore max share (Siqi Li via Sandy Ryza)
2014-11-14 15:18:56 -08:00
Jian He
d005404ef7
YARN-2856. Fixed RMAppImpl to handle ATTEMPT_KILLED event at ACCEPTED state on app recovery. Contributed by Rohith Sharmaks
2014-11-13 15:34:26 -08:00
Vinod Kumar Vavilapalli
3651fe1b08
YARN-2853. Fixed a bug in ResourceManager causing apps to hang when the user kill request races with ApplicationMaster finish. Contributed by Jian He.
2014-11-13 08:13:03 -08:00
Jason Lowe
f8aefa5e9c
YARN-2780. Log aggregated resource allocation in rm-appsummary.log. Contributed by Eric Payne
2014-11-12 17:01:15 +00:00
Vinod Kumar Vavilapalli
e76faebc95
YARN-2834. Fixed ResourceManager to ignore token-renewal failures on recovery consistent with the (somewhat incorrect) behaviour in the non-recovery case. Contributed by Jian He.
2014-11-09 18:56:58 -08:00
Zhijie Shen
9a4e0d343e
YARN-2505. Supported get/add/remove/change labels in RM REST API. Contributed by Craig Welch.
2014-11-07 20:35:46 -08:00
Vinod Kumar Vavilapalli
df36edf751
YARN-2826. Fixed user-groups mappings' refresh bug caused by YARN-2826. Contributed by Wangda Tan.
2014-11-07 19:43:55 -08:00
Vinod Kumar Vavilapalli
2ac1be7dec
YARN-2824. Fixed Capacity Scheduler to not crash when some node-labels are not mapped to queues by making default capacities per label to be zero. Contributed by Wangda Tan.
2014-11-07 10:39:37 -08:00
Vinod Kumar Vavilapalli
a5657182a7
YARN-2823. Fixed ResourceManager app-attempt state machine to inform schedulers about previous finished attempts of a running appliation to avoid expectation mismatch w.r.t transferred containers. Contributed by Jian He.
2014-11-07 09:28:36 -08:00
Vinod Kumar Vavilapalli
a3839a9fbf
YARN-2744. Fixed CapacityScheduler to validate node-labels correctly against queues. Contributed by Wangda Tan.
2014-11-06 17:28:12 -08:00
Jian He
395275af86
YARN-2579. Fixed a deadlock issue when EmbeddedElectorService and FatalEventDispatcher try to transition RM to StandBy at the same time. Contributed by Rohith Sharmaks
2014-11-05 16:59:54 -08:00
Vinod Kumar Vavilapalli
834e931d8e
YARN-2805. Fixed ResourceManager to load HA configs correctly before kerberos login. Contributed by Wangda Tan.
2014-11-05 15:30:33 -08:00
Karthik Kambatla
b2cd269802
YARN-2010. Handle app-recovery failures gracefully. (Jian He and Karthik Kambatla via kasha)
2014-11-04 17:45:24 -08:00
Vinod Kumar Vavilapalli
ec6cbece8e
YARN-2795. Fixed ResourceManager to not crash loading node-label data from HDFS in secure mode. Contributed by Wangda Tan.
2014-11-03 13:44:06 -08:00
Vinod Kumar Vavilapalli
d1828d9443
YARN-2779. Fixed ResourceManager to not require delegation tokens for communicating with Timeline Service. Contributed by Zhijie Shen.
2014-10-30 23:16:19 -07:00
Vinod Kumar Vavilapalli
e0233c16eb
YARN-2698. Moved some node label APIs to be correctly placed in client protocol. Contributed by Wangda Tan.
2014-10-30 22:59:31 -07:00
Karthik Kambatla
179cab81e0
YARN-2712. TestWorkPreservingRMRestart: Augment FS tests with queue and headroom checks. (Tsuyoshi Ozawa via kasha)
2014-10-30 00:29:07 -07:00
Karthik Kambatla
782971ae7a
YARN-2742. FairSchedulerConfiguration should allow extra spaces between value and unit. (Wei Yan via kasha)
2014-10-29 10:24:57 -07:00
Jian He
d5e0a09721
YARN-2503. Added node lablels in web UI. Contributed by Wangda Tan
2014-10-28 17:57:54 -07:00
Vinod Kumar Vavilapalli
a16d022ca4
YARN-2704. Changed ResourceManager to optionally obtain tokens itself for the sake of localization and log-aggregation for long-running services. Contributed by Jian He.
2014-10-27 15:49:47 -07:00
Xuan
ce1a4419a6
YARN-2726. CapacityScheduler should explicitly log when an accessible label has no capacity. Contributed by Wangda Tan
2014-10-27 10:26:21 -07:00
Vinod Kumar Vavilapalli
0186645505
YARN-2743. Fixed a bug in ResourceManager that was causing RMDelegationToken identifiers to be tampered and thus causing app submission failures in secure mode. Contributed by Jian He.
2014-10-26 11:14:34 -07:00
Zhijie Shen
0f3b6900be
YARN-2209. Replaced AM resync/shutdown command with corresponding exceptions and made related MR changes. Contributed by Jian He.
2014-10-23 21:56:03 -07:00
Zhijie Shen
7e3b5e6f5c
YARN-2721. Suppress NodeExist exception thrown by ZKRMStateStore when it retries creating znode. Contributed by Jian He.
2014-10-21 21:49:17 -07:00
Vinod Kumar Vavilapalli
c0e034336c
YARN-2715. Fixed ResourceManager to respect common configurations for proxy users/groups beyond just the YARN level config. Contributed by Zhijie Shen.
2014-10-21 20:09:40 -07:00
Vinod Kumar Vavilapalli
8256766498
YARN-2504. Enhanced RM Admin CLI to support management of node-labels. Contribyted by Wangda Tan.
2014-10-18 12:07:40 -07:00
Vinod Kumar Vavilapalli
e9c66e8fd2
YARN-2705. Fixed bugs in ResourceManager node-label manager that were causing test-failures: added a dummy in-memory labels-manager. Contributed by Wangda Tan.
2014-10-17 18:26:12 -07:00
Jian He
c3de2412eb
YARN-1879. Marked Idempotent/AtMostOnce annotations to ApplicationMasterProtocol for RM fail over. Contributed by Tsuyoshi OZAWA
2014-10-17 16:35:27 -07:00
Jian He
a6aa6e42ca
YARN-2588. Standby RM fails to transitionToActive if previous transitionToActive failed with ZK exception. Contributed by Rohith Sharmaks
2014-10-17 10:54:24 -07:00
Vinod Kumar Vavilapalli
abae63caf9
YARN-2699. Fixed a bug in CommonNodeLabelsManager that caused tests to fail when using ephemeral ports on NodeIDs. Contributed by Wangda Tan.
2014-10-17 08:58:08 -07:00
Vinod Kumar Vavilapalli
f2ea555ac6
YARN-2496. Enhanced Capacity Scheduler to have basic support for allocating resources based on node-labels. Contributed by Wangda Tan.
...
YARN-2500. Ehnaced ResourceManager to support schedulers allocating resources based on node-labels. Contributed by Wangda Tan.
2014-10-15 18:33:06 -07:00
Zhijie Shen
1220bb72d4
YARN-2656. Made RM web services authentication filter support proxy user. Contributed by Varun Vasudev and Zhijie Shen.
2014-10-14 21:50:46 -07:00
Karthik Kambatla
da709a2eac
YARN-2641. Decommission nodes on -refreshNodes instead of next NM-RM heartbeat. (Zhihai Xu via kasha)
2014-10-13 16:23:04 -07:00
Jian He
f9680d9a16
YARN-2308. Changed CapacityScheduler to explicitly throw exception if the queue
...
to which the apps were submitted is changed across RM restart. Contributed by Craig Welch & Chang Li
2014-10-13 14:09:04 -07:00
Vinod Kumar Vavilapalli
db7f165319
YARN-2494. Added NodeLabels Manager internal API and implementation. Contributed by Wangda Tan.
2014-10-10 11:44:21 -07:00
Jian He
30d56fdbb4
YARN-1857. CapacityScheduler headroom doesn't account for other AM's running. Contributed by Chen He and Craig Welch
2014-10-07 13:45:04 -07:00
Jian He
519e5a7dd2
YARN-2644. Fixed CapacityScheduler to return up-to-date headroom when AM allocates. Contributed by Craig Welch
2014-10-06 15:48:46 -07:00
Jian He
ea26cc0b4a
YARN-2615. Changed ClientToAMTokenIdentifier/RM(Timeline)DelegationTokenIdentifier to use protobuf as payload. Contributed by Junping Du
2014-10-06 10:47:43 -07:00
Chris Douglas
c8212bacb1
YARN-1051. Add a system for creating reservations of cluster capacity.
...
Contributed by Subru Krishnan and Carlo Curino.
2014-10-03 17:09:02 -07:00
subru
a2986234be
YARN-2611. Fixing jenkins findbugs warning and TestRMWebServicesCapacitySched for branch YARN-1051. Contributed by Subru Krishnan and Carlo Curino.
...
(cherry picked from commit c47464aba407d1dafe10be23fe454f0489cc4367)
2014-10-03 15:43:23 -07:00
subru
5e10a13bb4
YARN-2576. Making test patch pass in branch. Contributed by Subru Krishnan and Carlo Curino.
...
(cherry picked from commit 90ac0be86b898aefec5471db4027554c8e1b310c)
2014-10-03 15:43:13 -07:00
subru
6261f7cc69
YARN-2080. Integrating reservation system with ResourceManager and client-RM protocol. Contributed by Subru Krishnan and Carlo Curino.
...
(cherry picked from commit 8baeaead8532898163f1006276b731a237b1a559)
Conflicts:
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/rmapp/RMAppImpl.java
2014-10-03 15:42:43 -07:00
subru
3418c56bcf
YARN-1712. Plan follower that synchronizes the current state of reservation subsystem with the scheduler. Contributed by Subru Krishnan and Carlo Curino.
...
(cherry picked from commit 169085319b8b76641f8b9f6840a3fef06d221e2b)
2014-10-03 15:42:10 -07:00
carlo curino
b6df0dddcd
YARN-1711. Policy to enforce instantaneous and over-time quotas on user reservation. Contributed by Carlo Curino and Subru Krishnan.
...
(cherry picked from commit c4918cb4cb5a267a8cfd6eace28fcfe7ad6174e8)
2014-10-03 15:42:03 -07:00
carlo curino
f66ffcf832
YARN-1710. Logic to find allocations within a Plan that satisfy user ReservationRequest(s). Contributed by Carlo Curino and Subru Krishnan.
...
(cherry picked from commit aef7928899b37262773f3dc117157bb746bf8918)
2014-10-03 15:41:57 -07:00
subru
cf4b34282a
YARN-1709. In-memory data structures used to track resources over time to enable reservations.
...
(cherry picked from commit 0d8b2cd88b958b1e602fd4ea4078ef8d4742a7c3)
2014-10-03 15:41:51 -07:00
carlo curino
1c6950354f
YARN-2475. Logic for responding to capacity drops for the ReservationSystem. Contributed by Carlo Curino and Subru Krishnan.
...
(cherry picked from commit f83a07f266f2c5e6eead554d8a331ed7e75e10d5)
2014-10-03 15:41:21 -07:00
carlo curino
eb3e40b833
YARN-1707. Introduce APIs to add/remove/resize queues in the CapacityScheduler. Contributed by Carlo Curino and Subru Krishnan
...
(cherry picked from commit aac47fda7fecda9fc18ade34d633eca895865a70)
Conflicts:
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/CapacityScheduler.java
2014-10-03 15:41:02 -07:00
Jian He
054f285526
YARN-2628. Capacity scheduler with DominantResourceCalculator carries out reservation even though slots are free. Contributed by Varun Vasudev
2014-10-02 15:13:33 -07:00
Zhijie Shen
52bbe0f11b
YARN-2630. Prevented previous AM container status from being acquired by the current restarted AM. Contributed by Jian He.
2014-10-01 15:38:11 -07:00
Jian He
bbff96be48
YARN-2602. Fixed possible NPE in ApplicationHistoryManagerOnTimelineStore. Contributed by Zhijie Shen
2014-09-30 16:44:17 -07:00
Karthik Kambatla
14d60dadc2
YARN-2594. Potential deadlock in RM when querying ApplicationResourceUsageReport. (Wangda Tan via kasha)
2014-09-30 16:12:27 -07:00
Zhijie Shen
9582a50176
YARN-2627. Added the info logs of attemptFailuresValidityInterval and number of previous failed attempts. Contributed by Xuan Gong.
2014-09-30 12:57:45 -07:00
Jason Lowe
9c22065109
YARN-1769. CapacityScheduler: Improve reservations. Contributed by Thomas Graves
2014-09-29 14:12:18 +00:00
Karthik Kambatla
f4357240a6
YARN-2608. FairScheduler: Potential deadlocks in loading alloc files and clock access. (Wei Yan via kasha)
2014-09-25 17:42:51 -07:00
Jason Lowe
8269bfa613
YARN-2523. ResourceManager UI showing negative value for "Decommissioned Nodes" field. Contributed by Rohith
2014-09-25 22:37:05 +00:00
Zhijie Shen
72b0881ca6
YARN-2546. Made REST API for application creation/submission use numeric and boolean types instead of the string of them. Contributed by Varun Vasudev.
2014-09-24 17:57:32 -07:00
Zhijie Shen
c86674a3a4
YARN-2581. Passed LogAggregationContext to NM via ContainerTokenIdentifier. Contributed by Xuan Gong.
2014-09-24 17:50:26 -07:00
Karthik Kambatla
39c87344e1
YARN-2596. TestWorkPreservingRMRestart fails with FairScheduler. (kasha)
2014-09-24 15:23:31 -07:00
Karthik Kambatla
568d3dc2bb
YARN-1959. Fix headroom calculation in FairScheduler. (Anubhav Dhoot via kasha)
2014-09-22 23:49:39 -07:00
Karthik Kambatla
43efdd30b5
YARN-2539. FairScheduler: Set the default value for maxAMShare to 0.5. (Wei Yan via kasha)
2014-09-22 16:09:52 -07:00
Karthik Kambatla
23e17ce0ff
YARN-2540. FairScheduler: Queue filters not working on scheduler page in RM UI. (Ashwin Shankar via kasha)
2014-09-22 15:40:47 -07:00
Jian He
0a641496c7
YARN-1372. Ensure all completed containers are reported to the AMs across RM restart. Contributed by Anubhav Dhoot
2014-09-22 10:30:53 -07:00
Karthik Kambatla
c50fc92502
YARN-2452. TestRMApplicationHistoryWriter fails with FairScheduler. (Zhihai Xu via kasha)
2014-09-21 13:15:04 -07:00
Jian He
444acf8ea7
YARN-2565. Fixed RM to not use FileSystemApplicationHistoryStore unless explicitly set. Contributed by Zhijie Shen
2014-09-19 11:26:29 -07:00
Jason Lowe
a337f0e354
YARN-2561. MR job client cannot reconnect to AM after NM restart. Contributed by Junping Du
2014-09-18 21:34:40 +00:00
Jason Lowe
9ea7b6c063
YARN-2363. Submitted applications occasionally lack a tracking URL. Contributed by Jason Lowe
2014-09-18 20:13:16 +00:00
Vinod Kumar Vavilapalli
485c96e3cb
YARN-2001. Added a time threshold for RM to wait before starting container allocations after restart/failover. Contributed by Jian He.
2014-09-18 11:03:12 -07:00
Jian He
ee21b13cbd
YARN-2559. Fixed NPE in SystemMetricsPublisher when retrieving FinalApplicationStatus. Contributed by Zhijie Shen
2014-09-17 21:44:15 -07:00
junping_du
90a0c03f0a
YARN-1250. Generic history service should support application-acls. (Contributed by Zhijie Shen)
2014-09-16 18:20:49 -07:00
Vinod Kumar Vavilapalli
14e2639fd0
YARN-611. Added an API to let apps specify an interval beyond which AM failures should be ignored towards counting max-attempts. Contributed by Xuan Gong.
2014-09-13 18:04:05 -07:00
XuanGong
e65ae575a0
YARN-2456. Possible livelock in CapacityScheduler when RM is recovering
...
apps. Contributed by Jian He
2014-09-12 15:21:46 -07:00
Jian He
3122daa802
YARN-2229. Changed the integer field of ContainerId to be long type. Contributed by Tsuyoshi OZAWA
2014-09-12 10:33:33 -07:00
Jason Lowe
78b048393a
YARN-2484. FileSystemRMStateStore#readFile/writeFile should close FSData(In|Out)putStream in final block. Contributed by Tsuyoshi OZAWA
2014-09-12 14:09:14 +00:00
Zhijie Shen
469ea3dcef
YARN-2538. Added logs when RM sends roll-overed AMRMToken to AM. Contributed by Xuan Gong.
2014-09-11 22:19:00 -07:00
junping_du
6b8b1608e6
YARN-2033. Merging generic-history into the Timeline Store (Contributed by Zhijie Shen)
2014-09-12 10:04:51 +08:00
Karthik Kambatla
c11ada5ea6
YARN-2534. FairScheduler: Potential integer overflow calculating totalMaxShare. (Zhihai Xu via kasha)
2014-09-11 12:06:06 -07:00
Jian He
83be3ad444
YARN-415. Capture aggregate memory allocation at the app-level for chargeback. Contributed by Eric Payne & Andrey Klochkov
2014-09-10 18:20:54 -07:00
XUAN
47bdfa044a
YARN-2459. RM crashes if App gets rejected for any reason and HA is enabled. Contributed by Jian He
2014-09-10 11:44:41 -07:00
Vinod Kumar Vavilapalli
b67d5ba784
YARN-2448. Changed ApplicationMasterProtocol to expose RM-recognized resource types to the AMs. Contributed by Varun Vasudev.
2014-09-10 10:15:47 -07:00
Karthik Kambatla
3072c83b38
YARN-1458. FairScheduler: Zero weight can lead to livelock. (Zhihai Xu via kasha)
2014-09-10 08:26:14 -07:00
Karthik Kambatla
1dcaba9a7a
YARN-2394. FairScheduler: Configure fairSharePreemptionThreshold per queue. (Wei Yan via kasha)
2014-09-03 10:27:36 -07:00
Karthik Kambatla
0f34e6f387
YARN-2395. FairScheduler: Preemption timeout should be configurable per queue. (Wei Yan via kasha)
2014-08-30 01:17:13 -07:00
Jian He
5c14bc426b
YARN-1506. Changed RMNode/SchedulerNode to update resource with event notification. Contributed by Junping Du
2014-08-29 23:05:51 -07:00
Karthik Kambatla
270a271f53
YARN-2360. Fair Scheduler: Display dynamic fair share for queues on the scheduler page. (Ashwin Shankar and Wei Yan via kasha)
2014-08-29 17:15:38 -07:00
Jian He
c686aa3533
YARN-2447. RM web service app submission doesn't pass secrets correctly. Contributed by Varun Vasudev
2014-08-29 11:40:47 -07:00
Hitesh Shah
3de66011c2
YARN-2450. Fix typos in log messages. Contributed by Ray Chiang.
2014-08-29 11:16:36 -07:00
Karthik Kambatla
fa80ca49bd
YARN-2405. NPE in FairSchedulerAppsBlock. (Tsuyoshi Ozawa via kasha)
2014-08-28 23:21:37 -07:00
Jian
7b3e27ab73
YARN-2406. Move RM recovery related proto to yarn_server_resourcemanager_recovery.proto. Contributed by Tsuyoshi OZAWA
2014-08-28 21:47:26 -07:00
Karthik Kambatla
d16bfd1d0f
YARN-1326. RM should log using RMStore at startup time. (Tsuyoshi Ozawa via kasha)
2014-08-27 01:43:58 -07:00
Karthik Kambatla
0097b15e21
YARN-2393. FairScheduler: Add the notion of steady fair share. (Wei Yan via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1619845 13f79535-47bb-0310-9956-ffa450edef68
2014-08-22 15:44:47 +00:00
Jason Darrell Lowe
4236c6600e
YARN-2434. RM should not recover containers from previously failed attempt when AM restart is not enabled. Contributed by Jian He
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1619614 13f79535-47bb-0310-9956-ffa450edef68
2014-08-21 22:41:34 +00:00
Karthik Kambatla
e1dd210fa5
YARN-1919. Potential NPE in EmbeddedElectorService#stop. (Tsuyoshi Ozawa via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1619251 13f79535-47bb-0310-9956-ffa450edef68
2014-08-20 22:10:52 +00:00
Zhijie Shen
f6a778c372
YARN-2249. Avoided AM release requests being lost on work preserving RM restart. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618972 13f79535-47bb-0310-9956-ffa450edef68
2014-08-19 20:33:49 +00:00
Jian He
375c221960
YARN-2409. RM ActiveToStandBy transition missing stoping previous rmDispatcher. Contributed by Rohith
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618915 13f79535-47bb-0310-9956-ffa450edef68
2014-08-19 17:49:39 +00:00
Jian He
519c4be95a
YARN-2411. Support simple user and group mappings to queues. Contributed by Ram Venkatesh
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618542 13f79535-47bb-0310-9956-ffa450edef68
2014-08-18 06:08:45 +00:00
Jian He
c3084d6c16
YARN-2389. Added functionality for schedulers to kill all applications in a queue. Contributed by Subramaniam Venkatraman Krishnan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618294 13f79535-47bb-0310-9956-ffa450edef68
2014-08-15 23:53:57 +00:00
Jian He
7360cec692
YARN-2378. Added support for moving applications across queues in CapacityScheduler. Contributed by Subramaniam Venkatraman Krishnan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618106 13f79535-47bb-0310-9956-ffa450edef68
2014-08-15 06:00:31 +00:00
Zhijie Shen
a9023c2736
YARN-2397. Avoided loading two authentication filters for RM and TS web interfaces. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618054 13f79535-47bb-0310-9956-ffa450edef68
2014-08-14 21:17:20 +00:00
Karthik Kambatla
5197f8c3c5
YARN-1370. Fair scheduler to re-populate container allocation state. (Anubhav Dhoot via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617645 13f79535-47bb-0310-9956-ffa450edef68
2014-08-13 01:38:59 +00:00
Karthik Kambatla
4239695588
YARN-2399. Delete old versions of files. FairScheduler: Merge AppSchedulable and FSSchedulerApp into FSAppAttempt. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617619 13f79535-47bb-0310-9956-ffa450edef68
2014-08-12 22:51:57 +00:00
Karthik Kambatla
486e718fc1
YARN-2399. FairScheduler: Merge AppSchedulable and FSSchedulerApp into FSAppAttempt. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617600 13f79535-47bb-0310-9956-ffa450edef68
2014-08-12 21:43:27 +00:00
Junping Du
c2febdcbaa
YARN-1337. Recover containers upon nodemanager restart. (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617448 13f79535-47bb-0310-9956-ffa450edef68
2014-08-12 10:56:13 +00:00
Jian He
c4dc685343
YARN-2138. Cleaned up notifyDone* APIs in RMStateStore. Contributed by Varun Saxena
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617341 13f79535-47bb-0310-9956-ffa450edef68
2014-08-11 18:24:24 +00:00
Karthik Kambatla
da7b508ffc
YARN-2361. RMAppAttempt state machine entries for KILLED state has duplicate event entries. (Zhihai Xu via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617190 13f79535-47bb-0310-9956-ffa450edef68
2014-08-11 01:42:26 +00:00
Karthik Kambatla
bdd3e2ce49
YARN-2337. ResourceManager sets ClientRMService in RMContext multiple times. (Zhihai Xu via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617183 13f79535-47bb-0310-9956-ffa450edef68
2014-08-11 00:13:27 +00:00
Karthik Kambatla
a7643f4de7
YARN-2026. Fair scheduler: Consider only active queues for computing fairshare. (Ashwin Shankar via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1616915 13f79535-47bb-0310-9956-ffa450edef68
2014-08-09 02:10:00 +00:00
Xuan Gong
eeb4acd955
YARN-2212: ApplicationMaster needs to find a way to update the AMRMToken periodically. Contributed by Xuan Gong
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1616892 13f79535-47bb-0310-9956-ffa450edef68
2014-08-08 21:38:24 +00:00
Karthik Kambatla
6d39525f6b
YARN-2352. Add missing file. FairScheduler: Collect metrics on duration of critical methods that affect performance. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1616784 13f79535-47bb-0310-9956-ffa450edef68
2014-08-08 14:38:18 +00:00
Karthik Kambatla
14864e9c7c
YARN-2352. FairScheduler: Collect metrics on duration of critical methods that affect performance. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1616769 13f79535-47bb-0310-9956-ffa450edef68
2014-08-08 14:17:54 +00:00
Jian He
8437df8ba9
YARN-2008. Fixed CapacityScheduler to calculate headroom based on max available capacity instead of configured max capacity. Contributed by Craig Welch
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1616580 13f79535-47bb-0310-9956-ffa450edef68
2014-08-07 20:00:04 +00:00
Karthik Kambatla
8feddc4c84
YARN-2359. Application hangs when it fails to launch AM container. (Zhihai Xu via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1616375 13f79535-47bb-0310-9956-ffa450edef68
2014-08-07 00:06:17 +00:00
Junping Du
b8f151231b
YARN-1354. Recover applications upon nodemanager restart. (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1615550 13f79535-47bb-0310-9956-ffa450edef68
2014-08-04 13:25:37 +00:00
Junping Du
04ae560370
YARN-2370. Fix comment in o.a.h.y.server.resourcemanager.schedulerAppSchedulingInfo (Contributed by Wenwu Peng)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1615469 13f79535-47bb-0310-9956-ffa450edef68
2014-08-04 01:15:46 +00:00
Xuan Gong
e52f67e389
YARN-1994. Expose YARN/MR endpoints on multiple interfaces. Contributed by Craig Welch, Milan Potocnik,and Arpit Agarwal
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1614981 13f79535-47bb-0310-9956-ffa450edef68
2014-07-31 20:06:02 +00:00
Zhijie Shen
1d6e178144
YARN-2347. Consolidated RMStateVersion and NMDBSchemaVersion into Version in yarn-server-common. Contributed by Junping Du.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1614838 13f79535-47bb-0310-9956-ffa450edef68
2014-07-31 09:27:43 +00:00
Karthik Kambatla
c0b49ff107
YARN-2328. FairScheduler: Verify update and continuous scheduling threads are stopped when the scheduler is stopped. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1614432 13f79535-47bb-0310-9956-ffa450edef68
2014-07-29 17:41:52 +00:00
Zhijie Shen
d6532d3a77
YARN-2247. Made RM web services authenticate users via kerberos and delegation token. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1613821 13f79535-47bb-0310-9956-ffa450edef68
2014-07-27 17:55:06 +00:00
Jian He
d4fec34933
YARN-2211. Persist AMRMToken master key in RMStateStore for RM recovery. Contributed by Xuan Gong
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1613515 13f79535-47bb-0310-9956-ffa450edef68
2014-07-25 20:42:37 +00:00
Karthik Kambatla
1e553858f9
YARN-2214. FairScheduler: preemptContainerPreCheck() in FSParentQueue delays convergence towards fairness. (Ashwin Shankar via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1613459 13f79535-47bb-0310-9956-ffa450edef68
2014-07-25 16:13:07 +00:00
Jason Darrell Lowe
28fca92521
YARN-2147. client lacks delegation token exception details when application submit fails. Contributed by Chen He
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612950 13f79535-47bb-0310-9956-ffa450edef68
2014-07-23 21:40:57 +00:00
Sanford Ryza
c88402f36d
YARN-2313. Livelock can occur in FairScheduler when there are lots of running apps (Tsuyoshi Ozawa via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612769 13f79535-47bb-0310-9956-ffa450edef68
2014-07-23 05:00:52 +00:00
Karthik Kambatla
ff77582991
YARN-2273. NPE in ContinuousScheduling thread when we lose a node. (Wei Yan via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612720 13f79535-47bb-0310-9956-ffa450edef68
2014-07-22 22:44:38 +00:00
Junping Du
afb9394c91
YARN-2242. Addendum patch. Improve exception information on AM launch crashes. (Contributed by Li Lu)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612565 13f79535-47bb-0310-9956-ffa450edef68
2014-07-22 13:07:23 +00:00
Karthik Kambatla
266a4ff6a5
YARN-2131. Addendum. Add a way to format the RMStateStore. (Robert Kanter via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612443 13f79535-47bb-0310-9956-ffa450edef68
2014-07-22 01:30:36 +00:00
Sanford Ryza
7c71a3b876
YARN-2323. FairShareComparator creates too many Resource objects (Hong Zhiguo via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612187 13f79535-47bb-0310-9956-ffa450edef68
2014-07-21 05:00:27 +00:00
Karthik Kambatla
8871d8ed9f
YARN-2244. FairScheduler missing handling of containers for unknown application attempts. (Anubhav Dhoot via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1611840 13f79535-47bb-0310-9956-ffa450edef68
2014-07-19 00:12:05 +00:00
Xuan Gong
f1b831ccfb
YARN-2208. AMRMTokenManager need to have a way to roll over AMRMToken. Contributed by Xuan Gong
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1611820 13f79535-47bb-0310-9956-ffa450edef68
2014-07-18 21:46:29 +00:00
Vinod Kumar Vavilapalli
bda23181bf
YARN-2219. Changed ResourceManager to avoid AMs and NMs getting exceptions after RM recovery but before scheduler learns about apps and app-attempts. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1611222 13f79535-47bb-0310-9956-ffa450edef68
2014-07-17 00:14:56 +00:00
Vinod Kumar Vavilapalli
030580387a
YARN-2233. Implemented ResourceManager web-services to create, renew and cancel delegation tokens. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1610876 13f79535-47bb-0310-9956-ffa450edef68
2014-07-15 23:00:17 +00:00
Mayank Bansal
43589a8df7
YARN-1408 Preemption caused Invalid State Event: ACQUIRED at KILLED and caused a task timeout for 30mins. (Sunil G via mayank)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1610860 13f79535-47bb-0310-9956-ffa450edef68
2014-07-15 21:48:58 +00:00
Vinod Kumar Vavilapalli
c6cc6a6a8e
YARN-2260. Fixed ResourceManager's RMNode to correctly remember containers when nodes resync during work-preserving RM restart. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1610557 13f79535-47bb-0310-9956-ffa450edef68
2014-07-14 23:32:03 +00:00
Karthik Kambatla
4d211d5fde
YARN-2274. FairScheduler: Add debug information about cluster capacity, availability and reservations. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1609942 13f79535-47bb-0310-9956-ffa450edef68
2014-07-12 15:10:19 +00:00
Jian He
c9fb040c87
YARN-2181. Added preemption info to logs and RM web UI. Contributed by Wangda Tan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1609561 13f79535-47bb-0310-9956-ffa450edef68
2014-07-10 20:03:35 +00:00
Karthik Kambatla
8fbca62a90
YARN-2131. Add a way to format the RMStateStore. (Robert Kanter via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1609278 13f79535-47bb-0310-9956-ffa450edef68
2014-07-09 19:58:43 +00:00
Sanford Ryza
5644f529f3
YARN-2250. FairScheduler.findLowestCommonAncestorQueue returns null when queues not identical (Krisztian Horvath via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1607872 13f79535-47bb-0310-9956-ffa450edef68
2014-07-04 15:16:43 +00:00
Junping Du
5cb489f9d3
YARN-2242. Improve exception information on AM launch crashes. (Contributed by Li Lu)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1607655 13f79535-47bb-0310-9956-ffa450edef68
2014-07-03 14:15:19 +00:00
Vinod Kumar Vavilapalli
45b191e38c
YARN-2232. Fixed ResourceManager to allow DelegationToken owners to be able to cancel their own tokens in secure mode. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1607484 13f79535-47bb-0310-9956-ffa450edef68
2014-07-02 21:36:42 +00:00
Karthik Kambatla
447c1c233b
YARN-2241. ZKRMStateStore: On startup, show nicer messages if znodes already exist. (Robert Kanter via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1607473 13f79535-47bb-0310-9956-ffa450edef68
2014-07-02 20:33:26 +00:00
Mayank Bansal
03a25d2cc1
YARN-2022 Preempting an Application Master container can be kept as least priority when multiple applications are marked for preemption by ProportionalCapacityPreemptionPolicy (Sunil G via mayank)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1607227 13f79535-47bb-0310-9956-ffa450edef68
2014-07-02 01:54:47 +00:00
Vinod Kumar Vavilapalli
075ff276ca
YARN-1713. Added get-new-app and submit-app functionality to RM web services. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1607216 13f79535-47bb-0310-9956-ffa450edef68
2014-07-02 00:23:07 +00:00
Jian He
b0c51504c4
YARN-2052. Embedded an epoch number in container id to ensure the uniqueness of container id after RM restarts. Contributed by Tsuyoshi OZAWA
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1606557 13f79535-47bb-0310-9956-ffa450edef68
2014-06-29 18:24:03 +00:00
Jian He
b717d44b52
YARN-614. Changed ResourceManager to not count disk failure, node loss and RM restart towards app failures. Contributed by Xuan Gong
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1606407 13f79535-47bb-0310-9956-ffa450edef68
2014-06-28 23:37:46 +00:00
Jason Darrell Lowe
73927ba695
YARN-2104. Scheduler queue filter failed to work because index of queue column changed. Contributed by Wangda Tan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1606265 13f79535-47bb-0310-9956-ffa450edef68
2014-06-27 23:11:12 +00:00
Ravi Prakash
f084170a0a
YARN 2163. WebUI: Order of AppId in apps table should be consistent with ApplicationId.compareTo()
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605964 13f79535-47bb-0310-9956-ffa450edef68
2014-06-27 06:43:46 +00:00
Vinod Kumar Vavilapalli
9571db19eb
YARN-2171. Improved CapacityScheduling to not lock on nodemanager-count when AMs heartbeat in. Contributed by Jason Lowe.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605616 13f79535-47bb-0310-9956-ffa450edef68
2014-06-25 21:56:42 +00:00
Jian He
c3f1c30e65
YARN-1365. Changed ApplicationMasterService to allow an app to re-register after RM restart. Contributed by Anubhav Dhoot
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605263 13f79535-47bb-0310-9956-ffa450edef68
2014-06-25 04:42:39 +00:00
Vinod Kumar Vavilapalli
e285b98f0f
YARN-2152. Added missing information into ContainerTokenIdentifier so that NodeManagers can report the same to RM when RM restarts. Contributed Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605205 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 21:43:22 +00:00
Thomas Graves
1f9a0fd927
YARN-2072. RM/NM UIs and webservices are missing vcore information. (Nathan Roberts via tgraves)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605162 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 19:34:34 +00:00
Sanford Ryza
29c102cad0
YARN-2111. In FairScheduler.attemptScheduling, we don't count containers as assigned if they have 0 memory but non-zero cores (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605113 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 15:40:39 +00:00
Vinod Kumar Vavilapalli
d16470025a
YARN-2074. Changed ResourceManager to not count AM preemptions towards app failures. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1605106 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 15:15:12 +00:00
Karthik Kambatla
6fcbf9b848
YARN-2187. FairScheduler: Disable max-AM-share check by default. (Robert Kanter via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1604321 13f79535-47bb-0310-9956-ffa450edef68
2014-06-21 07:30:07 +00:00
Jian He
95897ca14b
YARN-1885. Fixed a bug that RM may not send application-clean-up signal to NMs where the completed applications previously ran in case of RM restart. Contributed by Wangda Tan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1603028 13f79535-47bb-0310-9956-ffa450edef68
2014-06-16 23:56:12 +00:00
Karthik Kambatla
082c2ab8b8
YARN-2159. Better logging in SchedulerNode#allocateContainer. (Ray Chiang via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1603003 13f79535-47bb-0310-9956-ffa450edef68
2014-06-16 20:45:38 +00:00
Vinod Kumar Vavilapalli
dc7dd1fa19
YARN-1702. Added kill app functionality to RM web services. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1602298 13f79535-47bb-0310-9956-ffa450edef68
2014-06-12 21:31:52 +00:00
Karthik Kambatla
4bc91b44c9
YARN-2155. FairScheduler: Incorrect threshold check for preemption. (Wei Yan via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1602295 13f79535-47bb-0310-9956-ffa450edef68
2014-06-12 21:23:32 +00:00
Jian He
c6e29a9f06
YARN-2125. Changed ProportionalCapacityPreemptionPolicy to log CSV in debug level. Contributed by Wangda Tan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1601980 13f79535-47bb-0310-9956-ffa450edef68
2014-06-11 18:31:28 +00:00
Jian He
710a8693e5
YARN-2124. Fixed NPE in ProportionalCapacityPreemptionPolicy. Contributed by Wangda Tan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1601964 13f79535-47bb-0310-9956-ffa450edef68
2014-06-11 17:30:18 +00:00
Karthik Kambatla
5de6f72054
YARN-1424. RMAppAttemptImpl should return the DummyApplicationResourceUsageReport for all invalid accesses. (Ray Chiang via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1601742 13f79535-47bb-0310-9956-ffa450edef68
2014-06-10 19:03:06 +00:00
Jian He
c94f2cec3a
Augmented RMStateStore with state machine. Contributed by Binglin Chang.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1601491 13f79535-47bb-0310-9956-ffa450edef68
2014-06-09 19:44:31 +00:00
Vinod Kumar Vavilapalli
424fd9494f
YARN-1368. Added core functionality of recovering container state into schedulers after ResourceManager Restart so as to preserve running work in the cluster. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1601303 13f79535-47bb-0310-9956-ffa450edef68
2014-06-09 03:09:21 +00:00
Karthik Kambatla
9958072d53
YARN-2132. ZKRMStateStore.ZKAction#runWithRetries doesn't log the exception it encounters. (Vamsee Yarlagadda via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1601065 13f79535-47bb-0310-9956-ffa450edef68
2014-06-07 06:12:59 +00:00
Karthik Kambatla
85d4c787e0
YARN-2128. FairScheduler: Incorrect calculation of amResource usage. (Wei Yan via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1601050 13f79535-47bb-0310-9956-ffa450edef68
2014-06-07 01:21:33 +00:00
Karthik Kambatla
969bef3c77
YARN-2122. In AllocationFileLoaderService, the reloadThread should be created in init() and started in start(). (Robert Kanter via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1601045 13f79535-47bb-0310-9956-ffa450edef68
2014-06-07 00:49:20 +00:00
Karthik Kambatla
d1d6bb70f4
YARN-2061. Revisit logging levels in ZKRMStateStore. (Ray Chiang via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1600498 13f79535-47bb-0310-9956-ffa450edef68
2014-06-04 21:11:09 +00:00
Sanford Ryza
16caa3fd18
YARN-1913. With Fair Scheduler, cluster can logjam when all resources are consumed by AMs (Wei Yan via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1599400 13f79535-47bb-0310-9956-ffa450edef68
2014-06-03 00:56:48 +00:00
Karthik Kambatla
0aad2d56df
YARN-1550. NPE in FairSchedulerAppsBlock#render. (Anubhav Dhoot via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1599345 13f79535-47bb-0310-9956-ffa450edef68
2014-06-02 20:22:52 +00:00
Karthik Kambatla
a4ba451802
YARN-1474. Make schedulers services. (Tsuyoshi Ozawa via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1598908 13f79535-47bb-0310-9956-ffa450edef68
2014-05-31 19:33:09 +00:00
Vinod Kumar Vavilapalli
23c325ad47
YARN-2115. Replaced RegisterNodeManagerRequest's ContainerStatus with a new NMContainerStatus which has more information that is needed for work-preserving RM-restart. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1598790 13f79535-47bb-0310-9956-ffa450edef68
2014-05-31 00:20:50 +00:00
Karthik Kambatla
49a3a0cd0c
YARN-2054. Better defaults for YARN ZK configs for retries and retry-inteval when HA is enabled. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1598630 13f79535-47bb-0310-9956-ffa450edef68
2014-05-30 15:24:49 +00:00
Sanford Ryza
342da5b4d3
YARN-596. Use scheduling policies throughout the queue hierarchy to decide which containers to preempt (Wei Yan via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1598197 13f79535-47bb-0310-9956-ffa450edef68
2014-05-29 04:01:24 +00:00
Sanford Ryza
edfbc8ad4a
YARN-2105. Fix TestFairScheduler after YARN-2012. (Ashwin Shankar via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1597902 13f79535-47bb-0310-9956-ffa450edef68
2014-05-27 23:46:22 +00:00
Sanford Ryza
a00b2d4f37
YARN-2073. Fair Scheduler: Add a utilization threshold to prevent preempting resources when cluster is free (Karthik Kambatla via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1597209 13f79535-47bb-0310-9956-ffa450edef68
2014-05-23 22:52:46 +00:00
Sanford Ryza
6c56612af5
YARN-2012. Fair Scheduler: allow default queue placement rule to take an arbitrary queue (Ashwin Shankar via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1597204 13f79535-47bb-0310-9956-ffa450edef68
2014-05-23 22:38:52 +00:00
Karthik Kambatla
619ec833fc
YARN-2089. FairScheduler: QueuePlacementPolicy and QueuePlacementRule are missing audience annotations. (Zhihai Xu via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1596765 13f79535-47bb-0310-9956-ffa450edef68
2014-05-22 06:13:24 +00:00
Vinod Kumar Vavilapalli
82f3454f5a
YARN-2017. Merged some of the common scheduler code. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1596753 13f79535-47bb-0310-9956-ffa450edef68
2014-05-22 05:32:26 +00:00
Jian He
0f9147c857
YARN-2053. Fixed a bug in AMS to not add null NMToken into NMTokens list from previous attempts for work-preserving AM restart. Contributed by Wangda Tan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1595116 13f79535-47bb-0310-9956-ffa450edef68
2014-05-16 06:22:22 +00:00
Sanford Ryza
561c014e77
YARN-2042. String shouldn't be compared using == in QueuePlacementRule#NestedUserQueue#getQueueForApp (Chen He via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1594482 13f79535-47bb-0310-9956-ffa450edef68
2014-05-14 06:55:04 +00:00
Sanford Ryza
84dfae2f8a
YARN-1986. In Fifo Scheduler, node heartbeat in between creating app and attempt causes NPE (Hong Zhiguo via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1594476 13f79535-47bb-0310-9956-ffa450edef68
2014-05-14 06:41:20 +00:00
Christopher Douglas
45b42676f9
YARN-1957. Consider the max capacity of the queue when computing the ideal
...
capacity for preemption. Contributed by Carlo Curino
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1594414 13f79535-47bb-0310-9956-ffa450edef68
2014-05-13 23:15:27 +00:00
Jonathan Turner Eagles
1c48142807
YARN-1981. Nodemanager version is not updated when a node reconnects (Jason Lowe via jeagles)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1594358 13f79535-47bb-0310-9956-ffa450edef68
2014-05-13 20:03:58 +00:00
Vinod Kumar Vavilapalli
2f87e77cb7
YARN-1861. Fixed a bug in RM to reset leader-election on fencing that was causing both RMs to be stuck in standby mode when automatic failover is enabled. Contributed by Karthik Kambatla and Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1594356 13f79535-47bb-0310-9956-ffa450edef68
2014-05-13 20:00:44 +00:00
Jian He
41344a4a69
YARN-1975. Fix yarn application CLI to print the scheme of the tracking url of failed/killed applications. Contributed by Junping Du
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1593874 13f79535-47bb-0310-9956-ffa450edef68
2014-05-12 00:43:35 +00:00
Sanford Ryza
1c5902fba2
YARN-1864. Add missing file FSQueueType.java
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1593191 13f79535-47bb-0310-9956-ffa450edef68
2014-05-08 07:22:42 +00:00
Sanford Ryza
cfc97a4e88
YARN-1864. Fair Scheduler Dynamic Hierarchical User Queues (Ashwin Shankar via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1593190 13f79535-47bb-0310-9956-ffa450edef68
2014-05-08 07:21:11 +00:00
Vinod Kumar Vavilapalli
7a241aee90
YARN-1929. Fixed a deadlock in ResourceManager that occurs when failover happens right at the time of shutdown. Contributed by Karthik Kambatla.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1591071 13f79535-47bb-0310-9956-ffa450edef68
2014-04-29 19:49:44 +00:00
Jing Zhao
6d4c7df434
HADOOP-10535. Make the retry numbers in ActiveStandbyElector configurable. Contributed by Jing Zhao.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1589905 13f79535-47bb-0310-9956-ffa450edef68
2014-04-24 23:03:30 +00:00
Jason Darrell Lowe
c02ea74551
YARN-1975. Used resources shows escaped html in CapacityScheduler and FairScheduler page. Contributed by Mit Desai
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1589859 13f79535-47bb-0310-9956-ffa450edef68
2014-04-24 20:02:44 +00:00
Junping Du
bd43d2481e
YARN-1947. TestRMDelegationTokens#testRMDTMasterKeyStateOnRollingMasterKey is failing intermittently. (Jian He via junping_du)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1588365 13f79535-47bb-0310-9956-ffa450edef68
2014-04-17 20:27:37 +00:00
Jian He
38d5ca2e47
YARN-1934. Fixed a potential NPE in ZKRMStateStore caused by handling Disconnected event from ZK. Contributed by Karthik Kambatla.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1587776 13f79535-47bb-0310-9956-ffa450edef68
2014-04-16 02:02:18 +00:00
Zhijie Shen
44b6261bfa
YARN-1892. Improved some logs in the scheduler. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1587717 13f79535-47bb-0310-9956-ffa450edef68
2014-04-15 20:37:44 +00:00
Vinod Kumar Vavilapalli
c6b70f4760
YARN-1933. Fixed test issues with TestAMRestart and TestNodeHealthService. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1587104 13f79535-47bb-0310-9956-ffa450edef68
2014-04-13 21:51:38 +00:00
Sanford Ryza
9274626a11
YARN-1923. Make Fair Scheduler resource ratio calculations terminate faster (Anubhav Dhoot via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1586796 13f79535-47bb-0310-9956-ffa450edef68
2014-04-11 23:24:40 +00:00
Zhijie Shen
dc84800184
YARN-1924. Made ZKRMStateStore updateApplication(Attempt)StateInternal work when Application(Attempt) state hasn't been stored before. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1586547 13f79535-47bb-0310-9956-ffa450edef68
2014-04-11 03:36:36 +00:00
Arun Murthy
05da90ee2f
YARN-1898. Addendum patch to ensure /jmx and /metrics are re-directed to Active RM.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1584954 13f79535-47bb-0310-9956-ffa450edef68
2014-04-05 02:51:08 +00:00
Zhijie Shen
b0180afc9c
YARN-1898. Made Standby RM links conf, stacks, logLevel, metrics, jmx, logs and static not be redirected to Active RM. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1583833 13f79535-47bb-0310-9956-ffa450edef68
2014-04-02 00:01:50 +00:00
Sanford Ryza
7bd62b8da0
YARN-1889. In Fair Scheduler, avoid creating objects on each call to AppSchedulable comparator (Hong Zhiguo via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1583491 13f79535-47bb-0310-9956-ffa450edef68
2014-03-31 23:40:45 +00:00
Jian He
71c2b159ab
YARN-1521. Mark Idempotent/AtMostOnce annotations to the APIs in ApplicationClientProtcol, ResourceManagerAdministrationProtocol and ResourceTrackerProtocol so that they work in HA scenario. Contributed by Xuan Gong
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1581678 13f79535-47bb-0310-9956-ffa450edef68
2014-03-26 03:43:41 +00:00
Vinod Kumar Vavilapalli
8a9ae9e3ec
YARN-1867. Fixed a bug in ResourceManager that was causing invalid ACL checks in the web-services after fail-over. Contributed by Vinod Kumar Vavilapalli.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1581662 13f79535-47bb-0310-9956-ffa450edef68
2014-03-26 03:03:17 +00:00
Vinod Kumar Vavilapalli
d14eff7d38
YARN-1866. Fixed an issue with renewal of RM-delegation tokens on restart or fail-over. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1581448 13f79535-47bb-0310-9956-ffa450edef68
2014-03-25 18:21:55 +00:00
Jian He
0febaa1b3d
YARN-1852. Fixed RMAppAttempt to not resend AttemptFailed/AttemptKilled events to already recovered Failed/Killed RMApps. Contributed by Rohith Sharmaks
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1580997 13f79535-47bb-0310-9956-ffa450edef68
2014-03-24 19:04:57 +00:00
Zhijie Shen
ef9634bf79
YARN-1577. Made UnmanagedAMLauncher do launchAM after the attempt reaches the LAUNCHED state. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1580164 13f79535-47bb-0310-9956-ffa450edef68
2014-03-22 04:35:46 +00:00
Jian He
ce56616037
YARN-1776. Fixed DelegationToken renewal to survive RM failover. Contributed by Zhijie Shen
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1580154 13f79535-47bb-0310-9956-ffa450edef68
2014-03-22 04:12:57 +00:00
Karthik Kambatla
cbb3914c89
YARN-1536. Cleanup: Get rid of ResourceManager#get*SecretManager() methods and use the RMContext methods instead. (Anubhav Dhoot via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1580128 13f79535-47bb-0310-9956-ffa450edef68
2014-03-22 02:35:52 +00:00
Jian He
f67218809c
YARN-1849. Fixed NPE in ResourceTrackerService#registerNodeManager for UAM. Contributed by Karthik Kambatla
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1580077 13f79535-47bb-0310-9956-ffa450edef68
2014-03-21 22:53:49 +00:00
Vinod Kumar Vavilapalli
4224e61340
YARN-1811. Fixed AMFilters in YARN to correctly accept requests from either web-app proxy or the RMs when HA is enabled. Contributed by Robert Kanter.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1579877 13f79535-47bb-0310-9956-ffa450edef68
2014-03-21 04:20:46 +00:00
Vinod Kumar Vavilapalli
764af69aa0
YARN-1640. Fixed manual failover of ResourceManagers to work correctly in secure clusters. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1579510 13f79535-47bb-0310-9956-ffa450edef68
2014-03-20 02:40:21 +00:00
Karthik Kambatla
bfafc1d0ce
YARN-1705. Reset cluster-metrics on transition to standby. (Rohith via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1579014 13f79535-47bb-0310-9956-ffa450edef68
2014-03-18 19:00:46 +00:00
Vinod Kumar Vavilapalli
57cdf8626a
YARN-1512. Enhanced CapacityScheduler to be able to decouple scheduling from node-heartbeats. Contributed by Arun C Murthy.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1578722 13f79535-47bb-0310-9956-ffa450edef68
2014-03-18 02:53:20 +00:00
Vinod Kumar Vavilapalli
92317e3459
YARN-1839. Fixed handling of NMTokens in ResourceManager such that containers launched by AMs running on the same machine as the AM are correctly propagated. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1578631 13f79535-47bb-0310-9956-ffa450edef68
2014-03-17 22:22:15 +00:00
Vinod Kumar Vavilapalli
81a456e638
YARN-1591. Fixed AsyncDispatcher to handle interrupts on shutdown in a sane manner and thus fix failure of TestResourceTrackerService. Contributed by Tsuyoshi Ozawa.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1578628 13f79535-47bb-0310-9956-ffa450edef68
2014-03-17 22:17:43 +00:00
Vinod Kumar Vavilapalli
8314674947
YARN-1685. Fixed few bugs related to handling of containers' log-URLs on ResourceManager and history-service. Contributed by Zhijie Shen.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1578602 13f79535-47bb-0310-9956-ffa450edef68
2014-03-17 21:36:21 +00:00
Vinod Kumar Vavilapalli
d9cdcb9474
YARN-1658. Modified web-app framework to let standby RMs redirect web-service calls to the active RM. Contributed by Cindy Li.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1577408 13f79535-47bb-0310-9956-ffa450edef68
2014-03-14 02:39:59 +00:00
Zhijie Shen
6af044bb75
YARN-1389. Made ApplicationClientProtocol and ApplicationHistoryProtocol expose analogous getApplication(s)/Attempt(s)/Container(s) APIs. Contributed by Mayank Bansal.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1577052 13f79535-47bb-0310-9956-ffa450edef68
2014-03-13 07:42:29 +00:00
Jason Darrell Lowe
135700a611
YARN-1789. ApplicationSummary does not escape newlines in the app name. Contributed by Tsuyoshi OZAWA
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1576960 13f79535-47bb-0310-9956-ffa450edef68
2014-03-12 22:25:01 +00:00
Vinod Kumar Vavilapalli
197217c95d
YARN-1816. Fixed ResourceManager to get RMApp correctly handle ATTEMPT_FINISHED event at ACCEPTED state that can happen after RM restarts. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1576911 13f79535-47bb-0310-9956-ffa450edef68
2014-03-12 21:18:55 +00:00
Vinod Kumar Vavilapalli
4de17c6052
YARN-1812. Fixed ResourceManager to synchrously renew tokens after recovery and thus recover app itself synchronously and avoid races with resyncing NodeManagers. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1576843 13f79535-47bb-0310-9956-ffa450edef68
2014-03-12 18:25:33 +00:00
Arun Murthy
4ce0e4bf2e
YARN-1444. Fix CapacityScheduler to deal with cases where applications specify host/rack requests without off-switch request. Contributed by Wangda Tan.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1576751 13f79535-47bb-0310-9956-ffa450edef68
2014-03-12 14:36:27 +00:00
Karthik Kambatla
477ed62b3f
YARN-1821. NPE on registerNodeManager if the request has containers for UnmanagedAMs (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1576525 13f79535-47bb-0310-9956-ffa450edef68
2014-03-11 22:18:54 +00:00
Vinod Kumar Vavilapalli
e4190b2a97
YARN-1788. Fixed a bug in ResourceManager to set the apps-completed and apps-killed metrics correctly for killed applications. Contributed by Varun Vasudev.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1576072 13f79535-47bb-0310-9956-ffa450edef68
2014-03-10 20:36:34 +00:00
Jian He
7da07461ff
YARN-1793. Fixed ClientRMService#forceKillApplication not killing unmanaged application. Contributed by Karthik Kambatla
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1576023 13f79535-47bb-0310-9956-ffa450edef68
2014-03-10 17:58:55 +00:00
Sanford Ryza
e2530de4ea
YARN-1790. Fair Scheduler UI not showing apps table (bc Wong via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1575487 13f79535-47bb-0310-9956-ffa450edef68
2014-03-08 05:12:45 +00:00
Karthik Kambatla
b06cc16f7d
YARN-1774. FS: Submitting to non-leaf queue throws NPE. (Anubhav Dhoot and Karthik Kambatla via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1575415 13f79535-47bb-0310-9956-ffa450edef68
2014-03-07 21:34:19 +00:00
Karthik Kambatla
9b15c5b11a
YARN-1525. Web UI should redirect to active RM when HA is enabled. (Cindy Li via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1575166 13f79535-47bb-0310-9956-ffa450edef68
2014-03-07 04:34:16 +00:00
Jian He
8cd23c5b48
YARN-1752. Fixed ApplicationMasterService to reject unregister request if AM did not register before. Contributed by Rohith Sharma.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1574623 13f79535-47bb-0310-9956-ffa450edef68
2014-03-05 19:04:58 +00:00
Karthik Kambatla
2e7c512270
YARN-1785. FairScheduler treats app lookup failures as ERRORs. (bc Wong via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1574604 13f79535-47bb-0310-9956-ffa450edef68
2014-03-05 18:22:25 +00:00
Vinod Kumar Vavilapalli
82e8c8bd71
YARN-1766. Fixed a bug in ResourceManager to use configuration loaded from the configuration-provider when booting up. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1574252 13f79535-47bb-0310-9956-ffa450edef68
2014-03-04 23:35:20 +00:00
Vinod Kumar Vavilapalli
88245b6a41
YARN-986. Changed client side to be able to figure out the right RM Delegation token for the right ResourceManager when HA is enabled. Contributed by Karthik Kambatla.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1574190 13f79535-47bb-0310-9956-ffa450edef68
2014-03-04 20:39:06 +00:00
Vinod Kumar Vavilapalli
a60434368c
YARN-1758. Fixed ResourceManager to not mandate the presence of site specific configuration files and thus fix failures in downstream tests. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1573695 13f79535-47bb-0310-9956-ffa450edef68
2014-03-03 19:41:52 +00:00
Karthik Kambatla
1ba26e3e68
YARN-1528. Allow setting auth for ZK connections. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1573014 13f79535-47bb-0310-9956-ffa450edef68
2014-02-28 17:34:08 +00:00
Zhijie Shen
faddccc216
YARN-1301. Added the INFO level log of the non-empty blacklist additions and removals inside ApplicationMasterService. Contributed by Tsuyoshi Ozawa.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1572400 13f79535-47bb-0310-9956-ffa450edef68
2014-02-27 02:07:16 +00:00
Vinod Kumar Vavilapalli
0fbc7fe816
YARN-1588. Enhanced RM and the scheduling protocol to also send NMTokens of transferred containers from previous app-attempts to new AMs after YARN-1490. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1572230 13f79535-47bb-0310-9956-ffa450edef68
2014-02-26 20:20:07 +00:00
Junping Du
c7142e7761
YARN-1561. Fix a generic type warning in FairScheduler. (Chen He via junping_du)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1571924 13f79535-47bb-0310-9956-ffa450edef68
2014-02-26 05:14:51 +00:00
Vinod Kumar Vavilapalli
ad70f26b1f
YARN-1734. Fixed ResourceManager to update the configurations when it transits from standby to active mode so as to assimilate any changes that happened while it was in standby mode. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1571539 13f79535-47bb-0310-9956-ffa450edef68
2014-02-25 02:07:19 +00:00
Sanford Ryza
1ab2f5a916
YARN-1678. Fair scheduler gabs incessantly about reservations (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1571468 13f79535-47bb-0310-9956-ffa450edef68
2014-02-24 22:35:27 +00:00
Vinod Kumar Vavilapalli
e167e585e9
YARN-1749. Updated application-history related configs to reflect the latest reality and to be consistently named. Contributed by Zhijie Shen.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1570948 13f79535-47bb-0310-9956-ffa450edef68
2014-02-23 02:04:58 +00:00
Zhijie Shen
da20095284
YARN-1071. Enabled ResourceManager to recover cluster metrics numDecommissionedNMs after restarting. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1570469 13f79535-47bb-0310-9956-ffa450edef68
2014-02-21 06:46:24 +00:00
Vinod Kumar Vavilapalli
772ead791c
YARN-1398. Fixed a deadlock in ResourceManager between users requesting queue-acls and completing containers. Contributed by Vinod Kumar Vavilapalli.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1570415 13f79535-47bb-0310-9956-ffa450edef68
2014-02-21 02:20:20 +00:00
Karthik Kambatla
12a8aa1510
YARN-1736. FS: AppSchedulable.assignContainer's priority argument is redundant. (Naren Koneru via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1570336 13f79535-47bb-0310-9956-ffa450edef68
2014-02-20 20:09:54 +00:00
Vinod Kumar Vavilapalli
5fd5c9900c
YARN-713. Fixed ResourceManager to not crash while building tokens when DNS issues happen transmittently. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1569979 13f79535-47bb-0310-9956-ffa450edef68
2014-02-19 23:39:13 +00:00
Sanford Ryza
0e3e91a06d
YARN-1718. Fix a couple isTerminals in Fair Scheduler queue placement rules (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1569928 13f79535-47bb-0310-9956-ffa450edef68
2014-02-19 21:36:51 +00:00
Vinod Kumar Vavilapalli
9da9f7d4d8
YARN-1666. Modified RM HA handling of include/exclude node-lists to be available across RM failover by making using of a remote configuration-provider. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1569856 13f79535-47bb-0310-9956-ffa450edef68
2014-02-19 17:52:57 +00:00
Jonathan Turner Eagles
996acc834e
YARN-1479. Invalid NaN values in Hadoop REST API JSON response (Chen He via jeagles)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1569853 13f79535-47bb-0310-9956-ffa450edef68
2014-02-19 17:49:21 +00:00
Jian He
3a7e7b3671
YARN-1428. Fixed RM to write the final state of RMApp/RMAppAttempt to the application history store in the transition to the final state. (Contributed by Zhijie Shen)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1569585 13f79535-47bb-0310-9956-ffa450edef68
2014-02-19 01:19:07 +00:00
Vinod Kumar Vavilapalli
6b03174aae
YARN-1590. Fixed ResourceManager, web-app proxy and MR JobHistoryServer to expand _HOST properly in their kerberos principles. Contributed by Mohammad Kamrul Islam.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1569537 13f79535-47bb-0310-9956-ffa450edef68
2014-02-18 21:50:37 +00:00
Sanford Ryza
bbbe808a51
YARN-1724. Race condition in Fair Scheduler when continuous scheduling is turned on (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1569447 13f79535-47bb-0310-9956-ffa450edef68
2014-02-18 18:00:56 +00:00
Sanford Ryza
e60f5b6c40
YARN-1721. When moving app between queues in Fair Scheduler, grab lock on FSSchedulerApp (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1569443 13f79535-47bb-0310-9956-ffa450edef68
2014-02-18 17:51:40 +00:00
Vinod Kumar Vavilapalli
990cffdcfa
YARN-1553. Modified YARN and MR to stop using HttpConfig.isSecure() and
...
instead rely on the http policy framework. And also fix some bugs related
to https handling in YARN web-apps. Contributed by Haohui Mai.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1568501 13f79535-47bb-0310-9956-ffa450edef68
2014-02-14 20:01:02 +00:00
Vinod Kumar Vavilapalli
d0a5e43de7
YARN-1417. Modified RM to generate container-tokens not at creation time, but at allocation time so as to prevent RM
...
from shelling out containers with expired tokens. Contributed by Omkar Vinit Joshi and Jian He.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1568060 13f79535-47bb-0310-9956-ffa450edef68
2014-02-13 22:02:11 +00:00
Vinod Kumar Vavilapalli
2b916db60b
YARN-1676. Modified RM HA handling of user-to-group mappings to be available across RM failover by making using of a remote configuration-provider. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1568041 13f79535-47bb-0310-9956-ffa450edef68
2014-02-13 21:23:33 +00:00
Jian He
dab635980d
YARN-1345. Remove FINAL_SAVING state from YarnApplicationAttemptState. Contributed by Zhijie Shen
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1567820 13f79535-47bb-0310-9956-ffa450edef68
2014-02-13 00:59:19 +00:00
Sanford Ryza
bc939f7ae6
YARN-1692. ConcurrentModificationException in fair scheduler AppSchedulable (Sangjin Lee via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1567788 13f79535-47bb-0310-9956-ffa450edef68
2014-02-12 23:04:42 +00:00
Karthik Kambatla
41ec3cce76
YARN-1641. ZK store should attempt a write periodically to ensure it is still Active. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1567628 13f79535-47bb-0310-9956-ffa450edef68
2014-02-12 14:09:13 +00:00
Vinod Kumar Vavilapalli
1fa6ab249b
YARN-1459. Changed ResourceManager to depend its service initialization on the configuration-provider mechanism during startup too. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1566791 13f79535-47bb-0310-9956-ffa450edef68
2014-02-10 22:50:15 +00:00
Zhijie Shen
943b2190d7
YARN-1689. Made RMAppAttempt get killed when RMApp is at ACCEPTED. Contributed by Vinod Kumar Vavilapalli.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1565497 13f79535-47bb-0310-9956-ffa450edef68
2014-02-07 00:18:46 +00:00
Sanford Ryza
e6eccf20de
YARN-1499. Fair Scheduler changes for moving apps between queues (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1564856 13f79535-47bb-0310-9956-ffa450edef68
2014-02-05 18:09:07 +00:00
Zhijie Shen
ebe0c17a95
YARN-1461. Added tags for YARN applications and changed RM to handle them. Contributed by Karthik Kambatla.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1564633 13f79535-47bb-0310-9956-ffa450edef68
2014-02-05 04:55:00 +00:00
Vinod Kumar Vavilapalli
b812af964d
YARN-1669. Modified RM HA handling of protocol level service-ACLS to be available across RM failover by making using of a remote configuration-provider. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1564549 13f79535-47bb-0310-9956-ffa450edef68
2014-02-04 22:46:52 +00:00
Vinod Kumar Vavilapalli
24fa232707
YARN-1667. Modified RM HA handling of super users (with proxying ability) to be available across RM failover by making using of a remote configuration-provider. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1564100 13f79535-47bb-0310-9956-ffa450edef68
2014-02-03 22:10:56 +00:00
Vinod Kumar Vavilapalli
eff1e809f2
YARN-1668. Modified RM HA handling of admin-acls to be available across RM failover by making using of a remote configuration-provider. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1564043 13f79535-47bb-0310-9956-ffa450edef68
2014-02-03 20:16:28 +00:00
Sanford Ryza
cb5e0787a6
YARN-1498 addendum to fix findbugs warning
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1564018 13f79535-47bb-0310-9956-ffa450edef68
2014-02-03 19:30:55 +00:00
Vinod Kumar Vavilapalli
ca72e11158
YARN-1611. Introduced the concept of a configuration provider which can be used by ResourceManager to read configuration locally or from remote systems so as to help RM failover. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1564002 13f79535-47bb-0310-9956-ffa450edef68
2014-02-03 19:07:55 +00:00
Sanford Ryza
1079c37128
YARN-1504. RM changes for moving apps between queues (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1563371 13f79535-47bb-0310-9956-ffa450edef68
2014-02-01 04:03:36 +00:00
Sanford Ryza
981679e6d7
YARN-1498. Common scheduler changes for moving apps between queues (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1563021 13f79535-47bb-0310-9956-ffa450edef68
2014-01-31 00:14:47 +00:00
Sanford Ryza
1cac66ce68
YARN-1617. Remove ancient comment and surround LOG.debug in AppSchedulingInfo.allocate (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1563004 13f79535-47bb-0310-9956-ffa450edef68
2014-01-30 22:40:22 +00:00
Karthik Kambatla
f1cef4cd29
YARN-1618. Fix invalid RMApp transition from NEW to FINAL_SAVING (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1562529 13f79535-47bb-0310-9956-ffa450edef68
2014-01-29 18:02:40 +00:00
Sanford Ryza
7f2b01a742
YARN-1629. IndexOutOfBoundsException in MaxRunningAppsEnforcer (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1561996 13f79535-47bb-0310-9956-ffa450edef68
2014-01-28 08:34:55 +00:00
Vinod Kumar Vavilapalli
cbee889711
YARN-321. Merging YARN-321 branch to trunk.
...
svn merge ../branches/YARN-321
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1561452 13f79535-47bb-0310-9956-ffa450edef68
2014-01-26 04:51:10 +00:00
Karthik Kambatla
482993ac53
YARN-1573. ZK store should use a private password for root-node-acls. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1560594 13f79535-47bb-0310-9956-ffa450edef68
2014-01-23 06:22:05 +00:00
Sanford Ryza
163348701c
YARN-1623. Include queue name in RegisterApplicationMasterResponse (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1560545 13f79535-47bb-0310-9956-ffa450edef68
2014-01-22 22:25:03 +00:00
Sanford Ryza
1a7e158844
YARN-1624. QueuePlacementPolicy format is not easily readable via a JAXB parser (Aditya Acharya via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1560470 13f79535-47bb-0310-9956-ffa450edef68
2014-01-22 18:51:23 +00:00
Karthik Kambatla
e0ce554b86
YARN-1616. RMFatalEventDispatcher should log the cause of the event (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1559902 13f79535-47bb-0310-9956-ffa450edef68
2014-01-21 02:01:51 +00:00
Sanford Ryza
67cbde3008
YARN-1496. Protocol additions to allow moving apps between queues (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1559050 13f79535-47bb-0310-9956-ffa450edef68
2014-01-17 08:56:22 +00:00
Sanford Ryza
7274b5ff93
YARN-1351. Invalid string format in Fair Scheduler log warn message (Konstantin Weitz via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1558597 13f79535-47bb-0310-9956-ffa450edef68
2014-01-15 23:09:32 +00:00
Junping Du
4d413350d7
YARN-1603. Remove two *.orig files which were unexpectedly committed. (Zhijie Shen via junping_du)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1558340 13f79535-47bb-0310-9956-ffa450edef68
2014-01-15 11:05:15 +00:00
Karthik Kambatla
d13c2eeca6
YARN-1598. HA-related rmadmin commands don't work on a secure cluster (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1558251 13f79535-47bb-0310-9956-ffa450edef68
2014-01-15 00:57:28 +00:00
Sanford Ryza
f4fd050dee
YARN-1567. In Fair Scheduler, allow empty queues to change between leaf and parent on allocation file reload (Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1558228 13f79535-47bb-0310-9956-ffa450edef68
2014-01-14 22:33:56 +00:00
Vinod Kumar Vavilapalli
25bc68d15e
YARN-1041. Added the ApplicationMasterProtocol API for applications to use the ability in ResourceManager to optionally not kill containers when the ApplicationMaster exits. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1557318 13f79535-47bb-0310-9956-ffa450edef68
2014-01-11 07:07:17 +00:00
Jian He
f677175f35
YARN-1166. Fixed app-specific and attempt-specific QueueMetrics to be triggered by accordingly app event and attempt event. Contributed by Zhijie Shen
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1557296 13f79535-47bb-0310-9956-ffa450edef68
2014-01-11 01:15:49 +00:00
Karthik Kambatla
6608b75829
YARN-1574. RMDispatcher should be reset on transition to standby. (Xuan Gong via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1557248 13f79535-47bb-0310-9956-ffa450edef68
2014-01-10 20:15:00 +00:00
Karthik Kambatla
a0ecb98a77
YARN-1033. Expose RM active/standby state to Web UI and REST API (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1557181 13f79535-47bb-0310-9956-ffa450edef68
2014-01-10 16:35:52 +00:00
Vinod Kumar Vavilapalli
1393581bce
YARN-1490. Introduced the ability to make ResourceManager optionally not kill all containers when an ApplicationMaster exits. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1557143 13f79535-47bb-0310-9956-ffa450edef68
2014-01-10 15:04:39 +00:00
Karthik Kambatla
49ec397340
YARN-1568. Rename clusterid to clusterId in ActiveRMInfoProto (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1556435 13f79535-47bb-0310-9956-ffa450edef68
2014-01-08 03:30:49 +00:00
Vinod Kumar Vavilapalli
4931545f76
YARN-1482. Modified WebApplicationProxy to make it work across ResourceManager fail-over. Contributed by Xuan Gong.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1556380 13f79535-47bb-0310-9956-ffa450edef68
2014-01-07 22:16:30 +00:00
Vinod Kumar Vavilapalli
c3cc855d27
YARN-1029. Added embedded leader election in the ResourceManager. Contributed by Karthik Kambatla.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1556103 13f79535-47bb-0310-9956-ffa450edef68
2014-01-07 01:56:11 +00:00
Vinod Kumar Vavilapalli
a45017a5f1
YARN-1493. Changed ResourceManager and Scheduler interfacing to recognize app-attempts separately from apps. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1554896 13f79535-47bb-0310-9956-ffa450edef68
2014-01-02 20:19:45 +00:00
Vinod Kumar Vavilapalli
bb2e2fee60
YARN-1549. Fixed a bug in ResourceManager's ApplicationMasterService that was causing unamanged AMs to not finish correctly. Contributed by haosdent.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1554886 13f79535-47bb-0310-9956-ffa450edef68
2014-01-02 19:54:07 +00:00
Karthik Kambatla
e7120079bd
YARN-1481. Reverting addendum patch
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1553994 13f79535-47bb-0310-9956-ffa450edef68
2013-12-29 05:43:31 +00:00
Vinod Kumar Vavilapalli
8f0bf54d34
YARN-1541. Changed ResourceManager to invalidate ApplicationMaster host/port information once an AM crashes. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1553772 13f79535-47bb-0310-9956-ffa450edef68
2013-12-28 01:09:07 +00:00
Karthik Kambatla
defeef6fe4
YARN-1481. Addendum patch to fix synchronization in AdminService
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1553738 13f79535-47bb-0310-9956-ffa450edef68
2013-12-27 21:10:14 +00:00
Karthik Kambatla
1c9ac9adeb
YARN-1523. Use StandbyException instead of RMNotYetReadyException (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1553616 13f79535-47bb-0310-9956-ffa450edef68
2013-12-27 03:28:23 +00:00
Karthik Kambatla
d841a0f474
YARN-1172. Convert SecretManagers in RM to services (Tsuyoshi OZAWA via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1553431 13f79535-47bb-0310-9956-ffa450edef68
2013-12-26 00:21:56 +00:00
Vinod Kumar Vavilapalli
93907baa0b
YARN-1307. Redesign znode structure for Zookeeper based RM state-store for better organization and scalability. Contributed by Tsuyoshi OZAWA.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1552209 13f79535-47bb-0310-9956-ffa450edef68
2013-12-19 02:33:05 +00:00
Vinod Kumar Vavilapalli
b774d7b3de
YARN-1446. Changed client API to retry killing application till RM acknowledges so as to account for RM crashes/failover. Contributed by Jian He.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1551444 13f79535-47bb-0310-9956-ffa450edef68
2013-12-17 02:16:20 +00:00
Vinod Kumar Vavilapalli
49ad07af97
YARN-312. Introduced ResourceManagerAdministrationProtocol changes to support changing resources on node. Contributed by Junping Du.
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1551403 13f79535-47bb-0310-9956-ffa450edef68
2013-12-16 23:36:16 +00:00
Jian He
28db28d491
YARN-1311. Fixed app specific scheduler-events' names to be app-attempt based. Contributed by Vinod Kumar Vavilapalli
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1550613 13f79535-47bb-0310-9956-ffa450edef68
2013-12-13 01:44:16 +00:00