Jian He
8676a118a1
YARN-4349. Support CallerContext in YARN. Contributed by Wangda Tan
2015-11-23 17:19:48 -08:00
Jason Lowe
d36b6e045f
YARN-4344. NMs reconnecting with changed capabilities can lead to wrong cluster resource calculations. Contributed by Varun Vasudev
2015-11-23 20:30:26 +00:00
Arun Suresh
da1016365a
YARN-3454. Add efficient merge operation to RLESparseResourceAllocation (Carlo Curino via asuresh)
2015-11-21 09:59:41 -08:00
Wangda Tan
2346fa3141
YARN-3769. Consider user limit when calculating total pending resource for preemption policy in Capacity Scheduler. (Eric Payne via wangda)
2015-11-20 15:55:50 -08:00
Arun Suresh
6a61928fb7
YARN-4184. Remove update reservation state api from state store as its not used by ReservationSystem (Sean Po via asuresh)
2015-11-17 15:50:34 -08:00
Jian He
fcd7888029
Revert "YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999) Contributed by Mohammad Shahid Khan"
...
This reverts commit 8fbea531d7
.
Conflicts:
hadoop-yarn-project/CHANGES.txt
2015-11-16 20:18:44 -08:00
Wangda Tan
7f55a18071
YARN-4347. Resource manager fails with Null pointer exception. (Jian He via wangda)
2015-11-12 11:23:40 -08:00
Wangda Tan
796638d9bc
YARN-4287. Capacity Scheduler: Rack Locality improvement (Nathan Roberts via wangda)
2015-11-12 11:09:37 -08:00
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
6351d3fa63
YARN-4183. Reverting the patch to fix behaviour change.
...
Revert "YARN-4183. Enabling generic application history forces every job to get a timeline service delegation token (jeagles)"
This reverts commit c293c58954
.
2015-11-11 10:40:43 -08:00
Jian He
8fbea531d7
YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999) Contributed by Mohammad Shahid Khan
2015-11-09 10:43:45 -08:00
Jian He
e5b1733e04
YARN-4127. RM fail with noAuth error if switched from failover to non-failover. Contributed by Varun Saxena
2015-10-29 15:42:57 -07:00
Jonathan Eagles
c293c58954
YARN-4183. Enabling generic application history forces every job to get a timeline service delegation token (jeagles)
2015-10-29 16:41:10 -05:00
Wangda Tan
56e4f6237a
YARN-3216. Max-AM-Resource-Percentage should respect node labels. (Sunil G via wangda)
2015-10-26 16:44:39 -07:00
Wangda Tan
6f606214e7
YARN-4169. Fix racing condition of TestNodeStatusUpdaterForLabels. (Naganarasimha G R via wangda)
2015-10-26 16:36:34 -07:00
Wangda Tan
3cc73773eb
YARN-4285. Display resource usage as percentage of queue and cluster in the RM UI (Varun Vasudev via wangda)
2015-10-26 13:07:39 -07:00
Jason Lowe
33a03af3c3
YARN-4284. condition for AM blacklisting is too narrow. Contributed by Sangjin Lee
2015-10-26 19:53:03 +00:00
Arun Suresh
ab8eb8770c
YARN-3738. Add support for recovery of reserved apps running under dynamic queues (subru via asuresh)
2015-10-24 22:53:10 -07:00
Jason Lowe
d3a34a4f38
YARN-4041. Slow delegation token renewal can severely prolong RM recovery. Contributed by Sunil G
2015-10-23 20:57:01 +00:00
Ming Ma
934d96a334
YARN-2913. Fair scheduler should have ability to set MaxResourceDefault for each queue. (Siqi Li via mingma)
2015-10-23 08:36:33 -07:00
Zhihai Xu
960201b79b
YARN-4256. YARN fair scheduler vcores with decimal values. Contributed by Jun Gong
2015-10-22 12:28:03 -07:00
Anubhav Dhoot
2798723a54
YARN-3739. Add reservation system recovery to RM recovery process. Contributed by Subru Krishnan.
2015-10-22 06:51:00 -07:00
Arun Suresh
506d1b1dbc
YARN-3985. Make ReservationSystem persist state using RMStateStore reservation APIs. (adhoot via asuresh)
2015-10-20 16:46:14 -07:00
Arun Suresh
7e2837f830
YARN-4270. Limit application resource reservation on nodes for non-node/rack specific requests (asuresh)
2015-10-19 20:00:38 -07:00
Jian He
f9da5cdb2b
YARN-4170. AM need to be notified with priority in AllocateResponse. Contributed by Sunil G
2015-10-16 15:26:27 -07:00
Wangda Tan
4337b263aa
YARN-4162. CapacityScheduler: Add resource usage by partition and queue capacity by partition to REST API. (Naganarasimha G R via wangda)
2015-10-16 15:06:28 -07:00
Jian He
cf23f2c2b5
YARN-4000. RM crashes with NPE if leaf queue becomes parent queue during restart. Contributed by Varun Saxena
2015-10-15 17:12:46 -07:00
Jian He
9849c8b386
YARN-4230. RM crashes with NPE when increasing container resource if there is no headroom left. Contributed by Meng Ding
2015-10-12 11:51:33 -07:00
Zhihai Xu
049c6e8dc0
YARN-4201. AMBlacklist does not work for minicluster. Contributed by Jun Gong.
2015-10-12 00:14:25 -07:00
Devaraj K
db93047881
YARN-3964. Support NodeLabelsProvider at Resource Manager side.
...
Contributed by Dian Fu.
2015-10-11 11:21:29 +05:30
Wangda Tan
def374e666
YARN-4140. RM container allocation delayed incase of app submitted to Nodelabel partition. (Bibin A Chundatt via wangda)
2015-10-09 16:38:59 -07:00
Jason Lowe
a0bca2b5ad
YARN-261. Ability to fail AM attempts. Contributed by Andrey Klochkov and Rohith Sharma K S
2015-10-09 14:17:38 +00:00
Rohith Sharma K S
8f195387a4
YARN-4235. FairScheduler PrimaryGroup does not handle empty groups returned for a user. (Anubhav Dhoot via rohithsharmaks)
2015-10-09 10:09:26 +05:30
Rohith Sharma K S
9156fc60c6
YARN-4209. RMStateStore FENCED state doesn’t work due to updateFencedState called by stateMachine.doTransition. (Zhihai Xu via rohithsharmaks)
2015-10-07 09:34:59 +05:30
Wangda Tan
29a582ada0
YARN-4215. RMNodeLabels Manager Need to verify and replace node labels for the only modified Node Label Mappings in the request. (Naganarasimha G R via wangda)
2015-10-06 11:56:04 -07:00
Xuan
8f08532bde
YARN-1897. CLI and core support for signal container functionality. Contributed by Ming Ma
2015-10-02 18:50:47 -07:00
Anubhav Dhoot
9735afe967
YARN-4180. AMLauncher does not retry on failures when talking to NM. (adhoot)
2015-09-28 16:13:41 -07:00
Jason Lowe
9f53a95ff6
YARN-4141. Runtime Application Priority change should not throw exception for applications at finishing states. Contributed by Sunil G
2015-09-28 22:55:20 +00:00
Anubhav Dhoot
fb2e525c07
YARN-4204. ConcurrentModificationException in FairSchedulerQueueInfo. (adhoot)
2015-09-28 09:05:45 -07:00
Rohith Sharma K S
a9aafad12b
YARN-4044. Running applications information changes such as movequeue is not published to TimeLine server. (Sunil G via rohithsharmaks)
2015-09-24 12:13:22 +05:30
Jian He
89cab1ba5f
YARN-1651. CapacityScheduler side changes to support container resize. Contributed by Wangda Tan
2015-09-23 13:29:38 -07:00
Jian He
5f5a968d65
YARN-3867. ContainerImpl changes to support container resizing. Contributed by Meng Ding
2015-09-23 13:29:37 -07:00
Jian He
83a18add10
YARN-1449. AM-NM protocol changes to support container resizing. Contributed by Meng Ding & Wangda Tan)
2015-09-23 13:29:36 -07:00
Arun Suresh
94dec5a916
YARN-3920. FairScheduler container reservation on a node should be configurable to limit it to large containers (adhoot via asuresh)
2015-09-18 14:02:55 -07:00
Wangda Tan
9bc913a35c
YARN-3212. RMNode State Transition Update with DECOMMISSIONING state. (Junping Du via wangda)
2015-09-18 10:04:17 -07:00
Rohith Sharma K S
723c31d45b
YARN-4135. Improve the assertion message in MockRM while failing after waiting for the state.(Nijel S F via rohithsharmaks)
2015-09-18 08:44:10 +05:30
Jian He
6c6e734f0b
YARN-4034. Render cluster Max Priority in scheduler metrics in RM web UI. Contributed by Rohith Sharma K S
2015-09-17 14:55:50 +08:00
Jian He
452079af8b
YARN-4078. Add getPendingResourceRequestForAttempt in YarnScheduler interface. Contributed by Naganarasimha G R
2015-09-16 14:59:20 +08:00
Wangda Tan
ae5308fe1d
YARN-3717. Expose app/am/queue's node-label-expression to RM web UI / CLI / REST-API. (Naganarasimha G R via wangda)
2015-09-15 11:40:50 -07:00
Junping Du
73e3a49eb0
YARN-313. Add Admin API for supporting node resource configuration in command line. (Contributed by Inigo Goiri, Kenji Kikushima and Junping Du)
2015-09-15 07:56:47 -07:00
Jian He
5468baa80a
YARN-3635. Refactored current queue mapping implementation in CapacityScheduler to use a generic PlacementManager framework. Contributed by Wangda Tan
2015-09-15 15:39:20 +08:00
Jian He
e1b1d7e4ae
YARN-4126. RM should not issue delegation tokens in unsecure mode. Contributed by Bibin A Chundatt
2015-09-14 14:09:19 +08:00
Karthik Kambatla
332b520a48
YARN-3697. FairScheduler: ContinuousSchedulingThread can fail to shutdown. (Zhihai Xu via kasha)
2015-09-13 18:07:43 -07:00
Karthik Kambatla
81df7b586a
YARN-2005. Blacklisting support for scheduling AMs. (Anubhav Dhoot via kasha)
2015-09-13 17:03:15 -07:00
Robert Kanter
ea4bb2749f
YARN-4145. Make RMHATestBase abstract so its not run when running all tests under that namespace (adhoot via rkanter)
2015-09-11 11:46:10 -07:00
Wangda Tan
bcc85e3bab
YARN-4024. YARN RM should avoid unnecessary resolving IP when NMs doing heartbeat. (Hong Zhiguo via wangda)
2015-09-04 15:13:53 -07:00
Jason Lowe
6eaca2e363
YARN-4105. Capacity Scheduler headroom for DRF is wrong. Contributed by Chang Li
2015-09-04 15:30:53 +00:00
Varun Vasudev
40d222e862
YARN-4103. RM WebServices missing scheme for appattempts logLinks. Contributed by Jonathan Eagles.
2015-09-04 14:31:51 +05:30
Varun Vasudev
b469ac531a
YARN-3970. Add REST api support for Application Priority. Contributed by Naganarasimha G R.
2015-09-03 16:40:10 +05:30
Jian He
09c64ba1ba
YARN-4101. RM should print alert messages if Zookeeper and Resourcemanager gets connection issue. Contributed by Xuan Gong
2015-09-02 17:45:23 -07:00
Rohith Sharma K S
7d6687fe76
YARN-3893. Both RM in active state when Admin#transitionToActive failure from refeshAll() (Bibin A Chundatt via rohithsharmaks)
2015-09-02 15:22:48 +05:30
Varun Vasudev
bf669b6d9f
YARN-4082. Container shouldn't be killed when node's label updated. Contributed by Wangda Tan.
2015-09-01 14:19:11 +05:30
Junping Du
beb65c9465
YARN-1556. NPE getting application report with a null appId. Contributed by Weiwei Yang.
2015-08-28 05:57:34 -07:00
Jian He
a9c8ea71aa
YARN-3250. Support admin cli interface in for Application Priority. Contributed by Rohith Sharma K S
2015-08-27 13:25:53 -07:00
Jian He
57c7ae1aff
YARN-4014. Support user cli interface in for Application Priority. Contributed by Rohith Sharma K S
2015-08-24 20:36:44 -07:00
Rohith Sharma K S
feaf034994
YARN-3896. RMNode transitioned from RUNNING to REBOOTED because its response id has not been reset synchronously. (Jun Gong via rohithsharmaks)
2015-08-24 11:25:07 +05:30
Xuan
37e1c3d82a
YARN-221. NM should provide a way for AM to tell it not to aggregate
...
logs. Contributed by Ming Ma
2015-08-22 16:25:24 -07:00
Rohith Sharma K S
22de7c1dca
YARN-3986. getTransferredContainers in AbstractYarnScheduler should be present in YarnScheduler interface
2015-08-21 10:51:11 +05:30
Zhihai Xu
3a76a010b8
YARN-3857: Memory leak in ResourceManager with SIMPLE mode. Contributed by mujunchao.
2015-08-18 10:36:40 -07:00
Jian He
e5003be907
YARN-4026. Refactored ContainerAllocator to accept a list of priorites rather than a single priority. Contributed by Wangda Tan
2015-08-12 15:07:50 -07:00
rohithsharmaks
1c12adb71f
YARN-4023. Publish Application Priority to TimelineServer. (Sunil G via rohithsharmaks)
2015-08-12 14:45:41 +05:30
Xuan
3ae716fa69
YARN-3999. RM hangs on draing events. Contributed by Jian He
2015-08-11 18:25:11 -07:00
Jian He
fa1d84ae27
YARN-3887. Support changing Application priority during runtime. Contributed by Sunil G
2015-08-10 20:51:54 -07:00
Wangda Tan
cf9d3c9256
YARN-3873. PendingApplications in LeafQueue should also use OrderingPolicy. (Sunil G via wangda)
2015-08-10 14:54:55 -07:00
Rohith Sharma K S
b6265d39c5
YARN-3948. Display Application Priority in RM Web UI.(Sunil G via rohithsharmaks)
2015-08-07 10:43:41 +05:30
Carlo Curino
8572a5a14b
YARN-3974. Refactor the reservation system test cases to use parameterized base test. (subru via curino)
2015-08-02 01:55:31 -07:00
Arun Suresh
154c9d2e42
YARN-3961. Expose pending, running and reserved containers of a queue in REST api and yarn top (adhoot via asuresh)
2015-08-05 23:14:14 -07:00
rohithsharmaks
df9e7280db
YARN-3992. TestApplicationPriority.testApplicationPriorityAllocation fails intermittently. (Contributed by Sunil G)
2015-08-06 10:43:37 +05:30
Jian He
ba2313d614
YARN-3983. Refactored CapacityScheduleri#FiCaSchedulerApp to easier extend container allocation logic. Contributed by Wangda Tan
2015-08-05 13:47:40 -07:00
Arun Suresh
f271d37735
YARN-3736. Add RMStateStore apis to store and load accepted reservations for failover (adhoot via asuresh)
2015-08-05 12:57:12 -07:00
Xuan
0306d902f5
YARN-3543. ApplicationReport should be able to tell whether the
...
Application is AM managed or not. Contributed by Rohith Sharma K S
2015-08-03 15:46:00 -07:00
Jonathan Eagles
3cd02b9522
YARN-3978. Configurably turn off the saving of container info in Generic AHS (Eric Payne via jeagles)
2015-08-03 10:38:05 -05:00
Jason Lowe
32e490b6c0
YARN-3990. AsyncDispatcher may overloaded with RMAppNodeUpdateEvent when Node is connected/disconnected. Contributed by Bibin A Chundatt
2015-07-31 17:37:24 +00:00
Zhihai Xu
ab80e27703
YARN-433. When RM is catching up with node updates then it should not expire acquired containers. Contributed by Xuan Gong
2015-07-30 21:57:11 -07:00
Wangda Tan
91b42e7d6e
YARN-3971. Skip RMNodeLabelsManager#checkRemoveFromClusterNodeLabelsOfQueue on nodelabel recovery. (Bibin A Chundatt via wangda)
2015-07-30 10:00:31 -07:00
ccurino
156f24ead0
YARN-3656. LowCost: A Cost-Based Placement Agent for YARN Reservations. (Jonathan Yaniv and Ishai Menache via curino)
2015-07-25 07:39:47 -07:00
Jian He
83fe34ac08
YARN-3026. Move application-specific container allocation logic from LeafQueue to FiCaSchedulerApp. Contributed by Wangda Tan
2015-07-24 14:00:25 -07:00
Karthik Kambatla
d19d187753
YARN-3957. FairScheduler NPE In FairSchedulerQueueInfo causing scheduler page to return 500. (Anubhav Dhoot via kasha)
2015-07-24 11:44:37 -07:00
carlo curino
0fcb4a8cf2
YARN-3969. Allow jobs to be submitted to reservation that is active but does not have any allocations. (subru via curino)
2015-07-23 19:33:59 -07:00
Robert Kanter
1d3026e7b3
YARN-3900. Protobuf layout of yarn_security_token causes errors in other protos that include it (adhoot via rkanter)
2015-07-23 14:46:54 -07:00
Wangda Tan
3bba180051
YARN-3941. Proportional Preemption policy should try to avoid sending duplicate PREEMPT_CONTAINER event to scheduler. (Sunil G via wangda)
2015-07-23 10:07:57 -07:00
Wangda Tan
76ec26de80
YARN-3932. SchedulerApplicationAttempt#getResourceUsageReport and UserInfo should based on total-used-resources. (Bibin A Chundatt via wangda)
2015-07-22 11:54:02 -07:00
Wangda Tan
c39ca541f4
YARN-2003. Support for Application priority : Changes in RM and Capacity Scheduler. (Sunil G via wangda)
2015-07-21 09:57:23 -07:00
Arun Suresh
9b272ccae7
YARN-3535. Scheduler must re-request container resources when RMContainer transitions from ALLOCATED to KILLED (rohithsharma and peng.zhang via asuresh)
2015-07-17 04:31:34 -07:00
Wangda Tan
3540d5fe4b
YARN-3885. ProportionalCapacityPreemptionPolicy doesn't preempt if queue is more than 2 level. (Ajith S via wangda)
2015-07-16 16:13:32 -07:00
Arun Suresh
ac94ba3e18
YARN-3453. Ensure preemption logic in FairScheduler uses DominantResourceCalculator in DRF queues to prevent unnecessary thrashing. (asuresh)
2015-07-14 00:23:55 -07:00
Wangda Tan
5ed1fead6b
YARN-3894. RM startup should fail for wrong CS xml NodeLabel capacity configuration. (Bibin A Chundatt via wangda)
2015-07-12 21:52:11 -07:00
Wangda Tan
1df39c1efc
YARN-3849. Too much of preemption activity causing continuos killing of containers across queues. (Sunil G via wangda)
2015-07-11 10:26:46 -07:00
Zhijie Shen
1ea36299a4
YARN-3116. RM notifies NM whether a container is an AM container or normal task container. Contributed by Giovanni Matteo Fumarola.
2015-07-10 18:58:10 -07:00
Ming Ma
08244264c0
YARN-3445. Cache runningApps in RMNode for getting running apps on given NodeId. (Junping Du via mingma)
2015-07-10 08:30:10 -07:00
carlo curino
0e602fa3a1
YARN-3800. Reduce storage footprint for ReservationAllocation. Contributed by Anubhav Dhoot.
2015-07-09 16:51:59 -07:00
Wangda Tan
0e4b06690f
YARN-3508. Prevent processing preemption events on the main RM dispatcher. (Varun Saxena via wangda)
2015-07-01 17:32:22 -07:00
Devaraj K
80a68d6056
YARN-3830. AbstractYarnScheduler.createReleaseCache may try to clean a
...
null attempt. Contributed by nijel.
2015-07-01 19:03:44 +05:30
Xuan
fe6c1bd73a
YARN-2871. TestRMRestart#testRMRestartGetApplicationList sometime fails
...
in trunk. Contributed by zhihai xu
2015-06-26 19:43:59 -07:00
Xuan
5b5bb8dcdc
YARN-3802. Two RMNodes for the same NodeId are used in RM sometimes
...
after NM is reconnected. Contributed by zhihai xu
2015-06-18 14:37:49 -07:00
Xuan
a826d432f9
YARN-3804. Both RM are on standBy state when kerberos user not in yarn.admin.acl. Contributed by Varun Saxena
2015-06-17 16:23:27 -07:00
Devaraj K
d8dcfa98e3
YARN-3794. TestRMEmbeddedElector fails because of ambiguous LOG reference.
...
Contributed by Chengbing Liu.
2015-06-12 13:42:49 +05:30
Xuan
5583f88bf7
YARN-3785. Support for Resource as an argument during submitApp call in
...
MockRM test class. Contributed by Sunil G
2015-06-10 21:40:48 -07:00
Jian He
960b8f19ca
YARN-2716. Refactor ZKRMStateStore retry code with Apache Curator. Contributed by Karthik Kambatla
2015-06-08 14:50:58 -07:00
Karthik Kambatla
bd69ea408f
YARN-3655. FairScheduler: potential livelock due to maxAMShare limitation and container reservation. (Zhihai Xu via kasha)
2015-06-07 11:37:52 -07:00
Xuan
3e000a919f
YARN-1462. AHS API and other AHS changes to handle tags for completed MR jobs. Contributed by Xuan Gong
2015-06-05 12:48:52 -07:00
Karthik Kambatla
75885852cc
YARN-3259. FairScheduler: Trigger fairShare updates on node events. (Anubhav Dhoot via kasha)
2015-06-05 09:39:41 -07:00
Jian He
6ad4e59cfc
YARN-3764. CapacityScheduler should forbid moving LeafQueue from one parent to another. Contributed by Wangda Tan
2015-06-04 10:52:59 -07:00
Wangda Tan
ebd797c48f
YARN-3733. Fix DominantRC#compare() does not work as expected if cluster resource is empty. (Rohith Sharmaks via wangda)
2015-06-04 10:22:57 -07:00
Junping Du
d7e7f6aa03
YARN-41. The RM should handle the graceful shutdown of the NM. Contributed by Devaraj K.
2015-06-04 04:59:27 -07:00
Xuan
5766a04428
YARN-3749. We should make a copy of configuration when init
...
MiniYARNCluster with multiple RMs. Contributed by Chun Chen
2015-06-03 17:20:15 -07:00
Zhijie Shen
bc85959edd
Revert "YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong."
...
This reverts commit 4a9ec1a824
.
2015-06-03 14:15:56 -07:00
Zhijie Shen
4a9ec1a824
YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong.
2015-05-30 21:05:36 -07:00
Vinod Kumar Vavilapalli
9acd24fec4
Fixed more FilesSystemRMStateStore issues. Contributed by Vinod Kumar Vavilapalli.
2015-05-28 15:25:56 -07:00
Allen Wittenauer
d6e3164d4a
YARN-2355. MAX_APP_ATTEMPTS_ENV may no longer be a useful env var for a container (Darrell Taylor via aw)
2015-05-27 16:40:56 -07:00
Wangda Tan
ec0a852a37
YARN-3647. RMWebServices api's should use updated api from CommonNodeLabelsManager to get NodeLabel object. (Sunil G via wangda)
2015-05-27 12:51:20 -07:00
Wangda Tan
cdbd66be11
YARN-3686. CapacityScheduler should trim default_node_label_expression. (Sunil G via wangda)
2015-05-26 15:58:47 -07:00
Jian He
10732d515f
YARN-3632. Ordering policy should be allowed to reorder an application when demand changes. Contributed by Craig Welch
2015-05-26 12:00:51 -07:00
Tsuyoshi Ozawa
9a3d617b63
YARN-2336. Fair scheduler's REST API returns a missing '[' bracket JSON for deep queue tree. Contributed by Kenji Kikushima and Akira Ajisaka.
2015-05-26 19:07:40 +09:00
Karthik Kambatla
4513761869
YARN-3675. FairScheduler: RM quits when node removal races with continuous-scheduling on the same node. (Anubhav Dhoot via kasha)
2015-05-21 13:44:42 -07:00
Jian He
8966d42179
YARN-3609. Load node labels from storage inside RM serviceStart. Contributed by Wangda Tan
2015-05-20 16:30:07 -07:00
Wangda Tan
563eb1ad2a
YARN-3583. Support of NodeLabel object instead of plain String in YarnClient side. (Sunil G via wangda)
2015-05-19 16:54:38 -07:00
Wangda Tan
b37da52a1c
YARN-3565. NodeHeartbeatRequest/RegisterNodeManagerRequest should use NodeLabel object instead of String. (Naganarasimha G R via wangda)
2015-05-19 16:34:17 -07:00
Jason Lowe
f7e051c431
YARN-2421. RM still allocates containers to an app in the FINISHING state. Contributed by Chang Li
2015-05-15 22:09:30 +00:00
Vinod Kumar Vavilapalli
9a2a9553ee
Fixing HDFS state-store. Contributed by Arun Suresh.
2015-05-14 16:13:51 -07:00
Junping Du
15ccd967ee
YARN-3505. Node's Log Aggregation Report with SUCCEED should not cached in RMApps. Contributed by Xuan Gong.
2015-05-14 10:58:12 -07:00
Wangda Tan
0e85044e26
YARN-3362. Add node label usage in RM CapacityScheduler web UI. (Naganarasimha G R via wangda)
2015-05-13 17:00:36 -07:00
Wangda Tan
7f19e7a254
YARN-3521. Support return structured NodeLabel objects in REST API (Sunil G via wangda)
2015-05-13 13:43:17 -07:00
Wangda Tan
341a476812
YARN-2921. Fix MockRM/MockAM#waitForState sleep too long. (Tsuyoshi Ozawa via wangda)
2015-05-13 13:06:07 -07:00
Karthik Kambatla
a60f78e98e
YARN-3395. FairScheduler: Trim whitespaces when using username for queuename. (Zhihai Xu via kasha)
2015-05-09 15:41:20 -07:00
Karthik Kambatla
70fb37cd79
YARN-1287. Consolidate MockClocks. (Sebastian Wong and Anubhav Dhoot via kasha)
2015-05-09 14:34:54 -07:00
Karthik Kambatla
2fb44c8aaf
YARN-3271. FairScheduler: Move tests related to max-runnable-apps from TestFairScheduler to TestAppRunnability. (nijel via kasha)
2015-05-08 16:39:10 -07:00
Jian He
f489a4ec96
YARN-2918. RM should not fail on startup if queue's configured labels do not exist in cluster-node-labels. Contributed by Wangda Tan
2015-05-07 17:35:41 -07:00
Akira Ajisaka
918af8efff
YARN-3577. Misspelling of threshold in log4j.properties for tests. Contributed by Brahma Reddy Battula.
2015-05-07 13:33:03 +09:00
Vinod Kumar Vavilapalli
4c7b9b6abe
YARN-3385. Fixed a race-condition in ResourceManager's ZooKeeper based state-store to avoid crashing on duplicate deletes. Contributed by Zhihai Xu.
2015-05-06 17:51:17 -07:00
Junping Du
31b627b2a8
YARN-3580. [JDK8] TestClientRMService.testGetLabelsToNodes fails. Contributed by Robert Kanter.
2015-05-06 16:51:05 -07:00
Jian He
e4c3b52c89
YARN-3343. Increased TestCapacitySchedulerNodeLabelUpdate#testNodeUpdate timeout. Contributed by Rohith Sharmaks
2015-05-05 11:33:47 -07:00
Jian He
d701acc9c6
YARN-2725. Added test cases of retrying creating znode in ZKRMStateStore. Contributed by Tsuyoshi Ozawa
2015-05-04 16:13:29 -07:00
Gera Shegalov
f8204e241d
YARN-2893. AMLaucher: sporadic job failures due to EOFException in readTokenStorageStream. (Zhihai Xu via gera)
2015-05-01 18:18:55 -07:00
Wangda Tan
e2e8f77118
YARN-3564. Fix TestContainerAllocation.testAMContainerAllocationWhenDNSUnavailable fails randomly. (Jian He via wangda)
2015-04-30 11:03:19 -07:00
Jian He
4c1af156ae
YARN-3533. Test: Fix launchAM in MockRM to wait for attempt to be scheduled. Contributed by Anubhav Dhoot
2015-04-29 14:50:01 -07:00
tgraves
2e215484bd
YARN-3517. RM web ui for dumping scheduler logs should be for admins only (Varun Vasudev via tgraves)
2015-04-29 21:25:42 +00:00
Karthik Kambatla
8f82970e0c
YARN-3485. FairScheduler headroom calculation doesn't consider maxResources for Fifo and FairShare policies. (kasha)
2015-04-28 21:00:35 -07:00
Wangda Tan
db1b674b50
YARN-2740. Fix NodeLabelsManager to properly handle node label modifications when distributed node label configuration enabled. (Naganarasimha G R via wangda)
2015-04-27 16:24:38 -07:00
Jian He
d497f6ea2b
YARN-2498. Respect labels in preemption policy of capacity scheduler for inter-queue preemption. Contributed by Wangda Tan
2015-04-24 17:03:13 -07:00
Jian He
d03dcb9635
YARN-3387. Previous AM's container completed status couldn't pass to current AM if AM and RM restarted during the same time. Contributed by Sandflee
2015-04-24 12:13:29 -07:00
Vinod Kumar Vavilapalli
f5fe35e297
YARN-3413. Changed Nodelabel attributes (like exclusivity) to be settable only via addToClusterNodeLabels but not changeable at runtime. (Wangda Tan via vinodkv)
2015-04-23 11:19:55 -07:00
Wangda Tan
395205444e
YARN-3319. Implement a FairOrderingPolicy. (Craig Welch via wangda)
2015-04-23 10:47:15 -07:00
tgraves
189a63a719
YARN-3434. Interaction between reservations and userlimit can result in significant ULF violation
2015-04-23 14:39:25 +00:00
Junping Du
fad9d7e85b
New parameter of CLI for decommissioning node gracefully in RMAdmin CLI. Contributed by Devaraj K
2015-04-22 10:07:20 -07:00
Jian He
bdd90110e6
YARN-3494. Expose AM resource limit and usage in CS QueueMetrics. Contributed by Rohith Sharmaks
2015-04-21 20:06:20 -07:00
Wangda Tan
e71d0d87d9
YARN-3410. YARN admin should be able to remove individual application records from RMStateStore. (Rohith Sharmaks via wangda)
2015-04-21 17:51:22 -07:00
Wangda Tan
44872b76fc
YARN-3463. Integrate OrderingPolicy Framework with CapacityScheduler. (Craig Welch via wangda)
2015-04-20 17:12:32 -07:00
Wangda Tan
f65eeb412d
YARN-3493. RM fails to come up with error "Failed to load/recover state" when mem settings are changed. (Jian He via wangda)
2015-04-17 17:11:22 -07:00
Jian He
d573f09fb9
YARN-2696. Queue sorting in CapacityScheduler should consider node label. Contributed by Wangda Tan
2015-04-17 13:36:59 -07:00
Junping Du
1db355a875
YARN-1402. Update related Web UI and CLI with exposing client API to check log aggregation status. Contributed by Xuan Gong.
2015-04-17 13:18:59 -07:00
Jian He
bb6dde68f1
YARN-3021. YARN's delegation-token handling disallows certain trust setups to operate properly over DistCp. Contributed by Yongjun Zhang
2015-04-16 19:43:37 -07:00
Jian He
1b89a3e173
YARN-3354. Add node label expression in ContainerTokenIdentifier to support RM recovery. Contributed by Wangda Tan
2015-04-15 13:57:06 -07:00
Tsuyoshi Ozawa
e48cedc663
YARN-3326. Support RESTful API for getLabelsToNodes. Contributed by Naganarasimha G R.
2015-04-15 14:03:55 -05:00
Wangda Tan
5004e75332
YARN-3318. Create Initial OrderingPolicy Framework and FifoOrderingPolicy. (Craig Welch via wangda)
2015-04-15 09:56:32 -07:00
Jian He
0fefda645b
YARN-3361. CapacityScheduler side changes to support non-exclusive node labels. Contributed by Wangda Tan
2015-04-14 11:45:58 -07:00
Jian He
b46ee1e7a3
YARN-3266. RMContext#inactiveNodes should have NodeId as map key. Contributed by Chengbing Liu
2015-04-14 10:54:22 -07:00
Jian He
a1afbc48b5
YARN-3472. Fixed possible leak in DelegationTokenRenewer#allTokens. Contributed by Rohith Sharmaks
2015-04-13 14:07:17 -07:00
Junping Du
92431c9617
YARN-1376. NM need to notify the log aggregation status to RM through Node heartbeat. Contributed by Xuan Gong.
2015-04-10 08:56:18 -07:00
Xuan
afa5d4715a
YARN-3293. Track and display capacity scheduler health metrics in web
...
UI. Contributed by Varun Vasudev
2015-04-09 23:38:04 -07:00
Vinod Kumar Vavilapalli
9c5911294e
YARN-3055. Fixed ResourceManager's DelegationTokenRenewer to not stop token renewal of applications part of a bigger workflow. Contributed by Daryn Sharp.
2015-04-09 13:08:53 -07:00
Robert Kanter
99b08a748e
YARN-2429. TestAMRMTokens.testTokenExpiry fails Intermittently with error message:Invalid AMRMToken (zxu via rkanter)
2015-04-06 14:11:20 -07:00
Tsuyoshi Ozawa
53959e69f7
TestFairScheduler.testContinuousScheduling fails Intermittently. Contributed by Zhihai Xu.
2015-04-06 20:19:13 +09:00
Sandy Ryza
6a6a59db7f
YARN-3415. Non-AM containers can be counted towards amResourceUsage of a fairscheduler queue (Zhihai Xu via Sandy Ryza)
2015-04-02 13:56:08 -07:00
Xuan
4728bdfa15
YARN-3248. Display count of nodes blacklisted by apps in the web UI.
...
Contributed by Varun Vasudev
2015-04-01 04:19:18 -07:00
Karthik Kambatla
79f7f2aabf
YARN-3412. RM tests should use MockRM where possible. (kasha)
2015-03-31 09:14:15 -07:00
Wangda Tan
2a945d24f7
YARN-2495. Allow admin specify labels from each NM (Distributed configuration for node label). (Naganarasimha G R via wangda)
2015-03-30 12:05:21 -07:00
Karthik Kambatla
2bc097cd14
YARN-3241. FairScheduler handles invalid queue names inconsistently. (Zhihai Xu via kasha)
2015-03-23 13:22:03 -07:00
cnauroth
6ca1f12024
YARN-3336. FileSystem memory leak in DelegationTokenRenewer.
2015-03-23 10:45:50 -07:00
Jian He
e1feb4ea1a
YARN-3345. Add non-exclusive node label API. Contributed by Wangda Tan
2015-03-20 19:04:38 -07:00
Jian He
586348e4cb
YARN-3356. Capacity Scheduler FiCaSchedulerApp should use ResourceUsage to track used-resources-by-label. Contributed by Wangda Tan
2015-03-20 13:54:01 -07:00
Devaraj K
93d0f4acc8
YARN-3357. Move TestFifoScheduler to FIFO package. Contributed by Rohith
...
Sharmaks.
2015-03-19 12:16:52 +05:30
Jian He
658097d6da
YARN-3273. Improve scheduler UI to facilitate scheduling analysis and debugging. Contributed Rohith Sharmaks
2015-03-17 21:30:23 -07:00
Tsuyoshi Ozawa
3bc72cc16d
YARN-3205. FileSystemRMStateStore should disable FileSystem Cache to avoid get a Filesystem with an old configuration. Contributed by Zhihai Xu.
2015-03-18 11:53:19 +09:00
Jian He
968425e9f7
YARN-3305. Normalize AM resource request on app submission. Contributed by Rohith Sharmaks
2015-03-17 13:49:59 -07:00
Jian He
487374b7fe
YARN-3243. CapacityScheduler should pass headroom from parent to children to make sure ParentQueue obey its capacity limits. Contributed by Wangda Tan.
2015-03-17 10:24:23 -07:00
Tsuyoshi Ozawa
7522a643fa
YARN-3349. Treat all exceptions as failure in TestFSRMStateStore#testFSRMStateStoreClientRetry. Contributed by Zhihai Xu.
2015-03-17 08:09:55 +09:00
Vinod Kumar Vavilapalli
863079bb87
YARN-3154. Added additional APIs in LogAggregationContext to avoid aggregating running logs of application when rolling is enabled. Contributed by Xuan Gong.
2015-03-12 13:32:29 -07:00
Zhijie Shen
85f6d67fa7
YARN-1884. Added nodeHttpAddress into ContainerReport and fixed the link to NM web page. Contributed by Xuan Gong.
2015-03-11 19:35:19 -07:00
Jason Lowe
27e8ea820f
YARN-3275. CapacityScheduler: Preemption happening on non-preemptable queues. Contributed by Eric Payne
2015-03-06 22:37:26 +00:00
Jian He
95bfd087dc
YARN-1809. Synchronize RM and TimeLineServer Web-UIs. Contributed by Zhijie Shen and Xuan Gong
2015-03-05 21:20:09 -08:00
Karthik Kambatla
8d88691d16
YARN-3242. Asynchrony in ZK-close can lead to ZKRMStateStore watcher receiving events for old client. (Zhihai Xu via kasha)
2015-03-04 19:49:05 -08:00
Karthik Kambatla
22426a1c9f
YARN-3231. FairScheduler: Changing queueMaxRunningApps interferes with pending jobs. (Siqi Li via kasha)
2015-03-04 18:06:58 -08:00
Jian He
b2f1ec312e
YARN-3222. Fixed NPE on RMNodeImpl#ReconnectNodeTransition when a node is reconnected with a different port. Contributed by Rohith Sharmaks
2015-03-03 16:28:28 -08:00
Wangda Tan
e17e5ba9d7
YARN-3272. Surface container locality info in RM web UI (Jian He via wangda)
2015-03-03 11:49:01 -08:00
Vinod Kumar Vavilapalli
14dd647c55
YARN-3265. Fixed a deadlock in CapacityScheduler by always passing a queue's available resource-limit from the parent queue. Contributed by Wangda Tan.
2015-03-02 17:52:47 -08:00
Wangda Tan
edcecedc1c
YARN-3262. Surface application outstanding resource requests table in RM web UI. (Jian He via wangda)
2015-02-27 16:13:32 -08:00
Tsuyoshi Ozawa
01a1621930
YARN-2820. Retry in FileSystemRMStateStore when FS's operations fail due to IOException. Contributed by Zhihai Xu.
2015-02-28 00:56:44 +09:00
Devaraj K
0d4296f0e0
YARN-3256. TestClientToAMTokens#testClientTokenRace is not running against
...
all Schedulers even when using ParameterizedSchedulerTestBase. Contributed
by Anubhav Dhoot.
2015-02-26 15:45:41 +05:30
Tsuyoshi Ozawa
6cbd9f1113
YARN-3247. TestQueueMappings should use CapacityScheduler explicitly. Contributed by Zhihai Xu.
2015-02-25 10:38:11 +09:00
Xuan
fe7a302473
YARN-2797. TestWorkPreservingRMRestart should use
...
ParametrizedSchedulerTestBase. Contributed by Karthik Kambatla
2015-02-21 19:17:29 -08:00
Jason Lowe
a64dd3d24b
YARN-3194. RM should handle NMContainerStatuses sent by NM while registering if NM is Reconnected node. Contributed by Rohith
2015-02-20 15:10:10 +00:00
Jian He
c0d9b93953
YARN-933. Fixed InvalidStateTransitonException at FINAL_SAVING state in RMApp. Contributed by Rohith Sharmaks
2015-02-19 15:42:39 -08:00
Wangda Tan
d49ae725d5
YARN-3076. Add API/Implementation to YarnClient to retrieve label-to-node mapping (Varun Saxena via wangda)
2015-02-19 11:00:57 -08:00
Jian He
1c03376300
YARN-1514. Utility to benchmark ZKRMStateStore#loadState for RM HA. Contributed by Tsuyoshi OZAWA
2015-02-18 16:06:55 -08:00
Jian He
f5da5566d9
YARN-3132. RMNodeLabelsManager should remove node from node-to-label mapping when node becomes deactivated. Contributed by Wangda Tan
2015-02-18 11:51:51 -08:00
Jian He
18297e0972
YARN-3104. Fixed RM to not generate new AMRM tokens on every heartbeat between rolling and activation. Contributed by Jason Lowe
2015-02-12 16:02:45 -08:00
Jian He
18a594257e
YARN-3124. Fixed CS LeafQueue/ParentQueue to use QueueCapacities to track capacities-by-label. Contributed by Wangda Tan
2015-02-12 14:58:09 -08:00
Xuan
65c69e296e
YARN-3151. On Failover tracking url wrong in application cli for KILLED
...
application. Contributed by Rohith
2015-02-11 21:19:48 -08:00
Zhijie Shen
d5855c0e46
YARN-2246. Made the proxy tracking URL always be http(s)://proxy addr:port/proxy/<appId> to avoid duplicate sections. Contributed by Devaraj K.
2015-02-10 15:24:01 -08:00
Zhijie Shen
23bf6c7207
YARN-3100. Made YARN authorization pluggable. Contributed by Jian He.
2015-02-09 20:34:56 -08:00
Jian He
0af6a99a3f
YARN-3094. Reset timer for liveness monitors after RM recovery. Contributed by Jun Gong
2015-02-09 13:47:08 -08:00
Karthik Kambatla
7e42088abf
YARN-2990. FairScheduler's delay-scheduling always waits for node-local and rack-local delays, even for off-rack-only requests. (kasha)
2015-02-08 22:48:10 -08:00
Jason Lowe
da2fb2bc46
YARN-3143. RM Apps REST API can return NPE or entries missing id and other fields. Contributed by Jason Lowe
2015-02-06 21:47:32 +00:00
Jian He
c1957fef29
YARN-2694. Ensure only single node label specified in ResourceRequest. Contributed by Wangda Tan
2015-02-06 11:34:20 -08:00
Jason Lowe
69c8a7f45b
YARN-1582. Capacity Scheduler: add a maximum-allocation-mb setting per queue. Contributed by Thomas Graves
2015-02-05 19:28:49 +00:00
Sandy Ryza
b6466deac6
YARN-3101. In Fair Scheduler, fix canceling of reservations for exceeding max share (Anubhav Dhoot via Sandy Ryza)
2015-02-05 09:39:28 -08:00
Zhijie Shen
e5b56e2af6
YARN-2543. Made resource usage be published to the timeline server too. Contributed by Naganarasimha G R.
2015-02-03 17:34:22 -08:00
Wangda Tan
5bd984691b
YARN-3075. NodeLabelsManager implementation to retrieve label to node mapping (Varun Saxena via wangda)
2015-02-03 12:52:42 -08:00
Jian He
21d80b3dd9
YARN-3098. Created common QueueCapacities class in Capacity Scheduler to track capacities-by-labels of queues. Contributed by Wangda Tan
2015-02-03 11:43:12 -08:00
Jason Lowe
a761bf8726
YARN-3085. Application summary should include the application type. Contributed by Rohith
2015-02-03 14:56:34 +00:00
Jian He
054a947989
YARN-3077. Fixed RM to create zk root path recursively. Contributed by Chun Chen
2015-01-30 17:34:49 -08:00
Jian He
86358221fc
YARN-3099. Capacity Scheduler LeafQueue/ParentQueue should use ResourceUsage to track used-resources-by-label. Contributed by Wangda Tan
2015-01-30 15:15:20 -08:00
Wangda Tan
7882bc0f14
YARN-3079. Scheduler should also update maximumAllocation when updateNodeResource. (Zhihai Xu via wangda)
2015-01-28 21:54:38 -08:00
Wangda Tan
18741adf97
YARN-2932. Add entry for preemptable status (enabled/disabled) to scheduler web UI and queue initialize/refresh logging. (Eric Payne via wangda)
2015-01-27 15:36:09 -08:00
Jian He
6f9fe76918
YARN-3092. Created a common ResourceUsage class to track labeled resource usages in Capacity Scheduler. Contributed by Wangda Tan
2015-01-26 15:38:00 -08:00
Tsuyoshi Ozawa
24aa462673
YARN-2800. Remove MemoryNodeLabelsStore and add a way to enable/disable node labels feature. Contributed by Wangda Tan.
2015-01-23 20:37:05 +09:00
Wangda Tan
0a2d3e717d
YARN-2933. Capacity Scheduler preemption policy should only consider capacity without labels temporarily. Contributed by Mayank Bansal
2015-01-19 16:48:50 -08:00
Junping Du
5d1cca34fa
YARN-3064. TestRMRestart/TestContainerResourceUsage/TestNodeManagerResync failure with allocation timeout. (Contributed by Jian He)
2015-01-16 00:08:36 -08:00
Jian He
c53420f583
YARN-2637. Fixed max-am-resource-percent calculation in CapacityScheduler when activating applications. Contributed by Craig Welch
2015-01-13 17:32:07 -08:00
Robert Kanter
ae7bf31fe1
YARN-3027. Scheduler should use totalAvailable resource from node instead of availableResource for maxAllocation. (adhoot via rkanter)
2015-01-12 10:47:52 -08:00
Zhijie Shen
60103fca04
YARN-2427. Added the API of moving apps between queues in RM web services. Contributed by Varun Vasudev.
2015-01-06 14:37:44 -08:00
Karthik Kambatla
0c4b112677
YARN-2881. [YARN-2574] Implement PlanFollower for FairScheduler. (Anubhav Dhoot via kasha)
2015-01-06 04:42:10 +05:30
Zhijie Shen
562a701945
YARN-2958. Made RMStateStore not update the last sequence number when updating the delegation token. Contributed by Varun Saxena.
2015-01-05 13:33:07 -08:00
Tsuyoshi Ozawa
ddc5be48fc
YARN-2922. ConcurrentModificationException in CapacityScheduler's LeafQueue. Contributed by Rohith Sharmaks.
2015-01-05 00:08:31 +09:00
Karthik Kambatla
e7257acd8a
YARN-2998. Abstract out scheduler independent PlanFollower components. (Anubhav Dhoot via kasha)
2014-12-30 19:55:24 -08:00
Jian He
e2351c7ae2
YARN-2987. Fixed ClientRMService#getQueueInfo to check against queue and app ACLs. Contributed by Varun Saxena
2014-12-30 17:15:37 -08:00
Jian He
b7442bf92e
YARN-2493. Added node-labels page on RM web UI. Contributed by Wangda Tan
2014-12-30 16:49:01 -08:00
Jian He
746ad6e989
Revert "YARN-2492(wrong jira number). Added node-labels page on RM web UI. Contributed by Wangda Tan"
...
This reverts commit 5f57b904f5
.
2014-12-30 16:48:49 -08:00
Jian He
5f57b904f5
YARN-2492. Added node-labels page on RM web UI. Contributed by Wangda Tan
2014-12-30 15:38:28 -08:00
Jian He
4f18018b7a
YARN-2946. Fixed potential deadlock in RMStateStore. Contributed by Rohith Sharmaks
2014-12-23 22:14:29 -08:00
Jian He
149512a837
YARN-2837. Support TimeLine server to recover delegation token when restarting. Contributed by Zhijie Shen
2014-12-23 18:25:37 -08:00
Jian He
0d89859b51
YARN-2340. Fixed NPE when queue is stopped during RM restart. Contributed by Rohith Sharmaks
2014-12-22 21:53:22 -08:00
Jian He
fdf042dfff
YARN-2920. Changed CapacityScheduler to kill containers on nodes where node labels are changed. Contributed by Wangda Tan
2014-12-22 16:51:15 -08:00
Karthik Kambatla
24ee9e3431
YARN-2975. FSLeafQueue app lists are accessed without required locks. (kasha)
2014-12-20 12:17:50 -08:00
Jian He
808cba3821
YARN-2952. Fixed incorrect version check in StateStore. Contributed by Rohith Sharmaks
2014-12-19 16:56:30 -08:00
Karthik Kambatla
a22ffc3188
YARN-2738. [YARN-2574] Add FairReservationSystem for FairScheduler. (Anubhav Dhoot via kasha)
2014-12-19 15:37:12 -08:00
Jason Lowe
0402bada19
YARN-2964. RM prematurely cancels tokens for jobs that submit jobs (oozie). Contributed by Jian He
2014-12-18 23:28:18 +00:00
Steve Loughran
3681de2039
YARN-2912 Jersey Tests failing with port in use. (varun saxena via stevel)
2014-12-12 17:11:07 +00:00
Jian He
2ed90a57fd
YARN-2930. Fixed TestRMRestart#testRMRestartRecoveringNodeLabelManager intermittent failure. Contributed by Wangda Tan
2014-12-09 16:48:04 -08:00
Karthik Kambatla
a2e07a5456
YARN-2910. FSLeafQueue can throw ConcurrentModificationException. (Wilfred Spiegelenburg via kasha)
2014-12-09 14:00:31 -08:00