Wangda Tan
|
3bba180051
|
YARN-3941. Proportional Preemption policy should try to avoid sending duplicate PREEMPT_CONTAINER event to scheduler. (Sunil G via wangda)
|
2015-07-23 10:07:57 -07:00 |
|
Wangda Tan
|
76ec26de80
|
YARN-3932. SchedulerApplicationAttempt#getResourceUsageReport and UserInfo should based on total-used-resources. (Bibin A Chundatt via wangda)
|
2015-07-22 11:54:02 -07:00 |
|
Wangda Tan
|
c39ca541f4
|
YARN-2003. Support for Application priority : Changes in RM and Capacity Scheduler. (Sunil G via wangda)
|
2015-07-21 09:57:23 -07:00 |
|
Arun Suresh
|
9b272ccae7
|
YARN-3535. Scheduler must re-request container resources when RMContainer transitions from ALLOCATED to KILLED (rohithsharma and peng.zhang via asuresh)
|
2015-07-17 04:31:34 -07:00 |
|
Wangda Tan
|
3540d5fe4b
|
YARN-3885. ProportionalCapacityPreemptionPolicy doesn't preempt if queue is more than 2 level. (Ajith S via wangda)
|
2015-07-16 16:13:32 -07:00 |
|
Arun Suresh
|
ac94ba3e18
|
YARN-3453. Ensure preemption logic in FairScheduler uses DominantResourceCalculator in DRF queues to prevent unnecessary thrashing. (asuresh)
|
2015-07-14 00:23:55 -07:00 |
|
Wangda Tan
|
5ed1fead6b
|
YARN-3894. RM startup should fail for wrong CS xml NodeLabel capacity configuration. (Bibin A Chundatt via wangda)
|
2015-07-12 21:52:11 -07:00 |
|
Wangda Tan
|
1df39c1efc
|
YARN-3849. Too much of preemption activity causing continuos killing of containers across queues. (Sunil G via wangda)
|
2015-07-11 10:26:46 -07:00 |
|
Zhijie Shen
|
1ea36299a4
|
YARN-3116. RM notifies NM whether a container is an AM container or normal task container. Contributed by Giovanni Matteo Fumarola.
|
2015-07-10 18:58:10 -07:00 |
|
Ming Ma
|
08244264c0
|
YARN-3445. Cache runningApps in RMNode for getting running apps on given NodeId. (Junping Du via mingma)
|
2015-07-10 08:30:10 -07:00 |
|
carlo curino
|
0e602fa3a1
|
YARN-3800. Reduce storage footprint for ReservationAllocation. Contributed by Anubhav Dhoot.
|
2015-07-09 16:51:59 -07:00 |
|
Wangda Tan
|
0e4b06690f
|
YARN-3508. Prevent processing preemption events on the main RM dispatcher. (Varun Saxena via wangda)
|
2015-07-01 17:32:22 -07:00 |
|
Devaraj K
|
80a68d6056
|
YARN-3830. AbstractYarnScheduler.createReleaseCache may try to clean a
null attempt. Contributed by nijel.
|
2015-07-01 19:03:44 +05:30 |
|
Xuan
|
fe6c1bd73a
|
YARN-2871. TestRMRestart#testRMRestartGetApplicationList sometime fails
in trunk. Contributed by zhihai xu
|
2015-06-26 19:43:59 -07:00 |
|
Xuan
|
5b5bb8dcdc
|
YARN-3802. Two RMNodes for the same NodeId are used in RM sometimes
after NM is reconnected. Contributed by zhihai xu
|
2015-06-18 14:37:49 -07:00 |
|
Xuan
|
a826d432f9
|
YARN-3804. Both RM are on standBy state when kerberos user not in yarn.admin.acl. Contributed by Varun Saxena
|
2015-06-17 16:23:27 -07:00 |
|
Devaraj K
|
d8dcfa98e3
|
YARN-3794. TestRMEmbeddedElector fails because of ambiguous LOG reference.
Contributed by Chengbing Liu.
|
2015-06-12 13:42:49 +05:30 |
|
Xuan
|
5583f88bf7
|
YARN-3785. Support for Resource as an argument during submitApp call in
MockRM test class. Contributed by Sunil G
|
2015-06-10 21:40:48 -07:00 |
|
Jian He
|
960b8f19ca
|
YARN-2716. Refactor ZKRMStateStore retry code with Apache Curator. Contributed by Karthik Kambatla
|
2015-06-08 14:50:58 -07:00 |
|
Karthik Kambatla
|
bd69ea408f
|
YARN-3655. FairScheduler: potential livelock due to maxAMShare limitation and container reservation. (Zhihai Xu via kasha)
|
2015-06-07 11:37:52 -07:00 |
|
Xuan
|
3e000a919f
|
YARN-1462. AHS API and other AHS changes to handle tags for completed MR jobs. Contributed by Xuan Gong
|
2015-06-05 12:48:52 -07:00 |
|
Karthik Kambatla
|
75885852cc
|
YARN-3259. FairScheduler: Trigger fairShare updates on node events. (Anubhav Dhoot via kasha)
|
2015-06-05 09:39:41 -07:00 |
|
Jian He
|
6ad4e59cfc
|
YARN-3764. CapacityScheduler should forbid moving LeafQueue from one parent to another. Contributed by Wangda Tan
|
2015-06-04 10:52:59 -07:00 |
|
Wangda Tan
|
ebd797c48f
|
YARN-3733. Fix DominantRC#compare() does not work as expected if cluster resource is empty. (Rohith Sharmaks via wangda)
|
2015-06-04 10:22:57 -07:00 |
|
Junping Du
|
d7e7f6aa03
|
YARN-41. The RM should handle the graceful shutdown of the NM. Contributed by Devaraj K.
|
2015-06-04 04:59:27 -07:00 |
|
Xuan
|
5766a04428
|
YARN-3749. We should make a copy of configuration when init
MiniYARNCluster with multiple RMs. Contributed by Chun Chen
|
2015-06-03 17:20:15 -07:00 |
|
Zhijie Shen
|
bc85959edd
|
Revert "YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong."
This reverts commit 4a9ec1a824 .
|
2015-06-03 14:15:56 -07:00 |
|
Zhijie Shen
|
4a9ec1a824
|
YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong.
|
2015-05-30 21:05:36 -07:00 |
|
Vinod Kumar Vavilapalli
|
9acd24fec4
|
Fixed more FilesSystemRMStateStore issues. Contributed by Vinod Kumar Vavilapalli.
|
2015-05-28 15:25:56 -07:00 |
|
Allen Wittenauer
|
d6e3164d4a
|
YARN-2355. MAX_APP_ATTEMPTS_ENV may no longer be a useful env var for a container (Darrell Taylor via aw)
|
2015-05-27 16:40:56 -07:00 |
|
Wangda Tan
|
ec0a852a37
|
YARN-3647. RMWebServices api's should use updated api from CommonNodeLabelsManager to get NodeLabel object. (Sunil G via wangda)
|
2015-05-27 12:51:20 -07:00 |
|
Wangda Tan
|
cdbd66be11
|
YARN-3686. CapacityScheduler should trim default_node_label_expression. (Sunil G via wangda)
|
2015-05-26 15:58:47 -07:00 |
|
Jian He
|
10732d515f
|
YARN-3632. Ordering policy should be allowed to reorder an application when demand changes. Contributed by Craig Welch
|
2015-05-26 12:00:51 -07:00 |
|
Tsuyoshi Ozawa
|
9a3d617b63
|
YARN-2336. Fair scheduler's REST API returns a missing '[' bracket JSON for deep queue tree. Contributed by Kenji Kikushima and Akira Ajisaka.
|
2015-05-26 19:07:40 +09:00 |
|
Karthik Kambatla
|
4513761869
|
YARN-3675. FairScheduler: RM quits when node removal races with continuous-scheduling on the same node. (Anubhav Dhoot via kasha)
|
2015-05-21 13:44:42 -07:00 |
|
Jian He
|
8966d42179
|
YARN-3609. Load node labels from storage inside RM serviceStart. Contributed by Wangda Tan
|
2015-05-20 16:30:07 -07:00 |
|
Wangda Tan
|
563eb1ad2a
|
YARN-3583. Support of NodeLabel object instead of plain String in YarnClient side. (Sunil G via wangda)
|
2015-05-19 16:54:38 -07:00 |
|
Wangda Tan
|
b37da52a1c
|
YARN-3565. NodeHeartbeatRequest/RegisterNodeManagerRequest should use NodeLabel object instead of String. (Naganarasimha G R via wangda)
|
2015-05-19 16:34:17 -07:00 |
|
Jason Lowe
|
f7e051c431
|
YARN-2421. RM still allocates containers to an app in the FINISHING state. Contributed by Chang Li
|
2015-05-15 22:09:30 +00:00 |
|
Vinod Kumar Vavilapalli
|
9a2a9553ee
|
Fixing HDFS state-store. Contributed by Arun Suresh.
|
2015-05-14 16:13:51 -07:00 |
|
Junping Du
|
15ccd967ee
|
YARN-3505. Node's Log Aggregation Report with SUCCEED should not cached in RMApps. Contributed by Xuan Gong.
|
2015-05-14 10:58:12 -07:00 |
|
Wangda Tan
|
0e85044e26
|
YARN-3362. Add node label usage in RM CapacityScheduler web UI. (Naganarasimha G R via wangda)
|
2015-05-13 17:00:36 -07:00 |
|
Wangda Tan
|
7f19e7a254
|
YARN-3521. Support return structured NodeLabel objects in REST API (Sunil G via wangda)
|
2015-05-13 13:43:17 -07:00 |
|
Wangda Tan
|
341a476812
|
YARN-2921. Fix MockRM/MockAM#waitForState sleep too long. (Tsuyoshi Ozawa via wangda)
|
2015-05-13 13:06:07 -07:00 |
|
Karthik Kambatla
|
a60f78e98e
|
YARN-3395. FairScheduler: Trim whitespaces when using username for queuename. (Zhihai Xu via kasha)
|
2015-05-09 15:41:20 -07:00 |
|
Karthik Kambatla
|
70fb37cd79
|
YARN-1287. Consolidate MockClocks. (Sebastian Wong and Anubhav Dhoot via kasha)
|
2015-05-09 14:34:54 -07:00 |
|
Karthik Kambatla
|
2fb44c8aaf
|
YARN-3271. FairScheduler: Move tests related to max-runnable-apps from TestFairScheduler to TestAppRunnability. (nijel via kasha)
|
2015-05-08 16:39:10 -07:00 |
|
Jian He
|
f489a4ec96
|
YARN-2918. RM should not fail on startup if queue's configured labels do not exist in cluster-node-labels. Contributed by Wangda Tan
|
2015-05-07 17:35:41 -07:00 |
|
Akira Ajisaka
|
918af8efff
|
YARN-3577. Misspelling of threshold in log4j.properties for tests. Contributed by Brahma Reddy Battula.
|
2015-05-07 13:33:03 +09:00 |
|
Vinod Kumar Vavilapalli
|
4c7b9b6abe
|
YARN-3385. Fixed a race-condition in ResourceManager's ZooKeeper based state-store to avoid crashing on duplicate deletes. Contributed by Zhihai Xu.
|
2015-05-06 17:51:17 -07:00 |
|
Junping Du
|
31b627b2a8
|
YARN-3580. [JDK8] TestClientRMService.testGetLabelsToNodes fails. Contributed by Robert Kanter.
|
2015-05-06 16:51:05 -07:00 |
|
Jian He
|
e4c3b52c89
|
YARN-3343. Increased TestCapacitySchedulerNodeLabelUpdate#testNodeUpdate timeout. Contributed by Rohith Sharmaks
|
2015-05-05 11:33:47 -07:00 |
|
Jian He
|
d701acc9c6
|
YARN-2725. Added test cases of retrying creating znode in ZKRMStateStore. Contributed by Tsuyoshi Ozawa
|
2015-05-04 16:13:29 -07:00 |
|
Gera Shegalov
|
f8204e241d
|
YARN-2893. AMLaucher: sporadic job failures due to EOFException in readTokenStorageStream. (Zhihai Xu via gera)
|
2015-05-01 18:18:55 -07:00 |
|
Wangda Tan
|
e2e8f77118
|
YARN-3564. Fix TestContainerAllocation.testAMContainerAllocationWhenDNSUnavailable fails randomly. (Jian He via wangda)
|
2015-04-30 11:03:19 -07:00 |
|
Jian He
|
4c1af156ae
|
YARN-3533. Test: Fix launchAM in MockRM to wait for attempt to be scheduled. Contributed by Anubhav Dhoot
|
2015-04-29 14:50:01 -07:00 |
|
tgraves
|
2e215484bd
|
YARN-3517. RM web ui for dumping scheduler logs should be for admins only (Varun Vasudev via tgraves)
|
2015-04-29 21:25:42 +00:00 |
|
Karthik Kambatla
|
8f82970e0c
|
YARN-3485. FairScheduler headroom calculation doesn't consider maxResources for Fifo and FairShare policies. (kasha)
|
2015-04-28 21:00:35 -07:00 |
|
Wangda Tan
|
db1b674b50
|
YARN-2740. Fix NodeLabelsManager to properly handle node label modifications when distributed node label configuration enabled. (Naganarasimha G R via wangda)
|
2015-04-27 16:24:38 -07:00 |
|
Jian He
|
d497f6ea2b
|
YARN-2498. Respect labels in preemption policy of capacity scheduler for inter-queue preemption. Contributed by Wangda Tan
|
2015-04-24 17:03:13 -07:00 |
|
Jian He
|
d03dcb9635
|
YARN-3387. Previous AM's container completed status couldn't pass to current AM if AM and RM restarted during the same time. Contributed by Sandflee
|
2015-04-24 12:13:29 -07:00 |
|
Vinod Kumar Vavilapalli
|
f5fe35e297
|
YARN-3413. Changed Nodelabel attributes (like exclusivity) to be settable only via addToClusterNodeLabels but not changeable at runtime. (Wangda Tan via vinodkv)
|
2015-04-23 11:19:55 -07:00 |
|
Wangda Tan
|
395205444e
|
YARN-3319. Implement a FairOrderingPolicy. (Craig Welch via wangda)
|
2015-04-23 10:47:15 -07:00 |
|
tgraves
|
189a63a719
|
YARN-3434. Interaction between reservations and userlimit can result in significant ULF violation
|
2015-04-23 14:39:25 +00:00 |
|
Junping Du
|
fad9d7e85b
|
New parameter of CLI for decommissioning node gracefully in RMAdmin CLI. Contributed by Devaraj K
|
2015-04-22 10:07:20 -07:00 |
|
Jian He
|
bdd90110e6
|
YARN-3494. Expose AM resource limit and usage in CS QueueMetrics. Contributed by Rohith Sharmaks
|
2015-04-21 20:06:20 -07:00 |
|
Wangda Tan
|
e71d0d87d9
|
YARN-3410. YARN admin should be able to remove individual application records from RMStateStore. (Rohith Sharmaks via wangda)
|
2015-04-21 17:51:22 -07:00 |
|
Wangda Tan
|
44872b76fc
|
YARN-3463. Integrate OrderingPolicy Framework with CapacityScheduler. (Craig Welch via wangda)
|
2015-04-20 17:12:32 -07:00 |
|
Wangda Tan
|
f65eeb412d
|
YARN-3493. RM fails to come up with error "Failed to load/recover state" when mem settings are changed. (Jian He via wangda)
|
2015-04-17 17:11:22 -07:00 |
|
Jian He
|
d573f09fb9
|
YARN-2696. Queue sorting in CapacityScheduler should consider node label. Contributed by Wangda Tan
|
2015-04-17 13:36:59 -07:00 |
|
Junping Du
|
1db355a875
|
YARN-1402. Update related Web UI and CLI with exposing client API to check log aggregation status. Contributed by Xuan Gong.
|
2015-04-17 13:18:59 -07:00 |
|
Jian He
|
bb6dde68f1
|
YARN-3021. YARN's delegation-token handling disallows certain trust setups to operate properly over DistCp. Contributed by Yongjun Zhang
|
2015-04-16 19:43:37 -07:00 |
|
Jian He
|
1b89a3e173
|
YARN-3354. Add node label expression in ContainerTokenIdentifier to support RM recovery. Contributed by Wangda Tan
|
2015-04-15 13:57:06 -07:00 |
|
Tsuyoshi Ozawa
|
e48cedc663
|
YARN-3326. Support RESTful API for getLabelsToNodes. Contributed by Naganarasimha G R.
|
2015-04-15 14:03:55 -05:00 |
|
Wangda Tan
|
5004e75332
|
YARN-3318. Create Initial OrderingPolicy Framework and FifoOrderingPolicy. (Craig Welch via wangda)
|
2015-04-15 09:56:32 -07:00 |
|
Jian He
|
0fefda645b
|
YARN-3361. CapacityScheduler side changes to support non-exclusive node labels. Contributed by Wangda Tan
|
2015-04-14 11:45:58 -07:00 |
|
Jian He
|
b46ee1e7a3
|
YARN-3266. RMContext#inactiveNodes should have NodeId as map key. Contributed by Chengbing Liu
|
2015-04-14 10:54:22 -07:00 |
|
Jian He
|
a1afbc48b5
|
YARN-3472. Fixed possible leak in DelegationTokenRenewer#allTokens. Contributed by Rohith Sharmaks
|
2015-04-13 14:07:17 -07:00 |
|
Junping Du
|
92431c9617
|
YARN-1376. NM need to notify the log aggregation status to RM through Node heartbeat. Contributed by Xuan Gong.
|
2015-04-10 08:56:18 -07:00 |
|
Xuan
|
afa5d4715a
|
YARN-3293. Track and display capacity scheduler health metrics in web
UI. Contributed by Varun Vasudev
|
2015-04-09 23:38:04 -07:00 |
|
Vinod Kumar Vavilapalli
|
9c5911294e
|
YARN-3055. Fixed ResourceManager's DelegationTokenRenewer to not stop token renewal of applications part of a bigger workflow. Contributed by Daryn Sharp.
|
2015-04-09 13:08:53 -07:00 |
|
Robert Kanter
|
99b08a748e
|
YARN-2429. TestAMRMTokens.testTokenExpiry fails Intermittently with error message:Invalid AMRMToken (zxu via rkanter)
|
2015-04-06 14:11:20 -07:00 |
|
Tsuyoshi Ozawa
|
53959e69f7
|
TestFairScheduler.testContinuousScheduling fails Intermittently. Contributed by Zhihai Xu.
|
2015-04-06 20:19:13 +09:00 |
|
Sandy Ryza
|
6a6a59db7f
|
YARN-3415. Non-AM containers can be counted towards amResourceUsage of a fairscheduler queue (Zhihai Xu via Sandy Ryza)
|
2015-04-02 13:56:08 -07:00 |
|
Xuan
|
4728bdfa15
|
YARN-3248. Display count of nodes blacklisted by apps in the web UI.
Contributed by Varun Vasudev
|
2015-04-01 04:19:18 -07:00 |
|
Karthik Kambatla
|
79f7f2aabf
|
YARN-3412. RM tests should use MockRM where possible. (kasha)
|
2015-03-31 09:14:15 -07:00 |
|
Wangda Tan
|
2a945d24f7
|
YARN-2495. Allow admin specify labels from each NM (Distributed configuration for node label). (Naganarasimha G R via wangda)
|
2015-03-30 12:05:21 -07:00 |
|
Karthik Kambatla
|
2bc097cd14
|
YARN-3241. FairScheduler handles invalid queue names inconsistently. (Zhihai Xu via kasha)
|
2015-03-23 13:22:03 -07:00 |
|
cnauroth
|
6ca1f12024
|
YARN-3336. FileSystem memory leak in DelegationTokenRenewer.
|
2015-03-23 10:45:50 -07:00 |
|
Jian He
|
e1feb4ea1a
|
YARN-3345. Add non-exclusive node label API. Contributed by Wangda Tan
|
2015-03-20 19:04:38 -07:00 |
|
Jian He
|
586348e4cb
|
YARN-3356. Capacity Scheduler FiCaSchedulerApp should use ResourceUsage to track used-resources-by-label. Contributed by Wangda Tan
|
2015-03-20 13:54:01 -07:00 |
|
Devaraj K
|
93d0f4acc8
|
YARN-3357. Move TestFifoScheduler to FIFO package. Contributed by Rohith
Sharmaks.
|
2015-03-19 12:16:52 +05:30 |
|
Jian He
|
658097d6da
|
YARN-3273. Improve scheduler UI to facilitate scheduling analysis and debugging. Contributed Rohith Sharmaks
|
2015-03-17 21:30:23 -07:00 |
|
Tsuyoshi Ozawa
|
3bc72cc16d
|
YARN-3205. FileSystemRMStateStore should disable FileSystem Cache to avoid get a Filesystem with an old configuration. Contributed by Zhihai Xu.
|
2015-03-18 11:53:19 +09:00 |
|
Jian He
|
968425e9f7
|
YARN-3305. Normalize AM resource request on app submission. Contributed by Rohith Sharmaks
|
2015-03-17 13:49:59 -07:00 |
|
Jian He
|
487374b7fe
|
YARN-3243. CapacityScheduler should pass headroom from parent to children to make sure ParentQueue obey its capacity limits. Contributed by Wangda Tan.
|
2015-03-17 10:24:23 -07:00 |
|
Tsuyoshi Ozawa
|
7522a643fa
|
YARN-3349. Treat all exceptions as failure in TestFSRMStateStore#testFSRMStateStoreClientRetry. Contributed by Zhihai Xu.
|
2015-03-17 08:09:55 +09:00 |
|
Vinod Kumar Vavilapalli
|
863079bb87
|
YARN-3154. Added additional APIs in LogAggregationContext to avoid aggregating running logs of application when rolling is enabled. Contributed by Xuan Gong.
|
2015-03-12 13:32:29 -07:00 |
|
Zhijie Shen
|
85f6d67fa7
|
YARN-1884. Added nodeHttpAddress into ContainerReport and fixed the link to NM web page. Contributed by Xuan Gong.
|
2015-03-11 19:35:19 -07:00 |
|
Jason Lowe
|
27e8ea820f
|
YARN-3275. CapacityScheduler: Preemption happening on non-preemptable queues. Contributed by Eric Payne
|
2015-03-06 22:37:26 +00:00 |
|
Jian He
|
95bfd087dc
|
YARN-1809. Synchronize RM and TimeLineServer Web-UIs. Contributed by Zhijie Shen and Xuan Gong
|
2015-03-05 21:20:09 -08:00 |
|
Karthik Kambatla
|
8d88691d16
|
YARN-3242. Asynchrony in ZK-close can lead to ZKRMStateStore watcher receiving events for old client. (Zhihai Xu via kasha)
|
2015-03-04 19:49:05 -08:00 |
|
Karthik Kambatla
|
22426a1c9f
|
YARN-3231. FairScheduler: Changing queueMaxRunningApps interferes with pending jobs. (Siqi Li via kasha)
|
2015-03-04 18:06:58 -08:00 |
|
Jian He
|
b2f1ec312e
|
YARN-3222. Fixed NPE on RMNodeImpl#ReconnectNodeTransition when a node is reconnected with a different port. Contributed by Rohith Sharmaks
|
2015-03-03 16:28:28 -08:00 |
|
Wangda Tan
|
e17e5ba9d7
|
YARN-3272. Surface container locality info in RM web UI (Jian He via wangda)
|
2015-03-03 11:49:01 -08:00 |
|
Vinod Kumar Vavilapalli
|
14dd647c55
|
YARN-3265. Fixed a deadlock in CapacityScheduler by always passing a queue's available resource-limit from the parent queue. Contributed by Wangda Tan.
|
2015-03-02 17:52:47 -08:00 |
|
Wangda Tan
|
edcecedc1c
|
YARN-3262. Surface application outstanding resource requests table in RM web UI. (Jian He via wangda)
|
2015-02-27 16:13:32 -08:00 |
|
Tsuyoshi Ozawa
|
01a1621930
|
YARN-2820. Retry in FileSystemRMStateStore when FS's operations fail due to IOException. Contributed by Zhihai Xu.
|
2015-02-28 00:56:44 +09:00 |
|
Devaraj K
|
0d4296f0e0
|
YARN-3256. TestClientToAMTokens#testClientTokenRace is not running against
all Schedulers even when using ParameterizedSchedulerTestBase. Contributed
by Anubhav Dhoot.
|
2015-02-26 15:45:41 +05:30 |
|
Tsuyoshi Ozawa
|
6cbd9f1113
|
YARN-3247. TestQueueMappings should use CapacityScheduler explicitly. Contributed by Zhihai Xu.
|
2015-02-25 10:38:11 +09:00 |
|
Xuan
|
fe7a302473
|
YARN-2797. TestWorkPreservingRMRestart should use
ParametrizedSchedulerTestBase. Contributed by Karthik Kambatla
|
2015-02-21 19:17:29 -08:00 |
|
Jason Lowe
|
a64dd3d24b
|
YARN-3194. RM should handle NMContainerStatuses sent by NM while registering if NM is Reconnected node. Contributed by Rohith
|
2015-02-20 15:10:10 +00:00 |
|
Jian He
|
c0d9b93953
|
YARN-933. Fixed InvalidStateTransitonException at FINAL_SAVING state in RMApp. Contributed by Rohith Sharmaks
|
2015-02-19 15:42:39 -08:00 |
|
Wangda Tan
|
d49ae725d5
|
YARN-3076. Add API/Implementation to YarnClient to retrieve label-to-node mapping (Varun Saxena via wangda)
|
2015-02-19 11:00:57 -08:00 |
|
Jian He
|
1c03376300
|
YARN-1514. Utility to benchmark ZKRMStateStore#loadState for RM HA. Contributed by Tsuyoshi OZAWA
|
2015-02-18 16:06:55 -08:00 |
|
Jian He
|
f5da5566d9
|
YARN-3132. RMNodeLabelsManager should remove node from node-to-label mapping when node becomes deactivated. Contributed by Wangda Tan
|
2015-02-18 11:51:51 -08:00 |
|
Jian He
|
18297e0972
|
YARN-3104. Fixed RM to not generate new AMRM tokens on every heartbeat between rolling and activation. Contributed by Jason Lowe
|
2015-02-12 16:02:45 -08:00 |
|
Jian He
|
18a594257e
|
YARN-3124. Fixed CS LeafQueue/ParentQueue to use QueueCapacities to track capacities-by-label. Contributed by Wangda Tan
|
2015-02-12 14:58:09 -08:00 |
|
Xuan
|
65c69e296e
|
YARN-3151. On Failover tracking url wrong in application cli for KILLED
application. Contributed by Rohith
|
2015-02-11 21:19:48 -08:00 |
|
Zhijie Shen
|
d5855c0e46
|
YARN-2246. Made the proxy tracking URL always be http(s)://proxy addr:port/proxy/<appId> to avoid duplicate sections. Contributed by Devaraj K.
|
2015-02-10 15:24:01 -08:00 |
|
Zhijie Shen
|
23bf6c7207
|
YARN-3100. Made YARN authorization pluggable. Contributed by Jian He.
|
2015-02-09 20:34:56 -08:00 |
|
Jian He
|
0af6a99a3f
|
YARN-3094. Reset timer for liveness monitors after RM recovery. Contributed by Jun Gong
|
2015-02-09 13:47:08 -08:00 |
|
Karthik Kambatla
|
7e42088abf
|
YARN-2990. FairScheduler's delay-scheduling always waits for node-local and rack-local delays, even for off-rack-only requests. (kasha)
|
2015-02-08 22:48:10 -08:00 |
|
Jason Lowe
|
da2fb2bc46
|
YARN-3143. RM Apps REST API can return NPE or entries missing id and other fields. Contributed by Jason Lowe
|
2015-02-06 21:47:32 +00:00 |
|
Jian He
|
c1957fef29
|
YARN-2694. Ensure only single node label specified in ResourceRequest. Contributed by Wangda Tan
|
2015-02-06 11:34:20 -08:00 |
|
Jason Lowe
|
69c8a7f45b
|
YARN-1582. Capacity Scheduler: add a maximum-allocation-mb setting per queue. Contributed by Thomas Graves
|
2015-02-05 19:28:49 +00:00 |
|
Sandy Ryza
|
b6466deac6
|
YARN-3101. In Fair Scheduler, fix canceling of reservations for exceeding max share (Anubhav Dhoot via Sandy Ryza)
|
2015-02-05 09:39:28 -08:00 |
|
Zhijie Shen
|
e5b56e2af6
|
YARN-2543. Made resource usage be published to the timeline server too. Contributed by Naganarasimha G R.
|
2015-02-03 17:34:22 -08:00 |
|
Wangda Tan
|
5bd984691b
|
YARN-3075. NodeLabelsManager implementation to retrieve label to node mapping (Varun Saxena via wangda)
|
2015-02-03 12:52:42 -08:00 |
|
Jian He
|
21d80b3dd9
|
YARN-3098. Created common QueueCapacities class in Capacity Scheduler to track capacities-by-labels of queues. Contributed by Wangda Tan
|
2015-02-03 11:43:12 -08:00 |
|
Jason Lowe
|
a761bf8726
|
YARN-3085. Application summary should include the application type. Contributed by Rohith
|
2015-02-03 14:56:34 +00:00 |
|
Jian He
|
054a947989
|
YARN-3077. Fixed RM to create zk root path recursively. Contributed by Chun Chen
|
2015-01-30 17:34:49 -08:00 |
|
Jian He
|
86358221fc
|
YARN-3099. Capacity Scheduler LeafQueue/ParentQueue should use ResourceUsage to track used-resources-by-label. Contributed by Wangda Tan
|
2015-01-30 15:15:20 -08:00 |
|
Wangda Tan
|
7882bc0f14
|
YARN-3079. Scheduler should also update maximumAllocation when updateNodeResource. (Zhihai Xu via wangda)
|
2015-01-28 21:54:38 -08:00 |
|
Wangda Tan
|
18741adf97
|
YARN-2932. Add entry for preemptable status (enabled/disabled) to scheduler web UI and queue initialize/refresh logging. (Eric Payne via wangda)
|
2015-01-27 15:36:09 -08:00 |
|
Jian He
|
6f9fe76918
|
YARN-3092. Created a common ResourceUsage class to track labeled resource usages in Capacity Scheduler. Contributed by Wangda Tan
|
2015-01-26 15:38:00 -08:00 |
|
Tsuyoshi Ozawa
|
24aa462673
|
YARN-2800. Remove MemoryNodeLabelsStore and add a way to enable/disable node labels feature. Contributed by Wangda Tan.
|
2015-01-23 20:37:05 +09:00 |
|
Wangda Tan
|
0a2d3e717d
|
YARN-2933. Capacity Scheduler preemption policy should only consider capacity without labels temporarily. Contributed by Mayank Bansal
|
2015-01-19 16:48:50 -08:00 |
|
Junping Du
|
5d1cca34fa
|
YARN-3064. TestRMRestart/TestContainerResourceUsage/TestNodeManagerResync failure with allocation timeout. (Contributed by Jian He)
|
2015-01-16 00:08:36 -08:00 |
|
Jian He
|
c53420f583
|
YARN-2637. Fixed max-am-resource-percent calculation in CapacityScheduler when activating applications. Contributed by Craig Welch
|
2015-01-13 17:32:07 -08:00 |
|
Robert Kanter
|
ae7bf31fe1
|
YARN-3027. Scheduler should use totalAvailable resource from node instead of availableResource for maxAllocation. (adhoot via rkanter)
|
2015-01-12 10:47:52 -08:00 |
|
Zhijie Shen
|
60103fca04
|
YARN-2427. Added the API of moving apps between queues in RM web services. Contributed by Varun Vasudev.
|
2015-01-06 14:37:44 -08:00 |
|
Karthik Kambatla
|
0c4b112677
|
YARN-2881. [YARN-2574] Implement PlanFollower for FairScheduler. (Anubhav Dhoot via kasha)
|
2015-01-06 04:42:10 +05:30 |
|
Zhijie Shen
|
562a701945
|
YARN-2958. Made RMStateStore not update the last sequence number when updating the delegation token. Contributed by Varun Saxena.
|
2015-01-05 13:33:07 -08:00 |
|
Tsuyoshi Ozawa
|
ddc5be48fc
|
YARN-2922. ConcurrentModificationException in CapacityScheduler's LeafQueue. Contributed by Rohith Sharmaks.
|
2015-01-05 00:08:31 +09:00 |
|
Karthik Kambatla
|
e7257acd8a
|
YARN-2998. Abstract out scheduler independent PlanFollower components. (Anubhav Dhoot via kasha)
|
2014-12-30 19:55:24 -08:00 |
|
Jian He
|
e2351c7ae2
|
YARN-2987. Fixed ClientRMService#getQueueInfo to check against queue and app ACLs. Contributed by Varun Saxena
|
2014-12-30 17:15:37 -08:00 |
|
Jian He
|
b7442bf92e
|
YARN-2493. Added node-labels page on RM web UI. Contributed by Wangda Tan
|
2014-12-30 16:49:01 -08:00 |
|
Jian He
|
746ad6e989
|
Revert "YARN-2492(wrong jira number). Added node-labels page on RM web UI. Contributed by Wangda Tan"
This reverts commit 5f57b904f5 .
|
2014-12-30 16:48:49 -08:00 |
|
Jian He
|
5f57b904f5
|
YARN-2492. Added node-labels page on RM web UI. Contributed by Wangda Tan
|
2014-12-30 15:38:28 -08:00 |
|
Jian He
|
4f18018b7a
|
YARN-2946. Fixed potential deadlock in RMStateStore. Contributed by Rohith Sharmaks
|
2014-12-23 22:14:29 -08:00 |
|
Jian He
|
149512a837
|
YARN-2837. Support TimeLine server to recover delegation token when restarting. Contributed by Zhijie Shen
|
2014-12-23 18:25:37 -08:00 |
|
Jian He
|
0d89859b51
|
YARN-2340. Fixed NPE when queue is stopped during RM restart. Contributed by Rohith Sharmaks
|
2014-12-22 21:53:22 -08:00 |
|
Jian He
|
fdf042dfff
|
YARN-2920. Changed CapacityScheduler to kill containers on nodes where node labels are changed. Contributed by Wangda Tan
|
2014-12-22 16:51:15 -08:00 |
|
Karthik Kambatla
|
24ee9e3431
|
YARN-2975. FSLeafQueue app lists are accessed without required locks. (kasha)
|
2014-12-20 12:17:50 -08:00 |
|
Jian He
|
808cba3821
|
YARN-2952. Fixed incorrect version check in StateStore. Contributed by Rohith Sharmaks
|
2014-12-19 16:56:30 -08:00 |
|
Karthik Kambatla
|
a22ffc3188
|
YARN-2738. [YARN-2574] Add FairReservationSystem for FairScheduler. (Anubhav Dhoot via kasha)
|
2014-12-19 15:37:12 -08:00 |
|
Jason Lowe
|
0402bada19
|
YARN-2964. RM prematurely cancels tokens for jobs that submit jobs (oozie). Contributed by Jian He
|
2014-12-18 23:28:18 +00:00 |
|
Steve Loughran
|
3681de2039
|
YARN-2912 Jersey Tests failing with port in use. (varun saxena via stevel)
|
2014-12-12 17:11:07 +00:00 |
|
Jian He
|
2ed90a57fd
|
YARN-2930. Fixed TestRMRestart#testRMRestartRecoveringNodeLabelManager intermittent failure. Contributed by Wangda Tan
|
2014-12-09 16:48:04 -08:00 |
|
Karthik Kambatla
|
a2e07a5456
|
YARN-2910. FSLeafQueue can throw ConcurrentModificationException. (Wilfred Spiegelenburg via kasha)
|
2014-12-09 14:00:31 -08:00 |
|
Jian He
|
e69af836f3
|
YARN-2869. CapacityScheduler should trim sub queue names when parse configuration. Contributed by Wangda Tan
|
2014-12-05 17:33:39 -08:00 |
|
Jason Lowe
|
4b13082199
|
YARN-2056. Disable preemption at Queue level. Contributed by Eric Payne
|
2014-12-05 21:06:48 +00:00 |
|
Jian He
|
258623ff8b
|
YARN-2301. Improved yarn container command. Contributed by Naganarasimha G R
|
2014-12-04 12:53:18 -08:00 |
|
Jian He
|
73fbb3c66b
|
YARN-2880. Added a test to make sure node labels will be recovered if RM restart is enabled. Contributed by Rohith Sharmaks
|
2014-12-03 17:14:52 -08:00 |
|
Jian He
|
392c3aaea8
|
YARN-2894. Fixed a bug regarding application view acl when RM fails over. Contributed by Rohith Sharmaks
|
2014-12-02 17:16:35 -08:00 |
|
Jian He
|
52bcefca8b
|
YARN-2136. Changed RMStateStore to ignore store opearations when fenced. Contributed by Varun Saxena
|
2014-12-02 10:54:48 -08:00 |
|
Jian He
|
a7fba0bc28
|
YARN-2765. Added leveldb-based implementation for RMStateStore. Contributed by Jason Lowe
|
2014-12-01 16:38:25 -08:00 |
|
Junping Du
|
c732ed760e
|
YARN-2907. SchedulerNode#toString should print all resource detail instead of only memory. (Contributed by Rohith)
|
2014-12-01 05:38:22 -08:00 |
|
Jian He
|
5805a81efb
|
YARN-2404. Removed ApplicationAttemptState and ApplicationState class in RMStateStore. Contributed by Tsuyoshi OZAWA
|
2014-11-25 12:48:22 -08:00 |
|
Sandy Ryza
|
a128cca305
|
YARN-2669. FairScheduler: queue names shouldn't allow periods (Wei Yan via Sandy Ryza)
|
2014-11-21 16:06:41 -08:00 |
|
Karthik Kambatla
|
3114d4731d
|
YARN-2604. Scheduler should consider max-allocation-* in conjunction with the largest node. (Robert Kanter via kasha)
|
2014-11-21 10:32:28 -08:00 |
|
Karthik Kambatla
|
a9a0cc3679
|
YARN-2315. FairScheduler: Set current capacity in addition to capacity. (Zhihai Xu via kasha)
|
2014-11-19 20:15:40 -08:00 |
|
Karthik Kambatla
|
c90fb84aaa
|
YARN-2802. ClusterMetrics to include AM launch and register delays. (Zhihai Xu via kasha)
|
2014-11-19 19:50:12 -08:00 |
|
Jian He
|
9cb8b75ba5
|
YARN-2865. Fixed RM to always create a new RMContext when transtions from StandBy to Active. Contributed by Rohith Sharmaks
|
2014-11-19 19:48:52 -08:00 |
|
Karthik Kambatla
|
2fce6d6141
|
YARN-2690. [YARN-2574] Make ReservationSystem and its dependent classes independent of Scheduler type. (Anubhav Dhoot via kasha)
|
2014-11-17 16:45:57 -08:00 |
|
Jason Lowe
|
81c9d17af8
|
YARN-2414. RM web UI: app page will crash if app is failed before any attempt has been created. Contributed by Wangda Tan
|
2014-11-17 21:15:48 +00:00 |
|
Sandy Ryza
|
1a47f890ba
|
YARN-2811. In Fair Scheduler, reservation fulfillments shouldn't ignore max share (Siqi Li via Sandy Ryza)
|
2014-11-14 15:18:56 -08:00 |
|
Jian He
|
d005404ef7
|
YARN-2856. Fixed RMAppImpl to handle ATTEMPT_KILLED event at ACCEPTED state on app recovery. Contributed by Rohith Sharmaks
|
2014-11-13 15:34:26 -08:00 |
|
Vinod Kumar Vavilapalli
|
3651fe1b08
|
YARN-2853. Fixed a bug in ResourceManager causing apps to hang when the user kill request races with ApplicationMaster finish. Contributed by Jian He.
|
2014-11-13 08:13:03 -08:00 |
|
Jason Lowe
|
f8aefa5e9c
|
YARN-2780. Log aggregated resource allocation in rm-appsummary.log. Contributed by Eric Payne
|
2014-11-12 17:01:15 +00:00 |
|
Vinod Kumar Vavilapalli
|
e76faebc95
|
YARN-2834. Fixed ResourceManager to ignore token-renewal failures on recovery consistent with the (somewhat incorrect) behaviour in the non-recovery case. Contributed by Jian He.
|
2014-11-09 18:56:58 -08:00 |
|
Arun C. Murthy
|
43cd07b408
|
YARN-2830. Add backwords compatible ContainerId.newInstance constructor. Contributed by Jonathan Eagles.
|
2014-11-09 14:57:37 -08:00 |
|
Zhijie Shen
|
9a4e0d343e
|
YARN-2505. Supported get/add/remove/change labels in RM REST API. Contributed by Craig Welch.
|
2014-11-07 20:35:46 -08:00 |
|
Vinod Kumar Vavilapalli
|
4cfd5bc7c1
|
YARN-2753. Fixed a bunch of bugs in the NodeLabelsManager classes. Contributed by Zhihai xu.
|
2014-11-07 14:15:53 -08:00 |
|
Vinod Kumar Vavilapalli
|
2ac1be7dec
|
YARN-2824. Fixed Capacity Scheduler to not crash when some node-labels are not mapped to queues by making default capacities per label to be zero. Contributed by Wangda Tan.
|
2014-11-07 10:39:37 -08:00 |
|
Xuan
|
1e97f2f094
|
YARN-2810. TestRMProxyUsersConf fails on Windows VMs. Contributed by Varun Vasudev
|
2014-11-07 09:44:43 -08:00 |
|
Vinod Kumar Vavilapalli
|
a5657182a7
|
YARN-2823. Fixed ResourceManager app-attempt state machine to inform schedulers about previous finished attempts of a running appliation to avoid expectation mismatch w.r.t transferred containers. Contributed by Jian He.
|
2014-11-07 09:28:36 -08:00 |
|
Vinod Kumar Vavilapalli
|
a3839a9fbf
|
YARN-2744. Fixed CapacityScheduler to validate node-labels correctly against queues. Contributed by Wangda Tan.
|
2014-11-06 17:28:12 -08:00 |
|
Jian He
|
395275af86
|
YARN-2579. Fixed a deadlock issue when EmbeddedElectorService and FatalEventDispatcher try to transition RM to StandBy at the same time. Contributed by Rohith Sharmaks
|
2014-11-05 16:59:54 -08:00 |
|
Zhijie Shen
|
b4c951ab83
|
YARN-2767. Added a test case to verify that http static user cannot kill or submit apps in the secure mode. Contributed by Varun Vasudev.
|
2014-11-05 10:57:38 -08:00 |
|
Karthik Kambatla
|
b2cd269802
|
YARN-2010. Handle app-recovery failures gracefully. (Jian He and Karthik Kambatla via kasha)
|
2014-11-04 17:45:24 -08:00 |
|
Vinod Kumar Vavilapalli
|
ec6cbece8e
|
YARN-2795. Fixed ResourceManager to not crash loading node-label data from HDFS in secure mode. Contributed by Wangda Tan.
|
2014-11-03 13:44:06 -08:00 |
|
Zhijie Shen
|
27715ec63b
|
YARN-2785. Fixed intermittent TestContainerResourceUsage failure. Contributed by Varun Vasudev.
|
2014-11-02 15:20:40 -08:00 |
|
Vinod Kumar Vavilapalli
|
e0233c16eb
|
YARN-2698. Moved some node label APIs to be correctly placed in client protocol. Contributed by Wangda Tan.
|
2014-10-30 22:59:31 -07:00 |
|
Karthik Kambatla
|
179cab81e0
|
YARN-2712. TestWorkPreservingRMRestart: Augment FS tests with queue and headroom checks. (Tsuyoshi Ozawa via kasha)
|
2014-10-30 00:29:07 -07:00 |
|
Karthik Kambatla
|
782971ae7a
|
YARN-2742. FairSchedulerConfiguration should allow extra spaces between value and unit. (Wei Yan via kasha)
|
2014-10-29 10:24:57 -07:00 |
|
Jian He
|
d5e0a09721
|
YARN-2503. Added node lablels in web UI. Contributed by Wangda Tan
|
2014-10-28 17:57:54 -07:00 |
|
Vinod Kumar Vavilapalli
|
a16d022ca4
|
YARN-2704. Changed ResourceManager to optionally obtain tokens itself for the sake of localization and log-aggregation for long-running services. Contributed by Jian He.
|
2014-10-27 15:49:47 -07:00 |
|
Vinod Kumar Vavilapalli
|
0186645505
|
YARN-2743. Fixed a bug in ResourceManager that was causing RMDelegationToken identifiers to be tampered and thus causing app submission failures in secure mode. Contributed by Jian He.
|
2014-10-26 11:14:34 -07:00 |
|
Jian He
|
5864dd99a4
|
YARN-1915. Fixed a race condition that client could use the ClientToAMToken to contact with AM before AM actually receives the ClientToAMTokenMasterKey. Contributed by Jason Lowe
|
2014-10-24 22:47:56 -07:00 |
|
Zhijie Shen
|
0f3b6900be
|
YARN-2209. Replaced AM resync/shutdown command with corresponding exceptions and made related MR changes. Contributed by Jian He.
|
2014-10-23 21:56:03 -07:00 |
|
Vinod Kumar Vavilapalli
|
c0e034336c
|
YARN-2715. Fixed ResourceManager to respect common configurations for proxy users/groups beyond just the YARN level config. Contributed by Zhijie Shen.
|
2014-10-21 20:09:40 -07:00 |
|
Vinod Kumar Vavilapalli
|
39063cd36f
|
YARN-2676. Enhanced Timeline auth-filter to support proxy users. Contributed by Zhijie Shen.
|
2014-10-17 22:02:50 -07:00 |
|
Vinod Kumar Vavilapalli
|
e9c66e8fd2
|
YARN-2705. Fixed bugs in ResourceManager node-label manager that were causing test-failures: added a dummy in-memory labels-manager. Contributed by Wangda Tan.
|
2014-10-17 18:26:12 -07:00 |
|
Jian He
|
c3de2412eb
|
YARN-1879. Marked Idempotent/AtMostOnce annotations to ApplicationMasterProtocol for RM fail over. Contributed by Tsuyoshi OZAWA
|
2014-10-17 16:35:27 -07:00 |
|
Jian He
|
a6aa6e42ca
|
YARN-2588. Standby RM fails to transitionToActive if previous transitionToActive failed with ZK exception. Contributed by Rohith Sharmaks
|
2014-10-17 10:54:24 -07:00 |
|
Vinod Kumar Vavilapalli
|
abae63caf9
|
YARN-2699. Fixed a bug in CommonNodeLabelsManager that caused tests to fail when using ephemeral ports on NodeIDs. Contributed by Wangda Tan.
|
2014-10-17 08:58:08 -07:00 |
|
Vinod Kumar Vavilapalli
|
b3056c266a
|
YARN-2685. Fixed a bug in CommonNodeLabelsManager that caused wrong resource tracking per label when a host runs multiple node-managers. Contributed by Wangda Tan.
|
2014-10-15 18:47:26 -07:00 |
|
Vinod Kumar Vavilapalli
|
f2ea555ac6
|
YARN-2496. Enhanced Capacity Scheduler to have basic support for allocating resources based on node-labels. Contributed by Wangda Tan.
YARN-2500. Ehnaced ResourceManager to support schedulers allocating resources based on node-labels. Contributed by Wangda Tan.
|
2014-10-15 18:33:06 -07:00 |
|
Jian He
|
0af1a2b5bc
|
YARN-2312. Deprecated old ContainerId#getId API and updated MapReduce to use ContainerId#getContainerId instead. Contributed by Tsuyoshi OZAWA
|
2014-10-15 15:22:07 -07:00 |
|
Zhijie Shen
|
1220bb72d4
|
YARN-2656. Made RM web services authentication filter support proxy user. Contributed by Varun Vasudev and Zhijie Shen.
|
2014-10-14 21:50:46 -07:00 |
|
Zhijie Shen
|
cdce88376a
|
HADOOP-11181. Generalized o.a.h.s.t.d.DelegationTokenManager to handle all sub-classes of AbstractDelegationTokenIdentifier. Contributed by Zhijie Shen.
|
2014-10-14 11:35:38 -07:00 |
|
Karthik Kambatla
|
da709a2eac
|
YARN-2641. Decommission nodes on -refreshNodes instead of next NM-RM heartbeat. (Zhihai Xu via kasha)
|
2014-10-13 16:23:04 -07:00 |
|
Jian He
|
f9680d9a16
|
YARN-2308. Changed CapacityScheduler to explicitly throw exception if the queue
to which the apps were submitted is changed across RM restart. Contributed by Craig Welch & Chang Li
|
2014-10-13 14:09:04 -07:00 |
|
Zhijie Shen
|
4aed2d8e91
|
YARN-2651. Spun off LogRollingInterval from LogAggregationContext. Contributed by Xuan Gong.
|
2014-10-13 10:54:09 -07:00 |
|
Vinod Kumar Vavilapalli
|
db7f165319
|
YARN-2494. Added NodeLabels Manager internal API and implementation. Contributed by Wangda Tan.
|
2014-10-10 11:44:21 -07:00 |
|
Jian He
|
e16e25ab1b
|
YARN-2649. Fixed TestAMRMRPCNodeUpdates test failure. Contributed by Ming Ma
|
2014-10-08 10:58:51 -07:00 |
|
Jian He
|
30d56fdbb4
|
YARN-1857. CapacityScheduler headroom doesn't account for other AM's running. Contributed by Chen He and Craig Welch
|
2014-10-07 13:45:04 -07:00 |
|
Jian He
|
519e5a7dd2
|
YARN-2644. Fixed CapacityScheduler to return up-to-date headroom when AM allocates. Contributed by Craig Welch
|
2014-10-06 15:48:46 -07:00 |
|
Jian He
|
ea26cc0b4a
|
YARN-2615. Changed ClientToAMTokenIdentifier/RM(Timeline)DelegationTokenIdentifier to use protobuf as payload. Contributed by Junping Du
|
2014-10-06 10:47:43 -07:00 |
|
subru
|
a2986234be
|
YARN-2611. Fixing jenkins findbugs warning and TestRMWebServicesCapacitySched for branch YARN-1051. Contributed by Subru Krishnan and Carlo Curino.
(cherry picked from commit c47464aba407d1dafe10be23fe454f0489cc4367)
|
2014-10-03 15:43:23 -07:00 |
|
subru
|
5e10a13bb4
|
YARN-2576. Making test patch pass in branch. Contributed by Subru Krishnan and Carlo Curino.
(cherry picked from commit 90ac0be86b898aefec5471db4027554c8e1b310c)
|
2014-10-03 15:43:13 -07:00 |
|
subru
|
6261f7cc69
|
YARN-2080. Integrating reservation system with ResourceManager and client-RM protocol. Contributed by Subru Krishnan and Carlo Curino.
(cherry picked from commit 8baeaead8532898163f1006276b731a237b1a559)
Conflicts:
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/rmapp/RMAppImpl.java
|
2014-10-03 15:42:43 -07:00 |
|
subru
|
3418c56bcf
|
YARN-1712. Plan follower that synchronizes the current state of reservation subsystem with the scheduler. Contributed by Subru Krishnan and Carlo Curino.
(cherry picked from commit 169085319b8b76641f8b9f6840a3fef06d221e2b)
|
2014-10-03 15:42:10 -07:00 |
|
carlo curino
|
b6df0dddcd
|
YARN-1711. Policy to enforce instantaneous and over-time quotas on user reservation. Contributed by Carlo Curino and Subru Krishnan.
(cherry picked from commit c4918cb4cb5a267a8cfd6eace28fcfe7ad6174e8)
|
2014-10-03 15:42:03 -07:00 |
|
carlo curino
|
f66ffcf832
|
YARN-1710. Logic to find allocations within a Plan that satisfy user ReservationRequest(s). Contributed by Carlo Curino and Subru Krishnan.
(cherry picked from commit aef7928899b37262773f3dc117157bb746bf8918)
|
2014-10-03 15:41:57 -07:00 |
|
subru
|
cf4b34282a
|
YARN-1709. In-memory data structures used to track resources over time to enable reservations.
(cherry picked from commit 0d8b2cd88b958b1e602fd4ea4078ef8d4742a7c3)
|
2014-10-03 15:41:51 -07:00 |
|
carlo curino
|
1c6950354f
|
YARN-2475. Logic for responding to capacity drops for the ReservationSystem. Contributed by Carlo Curino and Subru Krishnan.
(cherry picked from commit f83a07f266f2c5e6eead554d8a331ed7e75e10d5)
|
2014-10-03 15:41:21 -07:00 |
|
carlo curino
|
eb3e40b833
|
YARN-1707. Introduce APIs to add/remove/resize queues in the CapacityScheduler. Contributed by Carlo Curino and Subru Krishnan
(cherry picked from commit aac47fda7fecda9fc18ade34d633eca895865a70)
Conflicts:
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/CapacityScheduler.java
|
2014-10-03 15:41:02 -07:00 |
|
Karthik Kambatla
|
80d11eb68e
|
YARN-2635. TestRM, TestRMRestart, TestClientToAMTokens should run with both CS and FS. (Wei Yan and kasha via kasha)
|
2014-10-03 11:49:49 -07:00 |
|
Jian He
|
054f285526
|
YARN-2628. Capacity scheduler with DominantResourceCalculator carries out reservation even though slots are free. Contributed by Varun Vasudev
|
2014-10-02 15:13:33 -07:00 |
|
Karthik Kambatla
|
5e0b49da9c
|
YARN-2254. TestRMWebServicesAppsModification should run against both CS and FS. (Zhihai Xu via kasha)
|
2014-10-02 10:15:04 -07:00 |
|
Zhijie Shen
|
52bbe0f11b
|
YARN-2630. Prevented previous AM container status from being acquired by the current restarted AM. Contributed by Jian He.
|
2014-10-01 15:38:11 -07:00 |
|
Jian He
|
bbff96be48
|
YARN-2602. Fixed possible NPE in ApplicationHistoryManagerOnTimelineStore. Contributed by Zhijie Shen
|
2014-09-30 16:44:17 -07:00 |
|
Jason Lowe
|
9c22065109
|
YARN-1769. CapacityScheduler: Improve reservations. Contributed by Thomas Graves
|
2014-09-29 14:12:18 +00:00 |
|
Jian He
|
5391919b09
|
YARN-668. Changed NMTokenIdentifier/AMRMTokenIdentifier/ContainerTokenIdentifier to use protobuf object as the payload. Contributed by Junping Du.
|
2014-09-26 17:48:41 -07:00 |
|
Jason Lowe
|
8269bfa613
|
YARN-2523. ResourceManager UI showing negative value for "Decommissioned Nodes" field. Contributed by Rohith
|
2014-09-25 22:37:05 +00:00 |
|
Zhijie Shen
|
72b0881ca6
|
YARN-2546. Made REST API for application creation/submission use numeric and boolean types instead of the string of them. Contributed by Varun Vasudev.
|
2014-09-24 17:57:32 -07:00 |
|
Zhijie Shen
|
c86674a3a4
|
YARN-2581. Passed LogAggregationContext to NM via ContainerTokenIdentifier. Contributed by Xuan Gong.
|
2014-09-24 17:50:26 -07:00 |
|
Karthik Kambatla
|
f5578207d2
|
YARN-2252. Intermittent failure of TestFairScheduler.testContinuousScheduling. (Ratandeep Ratti and kasha via kasha)
|
2014-09-23 00:03:16 -07:00 |
|
Karthik Kambatla
|
568d3dc2bb
|
YARN-1959. Fix headroom calculation in FairScheduler. (Anubhav Dhoot via kasha)
|
2014-09-22 23:49:39 -07:00 |
|
Karthik Kambatla
|
43efdd30b5
|
YARN-2539. FairScheduler: Set the default value for maxAMShare to 0.5. (Wei Yan via kasha)
|
2014-09-22 16:09:52 -07:00 |
|
Jian He
|
0a641496c7
|
YARN-1372. Ensure all completed containers are reported to the AMs across RM restart. Contributed by Anubhav Dhoot
|
2014-09-22 10:30:53 -07:00 |
|
Karthik Kambatla
|
9721e2c1fe
|
YARN-2453. TestProportionalCapacityPreemptionPolicy fails with FairScheduler. (Zhihai Xu via kasha)
|
2014-09-21 23:13:45 -07:00 |
|
Karthik Kambatla
|
c50fc92502
|
YARN-2452. TestRMApplicationHistoryWriter fails with FairScheduler. (Zhihai Xu via kasha)
|
2014-09-21 13:15:04 -07:00 |
|
Jian He
|
444acf8ea7
|
YARN-2565. Fixed RM to not use FileSystemApplicationHistoryStore unless explicitly set. Contributed by Zhijie Shen
|
2014-09-19 11:26:29 -07:00 |
|
Zhijie Shen
|
6fe5c6b746
|
YARN-2568. Fixed the potential test failures due to race conditions when RM work-preserving recovery is enabled. Contributed by Jian He.
|
2014-09-18 21:56:56 -07:00 |
|
Jason Lowe
|
a337f0e354
|
YARN-2561. MR job client cannot reconnect to AM after NM restart. Contributed by Junping Du
|
2014-09-18 21:34:40 +00:00 |
|
Jason Lowe
|
9ea7b6c063
|
YARN-2363. Submitted applications occasionally lack a tracking URL. Contributed by Jason Lowe
|
2014-09-18 20:13:16 +00:00 |
|
Vinod Kumar Vavilapalli
|
485c96e3cb
|
YARN-2001. Added a time threshold for RM to wait before starting container allocations after restart/failover. Contributed by Jian He.
|
2014-09-18 11:03:12 -07:00 |
|
Jian He
|
ee21b13cbd
|
YARN-2559. Fixed NPE in SystemMetricsPublisher when retrieving FinalApplicationStatus. Contributed by Zhijie Shen
|
2014-09-17 21:44:15 -07:00 |
|
junping_du
|
90a0c03f0a
|
YARN-1250. Generic history service should support application-acls. (Contributed by Zhijie Shen)
|
2014-09-16 18:20:49 -07:00 |
|
Vinod Kumar Vavilapalli
|
14e2639fd0
|
YARN-611. Added an API to let apps specify an interval beyond which AM failures should be ignored towards counting max-attempts. Contributed by Xuan Gong.
|
2014-09-13 18:04:05 -07:00 |
|
XuanGong
|
e65ae575a0
|
YARN-2456. Possible livelock in CapacityScheduler when RM is recovering
apps. Contributed by Jian He
|
2014-09-12 15:21:46 -07:00 |
|
Jian He
|
3122daa802
|
YARN-2229. Changed the integer field of ContainerId to be long type. Contributed by Tsuyoshi OZAWA
|
2014-09-12 10:33:33 -07:00 |
|
junping_du
|
6b8b1608e6
|
YARN-2033. Merging generic-history into the Timeline Store (Contributed by Zhijie Shen)
|
2014-09-12 10:04:51 +08:00 |
|
Karthik Kambatla
|
c11ada5ea6
|
YARN-2534. FairScheduler: Potential integer overflow calculating totalMaxShare. (Zhihai Xu via kasha)
|
2014-09-11 12:06:06 -07:00 |
|
Jian He
|
83be3ad444
|
YARN-415. Capture aggregate memory allocation at the app-level for chargeback. Contributed by Eric Payne & Andrey Klochkov
|
2014-09-10 18:20:54 -07:00 |
|
Jian He
|
cbfe26370b
|
YARN-2158. Fixed TestRMWebServicesAppsModification#testSingleAppKill test failure. Contributed by Varun Vasudev
|
2014-09-10 12:47:34 -07:00 |
|
XUAN
|
47bdfa044a
|
YARN-2459. RM crashes if App gets rejected for any reason and HA is enabled. Contributed by Jian He
|
2014-09-10 11:44:41 -07:00 |
|
Vinod Kumar Vavilapalli
|
b67d5ba784
|
YARN-2448. Changed ApplicationMasterProtocol to expose RM-recognized resource types to the AMs. Contributed by Varun Vasudev.
|
2014-09-10 10:15:47 -07:00 |
|
Karthik Kambatla
|
3072c83b38
|
YARN-1458. FairScheduler: Zero weight can lead to livelock. (Zhihai Xu via kasha)
|
2014-09-10 08:26:14 -07:00 |
|
Karthik Kambatla
|
1dcaba9a7a
|
YARN-2394. FairScheduler: Configure fairSharePreemptionThreshold per queue. (Wei Yan via kasha)
|
2014-09-03 10:27:36 -07:00 |
|
Karthik Kambatla
|
0f34e6f387
|
YARN-2395. FairScheduler: Preemption timeout should be configurable per queue. (Wei Yan via kasha)
|
2014-08-30 01:17:13 -07:00 |
|
Jian He
|
5c14bc426b
|
YARN-1506. Changed RMNode/SchedulerNode to update resource with event notification. Contributed by Junping Du
|
2014-08-29 23:05:51 -07:00 |
|
Jian He
|
c686aa3533
|
YARN-2447. RM web service app submission doesn't pass secrets correctly. Contributed by Varun Vasudev
|
2014-08-29 11:40:47 -07:00 |
|
Karthik Kambatla
|
fa80ca49bd
|
YARN-2405. NPE in FairSchedulerAppsBlock. (Tsuyoshi Ozawa via kasha)
|
2014-08-28 23:21:37 -07:00 |
|
Karthik Kambatla
|
d16bfd1d0f
|
YARN-1326. RM should log using RMStore at startup time. (Tsuyoshi Ozawa via kasha)
|
2014-08-27 01:43:58 -07:00 |
|
Karthik Kambatla
|
0097b15e21
|
YARN-2393. FairScheduler: Add the notion of steady fair share. (Wei Yan via kasha)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1619845 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-22 15:44:47 +00:00 |
|
Jason Darrell Lowe
|
4236c6600e
|
YARN-2434. RM should not recover containers from previously failed attempt when AM restart is not enabled. Contributed by Jian He
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1619614 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-21 22:41:34 +00:00 |
|
Zhijie Shen
|
f6a778c372
|
YARN-2249. Avoided AM release requests being lost on work preserving RM restart. Contributed by Jian He.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618972 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-19 20:33:49 +00:00 |
|
Jian He
|
375c221960
|
YARN-2409. RM ActiveToStandBy transition missing stoping previous rmDispatcher. Contributed by Rohith
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618915 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-19 17:49:39 +00:00 |
|
Jian He
|
519c4be95a
|
YARN-2411. Support simple user and group mappings to queues. Contributed by Ram Venkatesh
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618542 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-18 06:08:45 +00:00 |
|
Jian He
|
c3084d6c16
|
YARN-2389. Added functionality for schedulers to kill all applications in a queue. Contributed by Subramaniam Venkatraman Krishnan
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618294 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-15 23:53:57 +00:00 |
|
Jian He
|
7360cec692
|
YARN-2378. Added support for moving applications across queues in CapacityScheduler. Contributed by Subramaniam Venkatraman Krishnan
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618106 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-15 06:00:31 +00:00 |
|
Zhijie Shen
|
a9023c2736
|
YARN-2397. Avoided loading two authentication filters for RM and TS web interfaces. Contributed by Varun Vasudev.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1618054 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-14 21:17:20 +00:00 |
|
Karthik Kambatla
|
5197f8c3c5
|
YARN-1370. Fair scheduler to re-populate container allocation state. (Anubhav Dhoot via kasha)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617645 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-13 01:38:59 +00:00 |
|
Karthik Kambatla
|
4239695588
|
YARN-2399. Delete old versions of files. FairScheduler: Merge AppSchedulable and FSSchedulerApp into FSAppAttempt. (kasha)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617619 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-12 22:51:57 +00:00 |
|
Karthik Kambatla
|
486e718fc1
|
YARN-2399. FairScheduler: Merge AppSchedulable and FSSchedulerApp into FSAppAttempt. (kasha)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617600 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-12 21:43:27 +00:00 |
|
Junping Du
|
c2febdcbaa
|
YARN-1337. Recover containers upon nodemanager restart. (Contributed by Jason Lowe)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617448 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-12 10:56:13 +00:00 |
|
Jian He
|
c4dc685343
|
YARN-2138. Cleaned up notifyDone* APIs in RMStateStore. Contributed by Varun Saxena
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617341 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-11 18:24:24 +00:00 |
|
Xuan Gong
|
946be75704
|
YARN-2400: Addendum fix for TestAMRestart failure. Contributed by Jian He
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617333 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-11 17:42:53 +00:00 |
|
Xuan Gong
|
743f7f30da
|
YARN-2400. Fixed TestAMRestart fails intermittently. Contributed by Jian He:
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1617028 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-09 23:31:11 +00:00 |
|
Karthik Kambatla
|
a7643f4de7
|
YARN-2026. Fair scheduler: Consider only active queues for computing fairshare. (Ashwin Shankar via kasha)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1616915 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-09 02:10:00 +00:00 |
|
Xuan Gong
|
eeb4acd955
|
YARN-2212: ApplicationMaster needs to find a way to update the AMRMToken periodically. Contributed by Xuan Gong
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1616892 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-08 21:38:24 +00:00 |
|
Karthik Kambatla
|
14864e9c7c
|
YARN-2352. FairScheduler: Collect metrics on duration of critical methods that affect performance. (kasha)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1616769 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-08 14:17:54 +00:00 |
|
Jian He
|
8437df8ba9
|
YARN-2008. Fixed CapacityScheduler to calculate headroom based on max available capacity instead of configured max capacity. Contributed by Craig Welch
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1616580 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-07 20:00:04 +00:00 |
|
Karthik Kambatla
|
8feddc4c84
|
YARN-2359. Application hangs when it fails to launch AM container. (Zhihai Xu via kasha)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1616375 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-07 00:06:17 +00:00 |
|
Junping Du
|
b8f151231b
|
YARN-1354. Recover applications upon nodemanager restart. (Contributed by Jason Lowe)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1615550 13f79535-47bb-0310-9956-ffa450edef68
|
2014-08-04 13:25:37 +00:00 |
|
Xuan Gong
|
e52f67e389
|
YARN-1994. Expose YARN/MR endpoints on multiple interfaces. Contributed by Craig Welch, Milan Potocnik,and Arpit Agarwal
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1614981 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-31 20:06:02 +00:00 |
|
Zhijie Shen
|
1d6e178144
|
YARN-2347. Consolidated RMStateVersion and NMDBSchemaVersion into Version in yarn-server-common. Contributed by Junping Du.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1614838 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-31 09:27:43 +00:00 |
|
Karthik Kambatla
|
c0b49ff107
|
YARN-2328. FairScheduler: Verify update and continuous scheduling threads are stopped when the scheduler is stopped. (kasha)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1614432 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-29 17:41:52 +00:00 |
|
Zhijie Shen
|
d6532d3a77
|
YARN-2247. Made RM web services authenticate users via kerberos and delegation token. Contributed by Varun Vasudev.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1613821 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-27 17:55:06 +00:00 |
|
Jian He
|
d4fec34933
|
YARN-2211. Persist AMRMToken master key in RMStateStore for RM recovery. Contributed by Xuan Gong
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1613515 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-25 20:42:37 +00:00 |
|
Karthik Kambatla
|
1e553858f9
|
YARN-2214. FairScheduler: preemptContainerPreCheck() in FSParentQueue delays convergence towards fairness. (Ashwin Shankar via kasha)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1613459 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-25 16:13:07 +00:00 |
|
Jason Darrell Lowe
|
28fca92521
|
YARN-2147. client lacks delegation token exception details when application submit fails. Contributed by Chen He
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612950 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-23 21:40:57 +00:00 |
|
Sanford Ryza
|
c88402f36d
|
YARN-2313. Livelock can occur in FairScheduler when there are lots of running apps (Tsuyoshi Ozawa via Sandy Ryza)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612769 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-23 05:00:52 +00:00 |
|
Karthik Kambatla
|
ff77582991
|
YARN-2273. NPE in ContinuousScheduling thread when we lose a node. (Wei Yan via kasha)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612720 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-22 22:44:38 +00:00 |
|
Zhijie Shen
|
eac0701c96
|
YARN-2319. Made the MiniKdc instance start/close before/after the class of TestRMWebServicesDelegationTokens. Contributed by Wenwu Peng.
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1612588 13f79535-47bb-0310-9956-ffa450edef68
|
2014-07-22 15:15:29 +00:00 |
|