Sandy Ryza
|
a128cca305
|
YARN-2669. FairScheduler: queue names shouldn't allow periods (Wei Yan via Sandy Ryza)
|
2014-11-21 16:06:41 -08:00 |
|
Karthik Kambatla
|
3114d4731d
|
YARN-2604. Scheduler should consider max-allocation-* in conjunction with the largest node. (Robert Kanter via kasha)
|
2014-11-21 10:32:28 -08:00 |
|
Karthik Kambatla
|
a9a0cc3679
|
YARN-2315. FairScheduler: Set current capacity in addition to capacity. (Zhihai Xu via kasha)
|
2014-11-19 20:15:40 -08:00 |
|
Karthik Kambatla
|
c90fb84aaa
|
YARN-2802. ClusterMetrics to include AM launch and register delays. (Zhihai Xu via kasha)
|
2014-11-19 19:50:12 -08:00 |
|
Jian He
|
9cb8b75ba5
|
YARN-2865. Fixed RM to always create a new RMContext when transtions from StandBy to Active. Contributed by Rohith Sharmaks
|
2014-11-19 19:48:52 -08:00 |
|
Karthik Kambatla
|
2fce6d6141
|
YARN-2690. [YARN-2574] Make ReservationSystem and its dependent classes independent of Scheduler type. (Anubhav Dhoot via kasha)
|
2014-11-17 16:45:57 -08:00 |
|
Jason Lowe
|
81c9d17af8
|
YARN-2414. RM web UI: app page will crash if app is failed before any attempt has been created. Contributed by Wangda Tan
|
2014-11-17 21:15:48 +00:00 |
|
Jian He
|
9b860661a5
|
YARN-2432. RMStateStore should process the pending events before close. Contributed by Varun Saxena
|
2014-11-14 17:24:09 -08:00 |
|
Sandy Ryza
|
1a47f890ba
|
YARN-2811. In Fair Scheduler, reservation fulfillments shouldn't ignore max share (Siqi Li via Sandy Ryza)
|
2014-11-14 15:18:56 -08:00 |
|
Jian He
|
d005404ef7
|
YARN-2856. Fixed RMAppImpl to handle ATTEMPT_KILLED event at ACCEPTED state on app recovery. Contributed by Rohith Sharmaks
|
2014-11-13 15:34:26 -08:00 |
|
Vinod Kumar Vavilapalli
|
3651fe1b08
|
YARN-2853. Fixed a bug in ResourceManager causing apps to hang when the user kill request races with ApplicationMaster finish. Contributed by Jian He.
|
2014-11-13 08:13:03 -08:00 |
|
Jason Lowe
|
f8aefa5e9c
|
YARN-2780. Log aggregated resource allocation in rm-appsummary.log. Contributed by Eric Payne
|
2014-11-12 17:01:15 +00:00 |
|
Vinod Kumar Vavilapalli
|
e76faebc95
|
YARN-2834. Fixed ResourceManager to ignore token-renewal failures on recovery consistent with the (somewhat incorrect) behaviour in the non-recovery case. Contributed by Jian He.
|
2014-11-09 18:56:58 -08:00 |
|
Arun C. Murthy
|
43cd07b408
|
YARN-2830. Add backwords compatible ContainerId.newInstance constructor. Contributed by Jonathan Eagles.
|
2014-11-09 14:57:37 -08:00 |
|
Zhijie Shen
|
9a4e0d343e
|
YARN-2505. Supported get/add/remove/change labels in RM REST API. Contributed by Craig Welch.
|
2014-11-07 20:35:46 -08:00 |
|
Vinod Kumar Vavilapalli
|
df36edf751
|
YARN-2826. Fixed user-groups mappings' refresh bug caused by YARN-2826. Contributed by Wangda Tan.
|
2014-11-07 19:43:55 -08:00 |
|
Vinod Kumar Vavilapalli
|
4cfd5bc7c1
|
YARN-2753. Fixed a bunch of bugs in the NodeLabelsManager classes. Contributed by Zhihai xu.
|
2014-11-07 14:15:53 -08:00 |
|
Vinod Kumar Vavilapalli
|
2ac1be7dec
|
YARN-2824. Fixed Capacity Scheduler to not crash when some node-labels are not mapped to queues by making default capacities per label to be zero. Contributed by Wangda Tan.
|
2014-11-07 10:39:37 -08:00 |
|
Xuan
|
1e97f2f094
|
YARN-2810. TestRMProxyUsersConf fails on Windows VMs. Contributed by Varun Vasudev
|
2014-11-07 09:44:43 -08:00 |
|
Vinod Kumar Vavilapalli
|
a5657182a7
|
YARN-2823. Fixed ResourceManager app-attempt state machine to inform schedulers about previous finished attempts of a running appliation to avoid expectation mismatch w.r.t transferred containers. Contributed by Jian He.
|
2014-11-07 09:28:36 -08:00 |
|
Vinod Kumar Vavilapalli
|
a3839a9fbf
|
YARN-2744. Fixed CapacityScheduler to validate node-labels correctly against queues. Contributed by Wangda Tan.
|
2014-11-06 17:28:12 -08:00 |
|
Jian He
|
395275af86
|
YARN-2579. Fixed a deadlock issue when EmbeddedElectorService and FatalEventDispatcher try to transition RM to StandBy at the same time. Contributed by Rohith Sharmaks
|
2014-11-05 16:59:54 -08:00 |
|
Vinod Kumar Vavilapalli
|
834e931d8e
|
YARN-2805. Fixed ResourceManager to load HA configs correctly before kerberos login. Contributed by Wangda Tan.
|
2014-11-05 15:30:33 -08:00 |
|
Zhijie Shen
|
b4c951ab83
|
YARN-2767. Added a test case to verify that http static user cannot kill or submit apps in the secure mode. Contributed by Varun Vasudev.
|
2014-11-05 10:57:38 -08:00 |
|
Karthik Kambatla
|
b2cd269802
|
YARN-2010. Handle app-recovery failures gracefully. (Jian He and Karthik Kambatla via kasha)
|
2014-11-04 17:45:24 -08:00 |
|
Vinod Kumar Vavilapalli
|
ec6cbece8e
|
YARN-2795. Fixed ResourceManager to not crash loading node-label data from HDFS in secure mode. Contributed by Wangda Tan.
|
2014-11-03 13:44:06 -08:00 |
|
Zhijie Shen
|
27715ec63b
|
YARN-2785. Fixed intermittent TestContainerResourceUsage failure. Contributed by Varun Vasudev.
|
2014-11-02 15:20:40 -08:00 |
|
Vinod Kumar Vavilapalli
|
d1828d9443
|
YARN-2779. Fixed ResourceManager to not require delegation tokens for communicating with Timeline Service. Contributed by Zhijie Shen.
|
2014-10-30 23:16:19 -07:00 |
|
Vinod Kumar Vavilapalli
|
e0233c16eb
|
YARN-2698. Moved some node label APIs to be correctly placed in client protocol. Contributed by Wangda Tan.
|
2014-10-30 22:59:31 -07:00 |
|
Karthik Kambatla
|
179cab81e0
|
YARN-2712. TestWorkPreservingRMRestart: Augment FS tests with queue and headroom checks. (Tsuyoshi Ozawa via kasha)
|
2014-10-30 00:29:07 -07:00 |
|
Karthik Kambatla
|
782971ae7a
|
YARN-2742. FairSchedulerConfiguration should allow extra spaces between value and unit. (Wei Yan via kasha)
|
2014-10-29 10:24:57 -07:00 |
|
Jian He
|
d5e0a09721
|
YARN-2503. Added node lablels in web UI. Contributed by Wangda Tan
|
2014-10-28 17:57:54 -07:00 |
|
Vinod Kumar Vavilapalli
|
a16d022ca4
|
YARN-2704. Changed ResourceManager to optionally obtain tokens itself for the sake of localization and log-aggregation for long-running services. Contributed by Jian He.
|
2014-10-27 15:49:47 -07:00 |
|
Xuan
|
ce1a4419a6
|
YARN-2726. CapacityScheduler should explicitly log when an accessible label has no capacity. Contributed by Wangda Tan
|
2014-10-27 10:26:21 -07:00 |
|
Vinod Kumar Vavilapalli
|
0186645505
|
YARN-2743. Fixed a bug in ResourceManager that was causing RMDelegationToken identifiers to be tampered and thus causing app submission failures in secure mode. Contributed by Jian He.
|
2014-10-26 11:14:34 -07:00 |
|
Jian He
|
5864dd99a4
|
YARN-1915. Fixed a race condition that client could use the ClientToAMToken to contact with AM before AM actually receives the ClientToAMTokenMasterKey. Contributed by Jason Lowe
|
2014-10-24 22:47:56 -07:00 |
|
Zhijie Shen
|
0f3b6900be
|
YARN-2209. Replaced AM resync/shutdown command with corresponding exceptions and made related MR changes. Contributed by Jian He.
|
2014-10-23 21:56:03 -07:00 |
|
Zhijie Shen
|
7e3b5e6f5c
|
YARN-2721. Suppress NodeExist exception thrown by ZKRMStateStore when it retries creating znode. Contributed by Jian He.
|
2014-10-21 21:49:17 -07:00 |
|
Vinod Kumar Vavilapalli
|
c0e034336c
|
YARN-2715. Fixed ResourceManager to respect common configurations for proxy users/groups beyond just the YARN level config. Contributed by Zhijie Shen.
|
2014-10-21 20:09:40 -07:00 |
|
Vinod Kumar Vavilapalli
|
8256766498
|
YARN-2504. Enhanced RM Admin CLI to support management of node-labels. Contribyted by Wangda Tan.
|
2014-10-18 12:07:40 -07:00 |
|
Vinod Kumar Vavilapalli
|
39063cd36f
|
YARN-2676. Enhanced Timeline auth-filter to support proxy users. Contributed by Zhijie Shen.
|
2014-10-17 22:02:50 -07:00 |
|
Vinod Kumar Vavilapalli
|
e9c66e8fd2
|
YARN-2705. Fixed bugs in ResourceManager node-label manager that were causing test-failures: added a dummy in-memory labels-manager. Contributed by Wangda Tan.
|
2014-10-17 18:26:12 -07:00 |
|
Jian He
|
c3de2412eb
|
YARN-1879. Marked Idempotent/AtMostOnce annotations to ApplicationMasterProtocol for RM fail over. Contributed by Tsuyoshi OZAWA
|
2014-10-17 16:35:27 -07:00 |
|
Jian He
|
a6aa6e42ca
|
YARN-2588. Standby RM fails to transitionToActive if previous transitionToActive failed with ZK exception. Contributed by Rohith Sharmaks
|
2014-10-17 10:54:24 -07:00 |
|
Vinod Kumar Vavilapalli
|
abae63caf9
|
YARN-2699. Fixed a bug in CommonNodeLabelsManager that caused tests to fail when using ephemeral ports on NodeIDs. Contributed by Wangda Tan.
|
2014-10-17 08:58:08 -07:00 |
|
Vinod Kumar Vavilapalli
|
b3056c266a
|
YARN-2685. Fixed a bug in CommonNodeLabelsManager that caused wrong resource tracking per label when a host runs multiple node-managers. Contributed by Wangda Tan.
|
2014-10-15 18:47:26 -07:00 |
|
Vinod Kumar Vavilapalli
|
f2ea555ac6
|
YARN-2496. Enhanced Capacity Scheduler to have basic support for allocating resources based on node-labels. Contributed by Wangda Tan.
YARN-2500. Ehnaced ResourceManager to support schedulers allocating resources based on node-labels. Contributed by Wangda Tan.
|
2014-10-15 18:33:06 -07:00 |
|
Jian He
|
0af1a2b5bc
|
YARN-2312. Deprecated old ContainerId#getId API and updated MapReduce to use ContainerId#getContainerId instead. Contributed by Tsuyoshi OZAWA
|
2014-10-15 15:22:07 -07:00 |
|
Zhijie Shen
|
1220bb72d4
|
YARN-2656. Made RM web services authentication filter support proxy user. Contributed by Varun Vasudev and Zhijie Shen.
|
2014-10-14 21:50:46 -07:00 |
|
Zhijie Shen
|
cdce88376a
|
HADOOP-11181. Generalized o.a.h.s.t.d.DelegationTokenManager to handle all sub-classes of AbstractDelegationTokenIdentifier. Contributed by Zhijie Shen.
|
2014-10-14 11:35:38 -07:00 |
|
Karthik Kambatla
|
da709a2eac
|
YARN-2641. Decommission nodes on -refreshNodes instead of next NM-RM heartbeat. (Zhihai Xu via kasha)
|
2014-10-13 16:23:04 -07:00 |
|
Jian He
|
f9680d9a16
|
YARN-2308. Changed CapacityScheduler to explicitly throw exception if the queue
to which the apps were submitted is changed across RM restart. Contributed by Craig Welch & Chang Li
|
2014-10-13 14:09:04 -07:00 |
|
Zhijie Shen
|
4aed2d8e91
|
YARN-2651. Spun off LogRollingInterval from LogAggregationContext. Contributed by Xuan Gong.
|
2014-10-13 10:54:09 -07:00 |
|
Vinod Kumar Vavilapalli
|
db7f165319
|
YARN-2494. Added NodeLabels Manager internal API and implementation. Contributed by Wangda Tan.
|
2014-10-10 11:44:21 -07:00 |
|
Jian He
|
e16e25ab1b
|
YARN-2649. Fixed TestAMRMRPCNodeUpdates test failure. Contributed by Ming Ma
|
2014-10-08 10:58:51 -07:00 |
|
Jian He
|
30d56fdbb4
|
YARN-1857. CapacityScheduler headroom doesn't account for other AM's running. Contributed by Chen He and Craig Welch
|
2014-10-07 13:45:04 -07:00 |
|
Jian He
|
519e5a7dd2
|
YARN-2644. Fixed CapacityScheduler to return up-to-date headroom when AM allocates. Contributed by Craig Welch
|
2014-10-06 15:48:46 -07:00 |
|
Jian He
|
ea26cc0b4a
|
YARN-2615. Changed ClientToAMTokenIdentifier/RM(Timeline)DelegationTokenIdentifier to use protobuf as payload. Contributed by Junping Du
|
2014-10-06 10:47:43 -07:00 |
|
Chris Douglas
|
c8212bacb1
|
YARN-1051. Add a system for creating reservations of cluster capacity.
Contributed by Subru Krishnan and Carlo Curino.
|
2014-10-03 17:09:02 -07:00 |
|
subru
|
a2986234be
|
YARN-2611. Fixing jenkins findbugs warning and TestRMWebServicesCapacitySched for branch YARN-1051. Contributed by Subru Krishnan and Carlo Curino.
(cherry picked from commit c47464aba407d1dafe10be23fe454f0489cc4367)
|
2014-10-03 15:43:23 -07:00 |
|
subru
|
5e10a13bb4
|
YARN-2576. Making test patch pass in branch. Contributed by Subru Krishnan and Carlo Curino.
(cherry picked from commit 90ac0be86b898aefec5471db4027554c8e1b310c)
|
2014-10-03 15:43:13 -07:00 |
|
subru
|
6261f7cc69
|
YARN-2080. Integrating reservation system with ResourceManager and client-RM protocol. Contributed by Subru Krishnan and Carlo Curino.
(cherry picked from commit 8baeaead8532898163f1006276b731a237b1a559)
Conflicts:
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/rmapp/RMAppImpl.java
|
2014-10-03 15:42:43 -07:00 |
|
subru
|
3418c56bcf
|
YARN-1712. Plan follower that synchronizes the current state of reservation subsystem with the scheduler. Contributed by Subru Krishnan and Carlo Curino.
(cherry picked from commit 169085319b8b76641f8b9f6840a3fef06d221e2b)
|
2014-10-03 15:42:10 -07:00 |
|
carlo curino
|
b6df0dddcd
|
YARN-1711. Policy to enforce instantaneous and over-time quotas on user reservation. Contributed by Carlo Curino and Subru Krishnan.
(cherry picked from commit c4918cb4cb5a267a8cfd6eace28fcfe7ad6174e8)
|
2014-10-03 15:42:03 -07:00 |
|
carlo curino
|
f66ffcf832
|
YARN-1710. Logic to find allocations within a Plan that satisfy user ReservationRequest(s). Contributed by Carlo Curino and Subru Krishnan.
(cherry picked from commit aef7928899b37262773f3dc117157bb746bf8918)
|
2014-10-03 15:41:57 -07:00 |
|
subru
|
cf4b34282a
|
YARN-1709. In-memory data structures used to track resources over time to enable reservations.
(cherry picked from commit 0d8b2cd88b958b1e602fd4ea4078ef8d4742a7c3)
|
2014-10-03 15:41:51 -07:00 |
|
carlo curino
|
1c6950354f
|
YARN-2475. Logic for responding to capacity drops for the ReservationSystem. Contributed by Carlo Curino and Subru Krishnan.
(cherry picked from commit f83a07f266f2c5e6eead554d8a331ed7e75e10d5)
|
2014-10-03 15:41:21 -07:00 |
|
carlo curino
|
eb3e40b833
|
YARN-1707. Introduce APIs to add/remove/resize queues in the CapacityScheduler. Contributed by Carlo Curino and Subru Krishnan
(cherry picked from commit aac47fda7fecda9fc18ade34d633eca895865a70)
Conflicts:
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/CapacityScheduler.java
|
2014-10-03 15:41:02 -07:00 |
|
Karthik Kambatla
|
80d11eb68e
|
YARN-2635. TestRM, TestRMRestart, TestClientToAMTokens should run with both CS and FS. (Wei Yan and kasha via kasha)
|
2014-10-03 11:49:49 -07:00 |
|
Jian He
|
054f285526
|
YARN-2628. Capacity scheduler with DominantResourceCalculator carries out reservation even though slots are free. Contributed by Varun Vasudev
|
2014-10-02 15:13:33 -07:00 |
|
Karthik Kambatla
|
5e0b49da9c
|
YARN-2254. TestRMWebServicesAppsModification should run against both CS and FS. (Zhihai Xu via kasha)
|
2014-10-02 10:15:04 -07:00 |
|
Zhijie Shen
|
52bbe0f11b
|
YARN-2630. Prevented previous AM container status from being acquired by the current restarted AM. Contributed by Jian He.
|
2014-10-01 15:38:11 -07:00 |
|
Jian He
|
bbff96be48
|
YARN-2602. Fixed possible NPE in ApplicationHistoryManagerOnTimelineStore. Contributed by Zhijie Shen
|
2014-09-30 16:44:17 -07:00 |
|
Karthik Kambatla
|
14d60dadc2
|
YARN-2594. Potential deadlock in RM when querying ApplicationResourceUsageReport. (Wangda Tan via kasha)
|
2014-09-30 16:12:27 -07:00 |
|
Zhijie Shen
|
9582a50176
|
YARN-2627. Added the info logs of attemptFailuresValidityInterval and number of previous failed attempts. Contributed by Xuan Gong.
|
2014-09-30 12:57:45 -07:00 |
|
Jason Lowe
|
9c22065109
|
YARN-1769. CapacityScheduler: Improve reservations. Contributed by Thomas Graves
|
2014-09-29 14:12:18 +00:00 |
|
Jian He
|
5391919b09
|
YARN-668. Changed NMTokenIdentifier/AMRMTokenIdentifier/ContainerTokenIdentifier to use protobuf object as the payload. Contributed by Junping Du.
|
2014-09-26 17:48:41 -07:00 |
|
Karthik Kambatla
|
f4357240a6
|
YARN-2608. FairScheduler: Potential deadlocks in loading alloc files and clock access. (Wei Yan via kasha)
|
2014-09-25 17:42:51 -07:00 |
|
Jason Lowe
|
8269bfa613
|
YARN-2523. ResourceManager UI showing negative value for "Decommissioned Nodes" field. Contributed by Rohith
|
2014-09-25 22:37:05 +00:00 |
|
Zhijie Shen
|
72b0881ca6
|
YARN-2546. Made REST API for application creation/submission use numeric and boolean types instead of the string of them. Contributed by Varun Vasudev.
|
2014-09-24 17:57:32 -07:00 |
|
Zhijie Shen
|
c86674a3a4
|
YARN-2581. Passed LogAggregationContext to NM via ContainerTokenIdentifier. Contributed by Xuan Gong.
|
2014-09-24 17:50:26 -07:00 |
|
Karthik Kambatla
|
39c87344e1
|
YARN-2596. TestWorkPreservingRMRestart fails with FairScheduler. (kasha)
|
2014-09-24 15:23:31 -07:00 |
|
Karthik Kambatla
|
f5578207d2
|
YARN-2252. Intermittent failure of TestFairScheduler.testContinuousScheduling. (Ratandeep Ratti and kasha via kasha)
|
2014-09-23 00:03:16 -07:00 |
|
Karthik Kambatla
|
568d3dc2bb
|
YARN-1959. Fix headroom calculation in FairScheduler. (Anubhav Dhoot via kasha)
|
2014-09-22 23:49:39 -07:00 |
|
Karthik Kambatla
|
43efdd30b5
|
YARN-2539. FairScheduler: Set the default value for maxAMShare to 0.5. (Wei Yan via kasha)
|
2014-09-22 16:09:52 -07:00 |
|
Karthik Kambatla
|
23e17ce0ff
|
YARN-2540. FairScheduler: Queue filters not working on scheduler page in RM UI. (Ashwin Shankar via kasha)
|
2014-09-22 15:40:47 -07:00 |
|
Jian He
|
0a641496c7
|
YARN-1372. Ensure all completed containers are reported to the AMs across RM restart. Contributed by Anubhav Dhoot
|
2014-09-22 10:30:53 -07:00 |
|
Karthik Kambatla
|
9721e2c1fe
|
YARN-2453. TestProportionalCapacityPreemptionPolicy fails with FairScheduler. (Zhihai Xu via kasha)
|
2014-09-21 23:13:45 -07:00 |
|
Karthik Kambatla
|
c50fc92502
|
YARN-2452. TestRMApplicationHistoryWriter fails with FairScheduler. (Zhihai Xu via kasha)
|
2014-09-21 13:15:04 -07:00 |
|
Jian He
|
444acf8ea7
|
YARN-2565. Fixed RM to not use FileSystemApplicationHistoryStore unless explicitly set. Contributed by Zhijie Shen
|
2014-09-19 11:26:29 -07:00 |
|
Zhijie Shen
|
6fe5c6b746
|
YARN-2568. Fixed the potential test failures due to race conditions when RM work-preserving recovery is enabled. Contributed by Jian He.
|
2014-09-18 21:56:56 -07:00 |
|
Jason Lowe
|
a337f0e354
|
YARN-2561. MR job client cannot reconnect to AM after NM restart. Contributed by Junping Du
|
2014-09-18 21:34:40 +00:00 |
|
Jason Lowe
|
9ea7b6c063
|
YARN-2363. Submitted applications occasionally lack a tracking URL. Contributed by Jason Lowe
|
2014-09-18 20:13:16 +00:00 |
|
Vinod Kumar Vavilapalli
|
485c96e3cb
|
YARN-2001. Added a time threshold for RM to wait before starting container allocations after restart/failover. Contributed by Jian He.
|
2014-09-18 11:03:12 -07:00 |
|
Jian He
|
ee21b13cbd
|
YARN-2559. Fixed NPE in SystemMetricsPublisher when retrieving FinalApplicationStatus. Contributed by Zhijie Shen
|
2014-09-17 21:44:15 -07:00 |
|
junping_du
|
90a0c03f0a
|
YARN-1250. Generic history service should support application-acls. (Contributed by Zhijie Shen)
|
2014-09-16 18:20:49 -07:00 |
|
Vinod Kumar Vavilapalli
|
14e2639fd0
|
YARN-611. Added an API to let apps specify an interval beyond which AM failures should be ignored towards counting max-attempts. Contributed by Xuan Gong.
|
2014-09-13 18:04:05 -07:00 |
|
XuanGong
|
e65ae575a0
|
YARN-2456. Possible livelock in CapacityScheduler when RM is recovering
apps. Contributed by Jian He
|
2014-09-12 15:21:46 -07:00 |
|
Jian He
|
3122daa802
|
YARN-2229. Changed the integer field of ContainerId to be long type. Contributed by Tsuyoshi OZAWA
|
2014-09-12 10:33:33 -07:00 |
|
Jason Lowe
|
78b048393a
|
YARN-2484. FileSystemRMStateStore#readFile/writeFile should close FSData(In|Out)putStream in final block. Contributed by Tsuyoshi OZAWA
|
2014-09-12 14:09:14 +00:00 |
|