Commit Graph

1410 Commits

Author SHA1 Message Date
Karthik Kambatla
c684f2b007 YARN-4729. SchedulerApplicationAttempt#getTotalRequiredResources can throw an NPE. (kasha) 2016-02-24 18:33:57 -08:00
Sangjin Lee
553b591ba0 YARN-4722. AsyncDispatcher logs redundant event queue sizes (Jason Lowe via sjlee) 2016-02-24 09:29:41 -08:00
Junping Du
9ed17f181d YARN-3223. Resource update during NM graceful decommission. Contributed by Brook Zhou. 2016-02-23 03:30:26 -08:00
Tsuyoshi Ozawa
0e12114c9c YARN-4648. Move preemption related tests from TestFairScheduler to TestFairSchedulerPreemption. Contributed by Kai Sasaki. 2016-02-23 19:50:08 +09:00
Junping Du
3fab88540f YARN-4386. refreshNodesGracefully() should send recommission event to active RMNodes only. Contributed by Kuhu Shukla. 2016-02-22 07:04:19 -08:00
Sangjin Lee
7de70680fe YARN-4690. Skip object allocation in FSAppAttempt#getResourceUsage when possible (Ming Ma via sjlee) 2016-02-17 20:55:21 -08:00
Karthik Kambatla
2ab4c476ed YARN-4689. FairScheduler: Cleanup preemptContainer to be more readable. (Kai Sasaki via kasha) 2016-02-17 18:16:15 -08:00
Arun Suresh
23f937e3b7 YARN-2575. Create separate ACLs for Reservation create/update/delete/list ops (Sean Po via asuresh) 2016-02-11 10:47:43 -08:00
Varun Vasudev
fa00d3e205 YARN-4655. Log uncaught exceptions/errors in various thread pools in YARN. Contributed by Sidharta Seethana. 2016-02-11 12:06:42 +05:30
Jian He
d16b17b4d2 YARN-4138. Roll back container resource allocation after resource increase token expires. Contributed by Meng Ding 2016-02-11 10:06:27 +08:00
=
b706cbc1bc YARN-4420. Add REST API for List Reservations (Sean Po via curino) 2016-02-10 10:19:26 -08:00
Arun Suresh
5cf5c41a89 YARN-4360. Improve GreedyReservationAgent to support "early" allocations, and performance improvements (curino via asuresh) 2016-02-10 09:11:15 -08:00
Devaraj K
565af873d5 YARN-4667. RM Admin CLI for refreshNodesResources throws NPE when nothing
is configured. Contributed by Naganarasimha G R.
2016-02-08 15:01:54 +05:30
Varun Vasudev
22a2b2231d YARN-4669. Fix logging statements in resource manager's Application class. Contributed by Sidharta Seethana. 2016-02-04 13:51:25 +05:30
Varun Vasudev
308d63f382 YARN-4307. Display blacklisted nodes for AM container in the RM web UI. Contributed by Naganarasimha G R. 2016-02-04 13:32:54 +05:30
Varun Vasudev
1adb64e09b YARN-4625. Make ApplicationSubmissionContext and ApplicationSubmissionContextInfo more consistent. Contributed by Xuan Gong. 2016-02-03 16:26:28 +05:30
Wangda Tan
9875325d5c YARN-4340. Add list API to reservation system. (Sean Po via wangda) 2016-02-02 10:17:33 +08:00
Jason Lowe
ed55950164 YARN-3102. Decommisioned Nodes not listed in Web UI. Contributed by Kuhu Shukla 2016-02-01 23:15:26 +00:00
Rohith Sharma K S
2673cbaf55 YARN-4615. Fix random test failure in TestAbstractYarnScheduler#testResourceRequestRecoveryToTheRightAppAttempt. (Sunil G via rohithsharmaks) 2016-02-01 10:43:56 +05:30
Jason Lowe
772ea7b41b YARN-4428. Redirect RM page to AHS page when AHS turned on and RM page is not available. Contributed by Chang Li 2016-01-29 21:48:54 +00:00
Jian He
f4a57d4a53 YARN-4617. LeafQueue#pendingOrderingPolicy should always use fixed ordering policy instead of using same as active applications ordering policy. Contributed by Rohith Sharma K S 2016-01-29 12:22:23 -08:00
Devaraj K
a277bdc9ed YARN-4411. RMAppAttemptImpl#createApplicationAttemptReport throws
IllegalArgumentException. Contributed by Bibin A Chundatt and yarntime.
2016-01-29 13:51:37 +05:30
Jian He
7f46636495 YARN-4519. Potential deadlock of CapacityScheduler between decrease container and assign containers. Contributed by Meng Ding 2016-01-28 14:51:00 -08:00
Rohith Sharma K S
ef343be82b YARN-4633. Fix random test failure in TestRMRestart#testRMRestartAfterPreemption. (Bibin A Chundatt via rohithsharmaks) 2016-01-28 21:53:45 +05:30
Karthik Kambatla
fb238d7e5d YARN-4462. FairScheduler: Disallow preemption from a queue. (Tao Jie via kasha) 2016-01-27 12:29:06 -08:00
Rohith Sharma K S
c01bee0108 YARN-4573. Fix test failure in TestRMAppTransitions#testAppRunningKill and testAppKilledKilled. (Takashi Ohnishi via rohithsharmaks) 2016-01-27 08:23:02 +05:30
rohithsharmaks
10dc2c0493 YARN-4613. Fix test failure in TestClientRMService#testGetClusterNodes. (Takashi Ohnishi via rohithsharmaks) 2016-01-24 23:36:15 +05:30
rohithsharmaks
99829eb221 YARN-4614. Fix random failure in TestApplicationPriority#testApplicationPriorityAllocationWithChangeInPriority. (Sunil G via rohithsharmaks) 2016-01-23 07:56:57 +05:30
rohithsharmaks
d6258b33a7 YARN-4497. RM might fail to restart when recovering apps whose attempts are missing. (Jun Gong via rohithsharmaks) 2016-01-22 20:27:38 +05:30
Akira Ajisaka
8f58f742ae YARN-4605. Spelling mistake in the help message of "yarn applicationattempt" command. Contributed by Weiwei Yang. 2016-01-22 19:43:06 +09:00
Rohith Sharma K S
e30668106d YARN-4584. RM startup failure when AM attempts greater than max-attempts. (Bibin A Chundatt via rohithsharmaks) 2016-01-22 10:14:46 +05:30
Jason Lowe
468a53b22f YARN-4610. Reservations continue looking for one app causes other apps to starve. Contributed by Jason Lowe 2016-01-21 18:31:29 +00:00
Karthik Kambatla
4992398aee YARN-4603. FairScheduler should mention user requested queuename in error message when failed in queue ACL check. (Tao Jie via kasha) 2016-01-21 17:40:59 +01:00
Wangda Tan
5ff5f67332 YARN-4557. Fix improper Queues sorting in PartitionedQueueComparator when accessible-node-labels=*. (Naganarasimha G R via wangda) 2016-01-21 11:21:06 +08:00
Xuan
890a2ebd1a YARN-4559. Make leader elector and zk store share the same curator
client. Contributed by Jian He
2016-01-20 14:48:10 -08:00
Jian He
edc43a9097 YARN-4565. Fix a bug that leads to AM resource limit not hornored when sizeBasedWeight enabled for FairOrderingPolicy. Contributed by Wangda Tan 2016-01-18 21:04:36 -08:00
Wangda Tan
a44ce3f14f YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda) 2016-01-19 09:30:04 +08:00
Wangda Tan
150f5ae034 Revert "YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)"
This reverts commit 3fe5728563.

Conflicts:
	hadoop-yarn-project/CHANGES.txt
2016-01-19 09:27:36 +08:00
Jian He
f385851141 YARN-4596. SystemMetricPublisher should not swallow error messages from TimelineClient#putEntities. Contributed by Li Lu 2016-01-18 16:58:39 -08:00
Karthik Kambatla
d40859fab1 YARN-4526. Make SystemClock singleton so AppSchedulingInfo could use it. (kasha) 2016-01-18 10:58:14 +01:00
Wangda Tan
3fe5728563 YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)
(cherry picked from commit 805a9ed85e)
2016-01-18 17:06:05 +08:00
Wangda Tan
adf260a728 Revert "YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)"
This reverts commit 805a9ed85e.
2016-01-18 16:50:45 +08:00
Wangda Tan
b08ecf5c75 YARN-4304. AM max resource configuration per partition to be displayed/updated correctly in UI and in various partition related metrics. (Sunil G via wangda) 2016-01-18 11:11:32 +08:00
Wangda Tan
805a9ed85e YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda) 2016-01-18 11:04:25 +08:00
Wangda Tan
9523648d57 YARN-4538. QueueMetrics pending cores and memory metrics wrong. (Bibin A Chundatt via wangda) 2016-01-18 10:57:14 +08:00
rohithsharmaks
f7736f464f YARN-4389. Allow application to enable or disable am blacklisting. (Sunil G via rohithsharmaks) 2016-01-15 21:38:26 +05:30
Karthik Kambatla
9d04f26d4c YARN-3446. FairScheduler headroom calculation should exclude nodes in the blacklist. (Zhihai Xu via kasha) 2016-01-14 08:33:23 -08:00
Karthik Kambatla
321072ba81 YARN-4551. Address the duplication between StatusUpdateWhenHealthy and StatusUpdateWhenUnhealthy transitions. (Sunil G via kasha) 2016-01-13 12:09:34 -08:00
Wangda Tan
c0537bcd2c YARN-4571. Make app id/name available to the yarn authorizer provider for better auditing. (Jian He via wangda) 2016-01-13 13:18:31 +08:00
Akira Ajisaka
da1e3e3c57 YARN-4567. javadoc failing on java 8. Contributed by Steve Loughran. This closes #67. 2016-01-12 15:12:17 +09:00
Wangda Tan
9e792da014 YARN-4582. Label-related invalid resource request exception should be able to properly handled by application. (Bibin A Chundatt via wangda) 2016-01-12 12:53:31 +08:00
Jian He
5fab4ec31c Missing file for YARN-4580 2016-01-11 17:00:44 -08:00
Jian He
b8942be888 YARN-4537. Pull out priority comparison from fifocomparator and use compound comparator for FifoOrdering policy. Contributed by Rohith Sharma K S 2016-01-11 16:44:28 -08:00
Jian He
109e528ef5 YARN-4479. Change CS LeafQueue pendingOrderingPolicy to hornor recovered apps. Contributed by Rohith Sharma K S 2016-01-08 15:51:10 -08:00
Xuan
89022f8d4b YARN-4438. Implement RM leader election with curator. Contributed by Jian He 2016-01-07 14:33:06 -08:00
Junping Du
c1462a67ff YARN-4546. ResourceManager crash due to scheduling opportunity overflow. Contributed by Jason Lowe. 2016-01-06 05:49:24 -08:00
rohithsharmaks
6da6d87872 YARN-4535. Fix checkstyle error in CapacityScheduler.java (Naganarasimha G R via rohithsharmaks) 2016-01-05 12:09:57 +05:30
Wangda Tan
4e4b3a8465 YARN-4524. Cleanup AppSchedulingInfo. (Karthik Kambatla via wangda)
(cherry picked from commit 05fa852d7567b7590d6b53bbf925f8f424736514)
2015-12-30 15:39:34 -08:00
Wangda Tan
8310b2e9ff YARN-4522. Queue acl can be checked at app submission. (Jian He via wangda) 2015-12-30 15:30:12 -08:00
Junping Du
223ce323bb YARN-1382. Remove unusableRMNodesConcurrentSet (never used) in NodeListManager to get rid of memory leak. Contributed by Rohith Sharma K S. 2015-12-30 07:52:07 -08:00
Jian He
5273413411 YARN-3480. Remove attempts that are beyond max-attempt limit from state store. Contributed by Jun Gong 2015-12-29 15:58:39 -08:00
Wangda Tan
561abb9fee YARN-4315. NaN in Queue percentage for cluster apps page. (Bibin A Chundatt via wangda) 2015-12-29 13:28:00 -08:00
Jian He
d0a22bae9b YARN-4417. Make RM and Timeline-server REST APIs more consistent. Contributed by Wangda Tan 2015-12-28 15:52:45 -08:00
Karthik Kambatla
0af492b4bd YARN-4156. TestAMRestart#testAMBlacklistPreventsRestartOnSameNode assumes CapacityScheduler. (Anubhav Dhoot via kasha) 2015-12-23 17:52:36 -08:00
rohithsharmaks
8c180a13c8 YARN-4109. Exception on RM scheduler page loading with labels. (Mohammad Shahid Khan via rohithsharmaks) 2015-12-23 09:12:32 +05:30
Arun Suresh
e88422df45 YARN-4477. FairScheduler: Handle condition which can result in an infinite loop in attemptScheduling. (Tao Jie via asuresh) 2015-12-21 22:41:09 -08:00
Wangda Tan
bc038b382c YARN-4454. NM to nodelabel mapping going wrong after RM restart. (Bibin A Chundatt via wangda) 2015-12-21 11:30:13 -08:00
Jian He
85c2466048 YARN-4164. Changed updateApplicationPriority API to return the updated application priority. Contributed by Rohith Sharma K S 2015-12-18 14:13:48 -08:00
Junping Du
1de56b0448 YARN-3226. UI changes for decommissioning node. Contributed by Sunil G. 2015-12-17 15:20:17 -08:00
Jason Lowe
91828fef6b YARN-4461. Redundant nodeLocalityDelay log in LeafQueue. Contributed by Eric Payne 2015-12-16 23:22:31 +00:00
Wangda Tan
9b856d9787 YARN-4416. Deadlock due to synchronised get Methods in AbstractCSQueue. (Naganarasimha G R via wangda) 2015-12-16 13:22:37 -08:00
Wangda Tan
7faa406f27 YARN-4225. Add preemption status to yarn queue -status for capacity scheduler. (Eric Payne via wangda) 2015-12-16 13:19:40 -08:00
Wangda Tan
79c41b1d83 YARN-4293. ResourceUtilization should be a part of yarn node CLI. (Sunil G via wangda) 2015-12-16 13:18:19 -08:00
Junping Du
50bd067e1d YARN-4452. NPE when submit Unmanaged application. Contributed by Naganarasimha G R. 2015-12-16 10:57:39 -08:00
Zhihai Xu
2aaed10327 YARN-4440. FSAppAttempt#getAllowedLocalityLevelByTime should init the lastScheduler time. Contributed by Lin Yiqun 2015-12-15 00:17:21 -08:00
Jian He
1cb3299b48 YARN-4403. (AM/NM/Container)LivelinessMonitor should use monotonic time when calculating period. Contributed by Junping Du 2015-12-14 13:51:23 -08:00
Wangda Tan
07b0fb996a YARN-4418. AM Resource Limit per partition can be updated to ResourceUsage as well. (Sunil G via wangda) 2015-12-14 11:24:30 -08:00
Wangda Tan
6cb0af3c39 YARN-3946. Update exact reason as to why a submitted app is in ACCEPTED state to app's diagnostic message. (Naganarasimha G R via wangda) 2015-12-14 10:52:46 -08:00
Arun Suresh
7fb212e5e6 YARN-4358 addendum patch to fix javadoc error 2015-12-12 22:22:55 -08:00
rohithsharmaks
a5e2e1ecb0 YARN-4421. Remove dead code in RmAppImpl.RMAppRecoveredTransition. (Daniel Templeton via rohithsharmaks) 2015-12-09 11:31:51 +05:30
Wangda Tan
7e4715186d YARN-4424. Fix deadlock in RMAppImpl. (Jian he via wangda) 2015-12-08 14:25:16 -08:00
Chris Douglas
9f50e13d5d YARN-4248. Followup patch adding asf-licence exclusions for json test files 2015-12-08 12:08:04 -08:00
=
c25a635459 YARN-4248. REST API for submit/update/delete Reservations. (curino) 2015-12-07 13:33:28 -08:00
Jonathan Eagles
4ff973f96a YARN-4422. Generic AHS sometimes doesn't show started, node, or logs on App page (Eric Payne via jeagles) 2015-12-07 15:04:48 -06:00
Xuan
4546c7582b YARN-4392. ApplicationCreatedEvent event time resets after RM
restart/failover. Contributed by Naganarasimha G R and Xuan Gong
2015-12-07 12:24:55 -08:00
Steve Loughran
65f395226b HADOOP-12321. Make JvmPauseMonitor an AbstractService. (Sunil G via Stevel) [includes HDFS-8947 MAPREDUCE-6462 and YARN-4072] 2015-12-06 17:43:35 +00:00
Arun Suresh
742632e346 YARN-4358. Reservation System: Improve relationship between SharingPolicy and ReservationAgent. (Carlo Curino via asuresh) 2015-12-05 21:26:16 -08:00
Jian He
755dda8dd8 YARN-4405. Support node label store in non-appendable file system. Contributed by Wangda Tan 2015-12-03 17:45:31 -08:00
Wangda Tan
a2c3bfc8c1 YARN-4292. ResourceUtilization should be a part of NodeInfo REST API. (Sunil G via wangda) 2015-12-03 14:28:32 -08:00
Jian He
9f77ccad73 YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999). Contributed by Mohammad Shahid Khan and Varun Saxena 2015-12-03 12:48:50 -08:00
Jian He
6b9a5beb2b YARN-4398. Remove unnecessary synchronization in RMStateStore. Contributed by Ning Ding 2015-12-02 11:07:18 -08:00
Tsuyoshi Ozawa
28dfe721b8 YARN-4387. Fix typo in FairScheduler log message. Contributed by Xin Wang. 2015-11-24 19:24:01 +09:00
Karthik Kambatla
52948bb20b YARN-3980. Plumb resource-utilization info in node heartbeat through to the scheduler. (Inigo Goiri via kasha) 2015-11-24 13:47:17 +05:30
Jian He
8676a118a1 YARN-4349. Support CallerContext in YARN. Contributed by Wangda Tan 2015-11-23 17:19:48 -08:00
Jason Lowe
d36b6e045f YARN-4344. NMs reconnecting with changed capabilities can lead to wrong cluster resource calculations. Contributed by Varun Vasudev 2015-11-23 20:30:26 +00:00
Arun Suresh
da1016365a YARN-3454. Add efficient merge operation to RLESparseResourceAllocation (Carlo Curino via asuresh) 2015-11-21 09:59:41 -08:00
Wangda Tan
2346fa3141 YARN-3769. Consider user limit when calculating total pending resource for preemption policy in Capacity Scheduler. (Eric Payne via wangda) 2015-11-20 15:55:50 -08:00
Jason Lowe
060cdcbe5d YARN-4374. RM capacity scheduler UI rounds user limit factor. Contributed by Chang Li 2015-11-20 23:12:29 +00:00
Arun Suresh
6a61928fb7 YARN-4184. Remove update reservation state api from state store as its not used by ReservationSystem (Sean Po via asuresh) 2015-11-17 15:50:34 -08:00
Jian He
fcd7888029 Revert "YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999) Contributed by Mohammad Shahid Khan"
This reverts commit 8fbea531d7.

Conflicts:
	hadoop-yarn-project/CHANGES.txt
2015-11-16 20:18:44 -08:00
Wangda Tan
7f55a18071 YARN-4347. Resource manager fails with Null pointer exception. (Jian He via wangda) 2015-11-12 11:23:40 -08:00
Wangda Tan
796638d9bc YARN-4287. Capacity Scheduler: Rack Locality improvement (Nathan Roberts via wangda) 2015-11-12 11:09:37 -08:00
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
6351d3fa63 YARN-4183. Reverting the patch to fix behaviour change.
Revert "YARN-4183. Enabling generic application history forces every job to get a timeline service delegation token (jeagles)"

This reverts commit c293c58954.
2015-11-11 10:40:43 -08:00
Jian He
8fbea531d7 YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999) Contributed by Mohammad Shahid Khan 2015-11-09 10:43:45 -08:00
Jian He
e5b1733e04 YARN-4127. RM fail with noAuth error if switched from failover to non-failover. Contributed by Varun Saxena 2015-10-29 15:42:57 -07:00
Jonathan Eagles
c293c58954 YARN-4183. Enabling generic application history forces every job to get a timeline service delegation token (jeagles) 2015-10-29 16:41:10 -05:00
Arun Suresh
58d1df585c YARN-4310. FairScheduler: Log skipping reservation messages at DEBUG level (asuresh) 2015-10-29 13:42:09 -07:00
Rohith Sharma K S
656c8f9527 YARN-4130. Duplicate declaration of ApplicationId in RMAppManager#submitApplication method. (Kai Sasaki via rohithsharmaks) 2015-10-29 12:22:44 +05:30
Wangda Tan
56e4f6237a YARN-3216. Max-AM-Resource-Percentage should respect node labels. (Sunil G via wangda) 2015-10-26 16:44:39 -07:00
Wangda Tan
6f606214e7 YARN-4169. Fix racing condition of TestNodeStatusUpdaterForLabels. (Naganarasimha G R via wangda) 2015-10-26 16:36:34 -07:00
Wangda Tan
3cc73773eb YARN-4285. Display resource usage as percentage of queue and cluster in the RM UI (Varun Vasudev via wangda) 2015-10-26 13:07:39 -07:00
Jason Lowe
33a03af3c3 YARN-4284. condition for AM blacklisting is too narrow. Contributed by Sangjin Lee 2015-10-26 19:53:03 +00:00
Rohith Sharma K S
5acdde4744 YARN-2729. Support script based NodeLabelsProvider Interface in Distributed Node Label Configuration Setup. (Naganarasimha G R via rohithsharmaks) 2015-10-26 15:42:42 +05:30
Arun Suresh
ab8eb8770c YARN-3738. Add support for recovery of reserved apps running under dynamic queues (subru via asuresh) 2015-10-24 22:53:10 -07:00
Akira Ajisaka
7781fe1b9e YARN-4294. [JDK8] Fix javadoc errors caused by wrong reference and illegal tag. (aajisaka) 2015-10-24 11:54:42 +09:00
Jason Lowe
d3a34a4f38 YARN-4041. Slow delegation token renewal can severely prolong RM recovery. Contributed by Sunil G 2015-10-23 20:57:01 +00:00
Ming Ma
934d96a334 YARN-2913. Fair scheduler should have ability to set MaxResourceDefault for each queue. (Siqi Li via mingma) 2015-10-23 08:36:33 -07:00
Jonathan Eagles
f8adeb712d YARN-4009. CORS support for ResourceManager REST API. ( Varun Vasudev via jeagles) 2015-10-23 10:34:08 -05:00
Junping Du
0fce5f9a49 YARN-4243. Add retry on establishing Zookeeper conenction in EmbeddedElectorService#serviceInit. Contributed by Xuan Gong. 2015-10-22 13:41:09 -07:00
Zhihai Xu
960201b79b YARN-4256. YARN fair scheduler vcores with decimal values. Contributed by Jun Gong 2015-10-22 12:28:03 -07:00
Anubhav Dhoot
2798723a54 YARN-3739. Add reservation system recovery to RM recovery process. Contributed by Subru Krishnan. 2015-10-22 06:51:00 -07:00
Arun Suresh
506d1b1dbc YARN-3985. Make ReservationSystem persist state using RMStateStore reservation APIs. (adhoot via asuresh) 2015-10-20 16:46:14 -07:00
Arun Suresh
7e2837f830 YARN-4270. Limit application resource reservation on nodes for non-node/rack specific requests (asuresh) 2015-10-19 20:00:38 -07:00
Jian He
f9da5cdb2b YARN-4170. AM need to be notified with priority in AllocateResponse. Contributed by Sunil G 2015-10-16 15:26:27 -07:00
Wangda Tan
4337b263aa YARN-4162. CapacityScheduler: Add resource usage by partition and queue capacity by partition to REST API. (Naganarasimha G R via wangda) 2015-10-16 15:06:28 -07:00
Jian He
cf23f2c2b5 YARN-4000. RM crashes with NPE if leaf queue becomes parent queue during restart. Contributed by Varun Saxena 2015-10-15 17:12:46 -07:00
rohithsharmaks
d6c8bad869 YARN-4250. NPE in AppSchedulingInfo#isRequestLabelChanged. (Brahma Reddy Battula via rohithsharmaks) 2015-10-14 16:11:34 +05:30
Jian He
9849c8b386 YARN-4230. RM crashes with NPE when increasing container resource if there is no headroom left. Contributed by Meng Ding 2015-10-12 11:51:33 -07:00
Zhihai Xu
049c6e8dc0 YARN-4201. AMBlacklist does not work for minicluster. Contributed by Jun Gong. 2015-10-12 00:14:25 -07:00
Devaraj K
db93047881 YARN-3964. Support NodeLabelsProvider at Resource Manager side.
Contributed by Dian Fu.
2015-10-11 11:21:29 +05:30
Wangda Tan
def374e666 YARN-4140. RM container allocation delayed incase of app submitted to Nodelabel partition. (Bibin A Chundatt via wangda) 2015-10-09 16:38:59 -07:00
Karthik Kambatla
4aa9b3e75c MAPREDUCE-6302. Incorrect headroom can lead to a deadlock between map and reduce allocations. (kasha) 2015-10-09 07:37:39 -07:00
Jason Lowe
a0bca2b5ad YARN-261. Ability to fail AM attempts. Contributed by Andrey Klochkov and Rohith Sharma K S 2015-10-09 14:17:38 +00:00
Rohith Sharma K S
8f195387a4 YARN-4235. FairScheduler PrimaryGroup does not handle empty groups returned for a user. (Anubhav Dhoot via rohithsharmaks) 2015-10-09 10:09:26 +05:30
Rohith Sharma K S
3793cbe4c3 YARN-4228. FileSystemRMStateStore use IOUtils#close instead of fs#close. (Bibin A Chundatt via rohithsharmaks) 2015-10-07 10:12:14 +05:30
Rohith Sharma K S
9156fc60c6 YARN-4209. RMStateStore FENCED state doesn’t work due to updateFencedState called by stateMachine.doTransition. (Zhihai Xu via rohithsharmaks) 2015-10-07 09:34:59 +05:30
Wangda Tan
29a582ada0 YARN-4215. RMNodeLabels Manager Need to verify and replace node labels for the only modified Node Label Mappings in the request. (Naganarasimha G R via wangda) 2015-10-06 11:56:04 -07:00
Harsh J
c918f7be5e HADOOP-12458. Retries is typoed to spell Retires in parts of hadoop-yarn and hadoop-common. Contributed by Neelesh Srinivas Salian. 2015-10-03 18:37:58 +05:30
Xuan
8f08532bde YARN-1897. CLI and core support for signal container functionality. Contributed by Ming Ma 2015-10-02 18:50:47 -07:00
Karthik Kambatla
a0b5a0a419 YARN-4066. Large number of queues choke fair scheduler. (Johan Gustavsson via kasha) 2015-09-29 07:55:34 -07:00
Anubhav Dhoot
9735afe967 YARN-4180. AMLauncher does not retry on failures when talking to NM. (adhoot) 2015-09-28 16:13:41 -07:00
Jason Lowe
9f53a95ff6 YARN-4141. Runtime Application Priority change should not throw exception for applications at finishing states. Contributed by Sunil G 2015-09-28 22:55:20 +00:00
Anubhav Dhoot
fb2e525c07 YARN-4204. ConcurrentModificationException in FairSchedulerQueueInfo. (adhoot) 2015-09-28 09:05:45 -07:00
Rohith Sharma K S
a9aafad12b YARN-4044. Running applications information changes such as movequeue is not published to TimeLine server. (Sunil G via rohithsharmaks) 2015-09-24 12:13:22 +05:30
Jian He
b3f6b641dc YARN-4171. Fix findbugs warnings in YARN-1197 branch. Contributed by Wangda Tan 2015-09-23 13:29:38 -07:00
Jian He
89cab1ba5f YARN-1651. CapacityScheduler side changes to support container resize. Contributed by Wangda Tan 2015-09-23 13:29:38 -07:00
Jian He
5f5a968d65 YARN-3867. ContainerImpl changes to support container resizing. Contributed by Meng Ding 2015-09-23 13:29:37 -07:00
Jian He
83a18add10 YARN-1449. AM-NM protocol changes to support container resizing. Contributed by Meng Ding & Wangda Tan) 2015-09-23 13:29:36 -07:00
Tsuyoshi Ozawa
dfd807afab HADOOP-12428. Fix inconsistency between log-level guards and statements. Contributed by Jagadesh Kiran N and Jackie Chang. 2015-09-22 12:54:29 +09:00
Rohith Sharma K S
c9cb6a5960 YARN-4167. NPE on RMActiveServices#serviceStop when store is null. (Bibin A Chundatt via rohithsharmaks) 2015-09-21 09:59:30 +05:30
Arun Suresh
94dec5a916 YARN-3920. FairScheduler container reservation on a node should be configurable to limit it to large containers (adhoot via asuresh) 2015-09-18 14:02:55 -07:00
Wangda Tan
9bc913a35c YARN-3212. RMNode State Transition Update with DECOMMISSIONING state. (Junping Du via wangda) 2015-09-18 10:04:17 -07:00
Rohith Sharma K S
723c31d45b YARN-4135. Improve the assertion message in MockRM while failing after waiting for the state.(Nijel S F via rohithsharmaks) 2015-09-18 08:44:10 +05:30
Jian He
6c6e734f0b YARN-4034. Render cluster Max Priority in scheduler metrics in RM web UI. Contributed by Rohith Sharma K S 2015-09-17 14:55:50 +08:00
Jian He
452079af8b YARN-4078. Add getPendingResourceRequestForAttempt in YarnScheduler interface. Contributed by Naganarasimha G R 2015-09-16 14:59:20 +08:00
Wangda Tan
ae5308fe1d YARN-3717. Expose app/am/queue's node-label-expression to RM web UI / CLI / REST-API. (Naganarasimha G R via wangda) 2015-09-15 11:40:50 -07:00
Junping Du
73e3a49eb0 YARN-313. Add Admin API for supporting node resource configuration in command line. (Contributed by Inigo Goiri, Kenji Kikushima and Junping Du) 2015-09-15 07:56:47 -07:00
Jian He
5468baa80a YARN-3635. Refactored current queue mapping implementation in CapacityScheduler to use a generic PlacementManager framework. Contributed by Wangda Tan 2015-09-15 15:39:20 +08:00
Jian He
e1b1d7e4ae YARN-4126. RM should not issue delegation tokens in unsecure mode. Contributed by Bibin A Chundatt 2015-09-14 14:09:19 +08:00
Karthik Kambatla
332b520a48 YARN-3697. FairScheduler: ContinuousSchedulingThread can fail to shutdown. (Zhihai Xu via kasha) 2015-09-13 18:07:43 -07:00
Karthik Kambatla
81df7b586a YARN-2005. Blacklisting support for scheduling AMs. (Anubhav Dhoot via kasha) 2015-09-13 17:03:15 -07:00
Steve Loughran
7269906254 HADOOP-12087. [JDK8] Fix javadoc errors caused by incorrect or illegal tags. (Akira AJISAKA via stevel). 2015-09-13 14:25:26 +01:00
Robert Kanter
ea4bb2749f YARN-4145. Make RMHATestBase abstract so its not run when running all tests under that namespace (adhoot via rkanter) 2015-09-11 11:46:10 -07:00
Jian He
6f72f1e600 YARN-2884. Added a proxy service in NM to proxy the the communication between AM and RM. Contributed by Kishore Chaliparambil 2015-09-08 09:35:46 +08:00
Xuan
9b78e6e33d YARN-4087. Followup fixes after YARN-2019 regarding RM behavior when
state-store error occurs. Contributed by Jian He
2015-09-07 17:45:47 -07:00
Wangda Tan
bcc85e3bab YARN-4024. YARN RM should avoid unnecessary resolving IP when NMs doing heartbeat. (Hong Zhiguo via wangda) 2015-09-04 15:13:53 -07:00
Jason Lowe
6eaca2e363 YARN-4105. Capacity Scheduler headroom for DRF is wrong. Contributed by Chang Li 2015-09-04 15:30:53 +00:00
Varun Vasudev
40d222e862 YARN-4103. RM WebServices missing scheme for appattempts logLinks. Contributed by Jonathan Eagles. 2015-09-04 14:31:51 +05:30
Varun Vasudev
b469ac531a YARN-3970. Add REST api support for Application Priority. Contributed by Naganarasimha G R. 2015-09-03 16:40:10 +05:30
Jian He
09c64ba1ba YARN-4101. RM should print alert messages if Zookeeper and Resourcemanager gets connection issue. Contributed by Xuan Gong 2015-09-02 17:45:23 -07:00
Rohith Sharma K S
7d6687fe76 YARN-3893. Both RM in active state when Admin#transitionToActive failure from refeshAll() (Bibin A Chundatt via rohithsharmaks) 2015-09-02 15:22:48 +05:30
Varun Vasudev
bf669b6d9f YARN-4082. Container shouldn't be killed when node's label updated. Contributed by Wangda Tan. 2015-09-01 14:19:11 +05:30
Jian He
a3fd2ccc86 YARN-4092. Fixed UI redirection to print useful messages when both RMs are in standby mode. Contributed by Xuan Gong 2015-08-31 17:33:24 -07:00
Junping Du
beb65c9465 YARN-1556. NPE getting application report with a null appId. Contributed by Weiwei Yang. 2015-08-28 05:57:34 -07:00
Jian He
a9c8ea71aa YARN-3250. Support admin cli interface in for Application Priority. Contributed by Rohith Sharma K S 2015-08-27 13:25:53 -07:00
Jian He
57c7ae1aff YARN-4014. Support user cli interface in for Application Priority. Contributed by Rohith Sharma K S 2015-08-24 20:36:44 -07:00
Rohith Sharma K S
feaf034994 YARN-3896. RMNode transitioned from RUNNING to REBOOTED because its response id has not been reset synchronously. (Jun Gong via rohithsharmaks) 2015-08-24 11:25:07 +05:30
Xuan
37e1c3d82a YARN-221. NM should provide a way for AM to tell it not to aggregate
logs. Contributed by Ming Ma
2015-08-22 16:25:24 -07:00
Rohith Sharma K S
22de7c1dca YARN-3986. getTransferredContainers in AbstractYarnScheduler should be present in YarnScheduler interface 2015-08-21 10:51:11 +05:30
Xuan
22dc5fc209 YARN-4028. AppBlock page key update and diagnostics value null on
recovery. Contributed by Bibin A Chundatt
2015-08-18 22:53:03 -07:00
Zhihai Xu
3a76a010b8 YARN-3857: Memory leak in ResourceManager with SIMPLE mode. Contributed by mujunchao. 2015-08-18 10:36:40 -07:00
Jian He
0a030546e2 YARN-3987. Send AM container completed msg to NM once AM finishes. Contributed by sandflee 2015-08-13 16:22:53 -07:00
Jian He
7a445fcfab YARN-4047. ClientRMService getApplications has high scheduler lock contention. Contributed by Jason Lowe 2015-08-13 16:02:57 -07:00
Jian He
e5003be907 YARN-4026. Refactored ContainerAllocator to accept a list of priorites rather than a single priority. Contributed by Wangda Tan 2015-08-12 15:07:50 -07:00
rohithsharmaks
1c12adb71f YARN-4023. Publish Application Priority to TimelineServer. (Sunil G via rohithsharmaks) 2015-08-12 14:45:41 +05:30
Xuan
3ae716fa69 YARN-3999. RM hangs on draing events. Contributed by Jian He 2015-08-11 18:25:11 -07:00
Jian He
fa1d84ae27 YARN-3887. Support changing Application priority during runtime. Contributed by Sunil G 2015-08-10 20:51:54 -07:00
Wangda Tan
cf9d3c9256 YARN-3873. PendingApplications in LeafQueue should also use OrderingPolicy. (Sunil G via wangda) 2015-08-10 14:54:55 -07:00
Wangda Tan
4bc42d76e7 YARN-3966. Fix excessive loggings in CapacityScheduler. (Jian He via wangda) 2015-08-07 09:46:57 -07:00
Rohith Sharma K S
b6265d39c5 YARN-3948. Display Application Priority in RM Web UI.(Sunil G via rohithsharmaks) 2015-08-07 10:43:41 +05:30
Carlo Curino
8572a5a14b YARN-3974. Refactor the reservation system test cases to use parameterized base test. (subru via curino) 2015-08-02 01:55:31 -07:00
Junping Du
cfee02b3bd YARN-4019. Add JvmPauseMonitor to ResourceManager and NodeManager. Contributed by Robert Kanter. 2015-08-06 06:49:45 -07:00
Arun Suresh
154c9d2e42 YARN-3961. Expose pending, running and reserved containers of a queue in REST api and yarn top (adhoot via asuresh) 2015-08-05 23:14:14 -07:00
rohithsharmaks
df9e7280db YARN-3992. TestApplicationPriority.testApplicationPriorityAllocation fails intermittently. (Contributed by Sunil G) 2015-08-06 10:43:37 +05:30
Jian He
ba2313d614 YARN-3983. Refactored CapacityScheduleri#FiCaSchedulerApp to easier extend container allocation logic. Contributed by Wangda Tan 2015-08-05 13:47:40 -07:00
Arun Suresh
f271d37735 YARN-3736. Add RMStateStore apis to store and load accepted reservations for failover (adhoot via asuresh) 2015-08-05 12:57:12 -07:00
Xuan
0306d902f5 YARN-3543. ApplicationReport should be able to tell whether the
Application is AM managed or not. Contributed by Rohith Sharma K S
2015-08-03 15:46:00 -07:00
Jonathan Eagles
3cd02b9522 YARN-3978. Configurably turn off the saving of container info in Generic AHS (Eric Payne via jeagles) 2015-08-03 10:38:05 -05:00
Jason Lowe
32e490b6c0 YARN-3990. AsyncDispatcher may overloaded with RMAppNodeUpdateEvent when Node is connected/disconnected. Contributed by Bibin A Chundatt 2015-07-31 17:37:24 +00:00
Zhihai Xu
ab80e27703 YARN-433. When RM is catching up with node updates then it should not expire acquired containers. Contributed by Xuan Gong 2015-07-30 21:57:11 -07:00
Wangda Tan
91b42e7d6e YARN-3971. Skip RMNodeLabelsManager#checkRemoveFromClusterNodeLabelsOfQueue on nodelabel recovery. (Bibin A Chundatt via wangda) 2015-07-30 10:00:31 -07:00
Karthik Kambatla
5205a330b3 YARN-2768. Avoid cloning Resource in FSAppAttempt#updateDemand. (Hong Zhiguo via kasha) 2015-07-29 09:42:32 -07:00
Jian He
3572ebd738 YARN-3846. RM Web UI queue filter is not working for sub queue. Contributed by Mohammad Shahid Khan 2015-07-27 17:12:05 -07:00
ccurino
156f24ead0 YARN-3656. LowCost: A Cost-Based Placement Agent for YARN Reservations. (Jonathan Yaniv and Ishai Menache via curino) 2015-07-25 07:39:47 -07:00
Wangda Tan
a3bd7b4a59 YARN-3973. Recent changes to application priority management break reservation system from YARN-1051 (Carlo Curino via wangda) 2015-07-24 16:44:18 -07:00
Jian He
83fe34ac08 YARN-3026. Move application-specific container allocation logic from LeafQueue to FiCaSchedulerApp. Contributed by Wangda Tan 2015-07-24 14:00:25 -07:00
Karthik Kambatla
d19d187753 YARN-3957. FairScheduler NPE In FairSchedulerQueueInfo causing scheduler page to return 500. (Anubhav Dhoot via kasha) 2015-07-24 11:44:37 -07:00
carlo curino
0fcb4a8cf2 YARN-3969. Allow jobs to be submitted to reservation that is active but does not have any allocations. (subru via curino) 2015-07-23 19:33:59 -07:00
Rohith Sharma K S
e202efaf93 YARN-3845. Scheduler page does not render RGBA color combinations in IE11. (Contributed by Mohammad Shahid Khan) 2015-07-24 12:43:06 +05:30
Robert Kanter
1d3026e7b3 YARN-3900. Protobuf layout of yarn_security_token causes errors in other protos that include it (adhoot via rkanter) 2015-07-23 14:46:54 -07:00
Wangda Tan
3bba180051 YARN-3941. Proportional Preemption policy should try to avoid sending duplicate PREEMPT_CONTAINER event to scheduler. (Sunil G via wangda) 2015-07-23 10:07:57 -07:00
Junping Du
ee98d6354b YARN-2019. Retrospect on decision of making RM crashed if any exception throw in ZKRMStateStore. Contributed by Jian He. 2015-07-22 17:52:35 -07:00
Wangda Tan
76ec26de80 YARN-3932. SchedulerApplicationAttempt#getResourceUsageReport and UserInfo should based on total-used-resources. (Bibin A Chundatt via wangda) 2015-07-22 11:54:02 -07:00
Wangda Tan
c39ca541f4 YARN-2003. Support for Application priority : Changes in RM and Capacity Scheduler. (Sunil G via wangda) 2015-07-21 09:57:23 -07:00
Arun Suresh
9b272ccae7 YARN-3535. Scheduler must re-request container resources when RMContainer transitions from ALLOCATED to KILLED (rohithsharma and peng.zhang via asuresh) 2015-07-17 04:31:34 -07:00
Wangda Tan
3540d5fe4b YARN-3885. ProportionalCapacityPreemptionPolicy doesn't preempt if queue is more than 2 level. (Ajith S via wangda) 2015-07-16 16:13:32 -07:00
Arun Suresh
ac94ba3e18 YARN-3453. Ensure preemption logic in FairScheduler uses DominantResourceCalculator in DRF queues to prevent unnecessary thrashing. (asuresh) 2015-07-14 00:23:55 -07:00
Akira Ajisaka
19295b36d9 YARN-3381. Fix typo InvalidStateTransitonException. Contributed by Brahma Reddy Battula. 2015-07-13 17:52:13 +09:00
Wangda Tan
5ed1fead6b YARN-3894. RM startup should fail for wrong CS xml NodeLabel capacity configuration. (Bibin A Chundatt via wangda) 2015-07-12 21:52:11 -07:00
Wangda Tan
1df39c1efc YARN-3849. Too much of preemption activity causing continuos killing of containers across queues. (Sunil G via wangda) 2015-07-11 10:26:46 -07:00
Zhijie Shen
1ea36299a4 YARN-3116. RM notifies NM whether a container is an AM container or normal task container. Contributed by Giovanni Matteo Fumarola. 2015-07-10 18:58:10 -07:00
Ming Ma
08244264c0 YARN-3445. Cache runningApps in RMNode for getting running apps on given NodeId. (Junping Du via mingma) 2015-07-10 08:30:10 -07:00
Xuan
5214876792 YARN-3888. ApplicationMaster link is broken in RM WebUI when appstate is
NEW. Contributed by Bibin A Chundatt
2015-07-09 21:37:33 -07:00
carlo curino
0e602fa3a1 YARN-3800. Reduce storage footprint for ReservationAllocation. Contributed by Anubhav Dhoot. 2015-07-09 16:51:59 -07:00
Jian He
c9dd2cada0 YARN-3892. Fixed NPE on RMStateStore#serviceStop when CapacityScheduler#serviceInit fails. Contributed by Bibin A Chundatt 2015-07-07 14:16:21 -07:00
Devaraj K
37d7395773 YARN-3875. FSSchedulerNode#reserveResource() doesn't print Application Id
properly in log. Contributed by Bibin A Chundatt.
2015-07-02 10:20:31 +05:30
Wangda Tan
0e4b06690f YARN-3508. Prevent processing preemption events on the main RM dispatcher. (Varun Saxena via wangda) 2015-07-01 17:32:22 -07:00
Devaraj K
80a68d6056 YARN-3830. AbstractYarnScheduler.createReleaseCache may try to clean a
null attempt. Contributed by nijel.
2015-07-01 19:03:44 +05:30
Devaraj K
b543d1a390 YARN-3859. LeafQueue doesn't print user properly for application add.
Contributed by Varun Saxena.
2015-06-28 10:04:50 +05:30
Xuan
fe6c1bd73a YARN-2871. TestRMRestart#testRMRestartGetApplicationList sometime fails
in trunk. Contributed by zhihai xu
2015-06-26 19:43:59 -07:00
Devaraj K
57f1a01eda YARN-3826. Race condition in ResourceTrackerService leads to wrong
diagnostics messages. Contributed by Chengbing Liu.
2015-06-25 16:13:59 +05:30
rohithsharmaks
dd4b387d96 YARN-3790. usedResource from rootQueue metrics may get stale data for FS scheduler after recovering the container (Zhihai Xu via rohithsharmaks) 2015-06-24 23:00:14 +05:30
Jason Lowe
2a20dd9b61 YARN-3809. Failed to launch new attempts because ApplicationMasterLauncher's threads all hang. Contributed by Jun Gong 2015-06-24 16:23:48 +00:00
Robert Kanter
99271b7621 YARN-3835. hadoop-yarn-server-resourcemanager test package bundles core-site.xml, yarn-site.xml (vamsee via rkanter) 2015-06-22 18:02:27 -07:00
Xuan
5b5bb8dcdc YARN-3802. Two RMNodes for the same NodeId are used in RM sometimes
after NM is reconnected. Contributed by zhihai xu
2015-06-18 14:37:49 -07:00
Xuan
a826d432f9 YARN-3804. Both RM are on standBy state when kerberos user not in yarn.admin.acl. Contributed by Varun Saxena 2015-06-17 16:23:27 -07:00
Devaraj K
b039e69bb0 YARN-3789. Improve logs for LeafQueue#activateApplications(). Contributed
by Bibin A Chundatt.
2015-06-16 14:03:22 +05:30
Devaraj K
d8dcfa98e3 YARN-3794. TestRMEmbeddedElector fails because of ambiguous LOG reference.
Contributed by Chengbing Liu.
2015-06-12 13:42:49 +05:30
Xuan
5583f88bf7 YARN-3785. Support for Resource as an argument during submitApp call in
MockRM test class. Contributed by Sunil G
2015-06-10 21:40:48 -07:00
Xuan
2b2465dfac YARN-3778. Fix Yarn resourcemanger CLI usage. Contributed by Brahma Reddy Battula 2015-06-08 15:43:03 -07:00
Jian He
960b8f19ca YARN-2716. Refactor ZKRMStateStore retry code with Apache Curator. Contributed by Karthik Kambatla 2015-06-08 14:50:58 -07:00
Devaraj K
c7ee6c151c YARN-3780. Should use equals when compare Resource in
RMNodeImpl#ReconnectNodeTransition. Contributed by zhihai xu.
2015-06-08 11:54:55 +05:30
Karthik Kambatla
bd69ea408f YARN-3655. FairScheduler: potential livelock due to maxAMShare limitation and container reservation. (Zhihai Xu via kasha) 2015-06-07 11:37:52 -07:00
Xuan
3e000a919f YARN-1462. AHS API and other AHS changes to handle tags for completed MR jobs. Contributed by Xuan Gong 2015-06-05 12:48:52 -07:00
Karthik Kambatla
75885852cc YARN-3259. FairScheduler: Trigger fairShare updates on node events. (Anubhav Dhoot via kasha) 2015-06-05 09:39:41 -07:00
Jian He
1970ca7cbc YARN-2392. Add more diags about app retry limits on AM failures. Contributed by Steve Loughran 2015-06-04 11:14:09 -07:00
Jian He
6ad4e59cfc YARN-3764. CapacityScheduler should forbid moving LeafQueue from one parent to another. Contributed by Wangda Tan 2015-06-04 10:52:59 -07:00
Wangda Tan
ebd797c48f YARN-3733. Fix DominantRC#compare() does not work as expected if cluster resource is empty. (Rohith Sharmaks via wangda) 2015-06-04 10:22:57 -07:00
Junping Du
d7e7f6aa03 YARN-41. The RM should handle the graceful shutdown of the NM. Contributed by Devaraj K. 2015-06-04 04:59:27 -07:00
Xuan
5766a04428 YARN-3749. We should make a copy of configuration when init
MiniYARNCluster with multiple RMs. Contributed by Chun Chen
2015-06-03 17:20:15 -07:00