Commit Graph

1854 Commits

Author SHA1 Message Date
Jian He
0dd9bcab97 YARN-4811. Generate histograms in ContainerMetrics for actual container resource usage 2016-03-31 14:28:13 -07:00
Robert Kanter
7a021471c3 YARN-4639. Remove dead code in TestDelegationTokenRenewer added in YARN-3055 (templedf via rkanter) 2016-03-31 13:09:09 -07:00
Allen Wittenauer
0a74610d1c HADOOP-11393. Revert HADOOP_PREFIX, go back to HADOOP_HOME (aw) 2016-03-31 07:51:05 -07:00
Jian He
60e4116bf1 YARN-4822. Refactor existing Preemption Policy of CS for easier adding new approach to select preemption candidates. Contributed by Wangda Tan 2016-03-30 12:43:52 -07:00
Wangda Tan
fc055a3cbe YARN-4865. Track Reserved resources in ResourceUsage and QueueCapacities. (Sunil G via wangda) 2016-03-29 17:07:55 -07:00
Xuan
80182809ae YARN-4863. AHS Security login should be in serviceInit() instead of serviceStart(). Contributed by Junping Du 2016-03-28 22:17:45 -07:00
Jason Lowe
948b758070 YARN-4773. Log aggregation performs extraneous filesystem operations when rolling log aggregation is disabled. Contributed by Jun Gong 2016-03-28 23:00:56 +00:00
Jian He
524bc3c33a YARN-998. Keep NM resource updated through dynamic resource config for RM/NM restart. Contributed by Junping Du 2016-03-28 11:12:33 -07:00
Jian He
55ae143923 YARN-4117. End to end unit test with mini YARN cluster for AMRMProxy Service. Contributed by Giovanni Matteo Fumarola 2016-03-27 20:22:12 -07:00
Karthik Kambatla
49ff54c860 YARN-4805. Don't go through all schedulers in ParameterizedTestBase. (kasha) 2016-03-26 21:45:13 -07:00
Arun Suresh
00bebb7e58 YARN-4823. Refactor the nested reservation id field in listReservation to simple string field. (subru via asuresh) 2016-03-25 15:54:38 -07:00
Arun Suresh
d82e797b65 YARN-4825. Remove redundant code in ClientRMService::listReservations. (subru via asuresh) 2016-03-24 09:59:55 -07:00
Allen Wittenauer
b1394d6307 YARN-4850. test-fair-scheduler.xml isn't valid xml (Yufei Gu via aw) 2016-03-24 08:15:58 -07:00
Junping Du
19b645c938 YARN-4820. ResourceManager web redirects in HA mode drops query parameters. Contributed by Varun Vasudev. 2016-03-23 19:34:30 -07:00
Eric Payne
92b7e0d413 YARN-4686. MiniYARNCluster.start() returns before cluster is completely started. Contributed by Eric Badger. 2016-03-18 16:12:47 +00:00
Junping Du
ca8106d2dd YARN-4785. inconsistent value type of the type field for LeafQueueInfo in response of RM REST API. 2016-03-17 09:04:41 -07:00
Karthik Kambatla
f84af8bd58 YARN-4812. TestFairScheduler#testContinuousScheduling fails intermittently. (kasha) 2016-03-17 05:54:06 -07:00
Wangda Tan
ae14e5d07f YARN-4108. CapacityScheduler: Improve preemption to only kill containers that would satisfy the incoming request. (Wangda Tan)
(cherry picked from commit 7e8c9beb41)
2016-03-16 17:02:33 -07:00
Wangda Tan
fa7a43529d Revert "CapacityScheduler: Improve preemption to only kill containers that would satisfy the incoming request. (Wangda Tan)"
This reverts commit 7e8c9beb41.
2016-03-16 17:02:10 -07:00
Wangda Tan
7e8c9beb41 CapacityScheduler: Improve preemption to only kill containers that would satisfy the incoming request. (Wangda Tan) 2016-03-16 16:59:59 -07:00
Karthik Kambatla
3ef5500783 YARN-4560. Make scheduler error checking message more user friendly. (Ray Chiang via kasha) 2016-03-15 23:45:01 -07:00
Robert Kanter
22ca176dfe TestResourceLocalizationService.testPublicResourceInitializesLocalDir fails Intermittently due to IllegalArgumentException from cleanup (templedf via rkanter) 2016-03-15 10:05:10 -07:00
Karthik Kambatla
20d389ce61 YARN-4719. Add a helper library to maintain node state and allows common queries. (kasha) 2016-03-14 14:19:05 -07:00
Junping Du
f291d82cd4 YARN-4545. Allow YARN distributed shell to use ATS v1.5 APIs. Li Lu via junping_du 2016-03-14 08:28:38 -07:00
Li Lu
017d2c127b YARN-4696. Improving EntityGroupFSTimelineStore on exception handling, test setup, and concurrency.
This commit amends commit d49cfb3504 with a missed test file.
2016-03-10 13:02:28 -08:00
Li Lu
d49cfb3504 YARN-4696. Improving EntityGroupFSTimelineStore on exception handling, test setup, and concurrency. (Steve Loughran via gtcarrera9) 2016-03-10 10:51:55 -08:00
Wangda Tan
0233d4e0ee YARN-4465. SchedulerUtils#validateRequest for Label check should happen only when nodelabel enabled. (Bibin A Chundatt via wangda) 2016-03-08 14:27:03 -08:00
Jian He
3c33158d1c YARN-4764. Application submission fails when submitted queue is not available in scheduler xml. Contributed by Bibin A Chundatt 2016-03-08 13:07:57 -08:00
Vinod Kumar Vavilapalli
b2661765a5 YARN-4762. Fixed CgroupHandler's creation and usage to avoid NodeManagers crashing when LinuxContainerExecutor is enabled. (Sidharta Seethana via vinodkv) 2016-03-07 11:08:17 -08:00
Jason Lowe
4163e36c2b YARN-4760. proxy redirect to history server uses wrong URL. Contributed by Eric Badger 2016-03-07 15:56:33 +00:00
Jason Lowe
059caf9989 YARN-4744. Too many signal to container failure in case of LCE. Contributed by Sidharta Seethana 2016-03-07 15:40:01 +00:00
Varun Vasudev
8ed2e060e8 YARN-4245. Generalize config file handling in container-executor. Contributed by Sidharta Seethana. 2016-03-07 16:18:35 +05:30
Varun Vasudev
e51a8c1056 YARN-4737. Add CSRF filter support in YARN. Contributed by Jonathan Maron. 2016-03-07 15:26:44 +05:30
Zhihai Xu
e1ccc9622b YARN-4761. NMs reconnecting with changed capabilities can lead to wrong cluster resource calculations on fair scheduler. Contributed by Sangjin Lee 2016-03-06 19:46:09 -08:00
Rohith Sharma K S
19ee185907 YARN-4763. RMApps Page crashes with NPE. (Bibin A Chundatt via rohithsharmaks) 2016-03-05 13:02:57 +05:30
Jian He
5c465df904 YARN-4671. There is no need to acquire CS lock when completing a container. Contributed by Meng Ding 2016-03-01 13:14:12 -08:00
Jian He
d93c22ec27 YARN-4748. ApplicationHistoryManagerOnTimelineStore should not swallow exceptions on generateApplicationReport. Contributed by Li Lu 2016-02-29 18:19:09 -08:00
Karthik Kambatla
9dafaaaf0d YARN-4704. TestResourceManager#testResourceAllocation() fails when using FairScheduler. (Yufei Gu via kasha) 2016-02-29 16:10:12 -08:00
Haohui Mai
0fa54d45b1 HADOOP-12813. Migrate TestRPC and related codes to rebase on ProtobufRpcEngine. Contributed by Kai Zheng. 2016-02-29 11:41:00 -08:00
Jason Lowe
c58a6d53c5 YARN-4731. container-executor should not follow symlinks in recursive_unlink_children. Contributed by Colin Patrick McCabe 2016-02-29 15:24:35 +00:00
Rohith Sharma K S
e0b14f26f5 YARN-4566. Fix test failure in TestMiniYarnClusterNodeUtilization. (Takashi Ohnishi via rohithsharmaks) 2016-02-29 10:38:14 +08:00
Karthik Kambatla
f9692770a5 YARN-4718. Rename variables in SchedulerNode to reduce ambiguity post YARN-1011. (Inigo Goiri via kasha) 2016-02-28 09:35:59 -08:00
Jason Lowe
6b0f813e89 YARN-4723. NodesListManager$UnknownNodeId ClassCastException. Contributed by Kuhu Shukla 2016-02-26 20:24:50 +00:00
Ming Ma
7f3139e54d YARN-4720. Skip unnecessary NN operations in log aggregation. (Jun Gong via mingma) 2016-02-26 08:40:05 -08:00
Robert Kanter
d7fdec1e6b YARN-4579. Allow DefaultContainerExecutor container log directory permissions to be configurable (rchiang via rkanter) 2016-02-25 16:36:38 -08:00
Karthik Kambatla
c684f2b007 YARN-4729. SchedulerApplicationAttempt#getTotalRequiredResources can throw an NPE. (kasha) 2016-02-24 18:33:57 -08:00
Robert Kanter
954dd57043 YARN-4697. NM aggregation thread pool is not bound by limits (haibochen via rkanter) 2016-02-24 15:00:24 -08:00
Sangjin Lee
553b591ba0 YARN-4722. AsyncDispatcher logs redundant event queue sizes (Jason Lowe via sjlee) 2016-02-24 09:29:41 -08:00
Jason Lowe
d284e187b8 YARN-2046. Out of band heartbeats are sent only on container kill and possibly too early. Contributed by Ming Ma 2016-02-23 20:49:09 +00:00
Junping Du
9ed17f181d YARN-3223. Resource update during NM graceful decommission. Contributed by Brook Zhou. 2016-02-23 03:30:26 -08:00
Tsuyoshi Ozawa
0e12114c9c YARN-4648. Move preemption related tests from TestFairScheduler to TestFairSchedulerPreemption. Contributed by Kai Sasaki. 2016-02-23 19:50:08 +09:00
Varun Vasudev
140cb5d745 YARN-4709. NMWebServices produces incorrect JSON for containers. Contributed by Varun Saxena. 2016-02-23 12:29:25 +05:30
Junping Du
3fab88540f YARN-4386. refreshNodesGracefully() should send recommission event to active RMNodes only. Contributed by Kuhu Shukla. 2016-02-22 07:04:19 -08:00
Sangjin Lee
7de70680fe YARN-4690. Skip object allocation in FSAppAttempt#getResourceUsage when possible (Ming Ma via sjlee) 2016-02-17 20:55:21 -08:00
Karthik Kambatla
2ab4c476ed YARN-4689. FairScheduler: Cleanup preemptContainer to be more readable. (Kai Sasaki via kasha) 2016-02-17 18:16:15 -08:00
Arun Suresh
23f937e3b7 YARN-2575. Create separate ACLs for Reservation create/update/delete/list ops (Sean Po via asuresh) 2016-02-11 10:47:43 -08:00
Varun Vasudev
fa00d3e205 YARN-4655. Log uncaught exceptions/errors in various thread pools in YARN. Contributed by Sidharta Seethana. 2016-02-11 12:06:42 +05:30
Jian He
d16b17b4d2 YARN-4138. Roll back container resource allocation after resource increase token expires. Contributed by Meng Ding 2016-02-11 10:06:27 +08:00
=
b706cbc1bc YARN-4420. Add REST API for List Reservations (Sean Po via curino) 2016-02-10 10:19:26 -08:00
Arun Suresh
5cf5c41a89 YARN-4360. Improve GreedyReservationAgent to support "early" allocations, and performance improvements (curino via asuresh) 2016-02-10 09:11:15 -08:00
Devaraj K
565af873d5 YARN-4667. RM Admin CLI for refreshNodesResources throws NPE when nothing
is configured. Contributed by Naganarasimha G R.
2016-02-08 15:01:54 +05:30
Varun Vasudev
22a2b2231d YARN-4669. Fix logging statements in resource manager's Application class. Contributed by Sidharta Seethana. 2016-02-04 13:51:25 +05:30
Varun Vasudev
308d63f382 YARN-4307. Display blacklisted nodes for AM container in the RM web UI. Contributed by Naganarasimha G R. 2016-02-04 13:32:54 +05:30
Jason Lowe
fa328e2d39 YARN-4594. container-executor fails to remove directory tree when chmod required. Contributed by Colin Patrick McCabe 2016-02-03 17:21:12 +00:00
Varun Vasudev
1adb64e09b YARN-4625. Make ApplicationSubmissionContext and ApplicationSubmissionContextInfo more consistent. Contributed by Xuan Gong. 2016-02-03 16:26:28 +05:30
Varun Vasudev
1cd55e0c17 YARN-4649. Add additional logging to some NM state store operations. Contributed by Sidharta Seethana 2016-02-02 09:25:23 +05:30
Wangda Tan
9875325d5c YARN-4340. Add list API to reservation system. (Sean Po via wangda) 2016-02-02 10:17:33 +08:00
Jason Lowe
ed55950164 YARN-3102. Decommisioned Nodes not listed in Web UI. Contributed by Kuhu Shukla 2016-02-01 23:15:26 +00:00
Rohith Sharma K S
2673cbaf55 YARN-4615. Fix random test failure in TestAbstractYarnScheduler#testResourceRequestRecoveryToTheRightAppAttempt. (Sunil G via rohithsharmaks) 2016-02-01 10:43:56 +05:30
Jason Lowe
772ea7b41b YARN-4428. Redirect RM page to AHS page when AHS turned on and RM page is not available. Contributed by Chang Li 2016-01-29 21:48:54 +00:00
Jian He
f4a57d4a53 YARN-4617. LeafQueue#pendingOrderingPolicy should always use fixed ordering policy instead of using same as active applications ordering policy. Contributed by Rohith Sharma K S 2016-01-29 12:22:23 -08:00
Karthik Kambatla
c9a09d6926 YARN-4647. Make RegisterNodeManagerRequestPBImpl thread-safe. (kasha) 2016-01-29 08:12:54 -08:00
Devaraj K
a277bdc9ed YARN-4411. RMAppAttemptImpl#createApplicationAttemptReport throws
IllegalArgumentException. Contributed by Bibin A Chundatt and yarntime.
2016-01-29 13:51:37 +05:30
Rohith Sharma K S
ac68666803 YARN-4543. Fix random test failure in TestNodeStatusUpdater.testStopReentrant. (Akihiro Suda via rohithsharmaks) 2016-01-29 12:29:54 +05:30
Rohith Sharma K S
09d831c95b YARN-4219. addendum patch to fix javadoc errors 2016-01-29 11:51:47 +05:30
Jian He
7f46636495 YARN-4519. Potential deadlock of CapacityScheduler between decrease container and assign containers. Contributed by Meng Ding 2016-01-28 14:51:00 -08:00
Xuan
9fab22b366 YARN-4219. New levelDB cache storage for timeline v1.5. Contributed by
Li Lu
2016-01-28 14:24:22 -08:00
Jason Lowe
61382ff8fa YARN-4643. Container recovery is broken with delegating container runtime. Contributed by Sidharta Seethana 2016-01-28 18:59:35 +00:00
Rohith Sharma K S
ef343be82b YARN-4633. Fix random test failure in TestRMRestart#testRMRestartAfterPreemption. (Bibin A Chundatt via rohithsharmaks) 2016-01-28 21:53:45 +05:30
Karthik Kambatla
fb238d7e5d YARN-4462. FairScheduler: Disallow preemption from a queue. (Tao Jie via kasha) 2016-01-27 12:29:06 -08:00
Rohith Sharma K S
c01bee0108 YARN-4573. Fix test failure in TestRMAppTransitions#testAppRunningKill and testAppKilledKilled. (Takashi Ohnishi via rohithsharmaks) 2016-01-27 08:23:02 +05:30
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
2085e60a96 YARN-3542. Refactored existing CPU cgroups support to use the newer and integrated ResourceHandler mechanism, and also deprecated the old LCEResourceHandler inteface hierarchy. Contributed by Varun Vasudev. 2016-01-25 16:19:36 -08:00
Jian He
992dd2f783 YARN-4520. Finished app info is unnecessarily persisted in NM state-store if container is acquired but not lunched on this node. Contributed by sandflee 2016-01-25 15:36:14 -08:00
rohithsharmaks
10dc2c0493 YARN-4613. Fix test failure in TestClientRMService#testGetClusterNodes. (Takashi Ohnishi via rohithsharmaks) 2016-01-24 23:36:15 +05:30
rohithsharmaks
99829eb221 YARN-4614. Fix random failure in TestApplicationPriority#testApplicationPriorityAllocationWithChangeInPriority. (Sunil G via rohithsharmaks) 2016-01-23 07:56:57 +05:30
Xuan
618bfd6ac2 YARN-4496. Improve HA ResourceManager Failover detection on the client.
Contributed by Jian He
2016-01-22 18:20:38 -08:00
Jason Lowe
46e5ea81e0 YARN-4598. Invalid event: RESOURCE_FAILED at CONTAINER_CLEANEDUP_AFTER_KILL. Contributed by tangshangwen 2016-01-22 21:55:01 +00:00
rohithsharmaks
d6258b33a7 YARN-4497. RM might fail to restart when recovering apps whose attempts are missing. (Jun Gong via rohithsharmaks) 2016-01-22 20:27:38 +05:30
Akira Ajisaka
8f58f742ae YARN-4605. Spelling mistake in the help message of "yarn applicationattempt" command. Contributed by Weiwei Yang. 2016-01-22 19:43:06 +09:00
Varun Vasudev
b41a7e89d1 YARN-4578. Directories that are mounted in docker containers need to be more restrictive/container-specific. Contributed by Sidharta Seethana. 2016-01-22 14:43:14 +05:30
Rohith Sharma K S
e30668106d YARN-4584. RM startup failure when AM attempts greater than max-attempts. (Bibin A Chundatt via rohithsharmaks) 2016-01-22 10:14:46 +05:30
Jason Lowe
468a53b22f YARN-4610. Reservations continue looking for one app causes other apps to starve. Contributed by Jason Lowe 2016-01-21 18:31:29 +00:00
Karthik Kambatla
4992398aee YARN-4603. FairScheduler should mention user requested queuename in error message when failed in queue ACL check. (Tao Jie via kasha) 2016-01-21 17:40:59 +01:00
Wangda Tan
5ff5f67332 YARN-4557. Fix improper Queues sorting in PartitionedQueueComparator when accessible-node-labels=*. (Naganarasimha G R via wangda) 2016-01-21 11:21:06 +08:00
Xuan
890a2ebd1a YARN-4559. Make leader elector and zk store share the same curator
client. Contributed by Jian He
2016-01-20 14:48:10 -08:00
Wangda Tan
89d1fd5dac HADOOP-12356. Fix computing CPU usage statistics on Windows. (Inigo Goiri via wangda) 2016-01-19 21:27:38 +08:00
Jian He
edc43a9097 YARN-4565. Fix a bug that leads to AM resource limit not hornored when sizeBasedWeight enabled for FairOrderingPolicy. Contributed by Wangda Tan 2016-01-18 21:04:36 -08:00
Wangda Tan
a44ce3f14f YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda) 2016-01-19 09:30:04 +08:00
Wangda Tan
150f5ae034 Revert "YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)"
This reverts commit 3fe5728563.

Conflicts:
	hadoop-yarn-project/CHANGES.txt
2016-01-19 09:27:36 +08:00
Jian He
f385851141 YARN-4596. SystemMetricPublisher should not swallow error messages from TimelineClient#putEntities. Contributed by Li Lu 2016-01-18 16:58:39 -08:00