Karthik Kambatla
185c3d4de1
YARN-4807. MockAM#waitForState sleep duration is too long. (Yufei Gu via kasha)
2016-04-27 09:43:23 -07:00
Jian He
4beff01354
YARN-4983. JVM and UGI metrics disappear after RM transitioned to standby mode
2016-04-26 21:00:17 -07:00
Arun Suresh
341888a0aa
YARN-4412. Create ClusterMonitor to compute ordered list of preferred NMs for OPPORTUNITIC containers. (asuresh)
2016-04-26 20:12:12 -07:00
Karthik Kambatla
1a3f1482e2
YARN-4795. ContainerMetrics drops records. (Daniel Templeton via kasha)
2016-04-26 06:15:36 -07:00
Karthik Kambatla
4b1dcbbe0c
YARN-1297. FairScheduler: Move some logs to debug and check if debug logging is enabled
2016-04-26 05:10:09 -07:00
Arun Suresh
c282a08f38
YARN-2885. Create AMRMProxy request interceptor and ContainerAllocator to distribute OPPORTUNISTIC containers to appropriate Nodes (asuresh)
...
(cherry picked from commit 2bf025278a318b0452fdc9ece4427b4c42124e39)
2016-04-24 22:38:33 -07:00
Jing Zhao
63e5412f1a
HDFS-9427. HDFS should not default to ephemeral ports. Contributed by Xiaobing Zhou.
2016-04-22 15:14:40 -07:00
Wangda Tan
7cb3a3da96
YARN-4846. Fix random failures for TestCapacitySchedulerPreemption#testPreemptionPolicyShouldRespectAlreadyMarkedKillableContainers. (Bibin A Chundatt via wangda)
2016-04-22 11:40:32 -07:00
Eric Payne
3dce486d88
YARN-4556. TestFifoScheduler.testResourceOverCommit fails. Contributed by Akihiro Suda
2016-04-21 21:16:47 +00:00
Li Lu
7c6339f66a
YARN-4968. A couple of AM retry unit tests need to wait SchedulerApplicationAttempt stopped. (Wangda Tan via gtcarrera9)
2016-04-21 13:25:33 -07:00
Karthik Kambatla
170c4fd4cd
YARN-4784. Fairscheduler: defaultQueueSchedulingPolicy should not accept FIFO. (Yufei Gu via kasha)
2016-04-20 23:58:12 -07:00
Wangda Tan
33fd95a99c
YARN-4890. Unit test intermittent failure: TestNodeLabelContainerAllocation#testQueueUsedCapacitiesUpdate. (Sunil G via wangda)
2016-04-20 17:37:38 -07:00
Karthik Kambatla
c8172f5f14
YARN-2883. Queuing of container requests in the NM. (Konstantinos Karanasos and Arun Suresh via kasha)
2016-04-20 09:55:50 -07:00
Wangda Tan
fdc46bfb37
YARN-4934. Reserved Resource for QueueMetrics needs to be handled correctly in few cases. (Sunil G via wangda)
2016-04-16 22:47:41 -07:00
Jason Lowe
69f3d428d5
YARN-4940. yarn node -list -all failed if RM start with decommissioned node. Contributed by sandflee
2016-04-15 20:36:45 +00:00
Jason Lowe
3150ae8108
YARN-4924. NM recovery race can lead to container not cleaned up. Contributed by sandflee
2016-04-14 19:17:14 +00:00
Robert Kanter
0d9194df00
YARN-4541. Change log message in LocalizedResource#handle() to DEBUG (rchiang via rkanter)
2016-04-13 17:45:18 -07:00
Xuan
e0cb426758
YARN-4886. Add HDFS caller context for EntityGroupFSTimelineStore. Contributed by Li Lu
2016-04-13 10:38:17 -07:00
Naganarasimha
437e9d6475
YARN-4810. NM applicationpage cause internal error 500. Contributed by Bibin A Chundatt.
2016-04-12 17:59:46 +05:30
Vinod Kumar Vavilapalli
44bbc50d91
YARN-4168. Fixed a failing test TestLogAggregationService.testLocalFileDeletionOnDiskFull. Contributed by Takashi Ohnishi.
2016-04-11 12:11:14 -07:00
Jason Lowe
2a5da97f81
Revert "YARN-4311. Removing nodes from include and exclude lists will not remove them from decommissioned nodes list. Contributed by Kuhu Shukla"
...
This reverts commit 1cbcd4a491
.
2016-04-11 15:51:01 +00:00
Junping Du
08ddb3ac6d
YARN-4928. Some yarn.server.timeline.* tests fail on Windows attempting to use a test root path containing a colon. Contributed by Gergely Novák.
2016-04-11 08:48:32 -07:00
Akira Ajisaka
1ff27f9d12
YARN-4630. Remove useless boxing/unboxing code. Contributed by Kousuke Saruta.
2016-04-11 14:55:03 +09:00
Akira Ajisaka
1b78b2ba17
YARN-4938. MiniYarnCluster should not request transitionToActive to RM on non-HA environment. Contributed by Eric Badger.
2016-04-11 01:31:49 +09:00
Karthik Kambatla
ff95fd547b
YARN-4927. TestRMHA#testTransitionedToActiveRefreshFail fails with FairScheduler. (Bibin A Chundatt via kasha)
2016-04-09 10:31:02 -07:00
Wangda Tan
ec06957941
YARN-3215. Respect labels in CapacityScheduler when computing headroom. (Naganarasimha G R via wangda)
2016-04-08 15:33:04 -07:00
Jian He
9cb0c963d2
YARN-4740. AM may not receive the container complete msg when it restarts. Contributed by Jun Gong
2016-04-08 11:20:35 -07:00
Karthik Kambatla
e82f961a39
YARN-4756. Unnecessary wait in Node Status Updater during reboot. (Eric Badger via kasha)
2016-04-07 17:05:29 -07:00
Jian He
93bacda08b
YARN-4769. Add support for CSRF header in the dump capacity scheduler logs and kill app buttons in RM web UI. Contributed by Varun Vasudev
2016-04-06 16:13:47 -07:00
Varun Vasudev
b41e65e5bc
YARN-4906. Capture container start/finish time in container metrics. Contributed by Jian He.
2016-04-06 13:41:33 +05:30
Wangda Tan
21eb428448
YARN-4699. Scheduler UI and REST o/p is not in sync when -replaceLabelsOnNode is used to change label of a node. (Sunil G via wangda)
2016-04-05 16:24:11 -07:00
Junping Du
0005816743
YARN-4916. TestNMProxy.tesNMProxyRPCRetry fails. Contributed by Tibor Kiss.
2016-04-05 09:01:08 -07:00
Junping Du
6be28bcc46
YARN-4893. Fix some intermittent test failures in TestRMAdminService. Contributed by Brahma Reddy Battula.
2016-04-05 06:57:54 -07:00
Jason Lowe
1cbcd4a491
YARN-4311. Removing nodes from include and exclude lists will not remove them from decommissioned nodes list. Contributed by Kuhu Shukla
2016-04-05 13:40:19 +00:00
Rohith Sharma K S
776b549e2a
YARN-4609. RM Nodes list page takes too much time to load. Contributed by Bibin A Chundatt
2016-04-05 14:47:25 +05:30
Rohith Sharma K S
552237d4a3
YARN-4880. Running TestZKRMStateStorePerf with real zookeeper cluster throws NPE. Contributed by Sunil G
2016-04-05 14:26:19 +05:30
naganarasimha
5092c94195
YARN-4746. yarn web services should convert parse failures of appId, appAttemptId and containerId to 400. Contributed by Bibin A Chundatt
2016-04-04 16:25:03 +05:30
Rohith Sharma K S
1e6f92977d
YARN-4607. Pagination support for AppAttempt page TotalOutstandingResource Requests table. Contributed by Bibin A Chundatt
2016-04-04 08:09:29 +05:30
Wangda Tan
12b11e2e68
YARN-4634. Scheduler UI/Metrics need to consider cases like non-queue label mappings. (Sunil G via wangda)
2016-03-31 14:35:18 -07:00
Jian He
0dd9bcab97
YARN-4811. Generate histograms in ContainerMetrics for actual container resource usage
2016-03-31 14:28:13 -07:00
Robert Kanter
7a021471c3
YARN-4639. Remove dead code in TestDelegationTokenRenewer added in YARN-3055 (templedf via rkanter)
2016-03-31 13:09:09 -07:00
Allen Wittenauer
0a74610d1c
HADOOP-11393. Revert HADOOP_PREFIX, go back to HADOOP_HOME (aw)
2016-03-31 07:51:05 -07:00
Jian He
60e4116bf1
YARN-4822. Refactor existing Preemption Policy of CS for easier adding new approach to select preemption candidates. Contributed by Wangda Tan
2016-03-30 12:43:52 -07:00
Wangda Tan
fc055a3cbe
YARN-4865. Track Reserved resources in ResourceUsage and QueueCapacities. (Sunil G via wangda)
2016-03-29 17:07:55 -07:00
Xuan
80182809ae
YARN-4863. AHS Security login should be in serviceInit() instead of serviceStart(). Contributed by Junping Du
2016-03-28 22:17:45 -07:00
Jason Lowe
948b758070
YARN-4773. Log aggregation performs extraneous filesystem operations when rolling log aggregation is disabled. Contributed by Jun Gong
2016-03-28 23:00:56 +00:00
Jian He
524bc3c33a
YARN-998. Keep NM resource updated through dynamic resource config for RM/NM restart. Contributed by Junping Du
2016-03-28 11:12:33 -07:00
Jian He
55ae143923
YARN-4117. End to end unit test with mini YARN cluster for AMRMProxy Service. Contributed by Giovanni Matteo Fumarola
2016-03-27 20:22:12 -07:00
Karthik Kambatla
49ff54c860
YARN-4805. Don't go through all schedulers in ParameterizedTestBase. (kasha)
2016-03-26 21:45:13 -07:00
Arun Suresh
00bebb7e58
YARN-4823. Refactor the nested reservation id field in listReservation to simple string field. (subru via asuresh)
2016-03-25 15:54:38 -07:00
Arun Suresh
d82e797b65
YARN-4825. Remove redundant code in ClientRMService::listReservations. (subru via asuresh)
2016-03-24 09:59:55 -07:00
Allen Wittenauer
b1394d6307
YARN-4850. test-fair-scheduler.xml isn't valid xml (Yufei Gu via aw)
2016-03-24 08:15:58 -07:00
Junping Du
19b645c938
YARN-4820. ResourceManager web redirects in HA mode drops query parameters. Contributed by Varun Vasudev.
2016-03-23 19:34:30 -07:00
Eric Payne
92b7e0d413
YARN-4686. MiniYARNCluster.start() returns before cluster is completely started. Contributed by Eric Badger.
2016-03-18 16:12:47 +00:00
Junping Du
ca8106d2dd
YARN-4785. inconsistent value type of the type field for LeafQueueInfo in response of RM REST API.
2016-03-17 09:04:41 -07:00
Karthik Kambatla
f84af8bd58
YARN-4812. TestFairScheduler#testContinuousScheduling fails intermittently. (kasha)
2016-03-17 05:54:06 -07:00
Wangda Tan
ae14e5d07f
YARN-4108. CapacityScheduler: Improve preemption to only kill containers that would satisfy the incoming request. (Wangda Tan)
...
(cherry picked from commit 7e8c9beb41
)
2016-03-16 17:02:33 -07:00
Wangda Tan
fa7a43529d
Revert "CapacityScheduler: Improve preemption to only kill containers that would satisfy the incoming request. (Wangda Tan)"
...
This reverts commit 7e8c9beb41
.
2016-03-16 17:02:10 -07:00
Wangda Tan
7e8c9beb41
CapacityScheduler: Improve preemption to only kill containers that would satisfy the incoming request. (Wangda Tan)
2016-03-16 16:59:59 -07:00
Karthik Kambatla
3ef5500783
YARN-4560. Make scheduler error checking message more user friendly. (Ray Chiang via kasha)
2016-03-15 23:45:01 -07:00
Robert Kanter
22ca176dfe
TestResourceLocalizationService.testPublicResourceInitializesLocalDir fails Intermittently due to IllegalArgumentException from cleanup (templedf via rkanter)
2016-03-15 10:05:10 -07:00
Karthik Kambatla
20d389ce61
YARN-4719. Add a helper library to maintain node state and allows common queries. (kasha)
2016-03-14 14:19:05 -07:00
Junping Du
f291d82cd4
YARN-4545. Allow YARN distributed shell to use ATS v1.5 APIs. Li Lu via junping_du
2016-03-14 08:28:38 -07:00
Li Lu
017d2c127b
YARN-4696. Improving EntityGroupFSTimelineStore on exception handling, test setup, and concurrency.
...
This commit amends commit d49cfb3504
with a missed test file.
2016-03-10 13:02:28 -08:00
Li Lu
d49cfb3504
YARN-4696. Improving EntityGroupFSTimelineStore on exception handling, test setup, and concurrency. (Steve Loughran via gtcarrera9)
2016-03-10 10:51:55 -08:00
Wangda Tan
0233d4e0ee
YARN-4465. SchedulerUtils#validateRequest for Label check should happen only when nodelabel enabled. (Bibin A Chundatt via wangda)
2016-03-08 14:27:03 -08:00
Jian He
3c33158d1c
YARN-4764. Application submission fails when submitted queue is not available in scheduler xml. Contributed by Bibin A Chundatt
2016-03-08 13:07:57 -08:00
Vinod Kumar Vavilapalli
b2661765a5
YARN-4762. Fixed CgroupHandler's creation and usage to avoid NodeManagers crashing when LinuxContainerExecutor is enabled. (Sidharta Seethana via vinodkv)
2016-03-07 11:08:17 -08:00
Jason Lowe
4163e36c2b
YARN-4760. proxy redirect to history server uses wrong URL. Contributed by Eric Badger
2016-03-07 15:56:33 +00:00
Jason Lowe
059caf9989
YARN-4744. Too many signal to container failure in case of LCE. Contributed by Sidharta Seethana
2016-03-07 15:40:01 +00:00
Varun Vasudev
8ed2e060e8
YARN-4245. Generalize config file handling in container-executor. Contributed by Sidharta Seethana.
2016-03-07 16:18:35 +05:30
Varun Vasudev
e51a8c1056
YARN-4737. Add CSRF filter support in YARN. Contributed by Jonathan Maron.
2016-03-07 15:26:44 +05:30
Zhihai Xu
e1ccc9622b
YARN-4761. NMs reconnecting with changed capabilities can lead to wrong cluster resource calculations on fair scheduler. Contributed by Sangjin Lee
2016-03-06 19:46:09 -08:00
Rohith Sharma K S
19ee185907
YARN-4763. RMApps Page crashes with NPE. (Bibin A Chundatt via rohithsharmaks)
2016-03-05 13:02:57 +05:30
Jian He
5c465df904
YARN-4671. There is no need to acquire CS lock when completing a container. Contributed by Meng Ding
2016-03-01 13:14:12 -08:00
Jian He
d93c22ec27
YARN-4748. ApplicationHistoryManagerOnTimelineStore should not swallow exceptions on generateApplicationReport. Contributed by Li Lu
2016-02-29 18:19:09 -08:00
Karthik Kambatla
9dafaaaf0d
YARN-4704. TestResourceManager#testResourceAllocation() fails when using FairScheduler. (Yufei Gu via kasha)
2016-02-29 16:10:12 -08:00
Haohui Mai
0fa54d45b1
HADOOP-12813. Migrate TestRPC and related codes to rebase on ProtobufRpcEngine. Contributed by Kai Zheng.
2016-02-29 11:41:00 -08:00
Jason Lowe
c58a6d53c5
YARN-4731. container-executor should not follow symlinks in recursive_unlink_children. Contributed by Colin Patrick McCabe
2016-02-29 15:24:35 +00:00
Rohith Sharma K S
e0b14f26f5
YARN-4566. Fix test failure in TestMiniYarnClusterNodeUtilization. (Takashi Ohnishi via rohithsharmaks)
2016-02-29 10:38:14 +08:00
Karthik Kambatla
f9692770a5
YARN-4718. Rename variables in SchedulerNode to reduce ambiguity post YARN-1011. (Inigo Goiri via kasha)
2016-02-28 09:35:59 -08:00
Jason Lowe
6b0f813e89
YARN-4723. NodesListManager$UnknownNodeId ClassCastException. Contributed by Kuhu Shukla
2016-02-26 20:24:50 +00:00
Ming Ma
7f3139e54d
YARN-4720. Skip unnecessary NN operations in log aggregation. (Jun Gong via mingma)
2016-02-26 08:40:05 -08:00
Robert Kanter
d7fdec1e6b
YARN-4579. Allow DefaultContainerExecutor container log directory permissions to be configurable (rchiang via rkanter)
2016-02-25 16:36:38 -08:00
Karthik Kambatla
c684f2b007
YARN-4729. SchedulerApplicationAttempt#getTotalRequiredResources can throw an NPE. (kasha)
2016-02-24 18:33:57 -08:00
Robert Kanter
954dd57043
YARN-4697. NM aggregation thread pool is not bound by limits (haibochen via rkanter)
2016-02-24 15:00:24 -08:00
Sangjin Lee
553b591ba0
YARN-4722. AsyncDispatcher logs redundant event queue sizes (Jason Lowe via sjlee)
2016-02-24 09:29:41 -08:00
Jason Lowe
d284e187b8
YARN-2046. Out of band heartbeats are sent only on container kill and possibly too early. Contributed by Ming Ma
2016-02-23 20:49:09 +00:00
Junping Du
9ed17f181d
YARN-3223. Resource update during NM graceful decommission. Contributed by Brook Zhou.
2016-02-23 03:30:26 -08:00
Tsuyoshi Ozawa
0e12114c9c
YARN-4648. Move preemption related tests from TestFairScheduler to TestFairSchedulerPreemption. Contributed by Kai Sasaki.
2016-02-23 19:50:08 +09:00
Varun Vasudev
140cb5d745
YARN-4709. NMWebServices produces incorrect JSON for containers. Contributed by Varun Saxena.
2016-02-23 12:29:25 +05:30
Junping Du
3fab88540f
YARN-4386. refreshNodesGracefully() should send recommission event to active RMNodes only. Contributed by Kuhu Shukla.
2016-02-22 07:04:19 -08:00
Sangjin Lee
7de70680fe
YARN-4690. Skip object allocation in FSAppAttempt#getResourceUsage when possible (Ming Ma via sjlee)
2016-02-17 20:55:21 -08:00
Karthik Kambatla
2ab4c476ed
YARN-4689. FairScheduler: Cleanup preemptContainer to be more readable. (Kai Sasaki via kasha)
2016-02-17 18:16:15 -08:00
Arun Suresh
23f937e3b7
YARN-2575. Create separate ACLs for Reservation create/update/delete/list ops (Sean Po via asuresh)
2016-02-11 10:47:43 -08:00
Varun Vasudev
fa00d3e205
YARN-4655. Log uncaught exceptions/errors in various thread pools in YARN. Contributed by Sidharta Seethana.
2016-02-11 12:06:42 +05:30
Jian He
d16b17b4d2
YARN-4138. Roll back container resource allocation after resource increase token expires. Contributed by Meng Ding
2016-02-11 10:06:27 +08:00
=
b706cbc1bc
YARN-4420. Add REST API for List Reservations (Sean Po via curino)
2016-02-10 10:19:26 -08:00
Arun Suresh
5cf5c41a89
YARN-4360. Improve GreedyReservationAgent to support "early" allocations, and performance improvements (curino via asuresh)
2016-02-10 09:11:15 -08:00
Devaraj K
565af873d5
YARN-4667. RM Admin CLI for refreshNodesResources throws NPE when nothing
...
is configured. Contributed by Naganarasimha G R.
2016-02-08 15:01:54 +05:30
Varun Vasudev
22a2b2231d
YARN-4669. Fix logging statements in resource manager's Application class. Contributed by Sidharta Seethana.
2016-02-04 13:51:25 +05:30
Varun Vasudev
308d63f382
YARN-4307. Display blacklisted nodes for AM container in the RM web UI. Contributed by Naganarasimha G R.
2016-02-04 13:32:54 +05:30
Jason Lowe
fa328e2d39
YARN-4594. container-executor fails to remove directory tree when chmod required. Contributed by Colin Patrick McCabe
2016-02-03 17:21:12 +00:00
Varun Vasudev
1adb64e09b
YARN-4625. Make ApplicationSubmissionContext and ApplicationSubmissionContextInfo more consistent. Contributed by Xuan Gong.
2016-02-03 16:26:28 +05:30
Varun Vasudev
1cd55e0c17
YARN-4649. Add additional logging to some NM state store operations. Contributed by Sidharta Seethana
2016-02-02 09:25:23 +05:30
Wangda Tan
9875325d5c
YARN-4340. Add list API to reservation system. (Sean Po via wangda)
2016-02-02 10:17:33 +08:00
Jason Lowe
ed55950164
YARN-3102. Decommisioned Nodes not listed in Web UI. Contributed by Kuhu Shukla
2016-02-01 23:15:26 +00:00
Rohith Sharma K S
2673cbaf55
YARN-4615. Fix random test failure in TestAbstractYarnScheduler#testResourceRequestRecoveryToTheRightAppAttempt. (Sunil G via rohithsharmaks)
2016-02-01 10:43:56 +05:30
Jason Lowe
772ea7b41b
YARN-4428. Redirect RM page to AHS page when AHS turned on and RM page is not available. Contributed by Chang Li
2016-01-29 21:48:54 +00:00
Jian He
f4a57d4a53
YARN-4617. LeafQueue#pendingOrderingPolicy should always use fixed ordering policy instead of using same as active applications ordering policy. Contributed by Rohith Sharma K S
2016-01-29 12:22:23 -08:00
Karthik Kambatla
c9a09d6926
YARN-4647. Make RegisterNodeManagerRequestPBImpl thread-safe. (kasha)
2016-01-29 08:12:54 -08:00
Devaraj K
a277bdc9ed
YARN-4411. RMAppAttemptImpl#createApplicationAttemptReport throws
...
IllegalArgumentException. Contributed by Bibin A Chundatt and yarntime.
2016-01-29 13:51:37 +05:30
Rohith Sharma K S
ac68666803
YARN-4543. Fix random test failure in TestNodeStatusUpdater.testStopReentrant. (Akihiro Suda via rohithsharmaks)
2016-01-29 12:29:54 +05:30
Rohith Sharma K S
09d831c95b
YARN-4219. addendum patch to fix javadoc errors
2016-01-29 11:51:47 +05:30
Jian He
7f46636495
YARN-4519. Potential deadlock of CapacityScheduler between decrease container and assign containers. Contributed by Meng Ding
2016-01-28 14:51:00 -08:00
Xuan
9fab22b366
YARN-4219. New levelDB cache storage for timeline v1.5. Contributed by
...
Li Lu
2016-01-28 14:24:22 -08:00
Jason Lowe
61382ff8fa
YARN-4643. Container recovery is broken with delegating container runtime. Contributed by Sidharta Seethana
2016-01-28 18:59:35 +00:00
Rohith Sharma K S
ef343be82b
YARN-4633. Fix random test failure in TestRMRestart#testRMRestartAfterPreemption. (Bibin A Chundatt via rohithsharmaks)
2016-01-28 21:53:45 +05:30
Karthik Kambatla
fb238d7e5d
YARN-4462. FairScheduler: Disallow preemption from a queue. (Tao Jie via kasha)
2016-01-27 12:29:06 -08:00
Rohith Sharma K S
c01bee0108
YARN-4573. Fix test failure in TestRMAppTransitions#testAppRunningKill and testAppKilledKilled. (Takashi Ohnishi via rohithsharmaks)
2016-01-27 08:23:02 +05:30
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
2085e60a96
YARN-3542. Refactored existing CPU cgroups support to use the newer and integrated ResourceHandler mechanism, and also deprecated the old LCEResourceHandler inteface hierarchy. Contributed by Varun Vasudev.
2016-01-25 16:19:36 -08:00
Jian He
992dd2f783
YARN-4520. Finished app info is unnecessarily persisted in NM state-store if container is acquired but not lunched on this node. Contributed by sandflee
2016-01-25 15:36:14 -08:00
rohithsharmaks
10dc2c0493
YARN-4613. Fix test failure in TestClientRMService#testGetClusterNodes. (Takashi Ohnishi via rohithsharmaks)
2016-01-24 23:36:15 +05:30
rohithsharmaks
99829eb221
YARN-4614. Fix random failure in TestApplicationPriority#testApplicationPriorityAllocationWithChangeInPriority. (Sunil G via rohithsharmaks)
2016-01-23 07:56:57 +05:30
Xuan
618bfd6ac2
YARN-4496. Improve HA ResourceManager Failover detection on the client.
...
Contributed by Jian He
2016-01-22 18:20:38 -08:00
Jason Lowe
46e5ea81e0
YARN-4598. Invalid event: RESOURCE_FAILED at CONTAINER_CLEANEDUP_AFTER_KILL. Contributed by tangshangwen
2016-01-22 21:55:01 +00:00
rohithsharmaks
d6258b33a7
YARN-4497. RM might fail to restart when recovering apps whose attempts are missing. (Jun Gong via rohithsharmaks)
2016-01-22 20:27:38 +05:30
Akira Ajisaka
8f58f742ae
YARN-4605. Spelling mistake in the help message of "yarn applicationattempt" command. Contributed by Weiwei Yang.
2016-01-22 19:43:06 +09:00
Varun Vasudev
b41a7e89d1
YARN-4578. Directories that are mounted in docker containers need to be more restrictive/container-specific. Contributed by Sidharta Seethana.
2016-01-22 14:43:14 +05:30
Rohith Sharma K S
e30668106d
YARN-4584. RM startup failure when AM attempts greater than max-attempts. (Bibin A Chundatt via rohithsharmaks)
2016-01-22 10:14:46 +05:30
Jason Lowe
468a53b22f
YARN-4610. Reservations continue looking for one app causes other apps to starve. Contributed by Jason Lowe
2016-01-21 18:31:29 +00:00
Karthik Kambatla
4992398aee
YARN-4603. FairScheduler should mention user requested queuename in error message when failed in queue ACL check. (Tao Jie via kasha)
2016-01-21 17:40:59 +01:00
Wangda Tan
5ff5f67332
YARN-4557. Fix improper Queues sorting in PartitionedQueueComparator when accessible-node-labels=*. (Naganarasimha G R via wangda)
2016-01-21 11:21:06 +08:00
Xuan
890a2ebd1a
YARN-4559. Make leader elector and zk store share the same curator
...
client. Contributed by Jian He
2016-01-20 14:48:10 -08:00
Wangda Tan
89d1fd5dac
HADOOP-12356. Fix computing CPU usage statistics on Windows. (Inigo Goiri via wangda)
2016-01-19 21:27:38 +08:00
Jian He
edc43a9097
YARN-4565. Fix a bug that leads to AM resource limit not hornored when sizeBasedWeight enabled for FairOrderingPolicy. Contributed by Wangda Tan
2016-01-18 21:04:36 -08:00
Wangda Tan
a44ce3f14f
YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)
2016-01-19 09:30:04 +08:00
Wangda Tan
150f5ae034
Revert "YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)"
...
This reverts commit 3fe5728563
.
Conflicts:
hadoop-yarn-project/CHANGES.txt
2016-01-19 09:27:36 +08:00
Jian He
f385851141
YARN-4596. SystemMetricPublisher should not swallow error messages from TimelineClient#putEntities. Contributed by Li Lu
2016-01-18 16:58:39 -08:00
Karthik Kambatla
d40859fab1
YARN-4526. Make SystemClock singleton so AppSchedulingInfo could use it. (kasha)
2016-01-18 10:58:14 +01:00
Wangda Tan
3fe5728563
YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)
...
(cherry picked from commit 805a9ed85e
)
2016-01-18 17:06:05 +08:00
Wangda Tan
adf260a728
Revert "YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)"
...
This reverts commit 805a9ed85e
.
2016-01-18 16:50:45 +08:00
Wangda Tan
b08ecf5c75
YARN-4304. AM max resource configuration per partition to be displayed/updated correctly in UI and in various partition related metrics. (Sunil G via wangda)
2016-01-18 11:11:32 +08:00
Wangda Tan
805a9ed85e
YARN-4502. Fix two AM containers get allocated when AM restart. (Vinod Kumar Vavilapalli via wangda)
2016-01-18 11:04:25 +08:00
Wangda Tan
9523648d57
YARN-4538. QueueMetrics pending cores and memory metrics wrong. (Bibin A Chundatt via wangda)
2016-01-18 10:57:14 +08:00
Junping Du
02f597c5db
YARN-4265. Provide new timeline plugin storage to support fine-grained entity caching. Contributed by Li Lu and Jason Lowe
2016-01-17 17:37:40 -08:00
Colin Patrick Mccabe
b2c155f810
HADOOP-12712. Fix some cmake plugin and native build warnings (cmccabe)
2016-01-15 12:29:18 -08:00
Junping Du
fc6d3a3b23
YARN-4581. AHS writer thread leak makes RM crash while RM is recovering. Contributed by sandflee.
2016-01-15 08:40:56 -08:00
rohithsharmaks
f7736f464f
YARN-4389. Allow application to enable or disable am blacklisting. (Sunil G via rohithsharmaks)
2016-01-15 21:38:26 +05:30
Colin Patrick Mccabe
b1ed28fa77
HADOOP-8887. Use a Maven plugin to build the native code using CMake (cmccabe)
2016-01-14 11:02:34 -08:00
Karthik Kambatla
9d04f26d4c
YARN-3446. FairScheduler headroom calculation should exclude nodes in the blacklist. (Zhihai Xu via kasha)
2016-01-14 08:33:23 -08:00
Varun Vasudev
3ddb92bd30
YARN-4553. Add cgroups support for docker containers. Contributed by Sidharta Seethana.
2016-01-14 14:29:29 +05:30
Karthik Kambatla
321072ba81
YARN-4551. Address the duplication between StatusUpdateWhenHealthy and StatusUpdateWhenUnhealthy transitions. (Sunil G via kasha)
2016-01-13 12:09:34 -08:00
Wangda Tan
c0537bcd2c
YARN-4571. Make app id/name available to the yarn authorizer provider for better auditing. (Jian He via wangda)
2016-01-13 13:18:31 +08:00
Jason Lowe
13de8359a1
YARN-4414. Nodemanager connection errors are retried at multiple levels. Contributed by Chang Li
2016-01-12 15:56:15 +00:00
Akira Ajisaka
da1e3e3c57
YARN-4567. javadoc failing on java 8. Contributed by Steve Loughran. This closes #67 .
2016-01-12 15:12:17 +09:00
Wangda Tan
9e792da014
YARN-4582. Label-related invalid resource request exception should be able to properly handled by application. (Bibin A Chundatt via wangda)
2016-01-12 12:53:31 +08:00
Jian He
5fab4ec31c
Missing file for YARN-4580
2016-01-11 17:00:44 -08:00
Jian He
b8942be888
YARN-4537. Pull out priority comparison from fifocomparator and use compound comparator for FifoOrdering policy. Contributed by Rohith Sharma K S
2016-01-11 16:44:28 -08:00
Jian He
109e528ef5
YARN-4479. Change CS LeafQueue pendingOrderingPolicy to hornor recovered apps. Contributed by Rohith Sharma K S
2016-01-08 15:51:10 -08:00
Xuan
89022f8d4b
YARN-4438. Implement RM leader election with curator. Contributed by Jian He
2016-01-07 14:33:06 -08:00
Akira Ajisaka
52b77577c4
YARN-4544. All the log messages about rolling monitoring interval are shown with WARN level. Contributed by Takashi Ohnishi.
2016-01-08 01:12:46 +09:00
Steve Loughran
07d1cb612c
YARN-4550. Some tests in TestContainerLanch fails on non-english locale environment. (Takashi Ohnishi via stevel)
2016-01-07 14:30:20 +00:00
rohithsharmaks
791c1639ae
YARN-4393. Fix intermittent test failure for TestResourceLocalizationService#testFailedDirsResourceRelease (Varun Saxana via rohithsharmaks)
2016-01-07 09:38:47 +05:30
Junping Du
c1462a67ff
YARN-4546. ResourceManager crash due to scheduling opportunity overflow. Contributed by Jason Lowe.
2016-01-06 05:49:24 -08:00
rohithsharmaks
6da6d87872
YARN-4535. Fix checkstyle error in CapacityScheduler.java (Naganarasimha G R via rohithsharmaks)
2016-01-05 12:09:57 +05:30
Rohith Sharma K S
f9e36dea96
YARN-4530. LocalizedResource trigger a NPE Cause the NodeManager exit. (tangshangwen via rohithsharmaks)
2016-01-04 09:58:13 +05:30
Wangda Tan
4e4b3a8465
YARN-4524. Cleanup AppSchedulingInfo. (Karthik Kambatla via wangda)
...
(cherry picked from commit 05fa852d7567b7590d6b53bbf925f8f424736514)
2015-12-30 15:39:34 -08:00
Wangda Tan
8310b2e9ff
YARN-4522. Queue acl can be checked at app submission. (Jian He via wangda)
2015-12-30 15:30:12 -08:00
Junping Du
223ce323bb
YARN-1382. Remove unusableRMNodesConcurrentSet (never used) in NodeListManager to get rid of memory leak. Contributed by Rohith Sharma K S.
2015-12-30 07:52:07 -08:00
Jian He
5273413411
YARN-3480. Remove attempts that are beyond max-attempt limit from state store. Contributed by Jun Gong
2015-12-29 15:58:39 -08:00
Wangda Tan
561abb9fee
YARN-4315. NaN in Queue percentage for cluster apps page. (Bibin A Chundatt via wangda)
2015-12-29 13:28:00 -08:00
Jian He
d0a22bae9b
YARN-4417. Make RM and Timeline-server REST APIs more consistent. Contributed by Wangda Tan
2015-12-28 15:52:45 -08:00
Gera Shegalov
2c17b81569
YARN-2934. Improve handling of container's stderr. (Naganarasimha G R via gera)
2015-12-24 23:48:05 -08:00
Karthik Kambatla
0af492b4bd
YARN-4156. TestAMRestart#testAMBlacklistPreventsRestartOnSameNode assumes CapacityScheduler. (Anubhav Dhoot via kasha)
2015-12-23 17:52:36 -08:00
Junping Du
882f2f0464
YARN-4234. New put APIs in TimelineClient for ats v1.5. Contributed by Xuan Gong.
2015-12-23 05:26:51 -08:00
rohithsharmaks
8c180a13c8
YARN-4109. Exception on RM scheduler page loading with labels. (Mohammad Shahid Khan via rohithsharmaks)
2015-12-23 09:12:32 +05:30
Arun Suresh
e88422df45
YARN-4477. FairScheduler: Handle condition which can result in an infinite loop in attemptScheduling. (Tao Jie via asuresh)
2015-12-21 22:41:09 -08:00
Wangda Tan
bc038b382c
YARN-4454. NM to nodelabel mapping going wrong after RM restart. (Bibin A Chundatt via wangda)
2015-12-21 11:30:13 -08:00
Uma Mahesh
0f82b5d878
YARN-4480. Clean up some inappropriate imports. (Kai Zheng via umamahesh)
2015-12-19 23:10:13 -08:00
Jian He
85c2466048
YARN-4164. Changed updateApplicationPriority API to return the updated application priority. Contributed by Rohith Sharma K S
2015-12-18 14:13:48 -08:00
Junping Du
1de56b0448
YARN-3226. UI changes for decommissioning node. Contributed by Sunil G.
2015-12-17 15:20:17 -08:00
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
4e7d32c0db
YARN-1856. Added cgroups based memory monitoring for containers as another alternative to custom memory-monitoring. Contributed by Varun Vasudev.
2015-12-17 12:13:03 -08:00
Jason Lowe
91828fef6b
YARN-4461. Redundant nodeLocalityDelay log in LeafQueue. Contributed by Eric Payne
2015-12-16 23:22:31 +00:00
Wangda Tan
9b856d9787
YARN-4416. Deadlock due to synchronised get Methods in AbstractCSQueue. (Naganarasimha G R via wangda)
2015-12-16 13:22:37 -08:00
Wangda Tan
7faa406f27
YARN-4225. Add preemption status to yarn queue -status for capacity scheduler. (Eric Payne via wangda)
2015-12-16 13:19:40 -08:00
Wangda Tan
79c41b1d83
YARN-4293. ResourceUtilization should be a part of yarn node CLI. (Sunil G via wangda)
2015-12-16 13:18:19 -08:00
Junping Du
50bd067e1d
YARN-4452. NPE when submit Unmanaged application. Contributed by Naganarasimha G R.
2015-12-16 10:57:39 -08:00
Zhihai Xu
2aaed10327
YARN-4440. FSAppAttempt#getAllowedLocalityLevelByTime should init the lastScheduler time. Contributed by Lin Yiqun
2015-12-15 00:17:21 -08:00
Xuan
d8a45425eb
YARN-4439. Clarify NMContainerStatus#toString method. Contributed by
...
Jian He
2015-12-14 15:41:28 -08:00
Jian He
915cd6c3f4
YARN-4402. TestNodeManagerShutdown And TestNodeManagerResync fails with bind exception. Contributed by Brahma Reddy Battula
2015-12-14 14:59:01 -08:00
Jian He
1cb3299b48
YARN-4403. (AM/NM/Container)LivelinessMonitor should use monotonic time when calculating period. Contributed by Junping Du
2015-12-14 13:51:23 -08:00
Wangda Tan
07b0fb996a
YARN-4418. AM Resource Limit per partition can be updated to ResourceUsage as well. (Sunil G via wangda)
2015-12-14 11:24:30 -08:00
Wangda Tan
dfcbbddb09
YARN-4309. Add container launch related debug information to container logs when a container fails. (Varun Vasudev via wangda)
2015-12-14 11:13:22 -08:00
Wangda Tan
6cb0af3c39
YARN-3946. Update exact reason as to why a submitted app is in ACCEPTED state to app's diagnostic message. (Naganarasimha G R via wangda)
2015-12-14 10:52:46 -08:00
Arun Suresh
7fb212e5e6
YARN-4358 addendum patch to fix javadoc error
2015-12-12 22:22:55 -08:00
rohithsharmaks
a5e2e1ecb0
YARN-4421. Remove dead code in RmAppImpl.RMAppRecoveredTransition. (Daniel Templeton via rohithsharmaks)
2015-12-09 11:31:51 +05:30
rohithsharmaks
15c3e7ffe3
YARN-4431. Not necessary to do unRegisterNM() if NM get stop due to failed to connect to RM. (Junpin Du via rohithsharmaks)
2015-12-09 10:50:43 +05:30
Wangda Tan
7e4715186d
YARN-4424. Fix deadlock in RMAppImpl. (Jian he via wangda)
2015-12-08 14:25:16 -08:00
Chris Douglas
9f50e13d5d
YARN-4248. Followup patch adding asf-licence exclusions for json test files
2015-12-08 12:08:04 -08:00
=
c25a635459
YARN-4248. REST API for submit/update/delete Reservations. (curino)
2015-12-07 13:33:28 -08:00
Jonathan Eagles
4ff973f96a
YARN-4422. Generic AHS sometimes doesn't show started, node, or logs on App page (Eric Payne via jeagles)
2015-12-07 15:04:48 -06:00
Xuan
4546c7582b
YARN-4392. ApplicationCreatedEvent event time resets after RM
...
restart/failover. Contributed by Naganarasimha G R and Xuan Gong
2015-12-07 12:24:55 -08:00
rohithsharmaks
01a641bc44
YARN-3456. Improve handling of incomplete TimelineEntities. (Varun Saxena via rohithsharmaks)
2015-12-07 15:16:56 +05:30
Steve Loughran
65f395226b
HADOOP-12321. Make JvmPauseMonitor an AbstractService. (Sunil G via Stevel) [includes HDFS-8947 MAPREDUCE-6462 and YARN-4072]
2015-12-06 17:43:35 +00:00
Arun Suresh
742632e346
YARN-4358. Reservation System: Improve relationship between SharingPolicy and ReservationAgent. (Carlo Curino via asuresh)
2015-12-05 21:26:16 -08:00
Jian He
755dda8dd8
YARN-4405. Support node label store in non-appendable file system. Contributed by Wangda Tan
2015-12-03 17:45:31 -08:00
Wangda Tan
a2c3bfc8c1
YARN-4292. ResourceUtilization should be a part of NodeInfo REST API. (Sunil G via wangda)
2015-12-03 14:28:32 -08:00
Jian He
9f77ccad73
YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999). Contributed by Mohammad Shahid Khan and Varun Saxena
2015-12-03 12:48:50 -08:00
Junping Du
62e9348bc1
YARN-4408. Fix issue that NodeManager still reports negative running containers. Contributed by Robert Kanter.
2015-12-03 06:36:37 -08:00
Jian He
6b9a5beb2b
YARN-4398. Remove unnecessary synchronization in RMStateStore. Contributed by Ning Ding
2015-12-02 11:07:18 -08:00
Tsuyoshi Ozawa
0656d2dc83
YARN-4380. TestResourceLocalizationService.testDownloadingResourcesOnContainerKill fails intermittently. Contributed by Varun Saxena.
2015-11-26 01:10:02 +09:00
Jason Lowe
4ac6799d4a
YARN-4132. Separate configs for nodemanager to resourcemanager connection timeout and retries. Contributed by Chang Li
2015-11-24 22:35:37 +00:00
Tsuyoshi Ozawa
28dfe721b8
YARN-4387. Fix typo in FairScheduler log message. Contributed by Xin Wang.
2015-11-24 19:24:01 +09:00
Karthik Kambatla
52948bb20b
YARN-3980. Plumb resource-utilization info in node heartbeat through to the scheduler. (Inigo Goiri via kasha)
2015-11-24 13:47:17 +05:30
Jian He
8676a118a1
YARN-4349. Support CallerContext in YARN. Contributed by Wangda Tan
2015-11-23 17:19:48 -08:00
Jason Lowe
d36b6e045f
YARN-4344. NMs reconnecting with changed capabilities can lead to wrong cluster resource calculations. Contributed by Varun Vasudev
2015-11-23 20:30:26 +00:00
Arun Suresh
da1016365a
YARN-3454. Add efficient merge operation to RLESparseResourceAllocation (Carlo Curino via asuresh)
2015-11-21 09:59:41 -08:00
Wangda Tan
2346fa3141
YARN-3769. Consider user limit when calculating total pending resource for preemption policy in Capacity Scheduler. (Eric Payne via wangda)
2015-11-20 15:55:50 -08:00
Jason Lowe
060cdcbe5d
YARN-4374. RM capacity scheduler UI rounds user limit factor. Contributed by Chang Li
2015-11-20 23:12:29 +00:00
Xuan
f114e728da
YARN-2859.addendum: fix the remaining issue from the previous patch
2015-11-19 10:08:38 -08:00
Arun Suresh
6a61928fb7
YARN-4184. Remove update reservation state api from state store as its not used by ReservationSystem (Sean Po via asuresh)
2015-11-17 15:50:34 -08:00
Jian He
fcd7888029
Revert "YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999) Contributed by Mohammad Shahid Khan"
...
This reverts commit 8fbea531d7
.
Conflicts:
hadoop-yarn-project/CHANGES.txt
2015-11-16 20:18:44 -08:00
Junping Du
855d52927b
YARN-4354. Public resource localization fails with NPE. Contributed by Jason Lowe.
2015-11-15 04:43:57 -08:00
Wangda Tan
7f55a18071
YARN-4347. Resource manager fails with Null pointer exception. (Jian He via wangda)
2015-11-12 11:23:40 -08:00
Wangda Tan
796638d9bc
YARN-4287. Capacity Scheduler: Rack Locality improvement (Nathan Roberts via wangda)
2015-11-12 11:09:37 -08:00
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
6351d3fa63
YARN-4183. Reverting the patch to fix behaviour change.
...
Revert "YARN-4183. Enabling generic application history forces every job to get a timeline service delegation token (jeagles)"
This reverts commit c293c58954
.
2015-11-11 10:40:43 -08:00
Jian He
8fbea531d7
YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999) Contributed by Mohammad Shahid Khan
2015-11-09 10:43:45 -08:00
Tsuyoshi Ozawa
ce31b22739
YARN-4320. TestJobHistoryEventHandler fails as AHS in MiniYarnCluster no longer binds to default port 8188. Contributed by Varun Saxena.
2015-10-30 18:00:20 +09:00
Jian He
e5b1733e04
YARN-4127. RM fail with noAuth error if switched from failover to non-failover. Contributed by Varun Saxena
2015-10-29 15:42:57 -07:00
Jonathan Eagles
c293c58954
YARN-4183. Enabling generic application history forces every job to get a timeline service delegation token (jeagles)
2015-10-29 16:41:10 -05:00
Arun Suresh
58d1df585c
YARN-4310. FairScheduler: Log skipping reservation messages at DEBUG level (asuresh)
2015-10-29 13:42:09 -07:00
Jason Lowe
e2267de207
YARN-2902. Killing a container that is localizing can orphan resources in the DOWNLOADING state. Contributed by Varun Saxena
2015-10-29 16:34:25 +00:00
Rohith Sharma K S
656c8f9527
YARN-4130. Duplicate declaration of ApplicationId in RMAppManager#submitApplication method. (Kai Sasaki via rohithsharmaks)
2015-10-29 12:22:44 +05:30
Xuan
27414dac66
YARN-2859. ApplicationHistoryServer binds to default port 8188 in
...
MiniYARNCluster. Contributed by Vinod Kumar Vavilapalli
2015-10-28 10:32:16 -07:00
Tsuyoshi Ozawa
96677bef00
YARN-3573. MiniMRYarnCluster constructor that starts the timeline server using a boolean should be marked deprecated. Contributed by Brahma Reddy Battula.
2015-10-27 12:35:01 +09:00
Akira Ajisaka
8a68630dd1
YARN-4300. [JDK8] Fix javadoc errors caused by wrong tags. (aajisaka)
2015-10-27 10:28:01 +09:00
Wangda Tan
56e4f6237a
YARN-3216. Max-AM-Resource-Percentage should respect node labels. (Sunil G via wangda)
2015-10-26 16:44:39 -07:00
Wangda Tan
6f606214e7
YARN-4169. Fix racing condition of TestNodeStatusUpdaterForLabels. (Naganarasimha G R via wangda)
2015-10-26 16:36:34 -07:00
Wangda Tan
3cc73773eb
YARN-4285. Display resource usage as percentage of queue and cluster in the RM UI (Varun Vasudev via wangda)
2015-10-26 13:07:39 -07:00
Jason Lowe
33a03af3c3
YARN-4284. condition for AM blacklisting is too narrow. Contributed by Sangjin Lee
2015-10-26 19:53:03 +00:00
Rohith Sharma K S
5acdde4744
YARN-2729. Support script based NodeLabelsProvider Interface in Distributed Node Label Configuration Setup. (Naganarasimha G R via rohithsharmaks)
2015-10-26 15:42:42 +05:30
Tsuyoshi Ozawa
ce60b4fc8b
YARN-3528. Tests with 12345 as hard-coded port break jenkins. Contributed by Brahma Reddy Battula.
2015-10-26 16:45:11 +09:00
Allen Wittenauer
1aa735c188
YARN-3724. Use POSIX nftw(3) instead of fts(3) (Alan Burlison via aw)
2015-10-25 21:43:23 -07:00
Arun Suresh
ab8eb8770c
YARN-3738. Add support for recovery of reserved apps running under dynamic queues (subru via asuresh)
2015-10-24 22:53:10 -07:00
Akira Ajisaka
7781fe1b9e
YARN-4294. [JDK8] Fix javadoc errors caused by wrong reference and illegal tag. (aajisaka)
2015-10-24 11:54:42 +09:00
Jason Lowe
d3a34a4f38
YARN-4041. Slow delegation token renewal can severely prolong RM recovery. Contributed by Sunil G
2015-10-23 20:57:01 +00:00
Ming Ma
934d96a334
YARN-2913. Fair scheduler should have ability to set MaxResourceDefault for each queue. (Siqi Li via mingma)
2015-10-23 08:36:33 -07:00
Jonathan Eagles
f8adeb712d
YARN-4009. CORS support for ResourceManager REST API. ( Varun Vasudev via jeagles)
2015-10-23 10:34:08 -05:00
Junping Du
0fce5f9a49
YARN-4243. Add retry on establishing Zookeeper conenction in EmbeddedElectorService#serviceInit. Contributed by Xuan Gong.
2015-10-22 13:41:09 -07:00