Commit Graph

1380 Commits

Author SHA1 Message Date
Vinod Kumar Vavilapalli
d1828d9443 YARN-2779. Fixed ResourceManager to not require delegation tokens for communicating with Timeline Service. Contributed by Zhijie Shen. 2014-10-30 23:16:19 -07:00
Vinod Kumar Vavilapalli
e0233c16eb YARN-2698. Moved some node label APIs to be correctly placed in client protocol. Contributed by Wangda Tan. 2014-10-30 22:59:31 -07:00
Steve Loughran
81fe8e4147 YARN-2677 registry punycoding of usernames doesn't fix all usernames to be DNS-valid (stevel) 2014-10-30 22:32:32 +00:00
Jason Lowe
73e626ad91 YARN-2755. NM fails to clean up usercache_DEL_<timestamp> dirs after YARN-661. Contributed by Siqi Li 2014-10-30 15:10:27 +00:00
Karthik Kambatla
179cab81e0 YARN-2712. TestWorkPreservingRMRestart: Augment FS tests with queue and headroom checks. (Tsuyoshi Ozawa via kasha) 2014-10-30 00:29:07 -07:00
Zhijie Shen
a8c1202220 YARN-2769. Fixed the problem that timeline domain is not set in distributed shell AM when using shell_command on Windows. Contributed by Varun Vasudev. 2014-10-29 12:12:30 -07:00
Karthik Kambatla
782971ae7a YARN-2742. FairSchedulerConfiguration should allow extra spaces between value and unit. (Wei Yan via kasha) 2014-10-29 10:24:57 -07:00
Zhijie Shen
ec63a3ffbd YARN-2747. Fixed the test failure of TestAggregatedLogFormat when native I/O is enabled. Contributed by Xuan Gong. 2014-10-28 22:43:40 -07:00
Jian He
d5e0a09721 YARN-2503. Added node lablels in web UI. Contributed by Wangda Tan 2014-10-28 17:57:54 -07:00
Zhijie Shen
8984e9b177 YARN-2741. Made NM web UI serve logs on the drive other than C: on Windows. Contributed by Craig Welch. 2014-10-28 14:11:19 -07:00
Xuan
69f79bee8b YARN-2758. Update TestApplicationHistoryClientService to use the new generic history store. Contributed by Zhijie Shen 2014-10-28 13:28:42 -07:00
Xuan
0d3e7e2bd6 RN-2279. Add UTs to cover timeline server authentication. Contributed by Zhijie Shen 2014-10-28 11:16:49 -07:00
Karthik Kambatla
ade3727ecb YARN-2760. Remove 'experimental' from FairScheduler docs. (Harsh J via kasha) 2014-10-28 08:49:14 -07:00
Jian He
f6b963fdfc YARN-2502. Changed DistributedShell to support node labels. Contributed by Wangda Tan 2014-10-27 20:13:00 -07:00
Vinod Kumar Vavilapalli
a16d022ca4 YARN-2704. Changed ResourceManager to optionally obtain tokens itself for the sake of localization and log-aggregation for long-running services. Contributed by Jian He. 2014-10-27 15:49:47 -07:00
Jian He
c05b581a55 YARN-2591. Fixed AHSWebServices to return FORBIDDEN(403) if the request user doesn't have access to the history data. Contributed by Zhijie Shen 2014-10-27 10:45:53 -07:00
Xuan
ce1a4419a6 YARN-2726. CapacityScheduler should explicitly log when an accessible label has no capacity. Contributed by Wangda Tan 2014-10-27 10:26:21 -07:00
Zhijie Shen
caecd9fffe YARN-2734. Skipped sub-folders in the local log dir when aggregating logs. Contributed by Xuan Gong. 2014-10-26 12:57:31 -07:00
Vinod Kumar Vavilapalli
0186645505 YARN-2743. Fixed a bug in ResourceManager that was causing RMDelegationToken identifiers to be tampered and thus causing app submission failures in secure mode. Contributed by Jian He. 2014-10-26 11:14:34 -07:00
Xuan
65d95b1a52 YARN-2723. Fix rmadmin -replaceLabelsOnNode does not correctly parse
port. Contributed by Naganarasimha G R
2014-10-25 12:45:14 -07:00
Karthik Kambatla
c51e53d7aa YARN-2183. [YARN-1492] Cleaner service for cache manager. (Chris Trezzo and Sangjin Lee via kasha) 2014-10-25 10:31:06 -07:00
Jian He
f44cf99599 YARN-2314. Disable ContainerManagementProtocolProxy cache by default to prevent creating thousands of threads in a large cluster. Contributed by Jason Lowe 2014-10-24 23:05:16 -07:00
Jian He
5864dd99a4 YARN-1915. Fixed a race condition that client could use the ClientToAMToken to contact with AM before AM actually receives the ClientToAMTokenMasterKey. Contributed by Jason Lowe 2014-10-24 22:47:56 -07:00
Karthik Kambatla
683897fd02 YARN-2713. "RM Home" link in NM should point to one of the RMs in an HA setup. (kasha) 2014-10-24 15:00:12 -07:00
Zhijie Shen
f81dc3f995 YARN-2703. Added logUploadedTime into LogValue for better display. Contributed by Xuan Gong. 2014-10-24 14:10:46 -07:00
Zhijie Shen
e31f0a6558 YARN-2724. Skipped uploading a local log file to HDFS if exception is raised when opening it. Contributed by Xuan Gong. 2014-10-24 11:13:44 -07:00
Zhijie Shen
0f3b6900be YARN-2209. Replaced AM resync/shutdown command with corresponding exceptions and made related MR changes. Contributed by Jian He. 2014-10-23 21:56:03 -07:00
Zhijie Shen
b94b8b30f2 YARN-2732. Fixed syntax error in SecureContainer.apt.vm. Contributed by Jian He. 2014-10-22 18:39:29 -07:00
Jian He
3b12fd6cfb YARN-2198. Remove the need to run NodeManager as privileged account for Windows Secure Container Executor. Contributed by Remus Rusanu 2014-10-22 15:57:46 -07:00
Steve Loughran
90e5ca24fb YARN-2700 TestSecureRMRegistryOperations failing on windows: auth problems 2014-10-22 21:13:52 +01:00
Steve Loughran
85a88649c3 YARN-2692 ktutil test hanging on some machines/ktutil versions (stevel) 2014-10-22 14:35:31 +01:00
Zhijie Shen
7e3b5e6f5c YARN-2721. Suppress NodeExist exception thrown by ZKRMStateStore when it retries creating znode. Contributed by Jian He. 2014-10-21 21:49:17 -07:00
Vinod Kumar Vavilapalli
c0e034336c YARN-2715. Fixed ResourceManager to respect common configurations for proxy users/groups beyond just the YARN level config. Contributed by Zhijie Shen. 2014-10-21 20:09:40 -07:00
Zhijie Shen
b2942762d7 YARN-2709. Made timeline client getDelegationToken API retry if ConnectException happens. Contributed by Li Lu. 2014-10-21 16:06:39 -07:00
cnauroth
6637e3cf95 YARN-2720. Windows: Wildcard classpath variables not expanded against resources contained in archives. Contributed by Craig Welch. 2014-10-21 12:33:21 -07:00
Jason Lowe
6f2028bd15 YARN-90. NodeManager should identify failed disks becoming good again. Contributed by Varun Vasudev 2014-10-21 17:31:13 +00:00
Zhijie Shen
171f2376d2 YARN-2717. Avoided duplicate logging when container logs are not found. Contributed by Xuan Gong. 2014-10-20 22:21:37 -07:00
Jian He
4a78a75228 Missing file for YARN-1879 2014-10-20 21:14:19 -07:00
Jian He
4fa1fb3193 Missing file for YARN-2701 2014-10-20 19:57:45 -07:00
Jian He
2839365f23 YARN-2701. Potential race condition in startLocalizer when using LinuxContainerExecutor. Contributed by Xuan Gong 2014-10-20 18:45:47 -07:00
Zhijie Shen
e90718fa5a YARN-2582. Fixed Log CLI and Web UI for showing aggregated logs of LRS. Contributed Xuan Gong. 2014-10-20 13:38:59 -07:00
Zhijie Shen
89427419a3 YARN-2673. Made timeline client put APIs retry if ConnectException happens. Contributed by Li Lu. 2014-10-20 12:22:17 -07:00
Vinod Kumar Vavilapalli
8256766498 YARN-2504. Enhanced RM Admin CLI to support management of node-labels. Contribyted by Wangda Tan. 2014-10-18 12:07:40 -07:00
Vinod Kumar Vavilapalli
39063cd36f YARN-2676. Enhanced Timeline auth-filter to support proxy users. Contributed by Zhijie Shen. 2014-10-17 22:02:50 -07:00
Vinod Kumar Vavilapalli
e9c66e8fd2 YARN-2705. Fixed bugs in ResourceManager node-label manager that were causing test-failures: added a dummy in-memory labels-manager. Contributed by Wangda Tan. 2014-10-17 18:26:12 -07:00
Jian He
c3de2412eb YARN-1879. Marked Idempotent/AtMostOnce annotations to ApplicationMasterProtocol for RM fail over. Contributed by Tsuyoshi OZAWA 2014-10-17 16:35:27 -07:00
Jian He
a6aa6e42ca YARN-2588. Standby RM fails to transitionToActive if previous transitionToActive failed with ZK exception. Contributed by Rohith Sharmaks 2014-10-17 10:54:24 -07:00
Vinod Kumar Vavilapalli
abae63caf9 YARN-2699. Fixed a bug in CommonNodeLabelsManager that caused tests to fail when using ephemeral ports on NodeIDs. Contributed by Wangda Tan. 2014-10-17 08:58:08 -07:00
Jian He
0fd0ebae64 YARN-2682. Updated WindowsSecureContainerExecutor to not use DefaultContainerExecutor#getFirstApplicationDir and use getWorkingDir() instead. Contributed by Zhihai Xu 2014-10-16 18:14:34 -07:00
Jian He
233d446be1 YARN-2621. Simplify the output when the user doesn't have the access for getDomain(s). Contributed by Zhijie Shen 2014-10-16 16:37:21 -07:00
Steve Loughran
6f43491c03 YARN-2689 TestSecureRMRegistryOperations failing on windows: secure ZK won't start 2014-10-16 14:22:02 -07:00
Vinod Kumar Vavilapalli
b3056c266a YARN-2685. Fixed a bug in CommonNodeLabelsManager that caused wrong resource tracking per label when a host runs multiple node-managers. Contributed by Wangda Tan. 2014-10-15 18:47:26 -07:00
Vinod Kumar Vavilapalli
f2ea555ac6 YARN-2496. Enhanced Capacity Scheduler to have basic support for allocating resources based on node-labels. Contributed by Wangda Tan.
YARN-2500. Ehnaced ResourceManager to support schedulers allocating resources based on node-labels. Contributed by Wangda Tan.
2014-10-15 18:33:06 -07:00
Jian He
0af1a2b5bc YARN-2312. Deprecated old ContainerId#getId API and updated MapReduce to use ContainerId#getContainerId instead. Contributed by Tsuyoshi OZAWA 2014-10-15 15:22:07 -07:00
Zhijie Shen
1220bb72d4 YARN-2656. Made RM web services authentication filter support proxy user. Contributed by Varun Vasudev and Zhijie Shen. 2014-10-14 21:50:46 -07:00
Zhijie Shen
cdce88376a HADOOP-11181. Generalized o.a.h.s.t.d.DelegationTokenManager to handle all sub-classes of AbstractDelegationTokenIdentifier. Contributed by Zhijie Shen. 2014-10-14 11:35:38 -07:00
Karthik Kambatla
cc93e7e683 YARN-2566. DefaultContainerExecutor should pick a working directory randomly. (Zhihai Xu via kasha) 2014-10-13 16:32:01 -07:00
Karthik Kambatla
da709a2eac YARN-2641. Decommission nodes on -refreshNodes instead of next NM-RM heartbeat. (Zhihai Xu via kasha) 2014-10-13 16:23:04 -07:00
Jian He
f9680d9a16 YARN-2308. Changed CapacityScheduler to explicitly throw exception if the queue
to which the apps were submitted is changed across RM restart. Contributed by Craig Welch & Chang Li
2014-10-13 14:09:04 -07:00
Jason Lowe
a56ea01002 YARN-2377. Localization exception stack traces are not passed as diagnostic info. Contributed by Gera Shegalov 2014-10-13 18:31:16 +00:00
Zhijie Shen
4aed2d8e91 YARN-2651. Spun off LogRollingInterval from LogAggregationContext. Contributed by Xuan Gong. 2014-10-13 10:54:09 -07:00
Jason Lowe
344a10ad5e YARN-2667. Fix the release audit warning caused by hadoop-yarn-registry. Contributed by Yi Liu 2014-10-13 15:37:50 +00:00
Steve Loughran
ac64ff77cf YARN-2668 yarn-registry JAR won't link against ZK 3.4.5. (stevel) 2014-10-11 18:34:26 -07:00
Vinod Kumar Vavilapalli
a5ec3d0809 YARN-2501. Enhanced AMRMClient library to support requests against node labels. Contributed by Wangda Tan. 2014-10-10 19:57:39 -07:00
Vinod Kumar Vavilapalli
db7f165319 YARN-2494. Added NodeLabels Manager internal API and implementation. Contributed by Wangda Tan. 2014-10-10 11:44:21 -07:00
Zhijie Shen
cb81bac002 YARN-2583. Modified AggregatedLogDeletionService to be able to delete rolling aggregated logs. Contributed by Xuan Gong. 2014-10-10 00:11:30 -07:00
cnauroth
d3afd730ac YARN-2662. TestCgroupsLCEResourcesHandler leaks file descriptors. Contributed by Chris Nauroth. 2014-10-09 22:44:44 -07:00
Zhijie Shen
e532ed8faa YARN-2617. Fixed ApplicationSubmissionContext to still set resource for backward compatibility. Contributed by Wangda Tan. 2014-10-09 17:57:50 -07:00
Karthik Kambatla
4f426fe223 YARN-2180. [YARN-1492] In-memory backing store for cache manager. (Chris Trezzo via kasha) 2014-10-09 17:16:06 -07:00
Vinod Kumar Vavilapalli
596702a025 YARN-2544. Added admin-API objects for using node-labels. Contributed by Wangda Tan. 2014-10-09 15:46:24 -07:00
Zhijie Shen
1d4612f5ad YARN-2629. Made the distributed shell use the domain-based timeline ACLs. Contributed by Zhijie Shen. 2014-10-09 12:59:47 -07:00
Vinod Kumar Vavilapalli
180afa2f86 YARN-2493. Added user-APIs for using node-labels. Contributed by Wangda Tan. 2014-10-09 11:26:26 -07:00
Mayank Bansal
df3becf080 YARN-2598 GHS should show N/A instead of null for the inaccessible information (Zhijie Shen via mayank) 2014-10-08 17:27:53 -07:00
Steve Loughran
6a326711aa YARN-913 service registry: YARN-2652 add hadoop-yarn-registry package under hadoop-yarn 2014-10-08 13:02:25 -07:00
Jian He
e16e25ab1b YARN-2649. Fixed TestAMRMRPCNodeUpdates test failure. Contributed by Ming Ma 2014-10-08 10:58:51 -07:00
Jian He
30d56fdbb4 YARN-1857. CapacityScheduler headroom doesn't account for other AM's running. Contributed by Chen He and Craig Welch 2014-10-07 13:45:04 -07:00
Jian He
519e5a7dd2 YARN-2644. Fixed CapacityScheduler to return up-to-date headroom when AM allocates. Contributed by Craig Welch 2014-10-06 15:48:46 -07:00
Jian He
ea26cc0b4a YARN-2615. Changed ClientToAMTokenIdentifier/RM(Timeline)DelegationTokenIdentifier to use protobuf as payload. Contributed by Junping Du 2014-10-06 10:47:43 -07:00
Chris Douglas
c8212bacb1 YARN-1051. Add a system for creating reservations of cluster capacity.
Contributed by Subru Krishnan and Carlo Curino.
2014-10-03 17:09:02 -07:00
subru
a2986234be YARN-2611. Fixing jenkins findbugs warning and TestRMWebServicesCapacitySched for branch YARN-1051. Contributed by Subru Krishnan and Carlo Curino.
(cherry picked from commit c47464aba407d1dafe10be23fe454f0489cc4367)
2014-10-03 15:43:23 -07:00
subru
5e10a13bb4 YARN-2576. Making test patch pass in branch. Contributed by Subru Krishnan and Carlo Curino.
(cherry picked from commit 90ac0be86b898aefec5471db4027554c8e1b310c)
2014-10-03 15:43:13 -07:00
subru
6261f7cc69 YARN-2080. Integrating reservation system with ResourceManager and client-RM protocol. Contributed by Subru Krishnan and Carlo Curino.
(cherry picked from commit 8baeaead8532898163f1006276b731a237b1a559)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/rmapp/RMAppImpl.java
2014-10-03 15:42:43 -07:00
subru
3418c56bcf YARN-1712. Plan follower that synchronizes the current state of reservation subsystem with the scheduler. Contributed by Subru Krishnan and Carlo Curino.
(cherry picked from commit 169085319b8b76641f8b9f6840a3fef06d221e2b)
2014-10-03 15:42:10 -07:00
carlo curino
b6df0dddcd YARN-1711. Policy to enforce instantaneous and over-time quotas on user reservation. Contributed by Carlo Curino and Subru Krishnan.
(cherry picked from commit c4918cb4cb5a267a8cfd6eace28fcfe7ad6174e8)
2014-10-03 15:42:03 -07:00
carlo curino
f66ffcf832 YARN-1710. Logic to find allocations within a Plan that satisfy user ReservationRequest(s). Contributed by Carlo Curino and Subru Krishnan.
(cherry picked from commit aef7928899b37262773f3dc117157bb746bf8918)
2014-10-03 15:41:57 -07:00
subru
cf4b34282a YARN-1709. In-memory data structures used to track resources over time to enable reservations.
(cherry picked from commit 0d8b2cd88b958b1e602fd4ea4078ef8d4742a7c3)
2014-10-03 15:41:51 -07:00
subru
c9266df404 YARN-1708. Public YARN APIs for creating/updating/deleting reservations.
(cherry picked from commit 3f2e3b275bcf29264a112c5d2f4eae289fe7f52c)
2014-10-03 15:41:39 -07:00
carlo curino
1c6950354f YARN-2475. Logic for responding to capacity drops for the ReservationSystem. Contributed by Carlo Curino and Subru Krishnan.
(cherry picked from commit f83a07f266f2c5e6eead554d8a331ed7e75e10d5)
2014-10-03 15:41:21 -07:00
carlo curino
eb3e40b833 YARN-1707. Introduce APIs to add/remove/resize queues in the CapacityScheduler. Contributed by Carlo Curino and Subru Krishnan
(cherry picked from commit aac47fda7fecda9fc18ade34d633eca895865a70)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/CapacityScheduler.java
2014-10-03 15:41:02 -07:00
Vinod Kumar Vavilapalli
34cdcaad71 YARN-2468. Enhanced NodeManager to support log handling APIs (YARN-2569) for use by long running services. Contributed by Xuan Gong. 2014-10-03 12:15:40 -07:00
Karthik Kambatla
80d11eb68e YARN-2635. TestRM, TestRMRestart, TestClientToAMTokens should run with both CS and FS. (Wei Yan and kasha via kasha) 2014-10-03 11:49:49 -07:00
Jian He
eb6ce5e97c YARN-2562. Changed ContainerId#toString() to be more readable. Contributed by Tsuyoshi OZAWA 2014-10-03 11:04:02 -07:00
Jian He
054f285526 YARN-2628. Capacity scheduler with DominantResourceCalculator carries out reservation even though slots are free. Contributed by Varun Vasudev 2014-10-02 15:13:33 -07:00
Zhijie Shen
1c93025a1b YARN-2527. Fixed the potential NPE in ApplicationACLsManager and added test cases for it. Contributed by Benoy Antony. 2014-10-02 14:56:13 -07:00
Jason Lowe
29f520052e YARN-2624. Resource Localization fails on a cluster due to existing cache directories. Contributed by Anubhav Dhoot 2014-10-02 17:39:34 +00:00
Karthik Kambatla
5e0b49da9c YARN-2254. TestRMWebServicesAppsModification should run against both CS and FS. (Zhihai Xu via kasha) 2014-10-02 10:15:04 -07:00
Jian He
3ef1cf187f YARN-2617. Fixed NM to not send duplicate container status whose app is not running. Contributed by Jun Gong 2014-10-02 10:04:09 -07:00
junping_du
c7cee9b455 YARN-1979. TestDirectoryCollection fails when the umask is unusual. (Contributed by Vinod Kumar Vavilapalli and Tsuyoshi OZAWA) 2014-10-02 08:02:59 -07:00
Vinod Kumar Vavilapalli
9e40de6af7 YARN-2446. Augmented Timeline service APIs to start taking in domains as a parameter while posting entities and events. Contributed by Zhijie Shen. 2014-10-01 17:52:03 -07:00
junping_du
0708827a93 YARN-2613. Support retry in NMClient for rolling-upgrades. (Contributed by Jian He) 2014-10-01 16:50:30 -07:00