Commit Graph

374 Commits

Author SHA1 Message Date
bshashikant
41440ec890
HDDS-2210. ContainerStateMachine should not be marked unhealthy if applyTransaction fails with closed container exception(#1552). 2019-10-02 10:11:01 +05:30
Mukul Kumar Singh
8efd25b33a
HDDS-1615. ManagedChannel references are being leaked in ReplicationSupervisor.java. Contributed by Mukul Kumar Singh. (#1547) 2019-10-01 14:31:15 +05:30
bshashikant
98ca07ebed HDDS-2207. Update Ratis to latest snapshot. Contributed by Shashikant Bannerjee. (#1550) 2019-09-30 20:36:27 +05:30
Shashikant Banerjee
a530ac3f50
HDDS-2153. Add a config to tune max pending requests in Ratis leader
Closes #1474
2019-09-30 15:42:04 +02:00
Doroszlai, Attila
9bf7a6e5b2 HDDS-2149. Replace findbugs with spotbugs
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-09-26 22:56:05 -07:00
Xiaoyu Yao
d072d3304c HDDS-2020. Remove mTLS from Ozone GRPC. Contributed by Xiaoyu Yao.
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-09-19 22:43:00 -07:00
Elek, Márton
126ef77a81 HDDS-2154. Fix Checkstyle issues (#1475) 2019-09-19 11:30:33 -07:00
Bharat Viswanadham
6d4b20c047
HDDS-2143. Rename classes under package org.apache.hadoop.utils
Closes #1465
2019-09-18 16:51:39 +02:00
bshashikant
7f9073132d HDDS-2117. ContainerStateMachine#writeStateMachineData times out. (#1430) 2019-09-17 16:49:25 +05:30
Doroszlai, Attila
e54977f888 HDDS-2132. TestKeyValueContainer is failing (#1457). 2019-09-17 14:31:33 +05:30
Vivek Ratnavel Subramanian
66bd1681f8 HDDS-2107. Datanodes should retry forever to connect to SCM in an unsecure environment (#1424) 2019-09-16 12:58:16 -07:00
avijayanhwx
8024818d33 HDDS-1786 : Datanodes takeSnapshot should delete previously created snapshots (#1163) 2019-09-13 23:20:31 +05:30
Shashikant Banerjee
fe8cdf0ab8 HDDS-2076. Read fails because the block cannot be located in the container (#1410)
Signed-off-by: Nanda kumar <nanda@apache.org>
2019-09-12 21:18:14 +05:30
Lokesh Jain
c3beeb7761 HDDS-2048: State check during container state transition in datanode should be lock protected (#1375) 2019-09-10 14:14:52 +05:30
Mukul Kumar Singh
d69b811ddd HDDS-2102. HddsVolumeChecker should use java optional in place of Guava optional. Contributed by Mukul Kumar Singh. (#1416) 2019-09-09 14:17:28 -07:00
Shashikant Banerjee
469165e6f2 HDDS-1843. Undetectable corruption after restart of a datanode. Contributed by Shashikant Banerjee(#1364). 2019-09-09 22:43:20 +05:30
Vivek Ratnavel Subramanian
3b9584d12b HDDS-2087. Remove the hard coded config key in ChunkManager (#1409) 2019-09-08 20:44:07 -07:00
Lokesh Jain
6e4cdf89ef HDDS-1561: Mark OPEN containers as QUASI_CLOSED as part of Ratis groupRemove (#1401) 2019-09-06 13:15:49 +05:30
Anu Engineer
acbea8d976 HDDS-1708. Add container scrubber metrics.
Contributed by Hrishikesh Gadre.
2019-09-05 14:33:06 -07:00
Nanda kumar
2b16d5377c
HDDS-1898. GrpcReplicationService#download cannot replicate the container. (#1326) 2019-09-05 18:12:36 +05:30
Hrishikesh Gadre
f347c348d8 Add support for checksum verification in data scrubber
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-09-04 20:31:49 -07:00
avijayanhwx
b53d19a343 HDDS-1783 : Latency metric for applyTransaction in ContainerStateMachine (#1363). 2019-09-03 15:18:50 +05:30
Doroszlai, Attila
a2d083f2c5 HDDS-1413. Attempt to fix TestCloseContainerCommandHandler by adjusting timeouts
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-08-30 18:17:08 -07:00
Xiaoyu Yao
ec34cee5e3
HDDS-2047. Datanodes fail to come up after 10 retries in a secure env… (#1379) 2019-08-30 09:27:37 -07:00
Márton Elek
371c9eb6a6
Revert "HDDS-1596. Create service endpoint to download configuration from SCM."
This reverts commit c0499bd704.
2019-08-29 09:25:03 +02:00
Doroszlai, Attila
872cdf48a6 HDDS-1941. Unused executor in SimpleContainerDownloader (#1367) 2019-08-28 14:59:49 -07:00
supratimdeka
1407414a52 HDDS-1094. Performance test infrastructure : skip writing user data on Datanode. Contributed by Supratim Deka (#1323) 2019-08-28 10:05:20 -07:00
Márton Elek
c0499bd704 HDDS-1596. Create service endpoint to download configuration from SCM.
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-08-28 09:13:46 -07:00
bshashikant
5d31a4eff7 HDDS-1753. Datanode unable to find chunk while replication data using ratis. (#1318) 2019-08-28 10:54:44 +05:30
Vivek Ratnavel Subramanian
b1eee8b52e HDDS-1946. CertificateClient should not persist keys/certs to ozone.m… (#1311) 2019-08-27 17:29:27 -07:00
Doroszlai, Attila
0883ce1021 HDDS-2026. Overlapping chunk region cannot be read concurrently
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-08-27 15:34:58 -07:00
Shashikant Banerjee
66cfa482c4 HDDS-1610. applyTransaction failure should not be lost on restart. Contributed by Shashikant Banerjee(#1226). 2019-08-27 23:38:43 +05:30
Shashikant Banerjee
ce8eb1283a Revert "HDDS-1610. applyTransaction failure should not be lost on restart. Contributed by Shashikant Banerjee."
This reverts commit 62445021d5 as it has unintended changes in DirectoryWithSnapshotFeature class..
2019-08-27 23:23:44 +05:30
Lokesh Jain
4379370fb1
HDDS-1981: Datanode should sync db when container is moved to CLOSED or QUASI_CLOSED state (#1319) 2019-08-27 10:22:03 +05:30
avijayanhwx
28fb4b527a HDDS-2008 : Wrong package for RatisHelper class in hadoop-hdds/common module. (#1333) 2019-08-22 10:57:22 -07:00
Shashikant Banerjee
62445021d5 HDDS-1610. applyTransaction failure should not be lost on restart. Contributed by Shashikant Banerjee. 2019-08-20 14:37:01 +05:30
Aravindan Vijayan
fc229b6490 HDDS-1832 : Improve logging for PipelineActions handling in SCM and datanode. (Change to Error logging)
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-08-13 22:04:31 -07:00
supratimdeka
f8ea6e1ce1 HDDS-1798. Propagate failure in writeStateMachineData to Ratis. Contributed by Supratim Deka (#1113) 2019-08-05 13:43:41 +05:30
avijayanhwx
6e8c8413ee HDDS-1832 : Improve logging for PipelineActions handling in SCM and datanode. (#1200) 2019-08-02 22:31:00 +05:30
Doroszlai, Attila
946e301731 HDDS-1878. checkstyle error in ContainerStateMachine (#1195) 2019-08-02 22:27:54 +05:30
Lokesh Jain
d4ab9aea6f HDDS-1816: ContainerStateMachine should limit number of pending apply transactions. Adds a config, uses snapshot threshold default value. (#1150) 2019-07-31 13:26:24 +05:30
Eric Yang
d023663e3e HDDS-1833. Moved RefCountedDB stacktrace to log level trace.
Contributed by Siddharth Wagle
2019-07-29 12:05:24 -04:00
Doroszlai, Attila
278390f5bf HDDS-1710. Fix checkstyle errors
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-07-23 07:56:30 -07:00
Márton Elek
c533b79c32 HDDS-1710. Publish JVM metrics via Hadoop metrics
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-07-22 16:27:50 -07:00
Doroszlai, Attila
c958eddcf4 HDDS-1811. Prometheus metrics are broken.
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-07-22 14:12:06 -07:00
Sammi Chen
69a46a95bb HDDS-1713. ReplicationManager fail to find proper node topology based… (#1112) 2019-07-19 15:45:26 -07:00
Shashikant Banerjee
20cf50c6d0 HDDS-1654. Ensure container state on datanode gets synced to disk whennever state change happens. Cotributed by Shashikant Banerjee. (#923) 2019-07-18 17:09:05 +05:30
Lokesh Jain
23e9bebe13
HDDS-1767: ContainerStateMachine should have its own executors for executing applyTransaction calls (#1087) 2019-07-18 14:48:13 +05:30
Nanda kumar
1502e6a22f HDDS-1756. DeleteContainerCommandHandler fails with NPE. Contributed by Nanda kumar(#1095). 2019-07-16 15:06:41 +05:30
Mukul Kumar Singh
0976f6fc30
HDDS-1766. ContainerStateMachine is unable to increment lastAppliedTermIndex. Contributed by Mukul Kumar Singh. (#1072) 2019-07-14 10:53:51 +05:30