From d532cb0e10cbb12ec4e1ac86022559e838e0bd73 Mon Sep 17 00:00:00 2001 From: zeekling Date: Mon, 21 Aug 2023 00:00:53 +0800 Subject: [PATCH 1/3] =?UTF-8?q?checkpoint=E7=9B=B8=E5=85=B3?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- basic/checkpoint.md | 76 +++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 76 insertions(+) create mode 100644 basic/checkpoint.md diff --git a/basic/checkpoint.md b/basic/checkpoint.md new file mode 100644 index 0000000..ccb149a --- /dev/null +++ b/basic/checkpoint.md @@ -0,0 +1,76 @@ + +# 常见报错 + +## The maximum number of queued checkpoint requests exceeded + +未完成的Checkpoint排队超过了1000个。需要查看作业是否存在被压等。一般情况下作业被压会导致checkpoint失败。 + + +## Periodic checkpoint scheduler is shut down + + + +## The minimum time between checkpoints is still pending + + +## Not all required tasks are currently running + + +## An Exception occurred while triggering the checkpoint. + + +## Asynchronous task checkpoint failed. + + +## The checkpoint was aborted due to exception of other subtasks sharing the ChannelState file + + +## Checkpoint expired before completing + + +## Checkpoint has been subsumed + + +## Checkpoint was declined + + +## Checkpoint was declined (tasks not ready) + + +## Checkpoint was declined (task is closing) + + +## Checkpoint was canceled because a barrier from newer checkpoint was received + + +## Task received cancellation from one of its inputs + + +## Checkpoint was declined because one input stream is finished + + +## CheckpointCoordinator shutdown + + +## Checkpoint Coordinator is suspending + + +## FailoverRegion is restarting + + +## Task has failed + + +## Task local checkpoint failure + + +## Unknown task for the checkpoint to notify + + +## Failure to finalize checkpoint + + +## Trigger checkpoint failure + + + -- 2.45.2 From 2c717c8768d9778417ce176b034453b444a8424c Mon Sep 17 00:00:00 2001 From: zeekling Date: Fri, 25 Aug 2023 23:43:08 +0800 Subject: [PATCH 2/3] =?UTF-8?q?=E6=9B=B4=E6=96=B0=E7=9B=AE=E5=BD=95?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- basic/README.md | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/basic/README.md b/basic/README.md index d56ee3a..d8f9ad3 100644 --- a/basic/README.md +++ b/basic/README.md @@ -7,6 +7,7 @@ - [slot相关](./slot相关.md) - [Flink基本架构](./Flink基本架构.md) - [旁路输出](./旁路输出.md) - - +- [BlobServer](./blobServer.md) +- [广播](./broadcast.md) +- [Checkpoint](./checkpoint.md) -- 2.45.2 From 274f00381257692f62a501b9526c6c864849499c Mon Sep 17 00:00:00 2001 From: zeekling Date: Sat, 26 Aug 2023 00:11:18 +0800 Subject: [PATCH 3/3] =?UTF-8?q?checkpoint=E7=9B=B8=E5=85=B3?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- basic/checkpoint.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/basic/checkpoint.md b/basic/checkpoint.md index ccb149a..a17bb24 100644 --- a/basic/checkpoint.md +++ b/basic/checkpoint.md @@ -15,6 +15,9 @@ ## Not all required tasks are currently running +部分算子任务已经完成,但是如果在维表join场景下,flink 1.13版本之前可能无法恢复checkpoint + + ## An Exception occurred while triggering the checkpoint. -- 2.45.2