close
Skip to content

[hotfix][client] Retry to initial cluster when encounter TimeoutException#2055

Merged
wuchong merged 1 commit into
apache:mainfrom
swuferhong:init-hotfix
Nov 30, 2025
Merged

[hotfix][client] Retry to initial cluster when encounter TimeoutException#2055
wuchong merged 1 commit into
apache:mainfrom
swuferhong:init-hotfix

Conversation

@swuferhong
Copy link
Copy Markdown
Contributor

@swuferhong swuferhong commented Nov 30, 2025

Purpose

Linked issue: close #xxx

java.lang.IllegalStateException: Failed to initialize fluss client connection to bootstrap servers: [xxx..com/xxx:80]. 
Reason: null
	at org.apache.fluss.client.metadata.MetadataUpdater.initializeCluster(MetadataUpdater.java:319)
	at org.apache.fluss.client.metadata.MetadataUpdater.<init>(MetadataUpdater.java:72)
	at org.apache.fluss.client.FlussConnection.<init>(FlussConnection.java:85)
	at org.apache.fluss.client.ConnectionFactory.createConnection(ConnectionFactory.java:66)
	at org.apache.fluss.flink.source.reader.FlinkSourceSplitReader.<init>(FlinkSourceSplitReader.java:127)
	at org.apache.fluss.flink.source.reader.FlinkSourceReader.lambda$new$0(FlinkSourceReader.java:69)
	at org.apache.flink.connector.base.source.reader.fetcher.SplitFetcherManager.createSplitFetcher(SplitFetcherManager.java:196)
	at org.apache.flink.connector.base.source.reader.fetcher.SingleThreadFetcherManager.addSplits(SingleThreadFetcherManager.java:107)
	at org.apache.flink.connector.base.source.reader.SourceReaderBase.addSplits(SourceReaderBase.java:258)
	at org.apache.flink.streaming.api.operators.SourceOperator.open(SourceOperator.java:380)
	at org.apache.flink.streaming.runtime.tasks.RegularOperatorChain.initializeStateAndOpenOperators(RegularOperatorChain.java:107)
	at org.apache.flink.streaming.runtime.tasks.StreamTask.restoreGates(StreamTask.java:842)
	at org.apache.flink.streaming.runtime.tasks.StreamTaskActionExecutor$1.call(StreamTaskActionExecutor.java:55)
	at org.apache.flink.streaming.runtime.tasks.StreamTask.restoreInternal(StreamTask.java:789)
	at org.apache.flink.streaming.runtime.tasks.StreamTask.restore(StreamTask.java:746)
	at org.apache.flink.runtime.taskmanager.Task.runWithSystemExitMonitoring(Task.java:968)
	at org.apache.flink.runtime.taskmanager.Task.restoreAndInvoke(Task.java:937)
	at org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:754)
	at org.apache.flink.runtime.taskmanager.Task.run(Task.java:569)
	at java.lang.Thread.run(Thread.java:879)
Caused by: java.util.concurrent.TimeoutException
	at java.util.concurrent.CompletableFuture.timedGet(CompletableFuture.java:1784)
	at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1928)
	at org.apache.fluss.client.utils.MetadataUtils.sendMetadataRequestAndRebuildCluster(MetadataUtils.java:163)
	at org.apache.fluss.client.utils.MetadataUtils.sendMetadataRequestAndRebuildCluster(MetadataUtils.java:68)
	at org.apache.fluss.client.metadata.MetadataUpdater.tryToInitializeCluster(MetadataUpdater.java:372)
	at org.apache.fluss.client.metadata.MetadataUpdater.tryToInitializeClusterWithRetries(MetadataUpdater.java:335)
	at org.apache.fluss.client.metadata.MetadataUpdater.initializeCluster(MetadataUpdater.java:292)
	... 19 more

Brief change log

Tests

API and Format

Documentation

@swuferhong swuferhong changed the title [hotfix][client] Retry to initial cluster when encounter TimeoutExcep… [hotfix][client] Retry to initial cluster when encounter TimeoutException Nov 30, 2025
@wuchong wuchong merged commit b7203a7 into apache:main Nov 30, 2025
5 checks passed
@swuferhong swuferhong deleted the init-hotfix branch December 18, 2025 11:59
Ugbot pushed a commit to Ugbot/fluss that referenced this pull request Apr 26, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants