-
Notifications
You must be signed in to change notification settings - Fork 3.2k
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Fail to recover a partition which was temporarily delegated to the internal broker #4804
Closed
7 tasks done
Labels
Comments
Is there some progress? |
7 tasks
7 tasks
emasab
added a commit
that referenced
this issue
Dec 14, 2024
… leader epoch (#4901) Failing tests including for #4796 and #4804 Closes #4796 and #4804 CHANGELOG Fix for the correct expected RPC code in test 0139 Apply same fix to metadata update operation too Don't change rktp state to active when there's no leader but wait it's available to validate it Comment about excluded -1 value
7 tasks
# for free
to join this conversation on GitHub.
Already have an account?
# to comment
Description
When a partition is absent from metadata it is delegated to the internal broker
:0/internal
.If the partition comes back later, with the same leader as before, it's no longer delegated back from the internal broker.
This makes the partition unusable, causing producing and consuming from said partition to stall.
The issue is likely a result of this change 6584ed7 in v2.4.0 (PR #4680)
This change requires that the epoch has been changed (
leader_epoch > rktp->rktp_leader_epoch
) before callingrd_kafka_toppar_broker_update()
, which is not the case in this situation.This commit also affects the related testcase 107 which fails.
This related test scenario only seem to pass if the commit 6584ed7 is reverted.
Logs where the issue can be seen:
Running test 107:
Checklist
v2.4.0
3.6.2
SLES
debug=..
as necessary) from librdkafkaCC @emasab
The text was updated successfully, but these errors were encountered: