Change execution scavenger to call admin delete #3526

yux0 · 2022-10-22T20:05:36Z

What changed?
Change execution scavenger to call admin delete

Why?
Admin delete is powerful than workflow API delete. In this case, we know the data is expired, we should delete the data regardless the state

How did you test it?
Local test.

Potential risks
No. This is de

Is hotfix candidate?
Yes

dnr · 2022-10-24T18:08:59Z

service/worker/service.go

-func (s *Service) initScanner() {
+func (s *Service) initScanner() error {
+	currentCluster := s.clusterMetadata.GetCurrentClusterName()
+	adminClient, err := s.clientBean.GetRemoteAdminClient(currentCluster)


I think we should add a GetAdminClient to ClientBean to keep it similar to Get[Remote]FrontendClient, since frontend and admin clients work the same way.

also, I noticed Get/SetFrontendClient in clientBean.go don't do proper locking

I plan to include this in the next available patch. So I created an issue to track here: #3532.

Do you plan to fix locking bug in this PR?

No. I don't see caller in those setters. I will remove them in another PR.

yycptt · 2022-10-26T18:45:00Z

service/worker/scanner/executions/scavenger.go

-	executorPoolSize         = 4
 	executorPollInterval     = time.Minute
-	executorMaxDeferredTasks = 10000


So we are increasing the # of workers and task buffer size? They are limiting the speed of scavenger?

I think it the pool size is the bottleneck. The max deferred task will allow it to scan cluster with large shard number.

yycptt · 2022-10-26T18:47:46Z

service/worker/scanner/executions/task.go

+			case *serviceerror.NotFound,
+				*serviceerror.NamespaceNotFound:
+				t.logger.Error("Garbage data in DB after namespace is deleted", tag.WorkflowNamespaceID(executionInfo.GetNamespaceId()))
+				// We cannot do much in this case. It just ignores this error.


hmmm, looks like we still need a way to clean up in this case. Admin DeleteWorkflowExecution should probably take in namespaceID instead of namespaceName?

Maybe create a task for this?

yycptt · 2022-10-26T18:51:42Z

service/worker/service.go

-func (s *Service) initScanner() {
+func (s *Service) initScanner() error {
+	currentCluster := s.clusterMetadata.GetCurrentClusterName()
+	adminClient, err := s.clientBean.GetRemoteAdminClient(currentCluster)


Do you plan to fix locking bug in this PR?

* Change execution scavenger to call admin delete

Change execution scavenger to call admin delete

7cb0e6a

yux0 requested a review from a team as a code owner October 22, 2022 20:05

Clean up code

a1055d7

dnr reviewed Oct 24, 2022

View reviewed changes

fix test

68d0701

yux0 added the release/1.18.4 label Oct 25, 2022

Update execution scavenger worker count

07b8c5f

yycptt reviewed Oct 26, 2022

View reviewed changes

yycptt approved these changes Oct 26, 2022

View reviewed changes

Merge branch 'master' into admin-delete

70b338a

yux0 merged commit 119478a into temporalio:master Oct 26, 2022

yux0 deleted the admin-delete branch October 26, 2022 19:55

dnr pushed a commit that referenced this pull request Oct 31, 2022

Change execution scavenger to call admin delete (#3526)

69e2bb9

* Change execution scavenger to call admin delete

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change execution scavenger to call admin delete #3526

Change execution scavenger to call admin delete #3526

yux0 commented Oct 22, 2022

dnr Oct 24, 2022

yux0 Oct 24, 2022

yycptt Oct 26, 2022

yux0 Oct 26, 2022

yycptt Oct 26, 2022

yux0 Oct 26, 2022

yycptt Oct 26, 2022

yux0 Oct 26, 2022

yycptt Oct 26, 2022

Change execution scavenger to call admin delete #3526

Change execution scavenger to call admin delete #3526

Conversation

yux0 commented Oct 22, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment