Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Log around config.FromConnStr to diagnose slow DNS SRV resolution #6255

Merged
merged 3 commits into from
May 19, 2023

Conversation

bbrks
Copy link
Member

@bbrks bbrks commented May 18, 2023

config.FromConnStr performs SRV DNS lookups, which under some circumstances can cause slow startup times for Sync Gateway.

Log around this function to diganose and log at warning when it takes longer than 10 seconds.

Pre-review checklist

  • Removed debug logging (fmt.Print, log.Print, ...)
  • Logging sensitive data? Make sure it's tagged (e.g. base.UD(docID), base.MD(dbName))
  • Updated relevant information in the API specifications (such as endpoint descriptions, schemas, ...) in docs/api

Integration Tests

  • n/a

@bbrks bbrks changed the title Log at debug around config.FromConnStr to diagnose slow DNS resolutio… Log around config.FromConnStr to diagnose slow DNS SRV resolution May 18, 2023
Copy link
Collaborator

@torcolvin torcolvin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think having slow DNS is relatively common.

But we actually do this in many places, usually at startup or DB startup:

connStrError := agentConfig.FromConnStr(connStr)

And anywhere a cluster connection is established (bootstrap) https://github.com/couchbase/gocb/blob/da582ed180ca768052410105b5e4d4394f80e674/client.go#L105

I think having any logging for this is fine, but it will not be comprehensive.

@@ -41,6 +41,9 @@ const kStatsReportInterval = time.Hour
const kDefaultSlowQueryWarningThreshold = 500 // ms
const KDefaultNumShards = 16

// fromConnStrWarningThreshold determines the amount of time it should take before we warn about parsing a connstr (mostly for DNS SRV resolution)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I

Suggested change
// fromConnStrWarningThreshold determines the amount of time it should take before we warn about parsing a connstr (mostly for DNS SRV resolution)
// fromConnStrWarningThreshold determines the amount of time it should take before we warn about parsing a connstr (mostly for DNS resolution)

I think there's other DNS resolution and the call happens to check if SRV records are there but it can do other things.

torcolvin
torcolvin previously approved these changes May 18, 2023
Copy link
Collaborator

@torcolvin torcolvin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think you need to address the comments, I think this would be helpful enough for users and it's not worth putting more time in it.

@bbrks
Copy link
Member Author

bbrks commented May 18, 2023

Didn't see the other method used as it was on a different type - a text search instead of AST search found it :)

@bbrks bbrks merged commit cb6aef7 into master May 19, 2023
@bbrks bbrks deleted the log_around_connstr_parse_for_dns_resolution branch May 19, 2023 12:38
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants