Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Faster flywire_partner_summary queries #205

Merged
merged 7 commits into from
Aug 14, 2024
Merged

Conversation

jefferis
Copy link
Collaborator

  • chunking eg 20 neurons at a time and sending to cave can give 10x speedups
  • but need to be careful about CAVE row limits
  • awkwardly you can't just increase your offset and get more rows
  • in the process implemented offset/limit arguments for generic CAVE queries
  • ... and the ability to fetch all rows of a query (but this doesn't work for the synapse table!)

* instead of doing one neuron at a time, idea is to make some large chunks
* plan was then to repeat these chunk requests if they overflowed
* however then realised that I can't tell if there's an overflow because some rows are filtered out before returning
* instead I think I should add logic to the basic cave_query to fetch more rows using the offset option
* but this doesn't work for synapse table (since it returns rows in random order)
* some testing is still in order when row limits are exceeded
* otherwise looks concerning
* but probably need to sort results at the end to be sure
@jefferis
Copy link
Collaborator Author

this has been failing to run due to auth issues that are unrelated to content of PR

@jefferis jefferis merged commit 43405c6 into master Aug 14, 2024
0 of 3 checks passed
@jefferis jefferis deleted the feature/faster-cave-query branch August 14, 2024 03:57
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant