Add LRU eviction mechanisms for streamed file chunks #11304

rtibbles · 2023-09-26T23:22:34Z

Summary

Adds a utility to get stats for all ChunkedFile objects in the file system
Adds ability to evict a certain number of bytes of the least recently used files (but will always delete the entire ChunkedFile rather than trying to pick and choose chunks)
Fixes a small bug in the StorageCalculation object that was double counting incomplete_downloads
Adds additional logic to content request processing to attempt to evict files from the streamed file cache when we run out of space
Adds capability to the new utility class to limit the total size of files to a set maximum
Does a minor refactor to put all core scheduled tasks into a single plugin that always gets invoked (to prevent the Android app continually having to keep this up to date: https://github.com/learningequality/kolibri-installer-android/blob/develop/src/kolibri_tasks.py)
Adds a new option to limit the size of the streamed files cache, defaults to 500MB
Adds a scheduled task that runs daily to limit the files cache to this maximum
Added a slight optimization to prevent os.walk descending into the chunked file directories at first
Updated to include the files from the diskcache folder in the size calculations
Updated to run the scheduled task hourly, rather than daily.

References

Reviewer guidance

Any concerns about the new option? Should the default value be lower?

I haven't benchmarked the os.walk performance on a large installed system, so I am unsure how slow it would get, there may be room for optimization there.

~~This also ignores the small diskcache that is used inside each ChunkedFile for the purposes of counting how much is used - should I include this instead?~~

Testing checklist

Contributor has fully tested the PR manually
If there are any front-end changes, before/after screenshots are included
Critical user journeys are covered by Gherkin stories
Critical and brittle code paths are covered by unit tests

PR process

PR has the correct target branch and milestone
PR has 'needs review' or 'work-in-progress' label
If PR is ready for review, a reviewer has been added. (Don't use 'Assignees')
If this is an important user-facing change, PR or related issue has a 'changelog' label
If this includes an internal dependency change, a link to the diff is provided

Reviewer checklist

Automated test coverage is satisfactory
PR is fully functional
PR has been tested for accessibility regressions
External dependency files were updated if necessary (yarn and pip)
Documentation is updated
Contributor is in AUTHORS.md

github-actions · 2023-09-27T00:29:08Z

Build Artifacts

Asset type	Download link
PEX file	kolibri-0.16.0b6.dev0_git.116.gd1f6f3f2.pex
Windows Installer (EXE)	kolibri-0.16.0b6.dev0+git.116.gd1f6f3f2-unsigned.exe
Debian Package	kolibri_0.16.0b6.dev0+git.116.gd1f6f3f2-0ubuntu1_all.deb
Mac Installer (DMG)	kolibri-0.16.0b6.dev0+git.116.gd1f6f3f2-0.3.0.dmg
Android Package (APK)	kolibri-0.16.0b6.dev0+git.116.gd1f6f3f2-0.1.0-debug.apk
TAR file	kolibri-0.16.0b6.dev0+git.116.gd1f6f3f2.tar.gz
WHL file	kolibri-0.16.0b6.dev0+git.116.gd1f6f3f2-py2.py3-none-any.whl

rtibbles · 2023-09-27T15:11:42Z

Looks like my tests are assuming certain things about the file system (and I forgot to update the tests for the small tweak to the server.py).

…ads.

…ent storage.

… base process bus.

rtibbles · 2023-09-27T18:43:17Z

OK, hopefully I've fixed the test issues now - but I wouldn't bet against the Github Actions environment still doing something a little weird, possibly!

…ystems.

nucleogenesis

This also ignores the small diskcache that is used inside each ChunkedFile for the purposes of counting how much is used - should I include this instead?

Does this mean we'll always be undercounting by some handful of bytes because the diskcache isn't accounted for? Looks like diskcache has a mechanism for eviction that defaults to "least recently stored" as the method but maybe we can just set it to LRU and not have to worry about it beyond that?

rtibbles · 2023-09-27T23:04:22Z

Does this mean we'll always be undercounting by some handful of bytes because the diskcache isn't accounted for?

Yes - all we are using diskcache for is to cache a few things about the file, the total file size, and other relevant information we may have previously received in the HEAD request. We are also using it to coordinate download locks for chunks, so that multiple threads/processes can gain an exclusive lock to download a specific chunk of a file (and hence prevent duplication). Importantly, these diskcache instances are being created on a per ChunkedFile basis, so the eviction mechanisms aren't terribly useful to us for these purposes.

I think I probably should just go and count the diskcache file size into the total size, as it will give us a more accurate count of how much space we are freeing up by evicting the file.

Include diskcache directory in file size counts.

rtibbles · 2023-09-28T00:54:23Z

Have updated this to include in the diskcache in the overall file size.

nucleogenesis · 2023-09-28T16:57:24Z

kolibri/utils/file_transfer.py

+            for dirpath, _, filenames in os.walk(chunked_file_dir):
+                for file in filenames:
+                    file_path = os.path.join(dirpath, file)


Curious how this is accounting for the diskcache differently?

It's now doing an os.walk of the entire ChunkedFile dir (including the directory that diskcache is using) so it's enumerating all the files.

Previously, it was only listing the files in the top directory, which was all the chunk files, plus the directory used for diskcache (which it was ignoring because it was a directory).

jredrejo

I've checked the code and tested it and everything works correctly.
However I think this implementation could be problematic in servers with a small disk (thinking for example of Raspberry PI servers using a SD card). The scheduled task to clean the cache could not be enough to clean all the cached space for users remotely browsing studio.

I think we should either provide an option to null this caching or at least increase the interval to run the cleaning task. Taking into account the cache has not limit to grow in disk, a 24 hours interval could be too long in some cases.

rtibbles · 2023-10-11T22:09:32Z

Have updated now to run the task hourly.

rtibbles added the TODO: needs review Waiting for review label Sep 26, 2023

rtibbles added this to the Kolibri 0.16 Release: Remote Content Browsing milestone Sep 26, 2023

rtibbles requested review from bjester, jredrejo and nucleogenesis September 26, 2023 23:22

github-actions bot added DEV: backend Python, databases, networking, filesystem... SIZE: medium labels Sep 26, 2023

rtibbles mentioned this pull request Sep 26, 2023

Remove unneeded default task starting. learningequality/kolibri-installer-android#160

Merged

rtibbles added 5 commits September 27, 2023 11:27

Add utility to manage and evict chunked files.

7bb6cfb

Tweak storage calc logic to prevent double counting incomplete downlo…

2f1f14c

…ads.

Use streamed file cache clear as a last resort when there's insuffici…

44f8613

…ent storage.

Add method to limit_files to a specific total number of bytes.

2eb8f41

Separate out default task scheduling into separate plugin, and add to…

b24bb37

… base process bus.

rtibbles force-pushed the stale_chunks branch from 1b209f5 to ba62b17 Compare September 27, 2023 18:42

rtibbles added 2 commits September 27, 2023 11:59

Add scheduled task for cleaning up the streamed file cache.

ea0772f

Sort file names to avoid inconsistency of os.walk on different file s…

8ca431d

…ystems.

rtibbles force-pushed the stale_chunks branch from ba62b17 to 8ca431d Compare September 27, 2023 18:59

nucleogenesis reviewed Sep 27, 2023

View reviewed changes

Optimize os.walk usage.

2b35350

Include diskcache directory in file size counts.

nucleogenesis reviewed Sep 28, 2023

View reviewed changes

jredrejo reviewed Oct 11, 2023

View reviewed changes

Run cache cleanup task hourly.

108a0f5

jredrejo approved these changes Oct 12, 2023

View reviewed changes

rtibbles merged commit ef9a11b into learningequality:release-v0.16.x Oct 13, 2023

rtibbles deleted the stale_chunks branch October 13, 2023 14:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LRU eviction mechanisms for streamed file chunks #11304

Add LRU eviction mechanisms for streamed file chunks #11304

rtibbles commented Sep 26, 2023 •

edited

Loading

github-actions bot commented Sep 27, 2023 •

edited

Loading

rtibbles commented Sep 27, 2023

rtibbles commented Sep 27, 2023

nucleogenesis left a comment

rtibbles commented Sep 27, 2023

rtibbles commented Sep 28, 2023

nucleogenesis Sep 28, 2023

rtibbles Sep 28, 2023

jredrejo left a comment •

edited

Loading

rtibbles commented Oct 11, 2023

Add LRU eviction mechanisms for streamed file chunks #11304

Add LRU eviction mechanisms for streamed file chunks #11304

Conversation

rtibbles commented Sep 26, 2023 • edited Loading

Summary

References

Reviewer guidance

Testing checklist

PR process

Reviewer checklist

github-actions bot commented Sep 27, 2023 • edited Loading

Build Artifacts

rtibbles commented Sep 27, 2023

rtibbles commented Sep 27, 2023

nucleogenesis left a comment

Choose a reason for hiding this comment

rtibbles commented Sep 27, 2023

rtibbles commented Sep 28, 2023

nucleogenesis Sep 28, 2023

Choose a reason for hiding this comment

rtibbles Sep 28, 2023

Choose a reason for hiding this comment

jredrejo left a comment • edited Loading

Choose a reason for hiding this comment

rtibbles commented Oct 11, 2023

rtibbles commented Sep 26, 2023 •

edited

Loading

github-actions bot commented Sep 27, 2023 •

edited

Loading

jredrejo left a comment •

edited

Loading