Can you consider optimizing the relocation process for superblocks? #376

Johnxjj · 2020-02-06T04:25:07Z

The current problem is that after the super block is used up, a process of relocating the super block will be performed, and this process will take a lot of time to do, which will cause the phenomenon of freeze. I have read the questions raised by many people before: they all reflect the occasional relatively large processing time during the operation of each API. I specifically located because of the superblock relocation process, this process will re-read each entry, and several times. The smaller the LFS_READ_SIZE, the more times.
One solution is to request a piece of memory, read the superblock data at one time, and analyze it, instead of reading the entries multiple times. For users, stable time is more important than memory.

e107steved · 2020-02-06T08:54:43Z

Stable time is not always more important than memory; the tradeoff will depend on a lot of application-specific things. If this feature were to be added, it would need to be optional. A single chip micro on its own is often RAM-poor.

Maybe for the future some flags to allow selection of tradeoffs to be applied - code size/speed/RAM usage etc

Johnxjj · 2020-02-06T09:02:06Z

@e107steved You are right! Different application scenarios have different needs.

geky · 2020-02-11T17:00:40Z

Ah yeah, this issue is caused by a naive check for whether or not we have enough space to expand the superblock.

Explanation:

The superblock is a bit special in that every relocation also expands the superblock chain. This is important to avoid wearing out the superblock early.

Normally the superblock always resides at blocks 0 and 1, but we we're writing to the root directory a lot, this could force the superblock to wear out early. So, if we notice a lot of writing (if a relocation happens), we instead build a linked list of blocks pointing to our real superblock. This creates an exponentially growing level of indirection that avoid the superblock wear problem.

There is some more info here (though not a lot)
https://github.com/ARMmbed/littlefs/blob/master/SPEC.md#0x0ff-lfs_type_superblock

The problem is that sometimes we don't want to expand. What if we only have a couple of blocks left in our filesystem? If we expand we risk running out of space and the extra superblock life is useless.

So before we expand the superblock, we also check if we have > half the space free. Otherwise we assume it is too risky and don't expand.

So we check the size of the filesystem before we expand the superblock. The problem is that finding the size of the filesystem right now involves scanning the entire filesystem:
https://github.com/ARMmbed/littlefs/blob/master/lfs.c#L1491

That being said, this process should only happen 1 maybe 2 times.

It may be possible to fix this in the short-term by storing the number of blocks free somewhere. Though would mean we'd end up doing the same filesystem scan, but at mount time? I'm not sure there's an easy fix.

My current plan is to fix this as a part of allocator improvements necessary to fix #75, which has related issues.

Johnxjj · 2020-03-16T07:38:21Z

@geky I have observed that this time is spent on line 1441-1483 and line 1528-1658 of lfs.c. During this time, LFS needs to go to the flash to read the data of the super block, a total of more than 1,200, and each time 1.5-2ms, so this total time is a lot. I think there is something wrong with this process, because all the contents of the same super block are read, why not read it in advance and then fetch it from RAM each time, which will greatly increase the speed. Just like in the process of reading a file, the entire block of data is first read into RAM, and then each time you can read the data directly from RAM.

Johnxjj · 2020-03-16T07:44:49Z

That is, my code did not run to block_cycles == 100. Here is my configuration:
g_cfg.read = lfs_block_read;
g_cfg.prog = lfs_block_prog;
g_cfg.erase = lfs_block_erase;
g_cfg.sync = lfs_block_sync;

/* block device configuration */
g_cfg.read_size = LFS_READ_SIZE;                        // 128
g_cfg.prog_size = LFS_PROG_SIZE;                        // 128
g_cfg.block_size = LFS_FLASH_SECTOR_SIZE;                    // 4096
g_cfg.block_count = ullfs_flash_sector_count;                // 256
g_cfg.block_cycles = LFS_BLOCK_CYCLES;                  // 100
g_cfg.cache_size = LFS_CACHE_SIZE;                    // 4096
g_cfg.lookahead_size = LFS_LOOKAHEAD_SIZE;             // 256/8

g_cfg.read_buffer = uclfs_read_buf;                    // uclfs_read_buf[4096]
g_cfg.prog_buffer = uclfs_prog_buf;                    //uclfs_prog_buf[4096]
g_cfg.lookahead_buffer = ullfs_lookahead_buf;         // ullfs_lookahead_buf[32]
g_file_cfg.buffer = uclfs_file_buf;                    // uclfs_file_buf[4096]

Johnxjj · 2020-03-16T08:36:36Z

hi @geky , I found that my question may be related to #203

geky added performance help wanted labels Feb 11, 2020

kevinior mentioned this issue Sep 23, 2021

lfs_rename sometimes very slow #327

Open

Johnxjj mentioned this issue Jun 8, 2022

Minor release: v2.5 #669

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can you consider optimizing the relocation process for superblocks? #376

Can you consider optimizing the relocation process for superblocks? #376

Johnxjj commented Feb 6, 2020

e107steved commented Feb 6, 2020

Johnxjj commented Feb 6, 2020

geky commented Feb 11, 2020

Johnxjj commented Mar 16, 2020

Johnxjj commented Mar 16, 2020

Johnxjj commented Mar 16, 2020

Can you consider optimizing the relocation process for superblocks? #376

Can you consider optimizing the relocation process for superblocks? #376

Comments

Johnxjj commented Feb 6, 2020

e107steved commented Feb 6, 2020

Johnxjj commented Feb 6, 2020

geky commented Feb 11, 2020

Johnxjj commented Mar 16, 2020

Johnxjj commented Mar 16, 2020

Johnxjj commented Mar 16, 2020