mirror of
				https://github.com/eledio-devices/thirdparty-littlefs.git
				synced 2025-10-31 08:42:40 +01:00 
			
		
		
		
	doc: Editorial tweaks
This commit is contained in:
		
				
					committed by
					
						 Christopher Haster
						Christopher Haster
					
				
			
			
				
	
			
			
			
						parent
						
							3457252fe6
						
					
				
				
					commit
					436707c8d0
				
			
							
								
								
									
										58
									
								
								DESIGN.md
									
									
									
									
									
								
							
							
						
						
									
										58
									
								
								DESIGN.md
									
									
									
									
									
								
							| @@ -27,16 +27,17 @@ cheap, and can be very granular. For NOR flash specifically, byte-level | |||||||
| programs are quite common. Erasing, however, requires an expensive operation | programs are quite common. Erasing, however, requires an expensive operation | ||||||
| that forces the state of large blocks of memory to reset in a destructive | that forces the state of large blocks of memory to reset in a destructive | ||||||
| reaction that gives flash its name. The [Wikipedia entry](https://en.wikipedia.org/wiki/Flash_memory) | reaction that gives flash its name. The [Wikipedia entry](https://en.wikipedia.org/wiki/Flash_memory) | ||||||
| has more information if you are interesting in how this works. | has more information if you are interested in how this works. | ||||||
|  |  | ||||||
| This leaves us with an interesting set of limitations that can be simplified | This leaves us with an interesting set of limitations that can be simplified | ||||||
| to three strong requirements: | to three strong requirements: | ||||||
|  |  | ||||||
| 1. **Power-loss resilient** - This is the main goal of the littlefs and the | 1. **Power-loss resilient** - This is the main goal of the littlefs and the | ||||||
|    focus of this project. Embedded systems are usually designed without a |    focus of this project. | ||||||
|    shutdown routine and a notable lack of user interface for recovery, so |  | ||||||
|    filesystems targeting embedded systems must be prepared to lose power an |    Embedded systems are usually designed without a shutdown routine and a | ||||||
|    any given time. |    notable lack of user interface for recovery, so filesystems targeting | ||||||
|  |    embedded systems must be prepared to lose power at any given time. | ||||||
|  |  | ||||||
|    Despite this state of things, there are very few embedded filesystems that |    Despite this state of things, there are very few embedded filesystems that | ||||||
|    handle power loss in a reasonable manner, and most can become corrupted if |    handle power loss in a reasonable manner, and most can become corrupted if | ||||||
| @@ -52,7 +53,8 @@ to three strong requirements: | |||||||
|    which stores a file allocation table (FAT) at a specific offset from the |    which stores a file allocation table (FAT) at a specific offset from the | ||||||
|    beginning of disk. Every block allocation will update this table, and after |    beginning of disk. Every block allocation will update this table, and after | ||||||
|    100,000 updates, the block will likely go bad, rendering the filesystem |    100,000 updates, the block will likely go bad, rendering the filesystem | ||||||
|    unusable even if there are many more erase cycles available on the storage. |    unusable even if there are many more erase cycles available on the storage | ||||||
|  |    as a whole. | ||||||
|  |  | ||||||
| 3. **Bounded RAM/ROM** - Even with the design difficulties presented by the | 3. **Bounded RAM/ROM** - Even with the design difficulties presented by the | ||||||
|    previous two limitations, we have already seen several flash filesystems |    previous two limitations, we have already seen several flash filesystems | ||||||
| @@ -80,21 +82,21 @@ designed in the early days of spinny magnet disks. While there is a vast amount | |||||||
| of interesting technology and ideas in this area, the nature of spinny magnet | of interesting technology and ideas in this area, the nature of spinny magnet | ||||||
| disks encourage properties, such as grouping writes near each other, that don't | disks encourage properties, such as grouping writes near each other, that don't | ||||||
| make as much sense on recent storage types. For instance, on flash, write | make as much sense on recent storage types. For instance, on flash, write | ||||||
| locality is not important and can actually increase wear destructively. | locality is not important and can actually increase wear. | ||||||
|  |  | ||||||
| One of the most popular designs for flash filesystems is called the | One of the most popular designs for flash filesystems is called the | ||||||
| [logging filesystem](https://en.wikipedia.org/wiki/Log-structured_file_system). | [logging filesystem](https://en.wikipedia.org/wiki/Log-structured_file_system). | ||||||
| The flash filesystems [jffs](https://en.wikipedia.org/wiki/JFFS) | The flash filesystems [jffs](https://en.wikipedia.org/wiki/JFFS) | ||||||
| and [yaffs](https://en.wikipedia.org/wiki/YAFFS) are good examples. In | and [yaffs](https://en.wikipedia.org/wiki/YAFFS) are good examples. In a | ||||||
| logging filesystem, data is not store in a data structure on disk, but instead | logging filesystem, data is not stored in a data structure on disk, but instead | ||||||
| the changes to the files are stored on disk. This has several neat advantages, | the changes to the files are stored on disk. This has several neat advantages, | ||||||
| such as the fact that the data is written in a cyclic log format naturally | such as the fact that the data is written in a cyclic log format and naturally | ||||||
| wear levels as a side effect. And, with a bit of error detection, the entire | wear levels as a side effect. And, with a bit of error detection, the entire | ||||||
| filesystem can easily be designed to be resilient to power loss. The | filesystem can easily be designed to be resilient to power loss. The | ||||||
| journaling component of most modern day filesystems is actually a reduced | journaling component of most modern day filesystems is actually a reduced | ||||||
| form of a logging filesystem. However, logging filesystems have a difficulty | form of a logging filesystem. However, logging filesystems have a difficulty | ||||||
| scaling as the size of storage increases. And most filesystems compensate by | scaling as the size of storage increases. And most filesystems compensate by | ||||||
| caching large parts of the filesystem in RAM, a strategy that is unavailable | caching large parts of the filesystem in RAM, a strategy that is inappropriate | ||||||
| for embedded systems. | for embedded systems. | ||||||
|  |  | ||||||
| Another interesting filesystem design technique is that of [copy-on-write (COW)](https://en.wikipedia.org/wiki/Copy-on-write). | Another interesting filesystem design technique is that of [copy-on-write (COW)](https://en.wikipedia.org/wiki/Copy-on-write). | ||||||
| @@ -107,7 +109,7 @@ where the COW data structures are synchronized. | |||||||
| ## Metadata pairs | ## Metadata pairs | ||||||
|  |  | ||||||
| The core piece of technology that provides the backbone for the littlefs is | The core piece of technology that provides the backbone for the littlefs is | ||||||
| the concept of metadata pairs. The key idea here, is that any metadata that | the concept of metadata pairs. The key idea here is that any metadata that | ||||||
| needs to be updated atomically is stored on a pair of blocks tagged with | needs to be updated atomically is stored on a pair of blocks tagged with | ||||||
| a revision count and checksum. Every update alternates between these two | a revision count and checksum. Every update alternates between these two | ||||||
| pairs, so that at any time there is always a backup containing the previous | pairs, so that at any time there is always a backup containing the previous | ||||||
| @@ -130,7 +132,7 @@ what the pair of blocks may look like after each update: | |||||||
| After each update, we can find the most up to date value of data by looking | After each update, we can find the most up to date value of data by looking | ||||||
| at the revision count. | at the revision count. | ||||||
|  |  | ||||||
| Now consider what the blocks may look like if we suddenly loss power while | Now consider what the blocks may look like if we suddenly lose power while | ||||||
| changing the value of data to 5: | changing the value of data to 5: | ||||||
| ``` | ``` | ||||||
|   block 1   block 2        block 1   block 2        block 1   block 2 |   block 1   block 2        block 1   block 2        block 1   block 2 | ||||||
| @@ -161,7 +163,7 @@ requires two blocks for each block of data. I'm sure users would be very | |||||||
| unhappy if their storage was suddenly cut in half! Instead of storing | unhappy if their storage was suddenly cut in half! Instead of storing | ||||||
| everything in these metadata blocks, the littlefs uses a COW data structure | everything in these metadata blocks, the littlefs uses a COW data structure | ||||||
| for files which is in turn pointed to by a metadata block. When | for files which is in turn pointed to by a metadata block. When | ||||||
| we update a file, we create a copies of any blocks that are modified until | we update a file, we create copies of any blocks that are modified until | ||||||
| the metadata blocks are updated with the new copy. Once the metadata block | the metadata blocks are updated with the new copy. Once the metadata block | ||||||
| points to the new copy, we deallocate the old blocks that are no longer in use. | points to the new copy, we deallocate the old blocks that are no longer in use. | ||||||
|  |  | ||||||
| @@ -184,7 +186,7 @@ Here is what updating a one-block file may look like: | |||||||
|             update data in file        update metadata pair |             update data in file        update metadata pair | ||||||
| ``` | ``` | ||||||
|  |  | ||||||
| It doesn't matter if we lose power while writing block 5 with the new data, | It doesn't matter if we lose power while writing new data to block 5, | ||||||
| since the old data remains unmodified in block 4. This example also | since the old data remains unmodified in block 4. This example also | ||||||
| highlights how the atomic updates of the metadata blocks provide a | highlights how the atomic updates of the metadata blocks provide a | ||||||
| synchronization barrier for the rest of the littlefs. | synchronization barrier for the rest of the littlefs. | ||||||
| @@ -206,7 +208,7 @@ files in filesystems. Of these, the littlefs uses a rather unique [COW](https:// | |||||||
| data structure that allows the filesystem to reuse unmodified parts of the | data structure that allows the filesystem to reuse unmodified parts of the | ||||||
| file without additional metadata pairs. | file without additional metadata pairs. | ||||||
|  |  | ||||||
| First lets consider storing files in a simple linked-list. What happens when | First lets consider storing files in a simple linked-list. What happens when we | ||||||
| append a block? We have to change the last block in the linked-list to point | append a block? We have to change the last block in the linked-list to point | ||||||
| to this new block, which means we have to copy out the last block, and change | to this new block, which means we have to copy out the last block, and change | ||||||
| the second-to-last block, and then the third-to-last, and so on until we've | the second-to-last block, and then the third-to-last, and so on until we've | ||||||
| @@ -240,8 +242,8 @@ Exhibit B: A backwards linked-list | |||||||
| ``` | ``` | ||||||
|  |  | ||||||
| However, a backwards linked-list does come with a rather glaring problem. | However, a backwards linked-list does come with a rather glaring problem. | ||||||
| Iterating over a file _in order_ has a runtime of O(n^2). Gah! A quadratic | Iterating over a file _in order_ has a runtime cost of O(n^2). Gah! A quadratic | ||||||
| runtime to just _read_ a file? That's awful. Keep in mind reading files are | runtime to just _read_ a file? That's awful. Keep in mind reading files is | ||||||
| usually the most common filesystem operation. | usually the most common filesystem operation. | ||||||
|  |  | ||||||
| To avoid this problem, the littlefs uses a multilayered linked-list. For | To avoid this problem, the littlefs uses a multilayered linked-list. For | ||||||
| @@ -266,7 +268,7 @@ Exhibit C: A backwards CTZ skip-list | |||||||
| ``` | ``` | ||||||
|  |  | ||||||
| The additional pointers allow us to navigate the data-structure on disk | The additional pointers allow us to navigate the data-structure on disk | ||||||
| much more efficiently than in a single linked-list. | much more efficiently than in a singly linked-list. | ||||||
|  |  | ||||||
| Taking exhibit C for example, here is the path from data block 5 to data | Taking exhibit C for example, here is the path from data block 5 to data | ||||||
| block 1. You can see how data block 3 was completely skipped: | block 1. You can see how data block 3 was completely skipped: | ||||||
| @@ -379,8 +381,8 @@ unintuitive property: | |||||||
|  |  | ||||||
|  |  | ||||||
| where:   | where:   | ||||||
| ctz(i) = the number of trailing bits that are 0 in i   | ctz(x) = the number of trailing bits that are 0 in x   | ||||||
| popcount(i) = the number of bits that are 1 in i   | popcount(x) = the number of bits that are 1 in x   | ||||||
|  |  | ||||||
| It's a bit bewildering that these two seemingly unrelated bitwise instructions | It's a bit bewildering that these two seemingly unrelated bitwise instructions | ||||||
| are related by this property. But if we start to dissect this equation we can | are related by this property. But if we start to dissect this equation we can | ||||||
| @@ -410,8 +412,7 @@ a bit to avoid integer overflow: | |||||||
|  |  | ||||||
|  |  | ||||||
| The solution involves quite a bit of math, but computers are very good at math. | The solution involves quite a bit of math, but computers are very good at math. | ||||||
| We can now solve for the block index + offset while only needed to store the | Now we can solve for both the block index and offset from the file size in O(1). | ||||||
| file size in O(1). |  | ||||||
|  |  | ||||||
| Here is what it might look like to update a file stored with a CTZ skip-list: | Here is what it might look like to update a file stored with a CTZ skip-list: | ||||||
| ``` | ``` | ||||||
| @@ -500,6 +501,7 @@ scanned to find the most recent free list, but once the list was found the | |||||||
| state of all free blocks becomes known. | state of all free blocks becomes known. | ||||||
|  |  | ||||||
| However, this approach had several issues: | However, this approach had several issues: | ||||||
|  |  | ||||||
| - There was a lot of nuanced logic for adding blocks to the free list without | - There was a lot of nuanced logic for adding blocks to the free list without | ||||||
|   modifying the blocks, since the blocks remain active until the metadata is |   modifying the blocks, since the blocks remain active until the metadata is | ||||||
|   updated. |   updated. | ||||||
| @@ -509,7 +511,7 @@ However, this approach had several issues: | |||||||
|   out of blocks and may no longer be able to add blocks to the free list. |   out of blocks and may no longer be able to add blocks to the free list. | ||||||
| - If we used a revision count to track the most recently updated free list, | - If we used a revision count to track the most recently updated free list, | ||||||
|   metadata blocks that were left unmodified were ticking time bombs that would |   metadata blocks that were left unmodified were ticking time bombs that would | ||||||
|   cause the system to go haywire if the revision count overflowed |   cause the system to go haywire if the revision count overflowed. | ||||||
| - Every single metadata block wasted space to store these free list references. | - Every single metadata block wasted space to store these free list references. | ||||||
|  |  | ||||||
| Actually, to simplify, this approach had one massive glaring issue: complexity. | Actually, to simplify, this approach had one massive glaring issue: complexity. | ||||||
| @@ -539,7 +541,7 @@ would have an abhorrent runtime. | |||||||
| So the littlefs compromises. It doesn't store a bitmap the size of the storage, | So the littlefs compromises. It doesn't store a bitmap the size of the storage, | ||||||
| but it does store a little bit-vector that contains a fixed set lookahead | but it does store a little bit-vector that contains a fixed set lookahead | ||||||
| for block allocations. During a block allocation, the lookahead vector is | for block allocations. During a block allocation, the lookahead vector is | ||||||
| checked for any free blocks, if there are none, the lookahead region jumps | checked for any free blocks. If there are none, the lookahead region jumps | ||||||
| forward and the entire filesystem is scanned for free blocks. | forward and the entire filesystem is scanned for free blocks. | ||||||
|  |  | ||||||
| Here's what it might look like to allocate 4 blocks on a decently busy | Here's what it might look like to allocate 4 blocks on a decently busy | ||||||
| @@ -1151,7 +1153,7 @@ develops errors and needs to be moved. | |||||||
|  |  | ||||||
| ## Wear leveling | ## Wear leveling | ||||||
|  |  | ||||||
| The second concern for the littlefs, is that blocks in the filesystem may wear | The second concern for the littlefs is that blocks in the filesystem may wear | ||||||
| unevenly. In this situation, a filesystem may meet an early demise where | unevenly. In this situation, a filesystem may meet an early demise where | ||||||
| there are no more non-corrupted blocks that aren't in use. It's common to | there are no more non-corrupted blocks that aren't in use. It's common to | ||||||
| have files that were written once and left unmodified, wasting the potential | have files that were written once and left unmodified, wasting the potential | ||||||
| @@ -1171,7 +1173,7 @@ of wear leveling: | |||||||
|  |  | ||||||
| In littlefs's case, it's possible to use the revision count on metadata pairs | In littlefs's case, it's possible to use the revision count on metadata pairs | ||||||
| to approximate the wear of a metadata block. And combined with the COW nature | to approximate the wear of a metadata block. And combined with the COW nature | ||||||
| of files, littlefs could provide your usually implementation of dynamic wear | of files, littlefs could provide your usual implementation of dynamic wear | ||||||
| leveling. | leveling. | ||||||
|  |  | ||||||
| However, the littlefs does not. This is for a few reasons. Most notably, even | However, the littlefs does not. This is for a few reasons. Most notably, even | ||||||
| @@ -1212,7 +1214,7 @@ So, to summarize: | |||||||
| 5. Files are represented by copy-on-write CTZ skip-lists which support O(1) | 5. Files are represented by copy-on-write CTZ skip-lists which support O(1) | ||||||
|    append and O(n log n) reading |    append and O(n log n) reading | ||||||
| 6. Blocks are allocated by scanning the filesystem for used blocks in a | 6. Blocks are allocated by scanning the filesystem for used blocks in a | ||||||
|    fixed-size lookahead region is that stored in a bit-vector |    fixed-size lookahead region that is stored in a bit-vector | ||||||
| 7. To facilitate scanning the filesystem, all directories are part of a | 7. To facilitate scanning the filesystem, all directories are part of a | ||||||
|    linked-list that is threaded through the entire filesystem |    linked-list that is threaded through the entire filesystem | ||||||
| 8. If a block develops an error, the littlefs allocates a new block, and | 8. If a block develops an error, the littlefs allocates a new block, and | ||||||
|   | |||||||
| @@ -116,7 +116,7 @@ can be either one of those found in the `enum lfs_error` in [lfs.h](lfs.h), | |||||||
| or an error returned by the user's block device operations. | or an error returned by the user's block device operations. | ||||||
|  |  | ||||||
| It should also be noted that the current implementation of littlefs doesn't | It should also be noted that the current implementation of littlefs doesn't | ||||||
| really do anything to insure that the data written to disk is machine portable. | really do anything to ensure that the data written to disk is machine portable. | ||||||
| This is fine as long as all of the involved machines share endianness | This is fine as long as all of the involved machines share endianness | ||||||
| (little-endian) and don't have strange padding requirements. | (little-endian) and don't have strange padding requirements. | ||||||
|  |  | ||||||
| @@ -148,7 +148,7 @@ littlefs is available in Mbed OS as the [LittleFileSystem](https://os.mbed.com/d | |||||||
| class. | class. | ||||||
|  |  | ||||||
| [littlefs-fuse](https://github.com/geky/littlefs-fuse) - A [FUSE](https://github.com/libfuse/libfuse) | [littlefs-fuse](https://github.com/geky/littlefs-fuse) - A [FUSE](https://github.com/libfuse/libfuse) | ||||||
| wrapper for littlefs. The project allows you to mount littlefs directly in a | wrapper for littlefs. The project allows you to mount littlefs directly on a | ||||||
| Linux machine. Can be useful for debugging littlefs if you have an SD card | Linux machine. Can be useful for debugging littlefs if you have an SD card | ||||||
| handy. | handy. | ||||||
|  |  | ||||||
|   | |||||||
		Reference in New Issue
	
	Block a user