Check this experiment they did https://github.com/internetarchive/sandcrawler/bl...

antongribok · on March 2, 2021

You will always have overhead.

In the video they mention storing everything in regular files on the filesystem. A regular filesystem would have inode overhead as well. XFS by default has 512 byte inodes (it can be more if you format it with bigger inodes, like you would for Ceph's Filestore backend).

For a lot of workloads Ceph's default erasure coding scheme (and Bluestore) would still be a lot more efficient than mirroring a file on top of a regular filesystem.

ddorian43 · on March 3, 2021

> For a lot of workloads Ceph's default erasure coding scheme (and Bluestore) would still be a lot more efficient than mirroring a file on top of a regular filesystem.

Yes that's correct, it's why Bluestore was created in the first place.

ddorian43 · on March 2, 2021

The same 4MB chunks in seaweedfs, would be ~40B * 20 = ~800Bytes.

Note: seaweedfs doesn't actually support 16+4, it's set in 10+4 in source-code. But the architecture makes the low overhead possible.