1
0
Fork 0
mirror of https://github.com/tldr-pages/tldr.git synced 2025-04-21 22:22:07 +02:00
tldr/pages/linux/duperemove.md
Lena 6fd816e36e
pages/*: use lowercase n for integer placeholders (#16033)
Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>
2025-03-27 21:23:12 +05:30

957 B

duperemove

Finds duplicate filesystem extents and optionally schedule them for deduplication. An extent is small part of a file inside the filesystem. On some filesystems one extent can be referenced multiple times, when parts of the content of the files are identical. More information: https://markfasheh.github.io/duperemove/.

  • Search for duplicate extents in a directory and show them:

duperemove -r {{path/to/directory}}

  • Deduplicate duplicate extents on a Btrfs or XFS (experimental) filesystem:

duperemove -r -d {{path/to/directory}}

  • Use a hash file to store extent hashes (less memory usage and can be reused on subsequent runs):

duperemove -r -d --hashfile={{path/to/hashfile}} {{path/to/directory}}

  • Limit I/O threads (for hashing and dedupe stage) and CPU threads (for duplicate extent finding stage):

duperemove -r -d --hashfile={{path/to/hashfile}} --io-threads={{n}} --cpu-threads={{n}} {{path/to/directory}}