Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Introduction

rsdedup is a fast, Rust-based file deduplication tool. It scans directories for duplicate files and supports multiple actions: reporting, deleting, hardlinking, and symlinking duplicates.

Key Features

  • Multiple actions — report, delete, hardlink, or symlink duplicates
  • Smart comparison — size grouping, then partial 4KB hash, then full hash
  • Multiple hash algorithms — SHA-256, xxHash, BLAKE3
  • Persistent hash cache — avoids rehashing unchanged files across runs
  • Parallel hashing — configurable thread count for fast scanning
  • Flexible filtering — include/exclude globs, min/max file size
  • Multiple output formats — human-readable text or JSON
  • Dry-run mode — preview destructive operations before executing
  • Shell completions — bash, zsh, and fish

Philosophy

rsdedup is designed to be:

  • Safe by default — read-only operations unless you explicitly ask for changes
  • Fast — multi-stage pipeline eliminates candidates early, parallel hashing
  • Incremental — persistent cache means repeated scans are nearly instant
  • Unix-friendly — composable with other tools via JSON output and meaningful exit codes