Get the latest tech news
Ptar: Replacing .tgz for petabyte-scale S3 archives
Hi, I’m Julien, co-founder of Plakar. Before we built this, I spent years as an engineer and later as a manager of infra teams.
deduplication: identical chunks stored once, even across snapshots built‑in encryption: no extra step tamper evidence: any change breaks the archive versioning: keep many snapshots easily S3 native: one command to archive a bucket partial restores and browsing: pick a file without unpacking it all fast targeted restores: grab one file in seconds In many real-world datasets, a large amount of data is actually redundant: multiple copies, backups, archives, or repeated files across folders. .ptar works differently: it automatically detects and removes duplicates, so each unique chunk is stored only once, no matter how many times it appears.
Or read this on Hacker News