Get the latest tech news

Ptar: Replacing .tgz for petabyte-scale S3 archives


Hi, I’m Julien, co-founder of Plakar. Before we built this, I spent years as an engineer and later as a manager of infra teams.

deduplication: identical chunks stored once, even across snapshots built‑in encryption: no extra step tamper evidence: any change breaks the archive versioning: keep many snapshots easily S3 native: one command to archive a bucket partial restores and browsing: pick a file without unpacking it all fast targeted restores: grab one file in seconds In many real-world datasets, a large amount of data is actually redundant: multiple copies, backups, archives, or repeated files across folders. .ptar works differently: it automatically detects and removes duplicates, so each unique chunk is stored only once, no matter how many times it appears.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Ptar

Ptar

Photo of .tgz

.tgz

Photo of scale S3 archives

scale S3 archives