Get the latest tech news

Show HN: HTML visualization of a PDF file's internal structure


A Python library to inspect and modify the internal structure of a PDF file - desgeeko/pdfsyntax

Inspecting the internal structure of a PDF file involves a lot of things (decompression, parsing, xref indexing, etc...) in order to make sense of the raw bytes. PDFSyntax takes care of the processing and proposes a visualization approach that consists in adding information and hyperlinks on top of a text that is a mostly a pretty-print of the PDF data once uncompressed. It respects the physical flow of the file while offering a logical navigation between revisions (incremental updates) and between objects.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of PDF

PDF

Photo of PDF file

PDF file

Photo of internal structure

internal structure

Related news:

News photo

Linux Running in a PDF

News photo

Mutool – all purpose tool for dealing with PDF files

News photo

Developer Makes Doom Run Inside a PDF File