Get the latest tech news

Show HN: Convert HTML DOM to semantic markdown for use in LLMs


DOM to Semantic-Markdown for use in LLMs. Contribute to romansky/dom-to-semantic-markdown development by creating an account on GitHub.

Semantic Clarity: Converts web content to a format more easily "understandable" for LLMs, enhancing their processing and reasoning capabilities. HTML to Semantic Markdown AST conversion Detection and extraction of main content Semantic structure preservation (e.g.,<header>,<footer>,<nav>) Metadata capture for images, tables, and other rich media elements URL refification for token optimization Customizable conversion options Browser and Node.js support Maintains table structures and data relationships Preserves link destinations while optimizing for token efficiency

Get the Android app

Or read this on Hacker News

Read more on:

Photo of LLMs

LLMs

Photo of dom

dom

Photo of semantic markdown

semantic markdown

Related news:

News photo

Converting Codebases with LLMs

News photo

You know what spreadsheets need? LLMs, says Microsoft

News photo

FlashAttention-3 unleashes the power of H100 GPUs for LLMs