Get the latest tech news

Show HN: LLM-aided OCR – Correcting Tesseract OCR errors with LLMs


Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections. - Dicklesworthstone/llm_aided_ocr

PDF to image conversion OCR using Tesseract Advanced error correction using LLMs (local or API-based) Smart text chunking for efficient processing Markdown formatting option Header and page number suppression (optional) Quality assessment of the final output Support for both local LLMs and cloud-based API providers (OpenAI, Anthropic) Asynchronous processing for improved performance Detailed logging for process tracking and debugging GPU acceleration for local LLM inference Implemented within the markdown formatting step Identifies and removes exact or near-exact repeated paragraphs Preserves unique content and ensures text flow Functions: generate_completion_from_claude() and generate_completion_from_openai() Implements proper error handling and retry logic Manages token limits and adjusts request sizes dynamically

Get the Android app

Or read this on Hacker News

Read more on:

Photo of LLMs

LLMs

Photo of tesseract ocr errors

tesseract ocr errors

Related news:

News photo

In AI, Smaller, Cheaper Models Are Getting Big Attention

News photo

Show HN: Ell – A command-line interface for LLMs written in Bash

News photo

Torchchat: Chat with LLMs Everywhere