Get the latest tech news

Show HN: LLM-aided OCR – Correcting Tesseract OCR errors with LLMs

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections. - Dicklesworthstone/llm_aided_ocr

PDF to image conversion OCR using Tesseract Advanced error correction using LLMs (local or API-based) Smart text chunking for efficient processing Markdown formatting option Header and page number suppression (optional) Quality assessment of the final output Support for both local LLMs and cloud-based API providers (OpenAI, Anthropic) Asynchronous processing for improved performance Detailed logging for process tracking and debugging GPU acceleration for local LLM inference Implemented within the markdown formatting step Identifies and removes exact or near-exact repeated paragraphs Preserves unique content and ensures text flow Functions: generate_completion_from_claude() and generate_completion_from_openai() Implements proper error handling and retry logic Manages token limits and adjusts request sizes dynamically

Get the Android app

Or read this on Hacker News