Get the latest tech news
Show HN: LLM-aided OCR – Correcting Tesseract OCR errors with LLMs
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections. - Dicklesworthstone/llm_aided_ocr
PDF to image conversion OCR using Tesseract Advanced error correction using LLMs (local or API-based) Smart text chunking for efficient processing Markdown formatting option Header and page number suppression (optional) Quality assessment of the final output Support for both local LLMs and cloud-based API providers (OpenAI, Anthropic) Asynchronous processing for improved performance Detailed logging for process tracking and debugging GPU acceleration for local LLM inference Implemented within the markdown formatting step Identifies and removes exact or near-exact repeated paragraphs Preserves unique content and ensures text flow Functions: generate_completion_from_claude() and generate_completion_from_openai() Implements proper error handling and retry logic Manages token limits and adjusts request sizes dynamically
Or read this on Hacker News