Get the latest tech news
Small but mighty: H2O.ai’s new AI models challenge tech giants in document analysis
H2O.ai has released efficient vision-language models for document AI, challenging tech giants with superior performance in OCR and text recognition tasks.
H2O.ai, a provider of open-source AI platforms, announced today two new vision-language models designed to improve document analysis and optical character recognition (OCR) tasks. “We’ve designed H2OVL Mississippi models to be a high-performance yet cost-effective solution, bringing AI-powered OCR, visual understanding, and Document AI to businesses,” Sri Ambati, CEO and Founder of H2O.ai said in an exclusive interview with VentureBeat. A comparison of average scores on eight single image benchmarks shows H2O.ai’s new H2OVL Mississippi-2B model (in yellow) outperforming several competitors, including offerings from Microsoft and Google.
Or read this on Venture Beat