Get the latest tech news
Show HN: I created a PoC for live descriptions of the surroundings for the blind
Live image description solution using ESP32-CAM + Phone + Server - o40/seesay
The idea is to have images taken at a set interval, which are then described using an AI model, and read back to the user using voice synthesis. Since I was going for low cost (<30$), and wanted to learn more about software development on arduino, I bought a ESP32-CAM with built-in WiFi to capture the images. Initially I tested the solution by feeding an image to the OpenAI API manually to see that I get a decent response, and that the latency is low.
Or read this on Hacker News