Get the latest tech news

Show HN: I created a PoC for live descriptions of the surroundings for the blind


Live image description solution using ESP32-CAM + Phone + Server - o40/seesay

The idea is to have images taken at a set interval, which are then described using an AI model, and read back to the user using voice synthesis. Since I was going for low cost (<30$), and wanted to learn more about software development on arduino, I bought a ESP32-CAM with built-in WiFi to capture the images. Initially I tested the solution by feeding an image to the OpenAI API manually to see that I get a decent response, and that the latency is low.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of surroundings

surroundings

Photo of PoC

PoC

Photo of live descriptions

live descriptions

Related news:

News photo

Show HN: Open Rewind – POC for audio and screen and video streaming to S3

News photo

Nothing’s first open-ear headphones keep you aware of your surroundings

News photo

Google's new accessibility updates help you search your surroundings faster than ever