Get the latest tech news

Images that Sound: Generating spectrograms that are also images


Images that Sound: Multimodal AI Art

Visual Anagrams, by Geng et al., which uses pretrained diffusion models and compositionality to make multi-view optical illusions. Diffusion Illusions, by Burgert et al., which produces multi-view illusions, along with other visual effects, through score distillation sampling. We adapt their code to make an SDS style baseline for generating images that sound.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of images

images

Related news:

News photo

New Windows AI feature records everything you’ve done on your PC | Recall uses AI features "to take images of your active screen every few seconds."

News photo

Retrospex: Convert images to fit Commodore 64 graphic modes

News photo

Show HN: I made a Mac app to search my images and videos locally with ML