Get the latest tech news
Apple teaching an AI system to use apps; maybe for advanced Siri
An Apple research paper describes how the company has been developing Ferret-UI, a generative AI system specifically designed to be...
An Apple research paper describes how the company has been developing Ferret-UI, a generative AI system specifically designed to be able to make sense of app screens. MLLMs – or Multimodal Large Language Models – aim to extend the ability of an AI system to make sense of non-textual information also: images, video, and audio. Given that UI screens typically exhibit a more elongated aspect ratio and contain smaller objects of interest (e.g., icons, texts) than natural images, we incorporate “any resolution” on top of Ferret to magnify details and leverage enhanced visual features […]
Or read this on r/apple