Get the latest tech news

Ghostwriter – use the reMarkable2 as an interface to vision-LLMs


Use the reMarkable2 as an interface to vision-LLMs (ChatGPT, Claude, Gemini). Ghost in the machine! - awwaiid/ghostwriter

The Remarkable flips out a bit ... and when the whole screen is a giant black square it really freaks out and doesn't complete Things that worked at least once: Writing "Fill in the answer to this math problem... 3 + 7 =""Draw a picture of a chihuahua. 2024-10-23- Code shuffle Doing a bit of refactoring, grouping utilities into separate files Yesterday a new Anthropic model came out (3.5-sonnet-new) which might be better at spacial awareness on the screen, so next up is to try that out in drawing-mode In any case, next I want to set it up with tools so that it can contextually give back an SVG or text or start to trigger external scripts, like for TODO list management Need to get some automation around the evaluations The segmenter has to be explicitly enabled with--apply-segmentation and it assumes that you have either--input-png or--save-screenshot because it (dumbly) re-parses the png file OMG this is the first time that the math prompt got even close to putting the answer where I want!

Get the Android app

Or read this on Hacker News

Read more on:

Photo of LLMs

LLMs

Photo of interface

interface

Photo of ghostwriter

ghostwriter

Related news:

News photo

Bolt: Bootstrap long chain-of-thought in LLMs without distillation [pdf]

News photo

Why LLMs still have problems with OCR

News photo

LLMs Were Backdoored Years Ago