Get the latest tech news

I ask this chess puzzle to every new LLM

I ask this chess puzzle to every new LLM. GitHub Gist: instantly share code, notes, and snippets.

Though this did not stop folks from organizing an LLMs chess championship this month where bots ate their own pieces, brought back dead knights and had short term memory losses about the board positions. The ability to just lookup cached values in a real engine also removes the need to do an actual tree search like stockfish which is much to ask from an LLM, though I did try reasoning with 4o to simulate a minimax with alpha-beta pruning but it still parroted wrong stuff forgetting things like forced moves. (Removed from the LLMs snubbing chess noise, a paper that caught my attention recently was about evidence of learned look-ahead in the policy network of Lc0, and extending the argument to maybe use it for future mesa-optimization tests - https://lnkd.in/g8KuTg3C)

Get the Android app

Or read this on Hacker News