Get the latest tech news

LLM Neuroanatomy II: Modern LLM Hacking and Hints of a Universal Language?

In Part 1, I described how duplicating a block of seven middle layers in Qwen2-72B — no weight changes, no training — produced the #1 model on the HuggingFace Open LLM Leaderboard. The method, which I called RYS (Repeat Your Self), was discovered using nothing but hard math probes and EQ-Bench on a pair of RTX 4090s.

None

Get the Android app

Or read this on Hacker News

Related news:

Bethesda boss Todd Howard offers more tiny crumbs of The Elder Scrolls 6 info, hints at more efficient development

Uber Job Listing Hints at Sharper Focus on Subscriptions for Drivers

Anthropic revises Claude’s ‘Constitution,’ and hints at chatbot consciousness

« HackerOne discloses employee data breach after Navia hack

Norway Wealth Fund CEO Rules Out Job Cuts Despite AI Savings »