Get the latest tech news
I compared Claude Opus 4.8 with 4.7 in a 10-round honesty test - and a legal prompt broke it
I tested Opus 4.8 against 4.7 using coding, medical, finance, and legal traps, then cross-checked the results with multiple AIs.
None
Or read this on ZDNet

