I Tested GPT-5.4 vs Claude Opus 4.6 on 20 Real Tasks — The #1 Model on LMSYS Isn’t What You Think
Two days ago, Claude Opus 4.6 quietly took the #1 spot on the LMSYS Chatbot Arena with an Elo score of 1504 — the highest any model has…
Like
0
Liked
Liked
