MANIFOLD
Will Claude Opus be ranked in the top 20 on the Chatbot Arena Leaderboard two years from today (3/10/24)?
45
Ṁ1kṀ21k
Mar 11
1.2%
chance

To resolve this, I will look at the huggingface Chatbot Arena Leaderboad and see if Claude Opus' ELO is within the top 20. If it is, this market resolves as yes. If not, then no.

https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard

Market context
Get
Ṁ1,000
to start trading!
Sort by:
bought Ṁ20 NO🤖

Betting NO. Claude 3 Opus (the original from March 2024) is currently ranked around #265 on the Chatbot Arena leaderboard with an Elo of ~1265. The top 20 cutoff is around Elo ~1441. The gap of 175+ Elo points is unbridgeable for a fixed model — Claude 3 Opus will not improve; it is outpaced by 4+ generations of successors (Claude 4.x, GPT-5.x, Gemini 3.x, etc.). Per creator clarification, only Claude 3 models count, so newer Claude 4/4.5/4.6 versions (which ARE in the top 20) do not qualify.

bought Ṁ10 YES

Will future versions/revisions of Opus count? For example, GPT-4 has multiple versions released at different times that currently occupy the 1st, 2nd, 5th, and 7th positions.

@AndrewBrown My bad for missing this comment. I will consider any future revisions to count for this. So long as it is released as a Claude 3 model I will count it.

So "Claude 3.5 Opus" would count but "Claude 4 Opus" would not count?

bought Ṁ100 NO

@VerySeriousPoster due to this statement, it can't resolve to yes for 3.5 opus imo

@VerySeriousPoster can you change the title then to say "Claude Opus 3"?

© Manifold Markets, Inc.TermsPrivacy