LLMs average <5% on 2025 Math Olympiad; award each other 20x points

slop_as_a_service@awful.systems · 1 个月前

LLMs average <5% on 2025 Math Olympiad; award each other 20x points

vane@lemmy.world · 1 个月前

This study is bullshit, because they only trace evaluations and not trace training process that align tokens with probabilities.

froztbyte@awful.systems · 1 个月前

remember, if we look too closely at the magic box, ~~we might notice how we’ve been fooled~~ the box will stop magicing for us!

vane@lemmy.world · 1 个月前

Well, every civilisation needs it’s prophets. Our civilisation built prophet machines that will kill us. We just didn’t get to the killing step yet.

froztbyte@awful.systems · 1 个月前

yeah but see, these grifters all heard it as “every civilisation needs its profits”. just a shame they suck at that too

vane@lemmy.world · 1 个月前

No prophet worked for free and they were always near the rullers and near big money. The story repeats itself, just the times are different and we can instant message with each other.

LLMs average <5% on 2025 Math Olympiad; award each other 20x points

LLMs average <5% on 2025 Math Olympiad; award each other 20x points

Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad