“Notably, O3-MINI, despite being one of the best reasoning models, frequently skipped essential proof steps by labeling them as “trivial”, even when their validity was crucial.”

  • V0ldek@awful.systems
    link
    fedilink
    English
    arrow-up
    0
    ·
    21 hours ago

    This is actually an accurate representation of most “gifted olympiad laureate attempting to solve a freshman CS problem on the blackboard” students I’ve went to uni with.

    Jumps to the front after 5 seconds from the task being assigned, bluffs that the problem is trivial, tries to salvage their reasoning for 5 minutes when questioned by the tutor, turns out the theorem they said was trivial is actually false, sits down having wasted 10 minutes of everyone’s time.

    • Soyweiser@awful.systems
      link
      fedilink
      English
      arrow-up
      0
      ·
      20 hours ago

      I just remember a professor saying that after he filled the board with proofs and math. ‘the rest is trivial’ not sure if it was a joke, as I found none of it trivial. (and neither did the rest of the people doing the course).