“Notably, O3-MINI, despite being one of the best reasoning models, frequently skipped essential proof steps by labeling them as “trivial”, even when their validity was crucial.”

      • froztbyte@awful.systems
        link
        fedilink
        English
        arrow-up
        0
        ·
        22 hours ago

        pray forgive, fair poster, for the shame I have cast upon myself in the action of doubting the Most Serious Article so affine to yourself - clearly a person of taste and wit, and I deserve the ire and muck resultant

        wait… wait, no, sorry! got those the wrong way around. happens all the time - guess I tried too hard to think like you.