my go-to llm in no specific order –

  1. Llama 4 behemoth
  2. Grok 3
  3. Claude 3.7 sonnet (extended thinking)

What’s your go-to?

When in a hurry, I just use the Gemini voice assistant or Meta ai – I have the Messenger app.

  • Sims@lemmy.ml
    link
    fedilink
    English
    arrow-up
    1
    ·
    24 days ago

    It seems there’s a lot of difference between the benchmark results and users experience with the Llama4 models. The 2 lesser Llama models fail at reasoning and several ordinary tests by AI youtubers. Maybe a configuration error, maybe the high lmsys results where actually from the Behemoth model, but something seem wrong to me.

    Anyway, I use these models at the moment (Imho best, and free on Groq, cerebra, openrouter and others); 80% qwq, 15% R1, and Deepseek v3 for non thinking. Used to be Llama3.3 70b for most, but DeepSeek and reasoning happened.