• z00s@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    4 months ago

    The problem is not the LLMs, but what people are trying to do with them.

    They are currently spoons, but people are desperately wishing they were katanas.

    They work really well for soup, but they can’t cut steak. But they’re being hyped as super ninja steak knives, and people are getting pissed when they can’t cut steak.

    If you give them watery, soupy tasks they can do successfully, they can lighten your workload, as long as you’re aware of what they are and aren’t good at.

    What people want LLMs to be able to do, ie. “Steak” tasks:

    • write complex documents

    • apply complex knowledge/rules to a situation

    • Write complex code and create entire programs based on vague description

    What LLMs can currently do ie. “Soup” tasks:

    • check this document and fix all spelling, punctuation and grammatical errors

    • summarise this paragraph as dot points

    • write a python program that sorts my photographs into folders based on the year they were taken

    Half of Lemmy is hyping katanas, the other half is yelling “Why won’t my spoon cut this steak?!! AI is so dumb!!!”

    • self@awful.systems
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 months ago

      they don’t do any of that soup shit reliably either and reading the article might have told you that

    • istewart@awful.systems
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 months ago

      Why did this immediately give me a flashback to Donald Trump yelling, “when it comes to great steaks, I’ve just raised the stakes!

    • blakestacey@awful.systems
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 months ago

      I’d offer congratulations on obfuscating a bad claim with a poor analogy, but you didn’t even do that very well.

    • froztbyte@awful.systems
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 months ago

      good god this entire post is the most tortured believer whataboutism I’ve encountered this month and there’s extremely strong competition here

      are currently spoons, but people are desperately wishing they were katanas

      ie. “Steak” tasks

      you should make a youtube channel, The Katana Steak-Eater. I’d watch the shit out of that at least one saturday afternoon

    • V0ldek@awful.systems
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 months ago

      What LLMs can currently do summarise this paragraph as dot points

      The entire point here is that they can’t?

      • fuzzzerd@programming.dev
        link
        fedilink
        English
        arrow-up
        0
        ·
        4 months ago

        Clearly this post is about LLMs not succeeding at this task, but anecdotally I’ve seen it work OK and also fail. Just like humans, which is the benchmark but they are faster.

        • self@awful.systems
          link
          fedilink
          English
          arrow-up
          0
          ·
          4 months ago

          humans are clearly faster at generating utterly banal shit, as proven by your posts in this thread

    • FredFig@awful.systems
      link
      fedilink
      English
      arrow-up
      0
      ·
      edit-2
      4 months ago

      Food analogy

      This level of discourse wouldn’t fly on 4chan, how is it so popular with LLM fans?

      • David Gerard@awful.systemsOPM
        link
        fedilink
        English
        arrow-up
        0
        ·
        4 months ago

        needs to be a car analogy

        • What people want LLMs to do, i.e. Corvette tasks
        • What LLMs actually do, i.e. Trabant tasks
        • self@awful.systems
          link
          fedilink
          English
          arrow-up
          0
          ·
          4 months ago

          What LLMs actually do, i.e. Trabant tasks

          more of a Power Wheels Barbie Jeep whose battery got left out in the sun too long, but I’ll allow it