• stephen01king@lemmy.zip
    link
    fedilink
    English
    arrow-up
    2
    ·
    2 months ago

    What does that even mean? Neural networks have varying levels of complexity, even within the same technology. Even the same LLM model can have different number of tokens that differentiate the complexity of their operation.

    So instead of using a neural network that is designed to input and output text and making it learn to output coding, which is also text, you think it’s supposed to be easier for them to make it instead analyse various video and audio input from multiple cameras, and then output the various actions that is required for it to drive a car? Does that make sense to you?

    • Valmond@lemmy.world
      link
      fedilink
      arrow-up
      1
      arrow-down
      1
      ·
      2 months ago

      They’re all based on the same tech, text, images whatever is just raw bytes. Train it for turning left, detect cats or whatever, same battle. It’s just harder when the problem is complicated like driving, and more so when programming.