• kautau@lemmy.world
    link
    fedilink
    arrow-up
    1
    ·
    4 hours ago

    I think it really depends on how accurate you want / what language you are interpreting. https://github.com/openai/whisper has multiple variations on their model, but they all pretty much require VRAM/graphics capability (or likely NPUs as they become more commonplace).