Perhaps the only appropriate use of AI

canihasaccount@lemmy.world · edit-2 3 hours ago

Perhaps the only appropriate use of AI

sobchak@programming.dev · 5 hours ago

According to their tech/marketing papers, it’s supposedly multi-modal, encoding audio to tokens.

prole@lemmy.blahaj.zone · 3 hours ago

Jesus, what a complety fucking useless waste of resources

Anisette [any/all]@quokk.au · 5 hours ago

yeah but that doesn’t mean anything, does it? I don’t think they just tokenize the raw audio, that wouldn’t make sense, right?

sobchak@programming.dev · 4 hours ago

I mean, you could. Just encode 100ms chunks or whatever into tokens then push them through the same model. I’m pretty sure that’s what the claim to do (though with MoE/routing now, maybe).