Perhaps the only appropriate use of AI

canihasaccount@lemmy.world · edit-2 3 hours ago

Perhaps the only appropriate use of AI

Anisette [any/all]@quokk.au · 8 hours ago

I wonder if they actually linked it to an algorithm that analyses the sound that is looking for certain patterns or something, and that is why you get the “atmosphere piece” thing.

sobchak@programming.dev · 5 hours ago

According to their tech/marketing papers, it’s supposedly multi-modal, encoding audio to tokens.

prole@lemmy.blahaj.zone · 3 hours ago

Jesus, what a complety fucking useless waste of resources

Anisette [any/all]@quokk.au · 5 hours ago

yeah but that doesn’t mean anything, does it? I don’t think they just tokenize the raw audio, that wouldn’t make sense, right?

sobchak@programming.dev · 4 hours ago

I mean, you could. Just encode 100ms chunks or whatever into tokens then push them through the same model. I’m pretty sure that’s what the claim to do (though with MoE/routing now, maybe).