I agree with the ethical standpoint of banning Generative AI on the grounds that it’s trained on stolen artist data, but I’m not sure how tenable “trained on stolen artist data” is as a technical definition of what is not acceptable.
For example, if a model were trained exclusively on licensed works and data, would this be permissible? Intuitively, I’d still consider that to be Generative AI (though this might be a moot point, because the one thing I agree with the tech giants on is that it’s impractical to train Generative AI systems on licensed data because of the gargantuan amounts of training data required)
Perhaps it’s foolish of me to even attempt to pin down definitions in this way, but given how tech oligarchs often use terms in slippery and misleading ways, I’ve found it useful to try pin terms down where possible
I agree with the ethical standpoint of banning Generative AI on the grounds that it’s trained on stolen artist data, but I’m not sure how tenable “trained on stolen artist data” is as a technical definition of what is not acceptable.
For example, if a model were trained exclusively on licensed works and data, would this be permissible? Intuitively, I’d still consider that to be Generative AI (though this might be a moot point, because the one thing I agree with the tech giants on is that it’s impractical to train Generative AI systems on licensed data because of the gargantuan amounts of training data required)
Perhaps it’s foolish of me to even attempt to pin down definitions in this way, but given how tech oligarchs often use terms in slippery and misleading ways, I’ve found it useful to try pin terms down where possible