That is kind of the issue - sure, there’s janky workarounds, using an outdated version of proprietary software to try to block parts of the system from working when you don’t want them to… But in the end, that’s just one problem of many, so I kinda just never came back to windows after the incident. I just responsibly regularly update my system, and probably have a better experience and lose less time just updating manually.
We probably won’t get better, but sounds like it’s still being trained on scraped data unless you explicitly opt out, including anything that may be getting mirrored by third parties that don’t opt out. Also, they can remove data from the training material retroactively… But presumably won’t be retraining the model from scratch, which means it will still have that in their weights, and the official weights will still have a potential advantage on models trained later on their training data.
From the license:
Oof, so they’re basically passing on data protection deletion requests to the users and telling them all to respectfully account for them.
They also claim “open data”, but I’m having trouble finding the actual training data, only the “Training data reconstruction scripts”…