- cross-posted to:
- [email protected]
- cross-posted to:
- [email protected]
My original, editorialized title: Ars Technica Sells Out
Linking to this because I know people here read Ars Technica, and I totally didn’t become a subscriber three days before this was announced. Nope. No sir.
I want Ars content to be part of whatever training data is provided to the best models. How does that get done without appearing like they are being bought?
Even if their contract explicitly states that it is a data sharing agreement only and the products of the media organization (articles/investigations) are not grounds for breach or retaliation, it is assumed that there is now some impartiality in future reporting.
So, for all media companies, the options seem to be:
Is there a GPL or other license structure that permits data sharing for LLM training in a way that it does not get transformed into something evil?