
Nonprofit Researchers Prove AI Can Respect Copyright—No Excuses
Nonprofit researchers at AllenAI have created Dolma, a 3-trillion-token dataset that proves AI companies can respect copyright without sacrificing performance, challenging industry claims that copyright infringement is technically necessary for training large language models.