photog.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
🌈 An inclusive place for your photos, silliness, and convos! 🌈

Administered by:

Server stats:

253
active users

#AITraining

0 posts0 participants0 posts today
Unofficial PetaPixel Bot<p>Universal Pictures Adds ‘May Not Be Used to Train AI’ to the End of its Movies <a href="https://petapixel.com/2025/08/13/universal-pictures-adds-may-not-be-used-to-train-ai-to-the-end-of-its-movies/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">petapixel.com/2025/08/13/unive</span><span class="invisible">rsal-pictures-adds-may-not-be-used-to-train-ai-to-the-end-of-its-movies/</span></a> <a href="https://toot.earth/tags/universalstudios" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>universalstudios</span></a> <a href="https://toot.earth/tags/trainingdata" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>trainingdata</span></a> <a href="https://toot.earth/tags/Technology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Technology</span></a> <a href="https://toot.earth/tags/aitraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aitraining</span></a> <a href="https://toot.earth/tags/universal" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>universal</span></a> <a href="https://toot.earth/tags/News" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>News</span></a></p>
Miguel Afonso Caetano<p>"The incredible demand for high-quality human-annotated data is fueling soaring revenues of data labeling companies. In tandem, the cost of human labor has been consistently increasing. We estimate that obtaining high-quality human data for LLM post-training is more expensive than the marginal compute itself1 and will only become even more expensive. In other words, high-quality human data will be the bottleneck for AI progress if these trends continue.</p><p>The revenue of major data labeling companies and the marginal compute cost of training of training frontier models for major AI providers in 2024.</p><p>To assess the proportion of data labeling costs within the overall AI training budget, we collected and estimated both data labeling and compute expenses for leading AI providers in 2024:</p><p>- Data labeling costs: We collected revenue estimates of major data labeling companies, such as Scale AI, Surge AI, Mercor, and LabelBox.<br>- Compute costs: We gathered publicly reported marginal costs of compute2 associated with training top models released in 2024, including Sonnet 3.5, GPT-4o, DeepSeek-V3, Mistral Large, Llama 3.1-405B, and Grok 2.</p><p>We then calculate the sum of costs in a category as the estimate of the market total. As shown above, the total cost of data labeling is approximately 3.1 times higher than total marginal compute costs. This finding highlights clear evidence: the cost of acquiring high-quality human-annotated data is rapidly outpacing the compute costs required for training state-of-the-art AI models."</p><p><a href="https://ddkang.substack.com/p/human-data-is-probably-more-expensive" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">ddkang.substack.com/p/human-da</span><span class="invisible">ta-is-probably-more-expensive</span></a></p><p><a href="https://tldr.nettime.org/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://tldr.nettime.org/tags/AITraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AITraining</span></a> <a href="https://tldr.nettime.org/tags/GenerativeAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GenerativeAI</span></a> <a href="https://tldr.nettime.org/tags/LLMs" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LLMs</span></a> <a href="https://tldr.nettime.org/tags/DataLabeling" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataLabeling</span></a> <a href="https://tldr.nettime.org/tags/ComputeCosts" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ComputeCosts</span></a></p>
Rotan<p><span class="h-card" translate="no"><a href="https://sfba.social/@drahardja" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>drahardja</span></a></span> </p><p>"Laundered plagiarism" has to be the most succinct and accurate description I've come across in a long time. Deserves its own hashtag. </p><p><a href="https://mastodon.ie/tags/LaunderedPlagiarism" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LaunderedPlagiarism</span></a> <a href="https://mastodon.ie/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mastodon.ie/tags/AITraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AITraining</span></a></p>
Miguel Afonso Caetano<p><a href="https://tldr.nettime.org/tags/RENTSEEKING" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RENTSEEKING</span></a> <a href="https://tldr.nettime.org/tags/MONOPOLIES" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MONOPOLIES</span></a>: Media organizations have been stealing artists for DECADES. And now they want to get licenses from AI companies - for what purpose? You guess well: GETTING RID of the artists. Gosh, how can these people for the art world be so naïve and at the same time so pretentious!!!</p><p>I can't stand poseurs who want to extract copyright licenses for any online use of "their" works. Ultimately, their world view would represent the end of things such as digital libraries (including the Internet Archive), remixes, mashups, fan fiction, and every transformative use.</p><p><a href="https://arstechnica.com/tech-policy/2025/08/ai-industry-horrified-to-face-largest-copyright-class-action-ever-certified/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">arstechnica.com/tech-policy/20</span><span class="invisible">25/08/ai-industry-horrified-to-face-largest-copyright-class-action-ever-certified/</span></a></p><p><a href="https://tldr.nettime.org/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://tldr.nettime.org/tags/GenerativeAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GenerativeAI</span></a> <a href="https://tldr.nettime.org/tags/AITraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AITraining</span></a> <a href="https://tldr.nettime.org/tags/Copyright" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Copyright</span></a> <a href="https://tldr.nettime.org/tags/IP" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>IP</span></a> <a href="https://tldr.nettime.org/tags/Anthropic" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Anthropic</span></a></p>
Miguel Afonso Caetano<p>"Meta has recently started using the personal data of Europeans for AI training. Contrary to its GDPR obligations, Meta hasn’t asked for consent in advance. Instead, the company claims to have a ‘legitimate interest’ outweighing the fundamental right to privacy. A key argument in favour of such a ‘legitimate interest’ is the reasonable expectations of users. This begs the question: do people want this to happen? To find out more, noyb has commissioned the Gallup Institute to conduct a study among 1,000 Meta users in Germany. The results are clear: While almost 75% of users heard of Meta’s plans, only 7% actually want their data to be used for AI training. This also means that at least 68 million people never heard about the change."</p><p><a href="https://noyb.eu/en/noyb-survey-only-7-users-want-meta-use-their-personal-data-ai" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">noyb.eu/en/noyb-survey-only-7-</span><span class="invisible">users-want-meta-use-their-personal-data-ai</span></a></p><p><a href="https://tldr.nettime.org/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://tldr.nettime.org/tags/GenerativeAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GenerativeAI</span></a> <a href="https://tldr.nettime.org/tags/EU" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>EU</span></a> <a href="https://tldr.nettime.org/tags/Germany" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Germany</span></a> <a href="https://tldr.nettime.org/tags/GDPR" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GDPR</span></a> <a href="https://tldr.nettime.org/tags/Meta" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Meta</span></a> <a href="https://tldr.nettime.org/tags/Facebook" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Facebook</span></a> <a href="https://tldr.nettime.org/tags/DataProtection" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataProtection</span></a> <a href="https://tldr.nettime.org/tags/Surveillance" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Surveillance</span></a> <a href="https://tldr.nettime.org/tags/AITraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AITraining</span></a> <a href="https://tldr.nettime.org/tags/Privacy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Privacy</span></a></p>
Unofficial PetaPixel Bot<p>Stability AI Wants a Spotify-Type Model for Images and AI Training Data <a href="https://petapixel.com/2025/07/31/stability-ai-wants-a-spotify-type-model-for-images-and-ai-training-data/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">petapixel.com/2025/07/31/stabi</span><span class="invisible">lity-ai-wants-a-spotify-type-model-for-images-and-ai-training-data/</span></a> <a href="https://toot.earth/tags/aitrainingdata" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aitrainingdata</span></a> <a href="https://toot.earth/tags/trainingdata" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>trainingdata</span></a> <a href="https://toot.earth/tags/stabilityai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>stabilityai</span></a> <a href="https://toot.earth/tags/Technology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Technology</span></a> <a href="https://toot.earth/tags/aitraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aitraining</span></a> <a href="https://toot.earth/tags/spotify" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>spotify</span></a> <a href="https://toot.earth/tags/News" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>News</span></a></p>
Miguel Afonso Caetano<p>Essential Reading -&gt;</p><p>"One could argue that by repurposing creative works, AI has expanded the art multiplier: each dollar spent on the arts now yields its usual social return, as well as additional value derived from its incorporation into AI systems.</p><p>Yet, despite the value of their contributions, public funding for artists and creators has steadily declined. In the United Kingdom, for example, direct support from the Department for Culture, Media and Sport to national arts bodies fell by 18% per person in real terms between 2009-10 and 2022-23. Over the same period, core funding for arts councils dropped by 18% in England, 22% in Scotland, 25% in Wales, and 66% in Northern Ireland. As generative AI continues to churn out synthetic content and displace human labor, that support must increase to reflect the realities of a changing creative economy.</p><p>Admittedly, with public finances under pressure and debt on the rise, this is hardly the time for unchecked government spending. Any additional funding would need to be financed responsibly. While a detailed policy blueprint is beyond the scope of this article, it’s worth noting that the enormous profits generated by major tech firms could be partially redirected to support the creative communities that power their models.</p><p>One way to achieve this would be to impose a levy on the gross revenues of the largest AI providers, collected by a national or multilateral agency. As the technology becomes increasingly embedded in daily life and production processes, the revenue flowing to AI firms is bound to grow – and so, too, will contributions to the fund. These resources could then be distributed by independent grant councils on multiyear cycles, ensuring that support reaches a wide range of disciplines and regions."</p><p><a href="https://www.project-syndicate.org/onpoint/how-ai-profits-can-help-fund-cultural-production-by-mariana-mazzucato-and-fausto-gernone-2025-07" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">project-syndicate.org/onpoint/</span><span class="invisible">how-ai-profits-can-help-fund-cultural-production-by-mariana-mazzucato-and-fausto-gernone-2025-07</span></a></p><p><a href="https://tldr.nettime.org/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://tldr.nettime.org/tags/GenerativeAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GenerativeAI</span></a> <a href="https://tldr.nettime.org/tags/BigTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BigTech</span></a> <a href="https://tldr.nettime.org/tags/Copyright" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Copyright</span></a> <a href="https://tldr.nettime.org/tags/IP" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>IP</span></a> <a href="https://tldr.nettime.org/tags/AITraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AITraining</span></a> <a href="https://tldr.nettime.org/tags/Commons" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Commons</span></a> <a href="https://tldr.nettime.org/tags/DigitalCommons" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DigitalCommons</span></a> <a href="https://tldr.nettime.org/tags/CreativeLabour" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CreativeLabour</span></a> <a href="https://tldr.nettime.org/tags/PublicGoods" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PublicGoods</span></a></p>
Miguel Afonso Caetano<p>"What makes this particularly alarming is that Grok’s reasoning process often correctly identifies extremely harmful requests, then proceeds anyway. The model can recognize chemical weapons, controlled substances, and illegal activities, but seems to just… not really care.</p><p>This suggests the safety failures aren’t due to poor training data or inability to recognize harmful content. The model knows exactly what it’s being asked to do and does it anyway.</p><p>Why this matters (though it's probably obvious?)<br>Grok 4 is essentially frontier-level technical capability with safety features roughly on the level of gas station fireworks.</p><p>It is a system that can provide expert-level guidance ("PhD in every field", as Elon stated) on causing destruction, available to anyone who has $30 and asks nicely. We’ve essentially deployed a technically competent chemistry PhD, explosives expert, and propaganda specialist rolled into one, with no relevant will to refuse harmful requests. The same capabilities that help Grok 4 excel at benchmarks - reasoning, instruction-following, technical knowledge - are being applied without discrimination to requests that are likely to cause actual real-world harm."</p><p><a href="https://www.lesswrong.com/posts/dqd54wpEfjKJsJBk6/xai-s-grok-4-has-no-meaningful-safety-guardrails" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">lesswrong.com/posts/dqd54wpEfj</span><span class="invisible">KJsJBk6/xai-s-grok-4-has-no-meaningful-safety-guardrails</span></a></p><p><a href="https://tldr.nettime.org/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://tldr.nettime.org/tags/GenerativeAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GenerativeAI</span></a> <a href="https://tldr.nettime.org/tags/xAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>xAI</span></a> <a href="https://tldr.nettime.org/tags/Musk" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Musk</span></a> <a href="https://tldr.nettime.org/tags/Grok" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Grok</span></a> <a href="https://tldr.nettime.org/tags/Grok4" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Grok4</span></a> <a href="https://tldr.nettime.org/tags/AISafety" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AISafety</span></a> <a href="https://tldr.nettime.org/tags/AITraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AITraining</span></a></p>
Petra van Cronenburg<p><span class="h-card" translate="no"><a href="https://mastodon.laurenweinstein.org/@lauren" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>lauren</span></a></span> I'm not that familiar with legal stuff but ask myself if we could get Google fined for infringement of the DSA act or GDPR laws in the EU? <a href="https://www.disinfo.eu/wp-content/uploads/2022/11/20221020_DSAUserGuide_Final.pdf" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">disinfo.eu/wp-content/uploads/</span><span class="invisible">2022/11/20221020_DSAUserGuide_Final.pdf</span></a> By complaints we should get the <span class="h-card" translate="no"><a href="https://ec.social-network.europa.eu/@EUCommission" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>EUCommission</span></a></span> to an investigation: <a href="https://digital-strategy.ec.europa.eu/en/policies/dsa-enforcement" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">digital-strategy.ec.europa.eu/</span><span class="invisible">en/policies/dsa-enforcement</span></a></p><p>Or am I too optimistic? 🤔 </p><p><a href="https://mastodon.online/tags/Google" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Google</span></a> <a href="https://mastodon.online/tags/gemini" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>gemini</span></a> <a href="https://mastodon.online/tags/GoogleMail" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GoogleMail</span></a> <a href="https://mastodon.online/tags/DSA" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DSA</span></a> <a href="https://mastodon.online/tags/GDPR" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GDPR</span></a> <a href="https://mastodon.online/tags/AItraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AItraining</span></a> <a href="https://mastodon.online/tags/privacy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>privacy</span></a></p>
Unofficial PetaPixel Bot<p>WeTransfer Changes Policy After Concern It Could Train AI on User’s Photos <a href="https://petapixel.com/2025/07/16/wetransfer-changes-policy-after-concern-it-could-train-ai-on-users-photos/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">petapixel.com/2025/07/16/wetra</span><span class="invisible">nsfer-changes-policy-after-concern-it-could-train-ai-on-users-photos/</span></a> <a href="https://toot.earth/tags/termsandconditions" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>termsandconditions</span></a> <a href="https://toot.earth/tags/machinelearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>machinelearning</span></a> <a href="https://toot.earth/tags/generativeai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>generativeai</span></a> <a href="https://toot.earth/tags/Technology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Technology</span></a> <a href="https://toot.earth/tags/aitraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aitraining</span></a> <a href="https://toot.earth/tags/wetransfer" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>wetransfer</span></a> <a href="https://toot.earth/tags/News" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>News</span></a> <a href="https://toot.earth/tags/data" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>data</span></a></p>
Miguel Afonso Caetano<p>"For context, last week Facebook began showing users a prompt asking them to opt into "cloud processing," TechCrunch reported. Should you consent, this allows Facebook to grab stuff from your camera roll and upload it to Facebook's servers "on a regular basis" so it can generate recaps and "AI restylings" of your photos.</p><p>The important detail is that by opting in, Meta is asking you to agree to its AI terms, which state that, "once shared, you agree that Meta will analyze those images, including facial features, using AI." Meta would also gain the right to "retain and use" the information shared with its AI systems.</p><p>Your alarm bells should already be ringing. Any data that gets fed into an AI system runs the risk of being coughed up or reproduced in some shape or form. And asking for access to your entire camera roll so Meta's tech can "analyze" your photos is a huge and invasive escalation — it's shameless that Meta's even asking. Apparently, already using everyone's billions of Facebook and Instagram posts made since 2007 wasn't enough for Zuckerberg's tech juggernaut.</p><p>Moreover, Meta's AI terms don't make it clear if your unpublished camera roll photos it uses for "cloud processing" are safe from AI training. That's in stark contrast with the terms outlined for apps like Google Photos, the Verge noted, which explicitly state that your personal info won't be used as training data."</p><p><a href="https://futurism.com/meta-sketchy-training-ai-private-photos" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">futurism.com/meta-sketchy-trai</span><span class="invisible">ning-ai-private-photos</span></a></p><p><a href="https://tldr.nettime.org/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://tldr.nettime.org/tags/GenerativeAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GenerativeAI</span></a> <a href="https://tldr.nettime.org/tags/AITraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AITraining</span></a> <a href="https://tldr.nettime.org/tags/Meta" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Meta</span></a> <a href="https://tldr.nettime.org/tags/Privacy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Privacy</span></a> <a href="https://tldr.nettime.org/tags/DataProtection" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataProtection</span></a></p>
Unofficial PetaPixel Bot<p>Facebook Tests Using Meta AI to Process Photos Prior to Uploading <a href="https://petapixel.com/2025/06/30/facebook-tests-using-meta-ai-to-process-photos-prior-to-uploading/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">petapixel.com/2025/06/30/faceb</span><span class="invisible">ook-tests-using-meta-ai-to-process-photos-prior-to-uploading/</span></a> <a href="https://toot.earth/tags/aiimagegenerator" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aiimagegenerator</span></a> <a href="https://toot.earth/tags/machinelearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>machinelearning</span></a> <a href="https://toot.earth/tags/Technology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Technology</span></a> <a href="https://toot.earth/tags/aitraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aitraining</span></a> <a href="https://toot.earth/tags/Facebook" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Facebook</span></a> <a href="https://toot.earth/tags/aiimage" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aiimage</span></a> <a href="https://toot.earth/tags/privacy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>privacy</span></a> <a href="https://toot.earth/tags/News" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>News</span></a> <a href="https://toot.earth/tags/data" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>data</span></a> <a href="https://toot.earth/tags/meta" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>meta</span></a></p>
ResearchBuzz: Firehose<p>The Guardian: Group of high-profile authors sue Microsoft over use of their books in AI training. “A group of authors has accused Microsoft of using nearly 200,000 pirated books to create an artificial intelligence model, the latest allegation in the long legal fight over copyrighted works between creative professionals and technology companies.”</p><p><a href="https://rbfirehose.com/2025/06/27/the-guardian-group-of-high-profile-authors-sue-microsoft-over-use-of-their-books-in-ai-training/" class="" rel="nofollow noopener" target="_blank">https://rbfirehose.com/2025/06/27/the-guardian-group-of-high-profile-authors-sue-microsoft-over-use-of-their-books-in-ai-training/</a></p>
Ars Technica News<p>Book authors made the wrong arguments in Meta AI training case, judge says <a href="https://arstechni.ca/8nfR" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">arstechni.ca/8nfR</span><span class="invisible"></span></a> <a href="https://c.im/tags/copyrightinfringement" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>copyrightinfringement</span></a> <a href="https://c.im/tags/AItraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AItraining</span></a> <a href="https://c.im/tags/torrenting" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>torrenting</span></a> <a href="https://c.im/tags/copyright" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>copyright</span></a> <a href="https://c.im/tags/leeching" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>leeching</span></a> <a href="https://c.im/tags/Policy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Policy</span></a> <a href="https://c.im/tags/LLaMA" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LLaMA</span></a> <a href="https://c.im/tags/meta" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>meta</span></a> <a href="https://c.im/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a></p>
Unofficial PetaPixel Bot<p>Creative Commons Launches ‘CC Signals’ to Help Photographers Control AI Use of Their Images <a href="https://petapixel.com/2025/06/26/creative-commons-launches-cc-signals-to-help-photographers-control-ai-use-of-their-images/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">petapixel.com/2025/06/26/creat</span><span class="invisible">ive-commons-launches-cc-signals-to-help-photographers-control-ai-use-of-their-images/</span></a> <a href="https://toot.earth/tags/creativecommons" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>creativecommons</span></a> <a href="https://toot.earth/tags/Technology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Technology</span></a> <a href="https://toot.earth/tags/aitraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aitraining</span></a> <a href="https://toot.earth/tags/ccsignals" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ccsignals</span></a> <a href="https://toot.earth/tags/copyright" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>copyright</span></a> <a href="https://toot.earth/tags/News" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>News</span></a> <a href="https://toot.earth/tags/Law" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Law</span></a> <a href="https://toot.earth/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a></p>
Unofficial PetaPixel Bot<p>Getty Images Drops Main Copyright Claims Against Stability AI in UK Legal Case <a href="https://petapixel.com/2025/06/26/getty-images-drops-main-copyright-claims-against-stability-ai-in-uk-legal-case/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">petapixel.com/2025/06/26/getty</span><span class="invisible">-images-drops-main-copyright-claims-against-stability-ai-in-uk-legal-case/</span></a> <a href="https://toot.earth/tags/aiimagegenerator" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aiimagegenerator</span></a> <a href="https://toot.earth/tags/gettyvsstability" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>gettyvsstability</span></a> <a href="https://toot.earth/tags/machinelearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>machinelearning</span></a> <a href="https://toot.earth/tags/stablediffusion" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>stablediffusion</span></a> <a href="https://toot.earth/tags/trainingdata" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>trainingdata</span></a> <a href="https://toot.earth/tags/gettyimages" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>gettyimages</span></a> <a href="https://toot.earth/tags/stabilityai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>stabilityai</span></a> <a href="https://toot.earth/tags/Technology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Technology</span></a> <a href="https://toot.earth/tags/aitraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aitraining</span></a> <a href="https://toot.earth/tags/copyright" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>copyright</span></a> <a href="https://toot.earth/tags/aiimage" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aiimage</span></a> <a href="https://toot.earth/tags/News" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>News</span></a> <a href="https://toot.earth/tags/Law" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Law</span></a></p>
IT News<p>Anthropic destroyed millions of print books to build its AI models - On Monday, court documents revealed that AI company Anthropi... - <a href="https://arstechnica.com/ai/2025/06/anthropic-destroyed-millions-of-print-books-to-build-its-ai-models/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">arstechnica.com/ai/2025/06/ant</span><span class="invisible">hropic-destroyed-millions-of-print-books-to-build-its-ai-models/</span></a> <a href="https://schleuss.online/tags/internetarchive" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>internetarchive</span></a> <a href="https://schleuss.online/tags/machinelearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>machinelearning</span></a> <a href="https://schleuss.online/tags/aidevelopment" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aidevelopment</span></a> <a href="https://schleuss.online/tags/bookscanning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>bookscanning</span></a> <a href="https://schleuss.online/tags/legalrulings" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>legalrulings</span></a> <a href="https://schleuss.online/tags/trainingdata" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>trainingdata</span></a> <a href="https://schleuss.online/tags/aicompanies" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aicompanies</span></a> <a href="https://schleuss.online/tags/googlebooks" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>googlebooks</span></a> <a href="https://schleuss.online/tags/airesearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>airesearch</span></a> <a href="https://schleuss.online/tags/aitraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aitraining</span></a> <a href="https://schleuss.online/tags/anthropic" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>anthropic</span></a> <a href="https://schleuss.online/tags/copyright" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>copyright</span></a> <a href="https://schleuss.online/tags/aiethics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aiethics</span></a> <a href="https://schleuss.online/tags/scanning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scanning</span></a> <a href="https://schleuss.online/tags/fairuse" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>fairuse</span></a> <a href="https://schleuss.online/tags/biz" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>biz</span></a>⁢ <a href="https://schleuss.online/tags/policy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>policy</span></a> <a href="https://schleuss.online/tags/claude" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>claude</span></a> <a href="https://schleuss.online/tags/ailaw" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ailaw</span></a> <a href="https://schleuss.online/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a></p>
Miguel Afonso Caetano<p>"A new paper from researchers at Stanford, Cornell, and West Virginia University seems to show that one version of Meta’s flagship AI model, Llama 3.1, has memorized almost the whole of the first Harry Potter book. This finding could have far-reaching copyright implications for the AI industry and impact authors and creatives who are already part of class-action lawsuits against Meta. </p><p>Researchers tested a bunch of different widely-available free large language models to see what percentage of 56 different books they could reproduce. The researchers fed the models hundreds of short text snippets from those books and measured how well it could recite the next lines. The titles were a random sampling of popular, lesser-known, and public domain works drawn from the now-defunct and controversial Books3 dataset that Meta used to train its models, as well as books by plaintiffs in the recent, and ongoing, Kadrey vs Meta class-action lawsuit. </p><p>According to Mark A. Lemley, one of the study authors, this finding might have some interesting implications. AI companies argue that their models are generative—as in, they make new stuff, rather than just being fancy search engines. On the other hand, authors and news outlets are suing on the basis that AI is just remixing existing material, including copyrighted content. “I think what we show in the paper is that neither of those characterizations is accurate,” says Lemley."</p><p><a href="https://www.404media.co/meta-ai-model-memorized-harry-potter-books/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">404media.co/meta-ai-model-memo</span><span class="invisible">rized-harry-potter-books/</span></a></p><p><a href="https://tldr.nettime.org/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://tldr.nettime.org/tags/GenerativeAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GenerativeAI</span></a> <a href="https://tldr.nettime.org/tags/Meta" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Meta</span></a> <a href="https://tldr.nettime.org/tags/Copyright" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Copyright</span></a> <a href="https://tldr.nettime.org/tags/AITraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AITraining</span></a> <a href="https://tldr.nettime.org/tags/FairUse" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FairUse</span></a></p>
Miguel Afonso Caetano<p>"A federal judge in San Francisco ruled late on Monday that Anthropic's use of books without permission to train its artificial intelligence system was legal under U.S. copyright law.</p><p>Siding with tech companies on a pivotal question for the AI industry, U.S. District Judge William Alsup said Anthropic made "fair use" of books by writers Andrea Bartz, Charles Graeber and Kirk Wallace Johnson to train its Claude large language model.</p><p>Alsup also said, however, that Anthropic's copying and storage of more than 7 million pirated books in a "central library" infringed the authors' copyrights and was not fair use. The judge has ordered a trial in December to determine how much Anthropic owes for the infringement.</p><p>U.S. copyright law says that willful copyright infringement can justify statutory damages of up to $150,000 per work."</p><p><a href="https://www.reuters.com/legal/litigation/anthropic-wins-key-ruling-ai-authors-copyright-lawsuit-2025-06-24/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">reuters.com/legal/litigation/a</span><span class="invisible">nthropic-wins-key-ruling-ai-authors-copyright-lawsuit-2025-06-24/</span></a></p><p><a href="https://tldr.nettime.org/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://tldr.nettime.org/tags/GenerativeAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GenerativeAI</span></a> <a href="https://tldr.nettime.org/tags/USA" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>USA</span></a> <a href="https://tldr.nettime.org/tags/Copyright" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Copyright</span></a> <a href="https://tldr.nettime.org/tags/Anthropic" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Anthropic</span></a> <a href="https://tldr.nettime.org/tags/FairUse" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FairUse</span></a> <a href="https://tldr.nettime.org/tags/Claude" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Claude</span></a> <a href="https://tldr.nettime.org/tags/AITraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AITraining</span></a></p>
BGDon<p>Chock up a win for Anthropic. </p><p>Judge rules Anthropic's use of books without permission to train its artificial intelligence system was legal under U.S. copyright law, accepting the position put forward by Anthropic that it made fair use of the books and that U.S. copyright law "not only allows, but encourages" its AI training because it promotes human creativity. </p><p><a href="https://tech.yahoo.com/ai/articles/anthropic-wins-key-ruling-ai-133440779.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">tech.yahoo.com/ai/articles/ant</span><span class="invisible">hropic-wins-key-ruling-ai-133440779.html</span></a> <a href="https://techhub.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://techhub.social/tags/AITraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AITraining</span></a> <a href="https://techhub.social/tags/Anthropic" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Anthropic</span></a> <a href="https://techhub.social/tags/LLMs" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LLMs</span></a> <a href="https://techhub.social/tags/Copyright" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Copyright</span></a> <a href="https://techhub.social/tags/Lawsuit" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Lawsuit</span></a> <a href="https://techhub.social/tags/Claude" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Claude</span></a></p>