Deep search
Search
Copilot
Images
Videos
Maps
News
Shopping
More
Flights
Travel
Hotels
Real Estate
Notebook
Top stories
Sports
U.S.
2024 Election
Local
World
Science
Technology
Entertainment
Business
More
Politics
Any time
Past hour
Past 24 hours
Past 7 days
Past 30 days
Best match
Most recent
Bluesky, Hugging Face
Bluesky dataset for AI training removed from Hugging Face
On 26 November, Daniel van Strien, a machine learning librarian at Hugging Face, uploaded a dataset of 1m public posts and accompanying metadata taken from Bluesky’s firehose API. The dataset card explained it was “intended for machine learning research and experimentation with social media data”.
Hugging Face’s Dataset Release Exposes 1M Bluesky Posts for Research
A Hugging Face librarian released and later removed a 1 million Bluesky posts dataset, sparking concerns over data transparency and consent. Daniel van
Bluesky’s Open API Makes User Data Vulnerable to Scraping
Bluesky is facing its first major controversy over data scraping after a dataset containing one million public posts appeared on the AI platform Hugging Face, 404Media reports. Compiled by Daniel van Strien,
Bluesky Open API: Data Scrapers May Access Firehouse for AI Training, as Demoed by Hugging Face
Bluesky's Firehouse is known for being an open API, but it is also its flaw as anyone can scrape its data for the likes of AI training.
One million public Bluesky posts scraped for AI training
Bluesky is already facing its first major AI scrape, despite the stance of its owners that it will never train generative AI on user data.
Bluesky Faces Backlash Over User Data Scraping for AI Training
Bluesky, the social media platform often seen as a rival to Twitter, is at the center of a controversy after one million of its public posts..|News Track
Twitter rival Bluesky’s user posts scraped for training AI
Bluesky user posts and user information was scraped by an AI researcher and built into a dataset and published on open platform, Hugging Face.
Bluesky’s open API means anyone can scrape your data for AI training
Bluesky might not be training AI systems on user content as other social networks are doing, but there’s little stopping third-parties from doing so. Per a report by 404 Media, a machine learning librarian at AI firm Hugging Face pulled 1 million public posts from Bluesky via its Firehose API for machine learning research,
Werd I/O on MSN
3d
Bluesky, AI, and the battle for consent on the open web
Daniel van Strien, a machine learning librarian at Hugging Face, took a million Bluesky posts and turned them into a dataset ...
Interesting Engineering on MSN
1d
Bluesky 101: Everything you should know about X and Meta’s latest headache
Learn what makes Bluesky different by taking an in-depth look at its technological framework, privacy commitments, and ...
8d
on MSN
Conservatives Join Bluesky, Face Abuse and Censorship
Conservative leaning media outlets now on Bluesky include satirical website The Babylon Bee and news site Newsmax, an early ...
Daily Tech News Show
3d
Bluesky Scraper – DTNS 4905
President of SCE Worldwide Studios, Shuhei Yoshida, steps down from his 38 years. Scott shares his thoughts on Yoshida’s ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results
Related topics
Artificial intelligence
AI
Scholastic Corporation
Feedback