Bluesky’s open API means anyone can scrape your data for AI training

Bluesky won’t be training AI systems on user content as different social networks are doing, however there’s little stopping third-parties from doing so.

Per a report by 404 Media, a machine learning librarian at AI agency Hugging Face pulled 1 million public posts from Bluesky through its Firehose API for machine studying analysis, pushing the dataset to a public repository. Daniel van Strien later removed the data as a result of controversy that ensued, nevertheless it serves as a well timed reminder that all the pieces you put up publicly to Bluesky is, properly, public.

Bluesky mentioned that it’s methods to allow customers to speak their consent preferences externally, although it’s as much as these events whether or not they respect these preferences.

The corporate posted: “Bluesky received’t have the ability to implement this consent outdoors of our programs. It is going to be as much as outdoors builders to respect these settings. We’re having ongoing conversations with engineers & legal professionals and we hope to have extra updates to share on this shortly!”

What’s clear right here is that whereas Bluesky is surging in popularity, its speedy rise to the forefront of the worldwide consciousness will imply it’s subject to the same levels of scrutiny as different main social platforms.

Sensi Tech Hub
Logo