WordPress and Tumblr Selling User Data to AI Companies

In a controversial move, Automattic, the parent company of WordPress and Tumblr, has reportedly entered into agreements to sell user data to AI companies like OpenAI and Midjourney. This decision has raised significant concerns among users regarding the privacy and ownership of their content.

Key Takeaways

Automattic is selling user data from WordPress and Tumblr to AI companies for training purposes.
The data includes potentially sensitive information, including private posts and content from deleted accounts.
Users will have the option to opt-out of data sharing, but the default settings remain unclear.

The Controversial Agreements

Reports indicate that Automattic is finalizing deals to provide user-generated content to AI firms. This content is intended to train machine learning models, which can generate text, images, and other media. However, the implications of this data sharing have sparked outrage among users who believed their content was secure.

What Data Is Being Shared?

The types of data that may be included in these agreements are concerning:

Private Posts: Content from public accounts that users believed was private.
Deleted Accounts: Posts from accounts that have been suspended or deleted.
Unanswered Asks: Private messages and interactions that were not intended for public consumption.
Explicit Content: Posts that may contain sensitive or explicit material.
Advertising Content: Data from advertising campaigns that Automattic does not own, such as those from Apple.

User Reactions and Concerns

The news has led to a backlash from the community, with many users feeling betrayed by the platforms they trusted. Concerns include:

Lack of Transparency: Users are frustrated by the unclear details surrounding what data is being shared and how it will be used.
Opt-Out Confusion: While Automattic has stated that users can opt-out, it remains uncertain how this will be implemented and whether it will be the default setting.
Legal Implications: The company has acknowledged that no laws currently require AI crawlers to respect user preferences, raising questions about the ethical implications of their actions.

Automattic’s Response

In response to the backlash, Automattic has attempted to reassure users:

They claim to block major AI platform crawlers by default and have features to prevent search engine indexing.
A new setting will be introduced to allow users to opt-out of data sharing, although the specifics of this feature are still being finalized.
The company emphasizes its commitment to user privacy and states that it will only share public content from sites that have not opted out.

The Future of User Data

As AI technology continues to evolve, the demand for large datasets for training purposes will likely increase. This situation raises critical questions about the ownership of digital content and the responsibilities of platforms that host user-generated material. Users are encouraged to consider their options and take proactive steps to protect their content.

In conclusion, the decision by Automattic to sell user data to AI companies has ignited a significant debate about privacy, consent, and the future of content ownership in the digital age. Users must remain vigilant and informed about how their data is being used and take advantage of any available opt-out options to safeguard their content.

Sources

Tumblr And WordPress Strike Deals For Sale Of User Public Data To AI Giants OpenAI and Midjourney, Digital Information World.
WordPress and Tumblr Plan to Sell User Content to AI Companies, Gizmodo.
Tumblr and WordPress Are Selling Your Data to AI Companies | Lifehacker, Lifehacker.