Key Takeaways

  • Encyclopaedia Britannica and Merriam-Webster have sued OpenAI for allegedly copying their content to train AI models.
  • The lawsuit claims that nearly 100,000 articles were used without permission, violating copyright law.
  • Publishers assert that AI responses often replicate their content, which could harm traffic to their websites.
  • They argue that reliance on AI-generated answers may impact their digital subscription and advertising revenue.
  • This case highlights a growing trend of legal actions against AI companies regarding training data practices.

The Encyclopedia Britannica sues OpenAI lawsuit was filed in a federal court in Manhattan. Encyclopaedia Britannica and Merriam-Webster accuse OpenAI of using copyrighted materials to train artificial intelligence models. The publishers claim their reference works were copied without permission. The complaint focuses on how OpenAI allegedly used their articles to develop systems such as ChatGPT. The case highlights growing legal disputes between content publishers and AI developers. Britannica and Merriam-Webster argue that their intellectual property was incorporated into training datasets without licensing agreements. The lawsuit also raises questions about how AI models process and reproduce copyrighted information.

Encyclopaedia Britannica and Merriam-Webster are long-standing publishers of reference materials. Britannica produces the well-known English-language encyclopedia. Merriam-Webster publishes dictionaries and language resources.

In the lawsuit, the publishers claim OpenAI copied large amounts of their content. The complaint states that nearly 100,000 articles were used. The companies say the materials were included in datasets that trained OpenAI’s language models.

According to the filing, the data allegedly helped train systems including GPT-4. The publishers claim this occurred without permission or compensation. They argue that the use of these materials violated copyright law.


Claims That AI Responses Replicate Britannica Content

The complaint states that AI models can produce responses similar to original encyclopedia entries. The publishers say this suggests portions of their content were memorized during training.

Examples are included in the court filing. These examples compare ChatGPT responses with Britannica text. In several cases, passages appear highly similar.

The publishers argue that this reproduction creates unauthorized copies of copyrighted works. They claim that users can obtain detailed information from the AI system instead of consulting the original publications.


Impact on Publishers and Digital Reference Platforms

Britannica and Merriam-Webster state that AI-generated answers could reduce traffic to their websites. They argue that users may rely on chatbots rather than visiting official sources.

The companies say this shift could affect their digital subscription and advertising models. They claim that AI responses may compete with their reference products.

The lawsuit seeks legal remedies for alleged copyright infringement. The case also forms part of a broader wave of legal actions involving generative AI systems. Several publishers, authors, and media organizations have filed similar lawsuits against AI companies over training data practices.

Source: https://www.reuters.com/legal/litigation/encyclopedia-britannica-sues-openai-over-ai-training-2026-03-16/