Close Menu
Nabka News
  • Home
  • News
  • Business
  • China
  • India
  • Pakistan
  • Political
  • Tech
  • Trend
  • USA
  • Sports

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

Gul Plaza fire under control after 36 hours; 14 dead as Karachi shopping centre gutted

January 19, 2026

China to contribute most to global economic growth in 2026, says WEF president-Xinhua

January 19, 2026

Hainan FTP’s 1st month of island-wide special customs boosts passenger convenience-Xinhua

January 19, 2026
Facebook X (Twitter) Instagram
  • Home
  • About NabkaNews
  • Advertise with NabkaNews
  • DMCA Policy
  • Privacy Policy
  • Terms of Use
  • Contact us
Facebook X (Twitter) Instagram Pinterest Vimeo
Nabka News
  • Home
  • News
  • Business
  • China
  • India
  • Pakistan
  • Political
  • Tech
  • Trend
  • USA
  • Sports
Nabka News
Home » OpenAI launches anthropological ignore rules to stop bots from scraping web content
Business

OpenAI launches anthropological ignore rules to stop bots from scraping web content

i2wtcBy i2wtcJune 21, 2024No Comments3 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email WhatsApp Copy Link
Follow Us
Google News Flipboard Threads
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link


Two of the world’s top AI startups are ignoring requests from media publishers to stop scraping web content for free model training data, Business Insider has learned.

OpenAI and Anthropic were found to ignore or circumvent established web rules, known as robots.txt, that prevent the automated scraping of websites.

TollBit, a startup that aims to broker paid licensing deals between publishers and AI companies, found several AI companies engaging in this behavior and notified some major publishers in a letter on Friday that was previously reported by Reuters. The letter did not name the AI ​​companies that allegedly circumvented the rules.

OpenAI and Anthropic have publicly stated that they respect robots.txt and block two specific web crawlers: GPTBot and ClaudeBot.

However, TollBit’s findings show that such blocks are not being respected as claimed, with AI companies such as OpenAI and Anthropic simply choosing to “bypass” robots.txt in order to retrieve or scrape all content from a given website or page.

An OpenAI spokesperson declined to comment beyond pointing BI to a May company blog post in which the company said it takes web crawler permissions into account each time it trains a new model. An Anthropic spokesperson did not respond to an email seeking comment.

Robots.txt is a single piece of code that has been used since the late 1990s as a way for websites to tell bot crawlers that they don’t want their data scraped or collected. It was widely accepted as one of the informal rules that support the web.

The rise of generative AI has startups and tech companies racing to build the most powerful AI models. A key ingredient is high-quality data. This thirst for training data is undermining robots.txt and the informal agreements that support the use of this code.

OpenAI develops the popular chatbot ChatGPT. The company’s largest investor is Microsoft. Anthropic develops another relatively popular chatbot Claude. The company’s largest investor is Amazon.

Both chatbots respond to users’ questions in a human-like manner, and can do so because the AI ​​models they are based on contain large amounts of text and data collected from the web, much of which is copyrighted or owned by its creators.

Last year, several technology companies filed a lawsuit with the U.S. Copyright Office arguing that nothing on the web should be considered copyrightable when it comes to AI training data.

OpenAI has signed deals with several publishers, including BI-owner Axel Springer, for access to their content, and the U.S. Copyright Office is expected to update its guidelines on AI and copyright later this year.

Are you a tech employee or someone with tips and insights to share? Contact Kali Hays. email address Or in a secure messaging appsignal +1-949-280-0267. Please contact us using a non-work device.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email WhatsApp Copy Link
i2wtc
  • Website

Related Posts

Business

Disney dominated 2025 box office. Can it keep the crown in 2026?

January 17, 2026
Business

White House econ advisor Hassett floats ‘Trump cards’ amid credit card battle

January 16, 2026
Business

Novo Nordisk shares rise after Wegovy obesity pill launch

January 16, 2026
Business

Family offices could be hit in Trump ban on investors buying homes

January 16, 2026
Business

College students, teens could be fueling the boom

January 15, 2026
Business

How to give away $150 billion

January 15, 2026
Add A Comment
Leave A Reply Cancel Reply

Top Posts

House Republicans unveil aid bill for Israel, Ukraine ahead of weekend House vote

April 17, 2024

Prime Minister Johnson presses forward with Ukraine aid bill despite pressure from hardliners

April 17, 2024

Justin Verlander makes season debut against Nationals

April 17, 2024

Tesla lays off 285 employees in Buffalo, New York as part of major restructuring

April 17, 2024
Don't Miss

Trump says China’s Xi ‘hard to make a deal with’ amid trade dispute | Donald Trump News

By i2wtcJune 4, 20250

Growing strains in US-China relations over implementation of agreement to roll back tariffs and trade…

Donald Trump’s 50% steel and aluminium tariffs take effect | Business and Economy News

June 4, 2025

The Take: Why is Trump cracking down on Chinese students? | Education News

June 4, 2025

Chinese couple charged with smuggling toxic fungus into US | Science and Technology News

June 4, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

About Us
About Us

Welcome to NabkaNews, your go-to source for the latest updates and insights on technology, business, and news from around the world, with a focus on the USA, Pakistan, and India.

At NabkaNews, we understand the importance of staying informed in today’s fast-paced world. Our mission is to provide you with accurate, relevant, and engaging content that keeps you up-to-date with the latest developments in technology, business trends, and news events.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks

Gul Plaza fire under control after 36 hours; 14 dead as Karachi shopping centre gutted

January 19, 2026

China to contribute most to global economic growth in 2026, says WEF president-Xinhua

January 19, 2026

Hainan FTP’s 1st month of island-wide special customs boosts passenger convenience-Xinhua

January 19, 2026
Most Popular

WWII bond lives on as U.S. students pay tribute to Flying Tiger rescued by Chinese people-Xinhua

July 30, 2025

EU conducts ‘dawn raids’ on Chinese security equipment suppliers

April 24, 2024

China’s bubble tea boom brews rural growth, international presence-Xinhua

August 5, 2025
© 2026 nabkanews. Designed by nabkanews.
  • Home
  • About NabkaNews
  • Advertise with NabkaNews
  • DMCA Policy
  • Privacy Policy
  • Terms of Use
  • Contact us

Type above and press Enter to search. Press Esc to cancel.