{"id":2376,"date":"2024-12-13T10:43:06","date_gmt":"2024-12-13T10:43:06","guid":{"rendered":"https:\/\/yolohive.com\/?p=2376"},"modified":"2024-12-13T10:43:16","modified_gmt":"2024-12-13T10:43:16","slug":"how-to-block-ai-crawler-bots-using-robots-txt-file","status":"publish","type":"post","link":"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/","title":{"rendered":"How to block AI Crawler Bots using robots.txt file"},"content":{"rendered":"\n<p>As a content creator or blog author, your unique, high-quality content is your asset. But have you noticed that some generative AI platforms, such as OpenAI and CCBot, might be using your work to train their algorithms\u2014without your consent?<\/p>\n\n\n\n<p>You don\u2019t have to worry! By using a simple file called&nbsp;<code>robots.txt<\/code>, you can block these AI crawlers from accessing your website or blog.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img fetchpriority=\"high\" decoding=\"async\" width=\"1024\" height=\"683\"  src=\"https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots-1024x683.jpg\" alt=\"\" class=\"wp-image-2522\" srcset=\"https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots-1024x683.jpg 1024w, https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots-300x200.jpg 300w, https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots-768x512.jpg 768w, https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots-18x12.jpg 18w, https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots-150x100.jpg 150w, https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots-450x300.jpg 450w, https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots.jpg 1170w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-what-is-a-robots-txt-file\">What is a robots.txt file?<\/h2>\n\n\n\n<p>The&nbsp;<code>robots.txt<\/code>&nbsp;file is a tool that allows website owners to manage how search engine crawlers interact with their content. It gives you the power to disallow specific bots from crawling your site, ensuring greater control over your content.<\/p>\n\n\n\n<p>The syntax below shows how to block a single bot using a user-agent:<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-plain\"><code>user-agent: {BOT-NAME-HERE}\ndisallow: \/<\/code><\/pre><\/div>\n\n\n\n<p>Below shows how to allow specific bots to crawl your website using a user-agent:<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-plain\"><code>User-agent: {BOT-NAME-HERE}\nAllow: \/<\/code><\/pre><\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-where-to-place-your-robots-txt-file\">Where to place your robots.txt file?<\/h2>\n\n\n\n<p>Upload the file to your website\u2019s root folder: <\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-plain\"><code>https:\/\/example.com\/robots.txt\nhttps:\/\/blog.example.com\/robots.txt<\/code><\/pre><\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-learn-more-about-nbsp-robots-txt\">Learn More About&nbsp;<code>robots.txt<\/code><\/h2>\n\n\n\n<p>If you\u2019re ready to take control of your website\u2019s accessibility, dive deeper into the details of&nbsp;<code>robots.txt<\/code>&nbsp;with these helpful resources:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong><a href=\"https:\/\/developers.google.com\/search\/docs\/crawling-indexing\/robots\/intro\">Introduction&nbsp;to&nbsp;robots.txt<\/a><\/strong>&nbsp;by Google: Understand the basics of how&nbsp;<code>robots.txt<\/code>&nbsp;works and how to configure it effectively for your site.<\/li>\n\n\n\n<li><strong><a href=\"https:\/\/www.cloudflare.com\/learning\/bots\/what-is-robots-txt\/\">What&nbsp;is&nbsp;robots.txt? | How&nbsp;a&nbsp;robots.txt&nbsp;file&nbsp;works<\/a><\/strong>&nbsp;from Cloudflare: A comprehensive guide to the purpose and function of&nbsp;<code>robots.txt<\/code>&nbsp;in managing web crawler access.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-how-to-block-ai-crawlers-bots-using-the-robots-txt-file\">How to block AI crawlers bots using the robots.txt file<\/h2>\n\n\n\n<p>The syntax:<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-plain\"><code>user-agent: {AI-Crawlers-Bot-Name-Here}\ndisallow: \/<\/code><\/pre><\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-blocking-google-ai-bard-and-vertex-ai-generative-apis\">Blocking Google AI (Bard and Vertex AI generative APIs)<\/h2>\n\n\n\n<p>Add the following two lines to your robots.txt:<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-plain\"><code>User-agent: Google-Extended\nDisallow: \/<\/code><\/pre><\/div>\n\n\n\n<h3 class=\"wp-block-heading\">Additional Information on User Agents and AI Bots<\/h3>\n\n\n\n<p>For more information about managing crawlers, you can review the <strong><a href=\"https:\/\/developers.google.com\/search\/docs\/crawling-indexing\/overview-google-crawlers\">list of user agents<\/a><\/strong> used by Google crawlers and fetchers. This can help you identify legitimate Google bots accessing your site.<\/p>\n\n\n\n<p>However, it\u2019s important to note that:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Google does not provide CIDR, IP ranges, or ASN details<\/strong> for its AI bots, making it challenging to block them directly via your web server firewall.<\/li>\n\n\n\n<li>As a result, using a <code>robots.txt<\/code> file remains one of the most effective methods to guide compliant crawlers and restrict access to your content.<\/li>\n<\/ul>\n\n\n\n<p>For advanced control, monitor your server logs for unusual activity and configure additional security measures, such as rate limiting or IP blocking, to complement your <code>robots.txt<\/code> directives.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-blocking-openai-using-the-robots-txt-file\">Blocking OpenAI using the robots.txt file<\/h2>\n\n\n\n<p>Add the following four lines to your robots.txt:<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-plain\"><code>User-agent: GPTBot\nDisallow: \/\nUser-agent: ChatGPT-User\nDisallow: \/<\/code><\/pre><\/div>\n\n\n\n<p>OpenAI utilizes two distinct user agents for its operations: one for web crawling and another for browsing, each associated with unique CIDR and IP address ranges. Configuring firewall rules to block these requires an advanced understanding of networking concepts and root-level access to a Linux server.<\/p>\n\n\n\n<p>If you\u2019re not familiar with these technical aspects, such as managing CIDR ranges or configuring firewalls, it\u2019s advisable to seek the assistance of a Linux system administrator. Keep in mind that OpenAI\u2019s IP address ranges are subject to change, which can turn this process into an ongoing effort to keep up with updates\u2014a game of cat and mouse.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-1-the-chatgpt-user-is-used-by-plugins-in-chatgpt\">1: The\u00a0ChatGPT-User is used by\u00a0<strong>plugins<\/strong>\u00a0in ChatGPT<\/h3>\n\n\n\n<p><a href=\"https:\/\/platform.openai.com\/docs\/actions\/introduction\">Below is a list<\/a> of user agents used by OpenAI\u2019s crawlers and fetchers, along with their associated CIDR or IP address ranges. To block OpenAI\u2019s plugin AI bot, you can configure your web server firewall to restrict access from specific IP ranges, such as <code>23.98.142.176\/28<\/code>.<\/p>\n\n\n\n<p>Here\u2019s an example of how to block a CIDR or IP range using the <code>ufw<\/code> command or <code>iptables<\/code> on your server:<\/p>\n\n\n\n<p><strong>Using UFW:<\/strong><\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-plain\"><code>sudo ufw deny from 23.98.142.176\/28  <\/code><\/pre><\/div>\n\n\n\n<p><strong>Using iptables:<\/strong><\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-plain\"><code>sudo iptables -A INPUT -s 23.98.142.176\/28 -j DROP  <\/code><\/pre><\/div>\n\n\n\n<p>These commands prevent any traffic originating from the specified IP range from accessing your server. Make sure to review and update your firewall rules periodically to account for changes in OpenAI\u2019s IP ranges. If you\u2019re unfamiliar with configuring firewalls, consider enlisting the help of a Linux system administrator.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-2-the-gptbot-is-used-by-chatgpt\">2: The\u00a0GPTBot \u00a0is used by ChatGPT<\/h3>\n\n\n\n<p><a href=\"https:\/\/platform.openai.com\/docs\/bots\">Below is a list<\/a> of user agents used by OpenAI crawlers and fetchers, along with the associated CIDR or IP address ranges. You can block these ranges directly on your web server using either the <code>ufw<\/code> command or <code>iptables<\/code>.<\/p>\n\n\n\n<p>Here\u2019s an example shell script to block those CIDR ranges:<\/p>\n\n\n\n<p><strong>Shell Script to Block OpenAI CIDR Ranges<\/strong><\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-plain\"><code>#!\/bin\/bash\n# Purpose: Block OpenAI ChatGPT bot CIDR \n# Tested on: Debian and Ubuntu Linux\n# ------------------------------------------------------------------\nfile=&quot;\/tmp\/out.txt.$$&quot;\nwget -q -O &quot;$file&quot; https:\/\/openai.com\/gptbot-ranges.txt 2&gt;\/dev\/null\n \nwhile IFS= read -r cidr\ndo\n    sudo ufw deny proto tcp from $cidr to any port 80\n    sudo ufw deny proto tcp from $cidr to any port 443\ndone &lt; &quot;$file&quot;\n[ -f &quot;$file&quot; ] && rm -f &quot;$file&quot;<\/code><\/pre><\/div>\n\n\n\n<p>If you\u2019re unfamiliar with firewall configuration, consult a Linux system administrator for assistance. Regularly update the script with new ranges to keep up with changes in OpenAI\u2019s IP list.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-blocking-commoncrawl-ccbot-using-the-robots-txt-file\">Blocking commoncrawl (CCBot) using the robots.txt file<\/h2>\n\n\n\n<p>Add the following two lines to your robots.txt:<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-plain\"><code>User-agent: CCBot\nDisallow: \/<\/code><\/pre><\/div>\n\n\n\n<p>Common Crawl, a non-profit foundation, operates a bot called CCBot, which is widely used to collect data for training AI models. Blocking CCBot is also essential if you want to prevent your content from being utilized in this way. However, similar to Google, Common Crawl does not provide CIDR, IP address ranges, or autonomous system information (ASN) that can be used to block its bot through your web server firewall. This limitation makes it challenging to restrict their access at the network level.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-blocking-perplexity-ai-using-the-robots-txt-file\">Blocking Perplexity AI using the robots.txt file<\/h2>\n\n\n\n<p>Another service that uses generative AI to rewrite your content is PerplexityBot. To block this bot, you can add the following rule to your <code>robots.txt<\/code> file:<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-plain\"><code>User-agent: PerplexityBot  \nDisallow: \/  <\/code><\/pre><\/div>\n\n\n\n<p>Additionally, PerplexityBot has published its <a href=\"https:\/\/www.perplexity.ai\/perplexitybot.json\">IP address ranges<\/a>, which you can block using your Web Application Firewall (WAF) or web server firewall. This ensures an added layer of protection against unauthorized access to your content.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-blocking-anthropic-ai-claude\">Blocking Anthropic AI (Claude)<\/h2>\n\n\n\n<p>Add the following lines to your robots.txt file:<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-plain\"><code>User-agent: anthropic-ai\nDisallow: \/\n\u200dUser-agent: Claude-Web\nDisallow: \/\nUser-agent: ClaudeBot\nDisallow: \/<\/code><\/pre><\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-can-ai-bots-ignore-my-nbsp-robots-txt-nbsp-file\">Can AI bots ignore my&nbsp;<code>robots.txt<\/code>&nbsp;file?<\/h2>\n\n\n\n<p>Can AI bots ignore my <code>robots.txt<\/code> file?<\/p>\n\n\n\n<p>Well-established organizations like Google and OpenAI generally respect the <code>robots.txt<\/code> protocols and comply with the rules you set. However, some poorly designed or malicious AI bots may choose to ignore your <code>robots.txt<\/code> file entirely, bypassing these restrictions and accessing your content without authorization.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-is-it-ethical-to-block-ai-bots-from-using-your-data\"><strong>Is It Ethical to Block AI Bots from Using Your Data?<\/strong><\/h2>\n\n\n\n<p>The ethical dilemma surrounding AI training data is complex. While AI is often promoted as a tool for the betterment of humanity, enabling advancements in fields like medicine and science, many have doubts about the true intentions of companies like OpenAI, Google, or Microsoft. Some argue that these technologies are more focused on profit than altruism, especially as generative AI begins to replace white-collar jobs. <\/p>\n\n\n\n<p>It\u2019s worth noting that the option to control access through\u00a0<code>robots.txt<\/code>\u00a0only became available after lawsuits from authors and companies challenged these practices in court. Ultimately, protecting your work is a personal decision, and it\u2019s important to weigh the potential benefits and risks of sharing it with AI systems.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-complete-list-of-ai-bots-for-github\">Complete list of AI Bots for Github<\/h2>\n\n\n\n<p><a href=\"https:\/\/github.com\/ai-robots-txt\/ai.robots.txt\">Github<\/a><\/p>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>As a content creator or blog author, your unique, high-quality content is your asset. But have you noticed that some generative AI platforms, such as OpenAI and CCBot, might be using your work to train their algorithms\u2014without your consent? You don\u2019t have to worry! By using a simple file called&nbsp;robots.txt, you can block these AI<\/p>\n","protected":false},"author":1,"featured_media":2522,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[436],"tags":[],"class_list":{"0":"post-2376","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-web-developer"},"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v22.6 (Yoast SEO v22.6) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>How to block AI Crawler Bots using robots.txt file<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"How to block AI Crawler Bots using robots.txt file\" \/>\n<meta property=\"og:description\" content=\"As a content creator or blog author, your unique, high-quality content is your asset. But have you noticed that some generative AI platforms, such as OpenAI and CCBot, might be using your work to train their algorithms\u2014without your consent? You don\u2019t have to worry! By using a simple file called&nbsp;robots.txt, you can block these AI\" \/>\n<meta property=\"og:url\" content=\"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/\" \/>\n<meta property=\"og:site_name\" content=\"Yolohive\" \/>\n<meta property=\"article:published_time\" content=\"2024-12-13T10:43:06+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-12-13T10:43:16+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1170\" \/>\n\t<meta property=\"og:image:height\" content=\"780\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"YoloHive\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/\"},\"author\":{\"name\":\"YoloHive\",\"@id\":\"https:\/\/yolohive.com\/#\/schema\/person\/571e1adbaca00ce3aae6a866320e2c00\"},\"headline\":\"How to block AI Crawler Bots using robots.txt file\",\"datePublished\":\"2024-12-13T10:43:06+00:00\",\"dateModified\":\"2024-12-13T10:43:16+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/\"},\"wordCount\":1119,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/yolohive.com\/#organization\"},\"image\":{\"@id\":\"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots.jpg\",\"articleSection\":[\"Web-Developer\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/\",\"url\":\"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/\",\"name\":\"How to block AI Crawler Bots using robots.txt file\",\"isPartOf\":{\"@id\":\"https:\/\/yolohive.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots.jpg\",\"datePublished\":\"2024-12-13T10:43:06+00:00\",\"dateModified\":\"2024-12-13T10:43:16+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/#primaryimage\",\"url\":\"https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots.jpg\",\"contentUrl\":\"https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots.jpg\",\"width\":1170,\"height\":780},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/yolohive.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"How to block AI Crawler Bots using robots.txt file\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/yolohive.com\/#website\",\"url\":\"https:\/\/yolohive.com\/\",\"name\":\"Yolohive\",\"description\":\"Discover, Learn, Thrive.\",\"publisher\":{\"@id\":\"https:\/\/yolohive.com\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/yolohive.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/yolohive.com\/#organization\",\"name\":\"Yolohive\",\"url\":\"https:\/\/yolohive.com\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/yolohive.com\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/yolohive.com\/wp-content\/uploads\/2024\/10\/yolo-hive-high-resolution-logo-black-transparent.png\",\"contentUrl\":\"https:\/\/yolohive.com\/wp-content\/uploads\/2024\/10\/yolo-hive-high-resolution-logo-black-transparent.png\",\"width\":1920,\"height\":1051,\"caption\":\"Yolohive\"},\"image\":{\"@id\":\"https:\/\/yolohive.com\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/yolohive.com\/#\/schema\/person\/571e1adbaca00ce3aae6a866320e2c00\",\"name\":\"YoloHive\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/yolohive.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/d9a6a9371c602531682a2f0d8c36e698?s=96&d=https%3A%2F%2Fyolohive.com%2Fwp-content%2Fuploads%2F2024%2F11%2FAsset-8%404x-150x150.png&r=x\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/d9a6a9371c602531682a2f0d8c36e698?s=96&d=https%3A%2F%2Fyolohive.com%2Fwp-content%2Fuploads%2F2024%2F11%2FAsset-8%404x-150x150.png&r=x\",\"caption\":\"YoloHive\"},\"sameAs\":[\"https:\/\/yolohive.com\"]}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"How to block AI Crawler Bots using robots.txt file","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/","og_locale":"en_US","og_type":"article","og_title":"How to block AI Crawler Bots using robots.txt file","og_description":"As a content creator or blog author, your unique, high-quality content is your asset. But have you noticed that some generative AI platforms, such as OpenAI and CCBot, might be using your work to train their algorithms\u2014without your consent? You don\u2019t have to worry! By using a simple file called&nbsp;robots.txt, you can block these AI","og_url":"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/","og_site_name":"Yolohive","article_published_time":"2024-12-13T10:43:06+00:00","article_modified_time":"2024-12-13T10:43:16+00:00","og_image":[{"width":1170,"height":780,"url":"https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots.jpg","type":"image\/jpeg"}],"author":"YoloHive","twitter_card":"summary_large_image","schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/#article","isPartOf":{"@id":"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/"},"author":{"name":"YoloHive","@id":"https:\/\/yolohive.com\/#\/schema\/person\/571e1adbaca00ce3aae6a866320e2c00"},"headline":"How to block AI Crawler Bots using robots.txt file","datePublished":"2024-12-13T10:43:06+00:00","dateModified":"2024-12-13T10:43:16+00:00","mainEntityOfPage":{"@id":"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/"},"wordCount":1119,"commentCount":0,"publisher":{"@id":"https:\/\/yolohive.com\/#organization"},"image":{"@id":"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/#primaryimage"},"thumbnailUrl":"https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots.jpg","articleSection":["Web-Developer"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/","url":"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/","name":"How to block AI Crawler Bots using robots.txt file","isPartOf":{"@id":"https:\/\/yolohive.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/#primaryimage"},"image":{"@id":"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/#primaryimage"},"thumbnailUrl":"https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots.jpg","datePublished":"2024-12-13T10:43:06+00:00","dateModified":"2024-12-13T10:43:16+00:00","breadcrumb":{"@id":"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/#primaryimage","url":"https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots.jpg","contentUrl":"https:\/\/yolohive.com\/wp-content\/uploads\/2024\/12\/ai-bots.jpg","width":1170,"height":780},{"@type":"BreadcrumbList","@id":"https:\/\/yolohive.com\/how-to-block-ai-crawler-bots-using-robots-txt-file\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/yolohive.com\/"},{"@type":"ListItem","position":2,"name":"How to block AI Crawler Bots using robots.txt file"}]},{"@type":"WebSite","@id":"https:\/\/yolohive.com\/#website","url":"https:\/\/yolohive.com\/","name":"Yolohive","description":"Discover, Learn, Thrive.","publisher":{"@id":"https:\/\/yolohive.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/yolohive.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/yolohive.com\/#organization","name":"Yolohive","url":"https:\/\/yolohive.com\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/yolohive.com\/#\/schema\/logo\/image\/","url":"https:\/\/yolohive.com\/wp-content\/uploads\/2024\/10\/yolo-hive-high-resolution-logo-black-transparent.png","contentUrl":"https:\/\/yolohive.com\/wp-content\/uploads\/2024\/10\/yolo-hive-high-resolution-logo-black-transparent.png","width":1920,"height":1051,"caption":"Yolohive"},"image":{"@id":"https:\/\/yolohive.com\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/yolohive.com\/#\/schema\/person\/571e1adbaca00ce3aae6a866320e2c00","name":"YoloHive","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/yolohive.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/d9a6a9371c602531682a2f0d8c36e698?s=96&d=https%3A%2F%2Fyolohive.com%2Fwp-content%2Fuploads%2F2024%2F11%2FAsset-8%404x-150x150.png&r=x","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/d9a6a9371c602531682a2f0d8c36e698?s=96&d=https%3A%2F%2Fyolohive.com%2Fwp-content%2Fuploads%2F2024%2F11%2FAsset-8%404x-150x150.png&r=x","caption":"YoloHive"},"sameAs":["https:\/\/yolohive.com"]}]}},"_links":{"self":[{"href":"https:\/\/yolohive.com\/wp-json\/wp\/v2\/posts\/2376"}],"collection":[{"href":"https:\/\/yolohive.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/yolohive.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/yolohive.com\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/yolohive.com\/wp-json\/wp\/v2\/comments?post=2376"}],"version-history":[{"count":1,"href":"https:\/\/yolohive.com\/wp-json\/wp\/v2\/posts\/2376\/revisions"}],"predecessor-version":[{"id":2523,"href":"https:\/\/yolohive.com\/wp-json\/wp\/v2\/posts\/2376\/revisions\/2523"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/yolohive.com\/wp-json\/wp\/v2\/media\/2522"}],"wp:attachment":[{"href":"https:\/\/yolohive.com\/wp-json\/wp\/v2\/media?parent=2376"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/yolohive.com\/wp-json\/wp\/v2\/categories?post=2376"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/yolohive.com\/wp-json\/wp\/v2\/tags?post=2376"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}