Skip to main content

Cloudflare's Pay-Per-Crawl Deadline: What September 15 Means for AI Citation Traffic

Cloudflare will block AI training and agent crawlers by default on ad-monetized pages starting September 15, 2026. Here is what the policy says, who feels it first, and how brands stay visible inside AI answers.

Mauricio Valdivia

Mauricio Valdivia

·11 min

Cloudflare's Pay-Per-Crawl Deadline: What September 15 Means for AI Citation Traffic

AI Crawlers Just Got a Deadline: September 15

The date is set. On July 1, 2026, Cloudflare, the company whose network sits in front of more than 20% of the web's domains, told the AI industry to split its crawlers into transparent categories or start losing default access to a large slice of the internet. Starting September 15, 2026, Cloudflare's default settings will block crawlers classified as Training and Agent on pages that display ads, while classic Search crawling stays allowed.

If your business depends on web traffic, as a publisher, an ecommerce brand, or an advertiser, this is not an infrastructure footnote. It is the clearest signal yet that the deal that funded the open web for 30 years, crawl me and send me visitors, is being renegotiated in public. And the new currency being negotiated is not the click. It is the citation.

This post walks through what the policy actually says, why Cloudflare says the old bargain broke, who feels the change first, and the practical moves for a brand that wants to stay visible while more of the buying journey happens inside AI answers.

What Cloudflare actually announced

The announcement, published on Cloudflare's blog under the banner "Your site, your rules," does three things at once: it reclassifies bots by behavior, it flips the network's defaults, and it ends the benefit of the doubt for crawlers that mix jobs.

Three kinds of bots, not one AI bucket

Cloudflare's argument is that "AI bot" stopped being a useful category, because AI is now inside everything, including ordinary search. So instead of asking whether a bot is AI, the new taxonomy asks what the bot does with your content:

CategoryWhat the bot doesDefault on ad pages after Sept 15
Searchindexes your content to answer queries laterallowed
Agentacts in real time on a user's behalfblocked
Trainingabsorbs content to train or fine-tune modelsblocked

The logic behind the split is value flow. Search crawling has historically funneled visitors back to the site. Training absorbs your work into a model permanently. An agent completes a task for a user, often without the visit a human would have made. Cloudflare wants site owners to price each behavior separately, and it wants companies that do all three to run three separate, honestly labeled crawlers.

The September 15 default flip

The enforcement mechanism is defaults, which is where the real power is, because most site owners never touch these settings. Cloudflare's own post states that for all new domains onboarding to the network, Training and Agent will be blocked by default on pages that display ads, while Search remains allowed. TechCrunch reports the new defaults will also apply to new Cloudflare customers, new sites set up by existing customers, and all existing free customers.

Why ad pages specifically? An ad is a signal that the page was built for human attention, the thing that pays the site's bills. On those pages, Cloudflare is siding with the human visit by default and keeping out the bot behaviors that replace it.

Mixed-use crawlers lose the benefit of the doubt

The sharpest edge of the policy is aimed at crawlers that blend purposes. From September 15, a multi-purpose crawler that combines Search with Training gets allowed or blocked according to all of its behaviors, and the most restrictive applicable rule wins. Cloudflare names names: crawlers such as Googlebot, Applebot, and BingBot will be blocked by site owners who have chosen to block Training.

Customer choice still rules. A site owner who wants no change can confirm that in their settings any time before the deadline, and Cloudflare says it will keep notifying customers as the date approaches. But the default pressure is unmistakable: separate your crawlers, or inherit the strictest treatment your mixed behavior deserves.

UGC creators each holding a different product up to the camera
Novoads · UGC video ads with AI, ready in minutes.
Try now

Why the 30-year crawl bargain broke

Cloudflare is not framing this as a punishment. It is framing it as accounting: the referral deal stopped paying out, and the numbers behind that claim are worth reading closely.

Bots now outnumber people

More than 50% of online traffic is now non-human, a milestone Cloudflare says arrived earlier than expected. Co-founder and CEO Matthew Prince put the stakes plainly: "Now that the majority of traffic on the Internet is non-human, we must go further and act faster so that a sustainable ecosystem can emerge."

There is waste inside that bot traffic, too. Cloudflare's data suggests more than half of crawl traffic from well-behaved bots goes to re-fetching pages that have not changed. Site owners pay for that serving; nobody gets any value from it.

The AI-summary click collapse

The second number explains publisher desperation better than any op-ed. Citing a 2025 Pew Research Center study, Cloudflare notes that when Google shows an AI summary, users click a traditional search result link just 8% of the time, about half as often as when there is no summary, and click a link inside the summary only 1% of the time.

Run that math on real volume, for 1,000 searches in your category:

  • With an AI summary on the page: roughly 80 clicks reach traditional results, and about 10 more follow a link inside the summary itself.
  • Without a summary: the same 1,000 searches produce roughly 150 clicks to traditional results.
  • Net effect: around 60 site visits per 1,000 searches simply stop existing. The answer engine kept the difference.

That is not a traffic dip; it is a structural rerouting of attention, and it is already visible on paid surfaces as well, where Google is testing AI summaries directly under search ads.

From Pay Per Crawl to Pay Per Use

Blocking is only half the announcement. The other half is a business model. In July 2025 Cloudflare launched Pay Per Crawl, a marketplace that lets websites charge AI bots for scraping. Now it is reshaping that into Pay Per Use: publishers get paid when their content creates value, not just when it is fetched.

The first experiments are running with Ceramic.ai and You.com. When a publisher opts in, they are paid when their content appears in Ceramic's AI search results or when You.com accesses a piece of their premium content. Ceramic calls its model pay-per-query. The design goal is a rate card for the answer economy: if your page powers the answer, the answer engine pays.

Who feels this first

A default change on one network sounds abstract until you map it to the parties who wake up on September 15 with different incentives.

Publishers whose pages carry ads

Ad-monetized publishers get the protection without lifting a finger, which is the point of a default. Their pages stop feeding training sets and agent workflows for free, and Pay Per Use gives them a path to revenue when AI products use their work. The trade-off is discoverability inside AI assistants that refuse to pay or to separate their crawlers, which is why the choice stays reversible in settings.

For a publisher, the September 15 decision reduces to three questions:

  • How much of your revenue depends on the human pageview an agent would replace?
  • Which AI surfaces currently send you meaningful referral or citation traffic you would rather keep?
  • Is your content distinctive enough that answer engines will pay for it under a per-use model?

AI companies with blended crawlers

For AI companies, the message is: transparency buys access. A crawler with one clear purpose can be allowed by category. A crawler that blends search, agents, and training inherits the harshest rule its behavior triggers. The strategic fork looks like this:

  • Separate and declare. Split crawling into single-purpose bots, label them honestly, and keep default access for the categories site owners still welcome.
  • Pay for what you use. Join Pay Per Use style arrangements and turn content acquisition into a licensed, scalable supply channel instead of a legal risk.
  • Scrape evasively. Keep blending purposes and face a network that fronts a fifth of the web's domains and specializes in bot detection.

The first two paths compound; the third gets more expensive every quarter.

Google's special seat at the table

The subtext of the whole announcement is the incumbent advantage. Cloudflare argues that using the same bot for search and AI unfairly advantages incumbent search providers, and TechCrunch reports Cloudflare specifically calling out the world's largest search engine as having access to about "2x more information" than other AI companies, because sites cannot stay discoverable without also feeding AI.

Google's position, per its own crawler documentation, is that Google-Extended is a standalone product token that publishers can use to manage whether crawled content trains future Gemini models, and that using it does not affect a site's inclusion in Google Search. That is real, documented separation for training. It is also precisely the granularity Cloudflare now wants from every crawler operator, enforced by defaults rather than goodwill. Platforms are converging on the same direction from different angles, the way Meta is rebuilding its ad stack end-to-end around AI while regulators push labeling rules for AI-generated ads.

Real UGC creators talking to camera in a row of video cards
Novoads · UGC video ads with AI, ready in minutes.
Try now

The GEO angle: citations get scarcer, and more valuable

Here is the part that matters for marketers who never touch a Cloudflare dashboard. Generative engine optimization, GEO, is the practice of making your brand the source AI assistants cite. This policy changes the supply side of that game.

Fewer open sources concentrate AI answers

Assistants can only cite what they can read. If large parts of the ad-funded web move behind default blocks and paywalls-per-query, the pool of freely crawlable sources shrinks. The sources that remain open, accurate, and well-structured get weighted more heavily in answers, because the engine has fewer alternatives for that information.

That cuts both ways. Publishers gain pricing power over their journalism. But brands gain something quieter: when third-party roundups and review sites become more expensive for engines to read, the brand's own documentation, comparison pages, and guides become the cheap, reliable source an assistant reaches for. Concretely, expect three second-order effects:

  • Answer engines lean harder on sources that stay open, so open plus specific beats authoritative but blocked.
  • Licensed content gets citation priority inside the engines that pay for it, fragmenting which assistant knows what.
  • Category questions with no paying publisher behind them get answered from whatever remains: often vendor content. That vendor can be you.

Your own domain becomes the sure source

Nobody blocks an AI assistant from reading their own product pages. Your domain is the one source whose crawl access you fully control, which makes it the safest citation asset you own. The work is making it worth citing:

  • Answer specific buyer questions in the first paragraph, not after a wind-up.
  • Publish real numbers you can defend: prices, timelines, spec comparisons.
  • Keep FAQ and comparison content structured so a passage can be lifted cleanly.
  • Stay open to search-class crawlers, since Search remains the allowed category that answer engines rely on for discovery.

This is the same shift we have written about across AI advertising: when machines mediate discovery, generic content blends into the summary, and only specific, extractable claims survive.

Referrals were the old currency; citations are the new one

The unhedged version: the referral era is not coming back. Waiting for AI platforms to restore the click-through economy is waiting for a bargain nobody is renewing. Pew's 8% and 1% click rates are what an answer-first interface looks like at scale, and Cloudflare's policy is the infrastructure layer accepting that reality and repricing it.

The metric to watch is no longer just organic sessions. It is how often your brand is named, quoted, and linked inside AI answers, and what share of your signups arrive from assistant surfaces. Start logging referrals from AI user agents now, so you have a baseline before September 15 reshuffles which sources assistants can reach.

A UGC creator filming a product review at home
Novoads · UGC video ads with AI, ready in minutes.
Try now

What to do before September 15

The deadline is Cloudflare's, but the homework is everyone's. Three checklists, depending on which seat you sit in.

If you publish content on Cloudflare

  • Open your dashboard and look at the new AI traffic controls for Search, Agent, and Training. The options are live now.
  • Decide your defaults deliberately instead of inheriting them. Blocking Training is reversible; a training set that already absorbed your content is not.
  • If your content earns money from attention, evaluate Pay Per Use as a second revenue line, the way early partners are testing pay-per-query pricing.

If you buy ads

  • Expect organic referral volume from AI surfaces to keep thinning, and treat paid creative as the channel you control end to end.
  • Shift budget toward creative testing volume: when discovery compresses into answers and feeds, the winning variant matters more than the winning keyword. That is the core argument for AI-generated UGC as a testing engine.
  • Watch how answer engines describe your category and make sure your ads' claims match what the machines are saying about you.

If you want AI assistants to cite you

  • Audit whether your key pages answer questions directly enough to be quoted. A page that takes 400 words to reach the point will not be excerpted.
  • Add sources, dates, and concrete figures to your money pages. Engines prefer passages that look verifiable.
  • Build UGC-style creative and case-study content around real product outcomes, because specificity is what separates a citable claim from filler. The broader playbook lives in our guide to AI for advertising.

How Novoads solves the creative side of AI-era discovery

Cloudflare is repricing how machines read the web. What it cannot reprice is the other half of the marketer's job: producing the ad creative that converts attention once you have it. Novoads generates UGC-style video ads from a script, with AI actors, native-sounding voiceover in 30+ languages, and captions, at roughly $2 to $11 per clip depending on the model. That cost curve is what makes testing ten angles a Tuesday task instead of a quarterly production, and testing volume is the one advantage that survives every change in how discovery works. You can try it on Novoads and judge the output against your current creative pipeline. Start for $1: $1 for 3 days of access, cancel anytime.

A UGC creator filming a skincare product review on a phone
Novoads · UGC video ads with AI, ready in minutes.
Try now

September 15 is one network's deadline, but the direction is the whole industry's. Crawlers are being forced to declare what they take. Publishers are learning to charge for value instead of visits. And AI answers are becoming the front page of the web, with a shrinking list of sources they can freely read.

For brands, the strategy that ages well is the one that never depended on the old bargain: own content specific enough to cite, keep it open to the crawlers that send value back, and produce enough creative to win wherever attention actually lands. The web is not closing. It is invoicing.

Frequently Asked Questions

What did Cloudflare announce on July 1, 2026?

Cloudflare announced a new classification system that separates crawlers into Search, Agent, and Training categories, plus new default settings that take effect on September 15, 2026. From that date, Training and Agent crawlers will be blocked by default on pages that display ads, while Search crawling remains allowed. The company also said its Pay Per Crawl marketplace is evolving into Pay Per Use, which pays publishers when their content creates value in AI products.

What exactly changes on September 15, 2026?

Cloudflare flips its defaults for new domains onboarding to its network: Training and Agent bots get blocked on ad-monetized pages unless the site owner chooses otherwise, and multi-purpose crawlers that combine search with training are evaluated by all of their behaviors, not just the friendliest one. TechCrunch reports the new defaults also apply to new Cloudflare customers, new sites set up by existing customers, and all existing free customers. Site owners can adjust their settings any time before the date.

Will Google stop crawling sites behind Cloudflare?

Not automatically. Search crawling stays allowed by default. The pressure point is that crawlers mixing Search with Training, and Cloudflare names Googlebot, Applebot, and BingBot as examples, will be blocked by site owners who choose to block Training. Google's own documentation says its Google-Extended token controls training use and does not affect a site's inclusion in Google Search, which is exactly the kind of separation Cloudflare is pushing the rest of the industry toward.

What are Pay Per Crawl and Pay Per Use?

Pay Per Crawl is the marketplace Cloudflare launched in July 2025 that lets websites charge AI companies for scraping their content. Pay Per Use is its next stage: instead of charging per fetch, publishers get paid when their content actually creates value, for example when it appears in an AI search answer. Cloudflare is running the first experiments with Ceramic.ai and You.com, and says other AI companies can bring their own payment models.

Does this policy affect my ad campaigns directly?

Not your ad accounts, no. Nothing changes inside Google Ads or Meta Ads because of this announcement. What changes is the discovery layer around your campaigns: if AI assistants lose default crawl access to large parts of the ad-funded web, the sources they can still read and cite carry more weight. That makes your owned content, product pages, FAQs, and comparison pages more valuable as citation targets, and it makes creative testing volume more important as organic referrals keep thinning.

How do I keep showing up inside AI answers after the change?

Keep your own domain open to search-class crawlers, since Search remains the allowed category and is how answer engines discover you. Publish content that answers specific buyer questions directly, with real numbers and clear structure, because assistants cite extractable passages. And measure referrals from AI surfaces now, so you have a baseline before the September 15 defaults start reshaping which sources assistants can reach.

Key Takeaways

  • Starting September 15, 2026, Cloudflare's new defaults will block Training and Agent crawlers on pages that display ads, while Search crawlers stay allowed. The change lands on new domains onboarding to Cloudflare, and TechCrunch reports it also covers new sites from existing customers and all existing free-tier customers.
  • Mixed-use crawlers that blend search with training will be judged by all of their behaviors, which puts named bots like Googlebot, Applebot, and BingBot under pressure to separate their jobs.
  • The economics behind the move are stark: more than 50% of online traffic is now non-human, and Pew found that an AI summary cuts clicks on traditional results to 8%, with only 1% clicking a link inside the summary.
  • Pay Per Crawl is evolving into Pay Per Use, with Ceramic.ai and You.com as first partners, so publishers get paid when their content creates value in AI answers, not just when it is fetched.
  • For brands, the practical shift is from chasing referral clicks to earning citations: publish direct-answer content on your own domain, keep it open to search-class crawlers, and scale ad creative as its own channel.
Mauricio Valdivia

Mauricio Valdivia

Founder of Novoads

Mauricio is the founder of Novoads, where he works to democratize video advertising with AI for brands in Latin America.