Amazonbot respects robots.txt for AI training opt-outs

// 1h agoNEWS

Amazonbot respects robots.txt for AI training opt-outs

Amazon is updating its web crawler behavior to strictly follow robots.txt directives and adopting the "noarchive" meta tag to allow webmasters to opt-out of AI training. The change, effective June 15, 2026, provides more granular control over how website data is consumed by Amazon's generative AI models like Amazon Nova while maintaining indexing for search services like Alexa and Rufus.

// ANALYSIS

Amazon's shift to standard robots.txt compliance is a strategic concession to webmasters who are increasingly wary of aggressive AI data harvesting.

–Standardizing crawler management eliminates the need for manual support requests and custom scraping mitigations.
–The distinction between Amazonbot (training) and Amzn-SearchBot (retrieval) allows for more efficient crawl budget allocation.
–The "noarchive" tag provides a vital middle ground for publishers who want search traffic but don't want to feed Amazon's LLMs.
–Aligning with Google and Cloudflare's bot management standards reduces fragmentation in web crawler configuration.
–The one-month implementation window gives developers a tight deadline to audit their server logs and update exclusion rules.

// TAGS

amazonbotamazon-novarobots-txtai-trainingscraperweb-crawlingalexarufus

DISCOVERED

1h ago

2026-05-15

PUBLISHED

5h ago

2026-05-14

RELEVANCE

8/ 10

AUTHOR

xena

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

LAUNCH51m ago

OpenAI Ads goes self-serve

OpenAI's advertising platform is now open to all US businesses, introducing self-serve campaign management and CPC bidding. The "tinted box" ad format appears contextually for users on Free and "Go" plans without influencing the AI's organic responses.

NEWS3h ago

UK replaces Palantir with in-house tech

The UK government saved millions by replacing Palantir's data platform with an in-house system for its refugee sponsorship scheme. The new "Share" platform proves that internal digital teams can successfully dismantle expensive vendor lock-in with user-centric design.

OPEN SOURCE3h ago

Antirez: DS4 replaces frontier models locally

Salvatore Sanfilippo (antirez) shared that DS4 now replaces frontier models like GPT-4o for his local development tasks. The engine uses asymmetric 2/8-bit quantization to run DeepSeek v4 Flash on high-end consumer hardware with near-GPT-4o latency.