How to Track Which AI Agents Visit Your Website and What They Do

To track which AI agents visit your website, you need agent analytics: a system that identifies AI crawlers (GPTBot, ClaudeBot, PerplexityBot, Google-Extended) and AI referral traffic (human visitors from ChatGPT, Perplexity, Copilot) using layered data sources like Cloudflare edge logs, GA4, and a JavaScript tracker. Reaudit provides this unified platform, showing you which agents crawl your pages, what they cite, where they fail, and how to optimize for them.
What Is Agent Analytics?
Agent analytics is the discipline of tracking and analyzing non-human AI agents, LLM crawlers, retrieval bots, and AI-driven browsers so you can see:
Which AI agents visit your site (e.g., GPTBot, ChatGPT-User, ClaudeBot, PerplexityBot, Google-Extended)
What they crawl, query, and cite from your pages
Where they fail, bounce, or time out
Which human visitors arrive via AI referrals from ChatGPT, Perplexity, Copilot, or other AI surfaces
How to optimize your site so agents understand, recommend, and send you more high-intent traffic
It extends classic web analytics (sessions, conversions) into the agent web, where autonomous AI systems are now the primary user crawling your content and deciding whether humans ever see it.
Why Agent Analytics Matters Now
Salesforce estimated that AI- and agent-referred traffic drove 21% of all global holiday orders in the 2023-24 season, representing $263 billion in sales. Shoppers arriving from generative AI platforms are twice as likely to purchase as visitors from other sources. For brands especially in SaaS, e-commerce, and digital agencies ignoring agent traffic means leaving revenue on the table.
Traditional analytics tools like Google Analytics treat AI crawlers as noise. They filter bot traffic out, so you never see the 50+ LLM bots that visit your site daily. Meanwhile, AI referrals (human clicks from AI answers) often appear as direct traffic, hiding their true source. Agent analytics solves this by making both visible.
AI Crawl vs. AI Referral: The Critical Difference
Understanding the distinction between an AI crawl and an AI referral is essential for optimization.
AI Crawl (Agent Visit)
A bot or agent GPTBot, ClaudeBot, PerplexityBot hits your site to index, retrieve, or validate information. It may never appear as a human session in GA4 because it is non-human traffic. The outcome: your content becomes part of the knowledge base that LLMs can cite in future answers.
AI Referral (Human Visit from AI)
A human user asks an LLM a question, the AI shows or cites your site, and the user clicks through. This appears as a human session, but the source of intent is the AI, not Google or direct. These visitors convert at higher rates because the AI has pre-qualified them.
Both matter. If you only track crawls, you know you are being seen but not whether that visibility translates into traffic or revenue. If you only track referrals, you miss why certain pages perform well with agents. Agent analytics ties these together: which agents crawled which pages, which pages get cited, and which citations drive real human visits and conversions.
Major AI Crawler User-Agents and What They Do
AI Crawler / Agent | Typical User-Agent Pattern | What It Does on Your Site |
|---|---|---|
OpenAI GPTBot | GPTBot | Crawls pages so OpenAI models (ChatGPT) can learn from and reference your content in answers. |
OpenAI ChatGPT-User | ChatGPT-User | Fetches specific pages in real time when a ChatGPT user opens a link or a plugin accesses your site. |
Anthropic ClaudeBot | ClaudeBot | Indexes content for Claude models to use in retrieval and QA; respects robots.txt. |
PerplexityBot | PerplexityBot | Crawls and snapshots pages that Perplexity AI uses to answer questions with live web citations. |
Google-Extended | Google-Extended | Opt-out mechanism for sites to control content use for AI training and generative features. |
Reaudit detects and classifies these user-agents correctly in your logs, shows volume trends over time, and highlights which crawlers are driving downstream citations and referrals.
How to Start Tracking AI Agents with Reaudit
Reaudit operationalizes agent analytics through a step-by-step integration that takes minutes to set up.
1. Connect Your Edge Data (Cloudflare)
Grant Reaudit access to Cloudflare logs so it can see all hits at the edge, including traffic that never surfaces in GA4 because it was blocked, cached, or filtered as a bot. This gives you authoritative counts of AI crawls and their user-agents.
2. Sync Your GA4 Property
Connect your Google Analytics 4 property so Reaudit can attribute sessions and conversions to AI referral sources where detectable, compare agent traffic vs. human traffic, and benchmark conversion rates for AI-referred visitors against other channels.
3. Install the JavaScript Tracker
Add Reaudit's JS snippet to your site template. This captures client-side metadata about visits that appear as direct but originate from AI surfaces, and records front-end events that edge logs miss.
4. Enable AI Crawler & Agent Detection
Reaudit parses user-agent strings and known IP patterns to classify LLM crawlers (GPTBot, ClaudeBot, PerplexityBot, Google-Extended) and AI-driven browsers. It normalizes these into a clean Agent dimension for analysis.
5. Activate AI Referral Tracking
Configure source rules and UTM patterns so Reaudit can label traffic as ChatGPT/OpenAI, Perplexity, Bing Copilot, and other AI assistants. This enables reports on visits, conversions, and revenue by AI referral source.
6. Turn On Page Citation Tracking
Reaudit tracks which of your pages appear as citations or references in AI answers across major surfaces (ChatGPT, Perplexity, Claude, Gemini). You get a page-level view: crawled → cited → clicked → converted.
7. Monitor Agent Failures and Optimization Opportunities
Use Reaudit dashboards to see pages with high AI crawl volume but low citation/referral volume (agents see them but don't trust or use them) and pages that are often crawled before agents bounce (they can't find structured answers). Iterate content and technical SEO to close these gaps.
To see which AI agents are already visiting your site and how much revenue they drive, connect your stack at reaudit.io.
Real Data: What You Can Expect to See
In a case study with 3dplotter.xyz, Reaudit tracked 726 AI mentions and 11,204 citations across major AI engines. The platform identified that PerplexityBot crawled the site 4x more than GPTBot, yet GPTBot citations drove 60% of AI referral traffic. This kind of insight lets you prioritize optimization efforts: fix what agents actually use, not just what crawls most.
Beyond Detection: Optimizing for Agents
Once you identify AI bot traffic on your site, the next step is optimization. AI agent detection for websites is only useful if you act on the data. Reaudit's full-stack platform goes beyond monitoring to help you generate citation-winning content, publish it with structured data, and measure the impact on visibility.
Use the AI Crawler URL Inspector to see exactly what ChatGPT, Claude, and Perplexity extract from your pages. Then apply those insights to improve your content's AI-readability.
Conclusion: The Agent Web Is Here
AI agents are already a major discovery and conversion layer. With 39% of consumers using AI to find products, and AI-referred shoppers converting at twice the rate of other visitors, monitoring AI crawlers visiting your website is no longer optional. Reaudit gives you the agent analytics platform to track, understand, and optimize for this new traffic source.
Frequently Asked Questions
How do I know if AI agents are scraping my site?
You can check your server logs or edge logs (Cloudflare) for user-agent strings like GPTBot, ClaudeBot, PerplexityBot, and Google-Extended. Alternatively, use a tool like Reaudit that automatically detects and classifies 50+ AI agents and shows you which pages they crawl and how often.
What is the difference between an AI crawl and an AI referral?
An AI crawl is a bot visiting your site to index content for an LLM. An AI referral is a human user who clicks a link to your site from an AI answer. Both are important: crawls build your knowledge base presence, referrals drive high-intent traffic.
Can Google Analytics detect AI agents?
GA4 filters out bot traffic by default, so it does not show AI crawler visits. It also often misattributes AI referral traffic as direct. For accurate AI agent detection, you need a tool that analyzes raw logs or edge data, like Reaudit.
What are the most common AI crawlers?
The most common AI crawlers are GPTBot (OpenAI), ChatGPT-User (OpenAI), ClaudeBot (Anthropic), PerplexityBot (Perplexity), and Google-Extended. Each has a specific user-agent string and purpose, from indexing to real-time retrieval.
How do I block AI crawlers from my site?
You can block AI crawlers using robots.txt directives, such as Disallow for GPTBot or PerplexityBot. However, blocking them means your content won't appear in AI answers. Most brands prefer to optimize for AI visibility instead of blocking.
What is agent analytics?
Agent analytics is the practice of tracking and analyzing non-human AI agents — crawlers, bots, and AI-driven browsers — to understand how they interact with your website, what they cite, and how they drive human traffic and conversions.
How can I track AI referral traffic?
Use a tool like Reaudit that detects AI referral sources by analyzing UTM parameters, referrer patterns, and client-side data. It can label traffic from ChatGPT, Perplexity, Copilot, and other AI platforms, giving you clear reports on visits and conversions.
Why is AI referral traffic more valuable?
Salesforce found that shoppers arriving from generative AI platforms are twice as likely to purchase as visitors from other sources. AI pre-qualifies users, so traffic from AI answers has higher intent and conversion rates.
Does Reaudit work with Cloudflare and GA4?
Yes, Reaudit integrates with Cloudflare edge logs for authoritative crawl volume data and with GA4 for session and conversion attribution. It also uses a JavaScript tracker to capture client-side behavior and AI referrals that appear as direct traffic.
What should I do after identifying AI agents on my site?
Optimize your content for AI readability: use structured data, clear headings, accurate facts, and FAQ schemas. Monitor which pages agents cite and which they ignore, then iterate. Reaudit's platform helps you generate, publish, and measure AI-optimized content.