How to Check if ChatGPT Can See Your Website
Updated — 9 min read
You can find out whether ChatGPT can see your website in just a few minutes by testing three things in order: can AI crawlers reach your pages, can they read your content, and can they quote you in an answer. These map to the three failure points that hide most sites from AI engines: a blocked robots.txt, JavaScript-only rendering, and thin or buried content. This guide walks you through six concrete methods — from a one-line curl test to asking the AI engines directly — so you can stop guessing about your AI visibility and start fixing the exact thing that's wrong.
If you've ever wondered "is my website visible to ChatGPT?" or "can ChatGPT read my site at all?", the good news is that AI visibility is diagnosable. Unlike traditional SEO, where rankings shift slowly, you can directly inspect what an AI bot receives when it requests your page and confirm in real time whether your content is reachable, parseable, and citable.
Why ChatGPT might not see your site
Before you run any tests, it helps to understand the three most common reasons a site stays invisible to AI engines. Almost every case of "why doesn't ChatGPT mention my website" traces back to one of these:
- Your robots.txt blocks AI crawlers. Many sites — often by default in a CMS or via a security plugin — disallow bots like
GPTBot. If the crawler is told not to fetch your pages, your content never enters the training or retrieval pipeline. - Your content only renders with JavaScript. If your page ships an empty HTML shell and builds the real content in the browser using client-side rendering, a crawler that reads raw HTML may see nothing but an empty
<div id="root">. - Your content is thin, buried, or unstructured. Even when a bot can reach and read your page, it needs clear, self-contained, quotable passages. A wall of marketing fluff with no direct answers gives the model nothing concrete to cite.
Method 1: Check your robots.txt
Your robots.txt file is the first gate. It lives at the root of your domain, so you can view it instantly by going to https://yoursite.com/robots.txt in any browser. This plain-text file tells crawlers which paths they may or may not fetch — and that includes the AI user-agents.
Open the file and scan for any Disallow rules tied to AI bots, or a blanket block that catches everything. Here are the user-agents that matter most for AI visibility in 2026:
Key AI crawler user-agents to look for in robots.txt
| User-agent | Operator | What it does |
|---|---|---|
| GPTBot | OpenAI | Crawls pages to train and improve OpenAI's models |
| ChatGPT-User | OpenAI | Fetches pages live when a user's prompt triggers a browse action |
| OAI-SearchBot | OpenAI | Indexes pages for ChatGPT search results and citations |
| PerplexityBot | Perplexity | Crawls and indexes pages for Perplexity answers |
| Google-Extended | Controls whether content is used for Gemini and AI features |
A line like User-agent: GPTBot followed by Disallow: / means you are explicitly telling ChatGPT's crawler to stay out of your entire site. Likewise, a global User-agent: * with Disallow: / blocks everyone. If you want AI to see your website, neither of these should be blocking the content you care about.
Method 2: Test what a bot actually receives
Viewing your site in a browser tells you what a human sees. To know what a crawler sees, you have to request the page as that crawler. The fastest way to do a curl user-agent test is from your terminal, spoofing the bot's user-agent string so the server responds exactly as it would to GPTBot.
# Request your homepage as GPTBot and read the raw HTML
curl -A "GPTBot" https://yoursite.com
# Save it to a file so you can search the response
curl -A "GPTBot" https://yoursite.com -o gptbot-view.html
# Follow redirects and show response headers too
curl -A "GPTBot" -IL https://yoursite.comRead the output. If your headline, body copy, and key facts appear in that raw HTML, a crawler can read them — this is what server-side rendering gives you. If the response is a near-empty shell with a few script tags and no visible text, the bot is receiving nothing useful, and you've found your problem.
View Source vs. the rendered DOM
You can run the same test inside a browser without the terminal. In Chrome or Firefox, right-click your page and choose View Page Source (or press Ctrl+U / Cmd+Option+U). This shows the raw HTML the server sent — the closest approximation to what a simple crawler ingests. Crucially, this is different from the rendered DOM you see in DevTools' Elements panel, which shows the page after JavaScript has run. AI crawlers that don't execute JavaScript only get the View Source version, so that's the one that matters for testing ChatGPT visibility.
Method 3: Check if your content needs JavaScript
This is the single most common reason a modern site is invisible to AI. Many React, Vue, and Angular apps use client-side rendering (CSR): the server sends a minimal HTML document, and the browser downloads JavaScript that fetches and builds the content afterward. Humans never notice. Crawlers that read raw HTML can be left with nothing.
To detect CSR, use View Source (Method 2) and look for these tell-tale signs:
- An almost-empty
<body>containing little more than<div id="root"></div>or<div id="app"></div>. - Large blocks of
<script>tags but no visible paragraph text, headings, or article content in the source. - A
curlresponse that's a fraction of the size of what you see in the browser. - Your main content appearing in DevTools' Elements panel but missing entirely from View Page Source.
If you see these patterns, your answer to "can ChatGPT read my site" is probably no. The fix is to render content on the server. Options include server-side rendering (SSR), static site generation (SSG), or prerendering, all of which deliver real HTML in the initial response. Frameworks like Next.js, Nuxt, Astro, and SvelteKit do this out of the box.
Method 4: Look for structured data
Structured data won't make an unreadable page readable, but it dramatically improves how confidently AI engines understand and quote a page they can already access. The dominant format is JSON-LD, embedded in a script tag. To check if AI can see your website's structured data, open View Source and search (Ctrl+F) for:
application/ld+jsonIf you find one or more <script type="application/ld+json"> blocks, your page is declaring machine-readable facts — things like article author and date, product price, FAQ pairs, organization details, or breadcrumbs. AI engines and Google's AI Overviews use these signals to disambiguate entities and pull precise answers. If you find nothing, adding relevant schema (Article, FAQPage, Organization, Product) is a high-leverage improvement. Validate it with Google's Rich Results Test or the Schema Markup Validator.
Method 5: Ask the AI engines directly
The most direct way to answer "is my website visible to ChatGPT" is to ask the engines themselves. This tests the final link: can they actually quote you? Run targeted prompts in each major engine and watch whether your site appears as a source or gets named in the answer.
- ChatGPT (with search enabled): Ask a question your page should answer, like "What does [your brand] offer?" or "Best [your topic] tools — include sources." Check whether your domain appears as a cited link.
- Perplexity: Perplexity always shows numbered citations, making it the easiest place to confirm visibility. Search your brand name plus a topic and look for your URL in the source list.
- Google Gemini / AI Overviews: Run an informational query in your niche and see whether your content is summarized or linked in the AI-generated answer.
Interpret the results carefully. If an engine cites you, it can clearly reach, read, and quote your content — you're in good shape. If it describes your topic well but never mentions you, your content is likely readable but not distinctive or quotable enough. If it gets basic facts about your brand wrong or can't find you at all, that points back to a crawling or rendering block you should diagnose with Methods 1–3.
Method 6: Run an automated GEO check
Running all five methods by hand is thorough but slow, and it's easy to miss something. An automated tool can fetch your page as a bot, parse the HTML, inspect your robots.txt, detect client-side rendering, and check for structured data and llms.txt in one pass — then score the result so you know where to focus.
This is exactly what a free GEO score does. Check GEO Score fetches your page the way an AI crawler would and grades it from 0–100 across four pillars of generative engine optimization: whether crawlers can access your content, whether it's structured for machine reading, whether it's genuinely answerable and quotable, and whether your site sends clear trust signals. It's the fastest way to turn the manual checks above into a prioritized to-do list.
What to do if ChatGPT can't see your site
If your tests revealed a problem, work through this checklist in order. It's sequenced from hard blockers to refinements, so fix the top items first.
- 01Unblock AI crawlers in robots.txt. Remove any
Disallowrules targetingGPTBot,ChatGPT-User,OAI-SearchBot, orPerplexityBot, unless you have a deliberate reason to exclude them. - 02Serve real HTML. If you detected client-side rendering, switch the pages that matter to server-side rendering, static generation, or prerendering so content appears in the initial response.
- 03Confirm the fix with curl. Re-run
curl -A "GPTBot" https://yoursite.comand verify your actual content now appears in the raw HTML. - 04Add structured data. Implement JSON-LD for your page type (Article, FAQPage, Organization, Product) and validate it.
- 05Write quotable, self-contained answers. Lead sections with direct answers, use clear headings as questions, and include facts, definitions, and lists a model can lift cleanly.
- 06Publish an llms.txt file. This emerging standard offers AI engines a clean, curated map of your most important content in Markdown at
/llms.txt. - 07Ensure a clean technical baseline. Use HTTPS, a valid sitemap, fast load times, and correct canonical tags so crawlers can navigate efficiently.
- 08Re-test with the AI engines. After changes propagate, repeat Method 5 to confirm you're now being read and cited.
Crawling and indexing take time, so don't expect instant citations after a fix. The point of these tests is to remove every technical obstacle between your content and the AI engines — once a page is reachable, readable, and quotable, your odds of being surfaced rise sharply.
Frequently asked questions
How do I check if ChatGPT can see my website?+
Test three things in order. First, open your robots.txt and confirm it doesn't block GPTBot, ChatGPT-User, or OAI-SearchBot. Second, run curl -A "GPTBot" https://yoursite.com and confirm your real content appears in the raw HTML. Third, ask ChatGPT (with search on) a question your page should answer and see if it cites your domain. If all three pass, ChatGPT can see your site.
How do I know if ChatGPT indexed my site?+
ChatGPT doesn't expose a public index the way Google Search Console does, so there's no exact "indexed" status to look up. The practical test is to enable search in ChatGPT and prompt it with brand and topic questions. If your URL shows up as a cited source, your content has been crawled and is retrievable. Perplexity, which always lists sources, is an even clearer place to confirm this.
Why doesn't ChatGPT mention my website?+
Usually one of three reasons: your robots.txt blocks the AI crawler, your content only renders with JavaScript so the bot receives an empty shell, or your content is readable but too thin or generic to be worth quoting. Work through Methods 1 through 3 to find the blocker, then improve the quotability of your content.
Does ChatGPT crawl my site?+
OpenAI operates several crawlers: GPTBot for model training, OAI-SearchBot for search indexing, and ChatGPT-User for live fetches triggered by a user's prompt. Whether they crawl you depends on your robots.txt permissions and whether your pages are reachable. You can verify access by checking your server logs for those user-agents or by running the curl test.
How do I test if AI can read my page?+
Use View Page Source (Ctrl+U or Cmd+Option+U) in your browser, or run a curl user-agent test from the terminal. If your headlines and body text appear in that raw HTML, AI crawlers can read them. If you only see an empty <div id="root"> and script tags, your content depends on JavaScript and many AI bots won't see it.
Does blocking GPTBot remove my site from ChatGPT?+
No. Blocking GPTBot in robots.txt prevents future crawling for training, but it doesn't erase knowledge the model already has, and it doesn't necessarily stop ChatGPT-User from fetching a page during a live browse. If you want maximum AI visibility, don't block any of OpenAI's user-agents.
What is a GEO score and why does it matter?+
GEO stands for generative engine optimization — the practice of making your content visible to AI answer engines like ChatGPT, Perplexity, and Gemini. A GEO score grades how well a page does this, typically across access, structure, answerability, and trust. It matters because it consolidates the manual checks in this guide into one number and a prioritized fix list.
Will server-side rendering help AI see my site?+
Yes, significantly, if your content currently relies on client-side rendering. Server-side rendering, static generation, and prerendering all deliver complete HTML in the initial response, so a crawler that doesn't execute JavaScript still receives your full content. After switching, confirm the change with the curl test.
What is llms.txt and do I need it?+
llms.txt is an emerging standard: a Markdown file at the root of your domain that gives AI engines a clean, curated overview of your most important pages. It's not yet universally supported and won't fix a blocked or unreadable site, but it's a low-cost signal that complements solid technical foundations and structured data.
How long until ChatGPT sees my site after I fix it?+
There's no fixed timeline. Live-fetch tools like ChatGPT-User and Perplexity can pick up changes almost immediately when a user's query triggers a fetch, while training-based knowledge updates on much slower cycles. The reliable approach is to fix the technical blockers, then re-test with the engines periodically rather than waiting for a guaranteed date.