Glossary

What is PerplexityBot?

PerplexityBot is Perplexity's web crawler. It fetches and indexes pages so they can be cited as the numbered sources under a Perplexity answer. Perplexity runs 2 main crawlers: PerplexityBot for indexing, and Perplexity-User for fetching a page a person acts on. The work that earns citations is the same work the the GEO study (Princeton, KDD 2024) measured. The rule is plain: “a source you can't crawl is a source you can't cite.”

What PerplexityBot actually does

When Perplexity answers a question, it does not invent the citations at the bottom. It pulls them from pages it has already fetched and stored. PerplexityBot is the program that does that fetching. It requests your page over HTTP, reads the HTML your server returns, and indexes the text it finds so that text can later be quoted and linked as a numbered source.

If PerplexityBot never reaches a page, that page cannot become one of those numbered sources. It is that direct. The crawler is the gate. Everything else about ranking and quotability comes after the page has been read.

PerplexityBot vs Perplexity-User

Perplexity runs 2 main crawlers, and they do different jobs. Treating them as one is the mistake that quietly removes a site from the answers:

Both identify themselves with a user-agent token that shows up in your server logs, so you can confirm which one visited and when. If you want to appear in Perplexity's answers, the crawler that has to be allowed is PerplexityBot.

Why it matters for your site

People increasingly ask Perplexity a question instead of scrolling a results page. The sources it cites get the click and the credit. If your page is not in that short list of numbered sources, you are invisible to that reader, even if you have the best answer on the web. Being readable by PerplexityBot is the price of admission.

The common failure is not a bad page, it is an unreachable one. A site looks perfect in a browser and is blank to a crawler because the answer only appears after JavaScript runs, or because a blanket robots.txt rule meant to "keep the AI out" also blocked the crawler that produces citations. Either way the result is the same: nothing to cite, so nothing cited. You can see exactly which checks a page passes or fails in our fix library.

How to allow PerplexityBot and check it

Start with two things, in order. First, make sure your robots.txt does not block the crawler:

# Let Perplexity index you for citations
User-agent: PerplexityBot
Allow: /

User-agent: Perplexity-User
Allow: /

Second, confirm your actual answer is in the raw HTML the server sends, not painted in later by JavaScript. Load the page with JavaScript disabled, or view source, and check that your real text is present. If it is missing there, the crawler sees an empty shell. A full walkthrough lives in our guide on how to fix robots.txt blocking AI crawlers.

After that, the work that pays off is the same work that has always paid off. Perplexity favors fresh, specific, well-structured sources, so lead with a self-contained answer near the top, include a real number or named entity, and use headings that match the question a person would actually ask. The peer-reviewed GEO study found that adding statistics, direct quotations, and cited sources measurably increased how often a page was surfaced in generative answers.

You can verify all of this in seconds. Paste your link into our GEO audit and we will tell you whether PerplexityBot can read you, whether your answer survives without JavaScript, and how quotable your top passage is. The same checks power the live dashboard in the app, which prints every failure in fix order.

A source you can't crawl is a source you can't cite.

See also

PerplexityBot is one entry in a small family of retrieval crawlers, each tied to a different answer engine. To see how the same pattern plays out at OpenAI, read What is OAI-SearchBot?. When you are ready to act on all of this, our guide on how to get cited by Perplexity turns the diagnosis into a fix list.

See if PerplexityBot can read you.

Paste your link. We check crawler access, JavaScript dependence, and how quotable your page is. The preview is free.