Question 1

Does robots.txt block Google from indexing a page?

Accepted Answer

No. robots.txt only blocks crawling. A URL blocked there can still appear in search results if other sites link to it — just without a description. To prevent indexing, use a noindex meta tag on the page itself. Do not block a noindex page in robots.txt or Googlebot will never see the noindex directive.

Question 2

How does the robots.txt rule precedence work?

Accepted Answer

Google picks the most specific (longest-matching) rule for the given path. If an Allow and a Disallow are equally specific, Allow wins. The tool applies the same rule to show which directive would actually decide whether a URL is crawled.

Question 3

Does each subdomain need its own robots.txt?

Accepted Answer

Yes. robots.txt applies only to the exact host from which it is served. www.example.com/robots.txt does not govern shop.example.com. Each subdomain needs its own file at /robots.txt.

robots.txt Tester

How the test works

Important limits

Related pages