URLs contain more than 3 query parameters, which typically produce many near-duplicate variants of the same content and waste crawl budget.
By Seoxpert Editorial · Published
Each combination of query parameters is a unique URL in Google's eyes. A product listing with four filters and three sort orders generates dozens of crawlable variants per page of results — many carrying nearly identical content. Without canonicalisation, Google spends crawl budget on variants and struggles to identify which one to rank.
Direct: crawl budget exhausted on near-duplicate URLs means important pages are re-crawled less often. Indirect: ranking signals (links, behaviour) are split across variants instead of consolidated, lowering the ranking potential of the true canonical.
Scanner parses each URL's query string. URLs with more than 3 parameters are flagged for review — the threshold where parameter explosion typically starts to cause real crawl-budget issues.
Canonical tag on a filter-variant URL
<!-- URL: /shoes?color=red&size=10&sort=price_asc&utm_source=email -->
<link rel="canonical" href="https://example.com/shoes" />
<!-- The base category page becomes the canonical. Filter variants consolidate to it. -->Strip tracking params on internal requests
# Redirect utm_* params to a clean URL on the same host
if ($args ~* "utm_") {
return 301 $uri?$args_without_utm;
}When the parameter genuinely changes the main content and the result deserves to rank on its own — e.g. a search results page that returns unique matches for a specific keyword, or a product variant that has unique SEO value (colour-specific landing pages targeting "red running shoes").
Canonical is usually better. Blocking in robots.txt prevents Google from seeing the canonical tag that would consolidate the URL. Use robots.txt only for variants that have no ranking value at all, like sort-order pagination.
Run a scan to see if URLs with Excessive Query Parameters affects your pages.
Scan my website →