TLDFYI

Domain indexation describes how many pages of a domain Googlebot has crawled and added to its search index. A healthy domain has most of its important pages indexed with minimal low-quality or duplicate content competing for crawl budget. Crawl budget — the number of pages Google crawls per day — is partly allocated at the domain level, so a large domain with many thin pages can cause important pages to be crawled infrequently. [[canonical-url|Canonical tags]], [[robots-txt|robots.txt]], and [[sitemap-domain|XML sitemaps]] are the primary tools for guiding indexation.

Example

Running 'site:example.com' in Google and finding 50,000 indexed pages for a 10,000-page site suggests duplicate content is consuming crawl budget that should be allocated to canonical pages.

Domain Indexation

Example