How to Export a Complete List of Pages From Any Website
You probably don’t know how many pages your website actually has.
Your CMS might show one number. Your sitemap might show another. Analytics will show something else entirely. None of them guarantee that every live, crawlable page is accounted for.
If you need a complete, structured list of every reachable page on a website, you need a proper crawl-based inventory.
Quick truth: The only reliable way to know what exists on a website is to crawl it and record what is actually reachable.
Why CMS counts and sitemaps aren’t enough
Most site owners rely on their content management system to tell them how many pages they have. But CMS dashboards don’t always reflect:
- Old pages that are still accessible via direct URLs
- Pages not included in navigation
- Duplicate URLs created by parameters
- Pages excluded from sitemaps
- Orphaned content that still exists
Sitemaps are helpful — but they’re curated. They don’t always represent what’s actually live and accessible.
What a crawl-based page inventory does differently
A crawl-based inventory works like a search engine bot. It starts at your homepage and follows internal links, recording each reachable page along the way.
The result is a structured list of:
- Every discovered URL
- HTTP status codes
- Page titles
- Meta descriptions
- Basic structural data
Instead of trusting dashboards, you get a concrete export of what the site actually exposes.
Why exporting to CSV matters
A proper CSV export turns a crawl into something usable.
With a full page list exported, you can:
- Sort by URL structure
- Identify outdated sections
- Plan content pruning
- Prepare for migrations
- Compare before-and-after site states
- Share a clean inventory with developers or clients
It becomes a working document — not just a visual report.
Tip: Before redesigning or migrating a website, export a full page inventory first. It becomes your baseline reference point.
Why I built this into Tom’s Site Auditor
When rebuilding my own website, I needed a way to know exactly what existed before making structural changes. I didn’t want a cloud dashboard or ranking estimates — I wanted a deterministic list of pages I could inspect and archive.
That’s why I built Tom’s SEO Site Auditor as a standalone Windows desktop app.
It crawls your site locally and exports structured data — including a complete list of discovered pages — into clean CSV files.
No sampling. No API limits. No cloud dependency.
Practical use cases
Exporting a full page list is useful for more than SEO:
- Website migration planning
- Pre-launch validation
- Content audits
- Client handover documentation
- Regression checks after updates
- Verifying site structure changes over time
Run a scan today. Run it again next month. Compare the results. You’ll know exactly what changed.
Built for clarity, not guesswork
Tom’s Site Auditor is designed to produce deterministic results. The same site, with the same settings, produces the same structured outputs.
The exported CSV files can be archived, diffed, filtered, and shared — turning a website crawl into a tangible asset.
Try the full app free for 7 days
If you want to generate a complete, exportable list of every page on your website, you can try the full version for 7 days.
Download Tom’s SEO Site Auditor here.
If you decide to upgrade, additional export features and automation tools are included with the paid version. See pricing and what’s included.
Where this is headed
Understanding what exists on your website is the foundation for improving it. Before rankings, before redesigns, before optimisation — you need a clear inventory.
This tool was built to provide exactly that.