XML Sitemap Extractor Documentation
Overview: Extract and analyze URLs from XML sitemaps for competitive analysis, content audits, and technical SEO planning.
Features
- Parse XML sitemaps and sitemap indexes
- Extract up to 50,000 URLs
- Filter URLs by pattern or date
- Analyze sitemap metadata (lastmod, priority, changefreq)
- Export to CSV, JSON, or plain text
- Validate sitemap structure and format
How to Use
- Enter sitemap URL - Input the full URL to the XML sitemap
- Choose extraction options - Select metadata to include
- Apply filters - Filter by date ranges or URL patterns
- Extract and analyze - View results and export as needed
Common Use Cases
Competitive Analysis
- Discover competitor page structure
- Identify content gaps and opportunities
- Analyze update frequencies
- Compare sitemap sizes
Technical SEO
- Audit your own sitemaps
- Validate sitemap compliance
- Check for orphaned pages
- Monitor sitemap changes
Sitemap Types Supported
Type | Description | Max URLs |
---|---|---|
Standard XML Sitemap | Basic URL list with metadata | 50,000 |
Sitemap Index | Contains links to multiple sitemaps | 50,000 per sitemap |
News Sitemap | Google News specific format | 1,000 |
Image Sitemap | Contains image-specific metadata | 50,000 |
Video Sitemap | Contains video-specific metadata | 50,000 |
Filtering Options
URL Patterns
Filter URLs containing specific paths, extensions, or patterns.
Date Ranges
Extract URLs modified within specific date ranges.
Priority Levels
Filter by sitemap priority values (0.0 to 1.0).
Export Formats
- CSV (Excel compatible)
- JSON (API integration)
- Plain text (one URL per line)