XML Sitemap Extractor Documentation

Overview: Extract and analyze URLs from XML sitemaps for competitive analysis, content audits, and technical SEO planning.

Features

  • Parse XML sitemaps and sitemap indexes
  • Extract up to 50,000 URLs
  • Filter URLs by pattern or date
  • Analyze sitemap metadata (lastmod, priority, changefreq)
  • Export to CSV, JSON, or plain text
  • Validate sitemap structure and format

How to Use

  1. Enter sitemap URL - Input the full URL to the XML sitemap
  2. Choose extraction options - Select metadata to include
  3. Apply filters - Filter by date ranges or URL patterns
  4. Extract and analyze - View results and export as needed

Common Use Cases

Competitive Analysis
  • Discover competitor page structure
  • Identify content gaps and opportunities
  • Analyze update frequencies
  • Compare sitemap sizes
Technical SEO
  • Audit your own sitemaps
  • Validate sitemap compliance
  • Check for orphaned pages
  • Monitor sitemap changes

Sitemap Types Supported

Type Description Max URLs
Standard XML Sitemap Basic URL list with metadata 50,000
Sitemap Index Contains links to multiple sitemaps 50,000 per sitemap
News Sitemap Google News specific format 1,000
Image Sitemap Contains image-specific metadata 50,000
Video Sitemap Contains video-specific metadata 50,000
Filtering Options
URL Patterns

Filter URLs containing specific paths, extensions, or patterns.

Date Ranges

Extract URLs modified within specific date ranges.

Priority Levels

Filter by sitemap priority values (0.0 to 1.0).

Export Formats
  • CSV (Excel compatible)
  • JSON (API integration)
  • Plain text (one URL per line)