# Discover Website Sitemaps

Website URL discovery is slow when sitemap locations are unknown. This skill uses SitemapKit to find sitemap files, extract URLs, and summarize crawl coverage.

## Install

```bash
npx skillstore add 0nl1n1n/sitemapkit
```

## Metadata

- - Slug: 0nl1n1n-sitemapkit
- - Version: 1.0.0
- - Author: 0nl1n1n
- - GitHub username: 0nl1n1n
- - License: MIT
- - Repository: https://github.com/0nl1n1n/sitemapkit-mcp/tree/main/
- - Ref: main
- - Supported tools: Claude, Codex, Claude Code
- - Risk level: medium
- - Risk factors: network, env\_access
- - Quality score: 50
- - Quality tier: warning
- - Public page: https://skillstore.pages.dev/skills/0nl1n1n-sitemapkit
- - Manifest: https://skillstore.pages.dev/api/skills/0nl1n1n-sitemapkit/manifest

## Capabilities

- Finds sitemap files for a supplied website domain.
- Extracts URL entries from a direct sitemap URL.
- Runs a full sitemap crawl across discovered sitemap files.
- Returns URL counts, processed sitemap counts, and truncation status.
- Reports remaining SitemapKit quota when the API response includes quota metadata.
- Provides setup guidance for Claude Desktop, Cursor, and Windsurf MCP clients.

## Use Cases

- SEO site inventory: Collect sitemap URLs and page counts before checking index coverage or migration readiness.
- Competitive content mapping: Extract public sitemap URLs from a competitor domain to understand site sections and publishing scale.
- Developer migration checks: Compare expected pages against sitemap output before or after a site launch.

## Prompt Templates

### Find sitemap files

```
Find all sitemap files for https://example.com and tell me where each sitemap was discovered.
```

### Extract one sitemap

```
Extract the URLs from https://example.com/sitemap.xml and include last modified dates when available.
```

### Audit site URL count

```
Run a full sitemap crawl for https://example.com with a limit of 5000 URLs. Summarize total URLs, processed sitemaps, and truncation status.
```

### Compare launch coverage

```
Discover all sitemaps for https://example.com, group the returned URLs by site section, and flag sections with unexpectedly low page counts.
```

## Limitations

- Requires a valid SITEMAPKIT\_API\_KEY environment variable.
- Sends target URLs and domains to the SitemapKit API.
- Maximum returned URLs depend on the user plan and max\_urls setting.
- It discovers URLs from sitemaps, not from a full browser crawl of every page.

## Best Practices

- Confirm the target domain is public and appropriate to send to SitemapKit.
- Start with sitemap discovery before running a full crawl on large sites.
- Set max\_urls deliberately to control result size and quota usage.

## Anti Patterns

- Do not use it for private URLs or sensitive internal domains.
- Do not assume a sitemap contains every live page on a site.
- Do not request very high URL limits without checking quota and truncation status.

## Security Audit

- - Safe to publish: true
- - Audited at: 2026-06-27T11:22:26.456\+00:00
- - Summary: Static analysis reported command execution, weak crypto, filesystem, network, and credential patterns. Review found the command and weak-crypto alerts are false positives from Markdown backticks, TypeScript template strings, and documentation text, while network access and SITEMAPKIT\_API\_KEY usage are intentional for the SitemapKit API. The skill is publishable with a warning because user-supplied URLs and an API key are sent to a third-party service.

## Stats

- - Views: 149
- - Downloads: 40
- - Favorites: 1
- - Popularity score: 0
