extract
Extract Web Content from URLs
This skill extracts clean markdown or text content from specific URLs using Tavily's extraction API. Perfect for research, documentation retrieval, and content aggregation without writing custom scraping code.
Download the skill ZIP
Upload in Claude
Go to Settings → Capabilities → Skills → Upload skill
Toggle on and start using
Test it
Using "extract". Extract content from https://example.com/about
Expected outcome:
## About Example
Welcome to Example.com...
Our Mission
We strive to provide...
Using "extract". Extract information about pricing from https://example.com/pricing and https://example.com/plans
Expected outcome:
## Pricing Information
### Basic Plan - $9/month
- Feature A
- Feature B
### Pro Plan - $29/month
- All Basic features
- Priority support...
Security Audit
Low RiskStatic analysis detected 137 potential issues across external_commands, network, filesystem, and env_access categories. After semantic evaluation, all findings are FALSE POSITIVES - these patterns represent legitimate API extraction functionality. The skill uses standard shell commands (curl, jq) to communicate with Tavily's official API, accesses environment variables for API key authentication, and reads OAuth tokens from the standard MCP auth directory. No malicious behavior, data exfiltration, or command injection vulnerabilities were identified.
Low Risk Issues (4)
Risk Factors
⚙️ External commands (62)
🌐 Network access (33)
📁 Filesystem access (17)
🔑 Env variables (16)
Quality Score
What You Can Build
Research Documentation Gathering
Extract documentation content from multiple API reference pages to build a local knowledge base
Competitive Analysis
Extract content from competitor websites, product pages, and blog posts for market research
Content Aggregation
Pull articles and content from multiple news sources or blogs into a single markdown format
Try These Prompts
Extract the content from this URL: https://example.com/article
Extract content from these URLs: https://docs.example.com/api, https://docs.example.com/auth
Extract information about authentication from these URLs: https://example.com/docs, https://example.com/api-reference. Focus on API keys and OAuth.
Extract all content from this JavaScript-heavy page using advanced extraction: https://app.example.com/dashboard
Best Practices
- Use the query parameter to filter content to exactly what you need, especially when extracting from large pages
- Start with basic extraction and only use advanced mode if content is missing or incomplete
- Batch URLs by topic or category to keep results organized and relevant
Avoid
- Extracting more than 20 URLs in a single request will fail
- Using chunks_per_source without a query parameter will return an error
- Not checking the failed_results field in the response may miss extraction failures