Skills alphafold-database

🧬

alphafold-database

Name: alphafold-database
Author: K-Dense-AI

Safe 🌐 Network access⚙️ External commands📁 Filesystem access

Access AlphaFold protein structures by UniProt ID

Also available from: davila7

Researchers need efficient access to computational protein structure predictions for drug discovery and structural biology studies. This skill provides direct access to AlphaFold DB's 200M+ AI-predicted protein structures, enabling retrieval by UniProt ID, download of coordinate files, and analysis of confidence metrics.

Supports: Claude Codex Code(CC)

📊 69 Adequate

Download the skill ZIP

Upload in Claude

Go to Settings → Capabilities → Skills → Upload skill

Toggle on and start using

Test it

Using "alphafold-database". Download the AlphaFold structure for P00520 and analyze confidence

Expected outcome:

AlphaFold ID: AF-P00520-F1
Protein: Tyrosine-protein kinase ABL1 (Human)
Sequence length: 1130 residues
pLDDT Analysis:
- Very high confidence (>90): 67% of residues
- High confidence (70-90): 18% of residues
Structure saved to: ./structures/AF-P00520-F1-model_v4.cif

Using "alphafold-database". Download E. coli proteome using Google Cloud

Expected outcome:

taxonomy ID: 83333
Downloading from: gs://public-datasets-deepmind-alphafold-v4/proteomes/
Files matched: 4123
Downloading proteome-tax_id-83333-*.tar (45 GB total)
Progress: 45.2 GB / 45.2 GB (100%)
Extracted 4123 structure archives to ./proteomes/

Security Audit

Safe

v4 • 1/17/2026

This is a legitimate scientific skill for accessing the AlphaFold protein structure database. All 244 static findings are false positives. The analyzer misinterpreted markdown code formatting (backticks), standard Python HTTP library usage, and documented public API endpoints as security threats. The skill uses safe Biopython library calls, standard requests to authorized EBI APIs, and subprocess with list-form arguments for Google Cloud access.

Files scanned

1,160

Lines analyzed

findings

Total audits

Risk Factors

🌐 Network access (2)

SKILL.md:57-127 references/api_reference.md:38-105

⚙️ External commands (1)

SKILL.md:244-255

📁 Filesystem access (1)

SKILL.md:115-136

Audited by: claude View Audit History →

Quality Score

Architecture

100

Maintainability

Content

Community

100

Security

Spec Compliance

What You Can Build

Retrieve protein structures for docking

Download target protein structures for computational docking studies and analyze binding site conformations.

Analyze prediction confidence

Evaluate pLDDT and PAE metrics to identify reliable structural regions for downstream analysis.

Build automated pipelines

Integrate AlphaFold access into computational workflows for large-scale protein analysis.

Try These Prompts

Get single protein structure

Download the AlphaFold structure for UniProt ID P00520 in mmCIF format and show the pLDDT confidence scores.

Compare multiple proteins

Download structures for P00520, P12931, and P04637. Compare their average pLDDT scores and identify high-confidence regions.

Batch download by species

Download all AlphaFold predictions for E. coli (taxonomy ID 83333) using Google Cloud bulk access.

Integrate with analysis pipeline

Create a Python script that takes a list of UniProt IDs, downloads their structures, extracts CA coordinates, and calculates inter-residue distances.

Best Practices

Use Biopython for simple single-protein access (cleaner API than direct HTTP calls)
Cache downloaded files locally to avoid repeated API requests and rate limits
For bulk downloads over 100 proteins, use Google Cloud Storage instead of REST API

Avoid

Avoid using shell=True with subprocess when calling gsutil (use list form instead)
Do not ignore pLDDT scores when interpreting structures (low confidence regions may be unreliable)
Avoid downloading individual files for whole proteomes (use tar archives from Google Cloud)

Frequently Asked Questions

What is the difference between PDB and mmCIF formats?

PDB is legacy format with 99,999 atom limit. mmCIF is modern standard supporting larger structures with full metadata.

How reliable are AlphaFold predictions?

Predictions with pLDDT >90 are very reliable. Regions below 50 may be disordered. Always check confidence metrics.

Can I use AlphaFold structures for drug docking?

Yes, but validate high-confidence regions. Low confidence areas may not reflect true structure. Consider multiple models.

What is the rate limit for the AlphaFold API?

Official limits are not published. Use 10 concurrent requests max with 100-200ms delays between calls.

How do I download an entire species proteome?

Use Google Cloud: gsutil cp gs://public-datasets-deepmind-alphafold-v4/proteomes/proteome-tax_id-*.tar .

Does this skill support multi-chain protein complexes?

No. AlphaFold DB provides single-chain predictions only. For complexes, model each chain separately.

Developer Details

Author

K-Dense-AI

License

MIT

Repository

https://github.com/K-Dense-AI/claude-scientific-skills/tree/main/scientific-skills/alphafold-database

Ref

main

File structure

📁 references/

📄 api_reference.md

📄 SKILL.md