biopython
Analyze biological sequences with Biopython
Also available from: davila7
Process DNA, RNA, and protein sequences efficiently using Biopython. Access NCBI databases programmatically, perform sequence alignments, and automate complex bioinformatics pipelines for research applications.
Download the skill ZIP
Upload in Claude
Go to Settings → Capabilities → Skills → Upload skill
Toggle on and start using
Test it
Using "biopython". Read a GenBank file and print information about each record
Expected outcome:
- Record ID: NC_001416
- Description: Escherichia coli bacteriophage lambda, complete genome
- Sequence length: 48502 bp
- GC content: 49.23%
- Features: 73 total (genes, promoters, operators)
Using "biopython". Search PubMed for CRISPR and get the first 5 results
Expected outcome:
- Found 15432 results for 'CRISPR'
- PMIDs: 37253479, 37253478, 37253477, 37253476, 37253475
- First result: Title, Authors, Journal info
Security Audit
SafeAll 546 static findings are FALSE POSITIVES. This skill contains only markdown documentation files with Python code examples for Biopython. The scanner misidentified markdown code block delimiters as shell commands, Biopython module names (Bio.Align, Bio.Phylo) as cryptographic algorithms, and documentation placeholders as real secrets. No executable code exists. This is a legitimate scientific documentation skill.
Risk Factors
🔑 Env variables (7)
⚙️ External commands (415)
📁 Filesystem access (6)
Quality Score
What You Can Build
Batch sequence processing
Process large FASTA files, calculate GC content, translate sequences, and prepare data for downstream analysis.
Build analysis pipelines
Create automated workflows for sequence retrieval, alignment, phylogenetics, and results parsing.
Learn bioinformatics
Explore sequence operations, database access, and structural bioinformatics through practical examples.
Try These Prompts
Show me how to read a FASTA file and extract sequence records using Bio.SeqIO.
Write code to search PubMed using Bio.Entrez and fetch the top 10 results for a query about protein kinase.
Perform a pairwise global alignment between two DNA sequences using Bio.Align and show me the alignment score.
Load a PDB file and calculate the distance between two alpha carbon atoms in a protein chain.
Best Practices
- Always set Entrez.email when accessing NCBI databases to comply with their usage policy
- Use iterators like SeqIO.parse() for large files to avoid loading everything into memory
- Cache downloaded sequences to avoid redundant network requests and respect rate limits
Avoid
- Do not hardcode API keys in scripts; use environment variables instead
- Do not skip error handling for network operations which can fail due to connectivity issues
- Do not process large alignments without checking memory requirements first
Frequently Asked Questions
What file formats does Biopython support?
How do I access NCBI databases?
Can Biopython run local BLAST searches?
How do I calculate sequence properties?
What is the difference between parse and read?
How do I handle rate limits with NCBI?
Developer Details
Author
K-Dense-AILicense
BSD-3-Clause
Repository
https://github.com/K-Dense-AI/claude-scientific-skills/tree/main/scientific-skills/biopythonRef
main
File structure