gget
Query genomic databases for gene info and sequences
๋ํ ๋ค์์์ ์ฌ์ฉํ ์ ์์ต๋๋ค: davila7
gget provides unified access to 20+ genomic databases through a simple CLI or Python interface. Query gene information, retrieve sequences, perform BLAST searches, and run enrichment analysis without managing multiple API connections.
์คํฌ ZIP ๋ค์ด๋ก๋
Claude์์ ์ ๋ก๋
์ค์ โ ๊ธฐ๋ฅ โ ์คํฌ โ ์คํฌ ์ ๋ก๋๋ก ์ด๋
ํ ๊ธ์ ์ผ๊ณ ์ฌ์ฉ ์์
ํ ์คํธํด ๋ณด๊ธฐ
"gget" ์ฌ์ฉ ์ค์ ๋๋ค. Search for BRCA1 gene in human
์์ ๊ฒฐ๊ณผ:
- Found: ENSG00000012048 (BRCA1)
- Description: BRCA1 DNA repair associated
- UniProt ID: P38398
- Chromosome: 17q21.31
- Biotype: protein coding
"gget" ์ฌ์ฉ ์ค์ ๋๋ค. Get tissue expression for ACE2
์์ ๊ฒฐ๊ณผ:
- Kidney: median expression 245.3
- Testis: median expression 189.7
- Small intestine: median expression 156.2
- Bladder: median expression 98.4
๋ณด์ ๊ฐ์ฌ
์์ This is legitimate bioinformatics software. All 614 static findings are false positives: markdown code fences were misidentified as Ruby shell execution, hardcoded URLs are public genomic databases (Ensembl, UniProt, NCBI), cryptographic patterns are data integrity checksums, and the critical heuristic is standard bioinformatics behavior (network queries to public APIs + local file operations for results).
์ํ ์์ธ
๐ ๋คํธ์ํฌ ์ ๊ทผ (2)
๐ ํ์ผ ์์คํ ์ก์ธ์ค (3)
ํ์ง ์ ์
๋ง๋ค ์ ์๋ ๊ฒ
Quick gene lookups
Rapidly retrieve gene information, sequences, and annotations from Ensembl, UniProt, and NCBI during research exploration.
Enrichment analysis
Run gene ontology and pathway enrichment on gene lists to identify biological patterns and functional associations.
Sequence comparison
Perform BLAST searches and retrieve protein structures for sequence analysis and comparison tasks.
์ด ํ๋กฌํํธ๋ฅผ ์ฌ์ฉํด ๋ณด์ธ์
Use gget to search for the gene TP53 in human and return its Ensembl ID, description, and UniProt ID.
Retrieve the nucleotide and protein sequences for Ensembl gene ENSG00000139618 in FASTA format using gget.
Run a BLAST search using gget to find similar sequences to 'MSEQWKAVLFPLLLAAATSL...'. Use the nr database and return top 5 hits.
Run gget enrichr on the gene list ['TP53', 'BRCA1', 'MDM2', 'CDKN1A'] against KEGG Pathways database and show top 10 enriched terms.
๋ชจ๋ฒ ์ฌ๋ก
- Keep gget updated to match changing database structures
- Use the -csv flag for CLI output to simplify parsing
- Limit large queries to avoid server timeout errors
ํผํ๊ธฐ
- Running thousands of queries without rate limiting
- Using unreleased database versions in reproducible workflows
- Skipping result validation when downstream analysis depends on data quality
์์ฃผ ๋ฌป๋ ์ง๋ฌธ
What databases does gget query?
Does gget require authentication?
Can I use gget in Python scripts?
How does gget handle rate limits?
What species are supported?
How accurate are enrichment results?
๊ฐ๋ฐ์ ์ธ๋ถ ์ ๋ณด
์์ฑ์
K-Dense-AI๋ผ์ด์ ์ค
BSD-2-Clause license
๋ฆฌํฌ์งํ ๋ฆฌ
https://github.com/K-Dense-AI/claude-scientific-skills/tree/main/scientific-skills/gget์ฐธ์กฐ
main
ํ์ผ ๊ตฌ์กฐ
๐ references/
๐ database_info.md
๐ module_reference.md
๐ workflows.md
๐ scripts/
๐ batch_sequence_analysis.py
๐ gene_analysis.py
๐ SKILL.md