esm
Generate and design proteins with ESM
Also available from: davila7
Protein engineering requires analyzing and designing novel protein sequences. ESM provides state-of-the-art protein language models for generating sequences, predicting structures, and creating embeddings for downstream analysis. Use ESM3 for generative tasks across sequence, structure, and function, or ESM C for efficient representation learning.
Download the skill ZIP
Upload in Claude
Go to Settings → Capabilities → Skills → Upload skill
Toggle on and start using
Test it
Using "esm". Generate a complete GFP variant using ESM3. The sequence should have the chromophore motif 'SYG' at positions 65-67.
Expected outcome:
Generated GFP variant sequence (238 amino acids) with chromophore motif preserved. The model produced a sequence with characteristic GFP barrel structure. Sequence can be validated through structure prediction.
Using "esm". Extract embeddings for these three protein sequences and compute pairwise similarity.
Expected outcome:
Embedding vectors (1280 dimensions for ESM C-300M). Pairwise cosine similarities: 0.72 between sequences A and B, 0.45 between A and C, 0.68 between B and C.
Security Audit
SafeAll 368 static findings are false positives. The scanner incorrectly flagged markdown documentation patterns. The skill provides documentation for legitimate protein language models from EvolutionaryScale. All code examples are standard scientific workflows for protein engineering. Python f-strings with underscores (protein masks), MD5 for cache keys, and ML terminology were misclassified as security issues.
Risk Factors
⚡ Contains scripts (5)
🌐 Network access (21)
⚙️ External commands (188)
📁 Filesystem access (13)
Quality Score
What You Can Build
Design novel fluorescent proteins
Generate GFP variants with desired properties using ESM3 function conditioning to target specific functional domains while exploring sequence space.
Optimize enzyme sequences
Use inverse folding to redesign enzyme sequences that maintain structural integrity while improving stability or activity for industrial applications.
Screen protein variants with embeddings
Generate embeddings for protein libraries and use clustering to identify promising candidates before experimental testing.
Try These Prompts
Generate a complete protein sequence by filling in the masked positions. The input sequence uses '_' to represent masked residues that need to be generated. Use ESM3 with the sequence track to complete the protein: {partial_sequence}Use ESM3 to predict the 3D structure of this protein sequence. Generate structure coordinates using the structure track. Return the PDB-formatted coordinates: {protein_sequence}Perform inverse folding to design a protein sequence that folds into the target structure provided as PDB. Use ESM3 to generate a sequence that is compatible with the 3D coordinates: {pdb_content}Generate high-quality embeddings for this protein sequence using ESM C. Return the embedding vector for downstream analysis or similarity comparison: {protein_sequence}Best Practices
- Start with smaller open models for prototyping before scaling to larger Forge API models for production quality
- Use chain-of-thought generation for complex protein designs by iterating through sequence, structure, and function tracks
- Validate generated sequences with structure prediction and follow Responsible Biodesign Framework guidelines
Avoid
- Do not skip experimental validation of computationally designed proteins
- Do not use generated proteins directly in clinical applications without proper testing
- Do not ignore biosafety implications when designing proteins with novel functions