Ultimate Alphafold Database
All-in-one skill covering access, alphafold, predicted, protein. Includes structured workflows, validation checks, and reusable patterns for scientific.
Ultimate AlphaFold Database
A scientific computing skill for querying and working with the AlphaFold Protein Structure Database — the public repository of over 200 million AI-predicted 3D protein structures maintained by DeepMind and EMBL-EBI. Ultimate AlphaFold Database helps you retrieve predicted structures, assess prediction confidence, and integrate structural data into research workflows.
When to Use This Skill
Choose Ultimate AlphaFold Database when:
- Retrieving predicted protein structures by UniProt accession
- Assessing prediction confidence (pLDDT scores) for structural regions
- Downloading structure files (PDB, mmCIF) for downstream analysis
- Integrating AlphaFold predictions into structural biology pipelines
Consider alternatives when:
- You need experimental (not predicted) structures (use PDB/RCSB)
- You want to run AlphaFold predictions yourself (use ColabFold or AlphaFold software)
- You need protein-protein complex predictions (use AlphaFold-Multimer)
- You're doing sequence analysis without structural context (use UniProt directly)
Quick Start
claude "Retrieve the AlphaFold predicted structure for human insulin (P01308)"
import requests # Fetch structure metadata uniprot_id = "P01308" # Human insulin url = f"https://alphafold.ebi.ac.uk/api/prediction/{uniprot_id}" response = requests.get(url) data = response.json()[0] print(f"Protein: {data['uniprotDescription']}") print(f"Organism: {data['organismScientificName']}") print(f"PDB URL: {data['pdbUrl']}") print(f"CIF URL: {data['cifUrl']}") print(f"PAE URL: {data['paeImageUrl']}") # Download the PDB file pdb_response = requests.get(data['pdbUrl']) with open(f"AF-{uniprot_id}-F1-model_v4.pdb", 'w') as f: f.write(pdb_response.text)
Core Concepts
AlphaFold API Endpoints
| Endpoint | Purpose | Example |
|---|---|---|
/api/prediction/{id} | Get structure by UniProt ID | /api/prediction/P01308 |
/api/prediction/{id}?qualifier=F1 | Specific fragment | Fragment 1 of multi-fragment |
| PDB download | Structure coordinates | alphafold.ebi.ac.uk/files/AF-{id}-F1-model_v4.pdb |
| mmCIF download | Structure in mmCIF format | alphafold.ebi.ac.uk/files/AF-{id}-F1-model_v4.cif |
| PAE download | Predicted Aligned Error | alphafold.ebi.ac.uk/files/AF-{id}-F1-predicted_aligned_error_v4.json |
Confidence Metrics
from Bio.PDB import PDBParser import numpy as np # Parse AlphaFold PDB — B-factor column contains pLDDT scores parser = PDBParser(QUIET=True) structure = parser.get_structure("protein", f"AF-{uniprot_id}-F1-model_v4.pdb") plddt_scores = [] for atom in structure.get_atoms(): if atom.name == "CA": # Alpha carbons only plddt_scores.append(atom.bfactor) plddt = np.array(plddt_scores) print(f"Mean pLDDT: {plddt.mean():.1f}") print(f"High confidence (>90): {(plddt > 90).sum()} residues") print(f"Confident (70-90): {((plddt >= 70) & (plddt <= 90)).sum()} residues") print(f"Low confidence (50-70): {((plddt >= 50) & (plddt < 70)).sum()} residues") print(f"Very low (<50): {(plddt < 50).sum()} residues")
Confidence Interpretation
| pLDDT Score | Confidence | Interpretation |
|---|---|---|
| > 90 | Very high | Backbone and side chains reliable |
| 70 – 90 | Confident | Backbone reliable, some side chain uncertainty |
| 50 – 70 | Low | Treat with caution, likely disordered |
| < 50 | Very low | Likely intrinsically disordered region |
Configuration
| Parameter | Description | Default |
|---|---|---|
api_base_url | AlphaFold API base URL | https://alphafold.ebi.ac.uk |
model_version | AlphaFold model version | 4 |
output_format | PDB or mmCIF | pdb |
include_pae | Download Predicted Aligned Error | false |
confidence_threshold | Minimum pLDDT for analysis | 70 |
Best Practices
-
Check pLDDT scores before using a structure. Not all regions of an AlphaFold prediction are equally reliable. Regions with pLDDT < 70 (often loops and termini) should not be used for detailed structural analysis or drug design without experimental validation.
-
Use the PAE (Predicted Aligned Error) for multi-domain proteins. pLDDT tells you about local confidence, but PAE tells you about the relative positioning of domains. High PAE between two domains means their relative orientation is uncertain — don't trust the inter-domain geometry.
-
Cross-reference with experimental structures. If an experimental structure exists in the PDB for your protein, compare it with the AlphaFold prediction. Discrepancies highlight regions where the prediction may be less reliable or where the protein adopts different conformations.
-
Batch downloads for large-scale analysis. For proteome-scale studies, use the bulk download files from the AlphaFold FTP server rather than making individual API calls. EMBL-EBI provides pre-packaged tar files for entire organisms.
-
Cite AlphaFold appropriately. When using AlphaFold predictions in publications, cite both the AlphaFold method paper and the AlphaFold database paper. Include the model version (v4) and the date of access, as predictions may be updated.
Common Issues
No structure found for a UniProt ID. AlphaFold DB doesn't cover all proteins — it focuses on UniProt reference proteomes. Check that your ID is a valid UniProt accession (not PDB, RefSeq, or GenBank). For proteins not in the database, run AlphaFold locally or use ColabFold.
Structure looks unrealistic in visualization. Regions with pLDDT < 50 often appear as extended, unstructured loops. This doesn't mean the protein is actually extended — these regions are likely intrinsically disordered, and AlphaFold is correctly indicating uncertainty. Don't interpret low-confidence regions as real structural features.
Large protein produces multiple fragment files. AlphaFold splits proteins longer than ~2700 residues into overlapping fragments (F1, F2, etc.). You'll need to stitch fragments together or analyze them separately. The overlap regions can help you align fragments, but inter-fragment domain positioning should be treated with caution.
Reviews
No reviews yet. Be the first to review this template!
Similar Templates
Full-Stack Code Reviewer
Comprehensive code review skill that checks for security vulnerabilities, performance issues, accessibility, and best practices across frontend and backend code.
Test Suite Generator
Generates comprehensive test suites with unit tests, integration tests, and edge cases. Supports Jest, Vitest, Pytest, and Go testing.
Pro Architecture Workspace
Battle-tested skill for architectural, decision, making, framework. Includes structured workflows, validation checks, and reusable patterns for development.