Biological Datasets

Manage training data for AI models: sequences, structures, and properties

Therapeutic Antibody Sequences

active
Antibody Library45,892 entries2.3 GBUpdated: 2024-01-15
Source: OAS, IMGT, TherapeuticDB

PDB Protein Structures

active
Structure Database215,847 entries58.7 GBUpdated: 2024-01-20
Source: RCSB PDB

Protein-Protein Interaction Data

active
PPI Affinity12,453 entries890 MBUpdated: 2024-01-18
Source: SKEMPI, PDBbind

AlphaFold Predictions

syncing
Structure Predictions567,234 entries125 GBUpdated: 2024-01-10
Source: AlphaFold DB

Immunogenicity Training Set

active
Epitope Data8,923 entries345 MBUpdated: 2024-01-12
Source: IEDB, NetMHC

Developability Profiles

active
Biophysical Properties3,456 entries156 MBUpdated: 2024-01-14
Source: Internal, Literature

Total Entries

854.8k

Across all datasets

Total Storage

187.4 GB

Managed datasets

Active Datasets

5/6

Ready for training