AI + Medicine

What Is a Biological Foundation Model?

A biological foundation model is built on the idea that biology contains patterns that can be learned at scale. Instead of training a model for one narrow task, researchers train it on broad biological data, such as protein sequences, structures, gene expression, single-cell data, images, or multimodal lab measurements. The model can then be adapted for tasks like variant interpretation, protein design, target discovery, or disease modeling.

AIBiology

Video coming soon

This page is ready for the published video once it is added to the content feed.

Plain-English explanation

Why it matters

Rare diseases often suffer from small datasets. A foundation model may help by transferring patterns learned from broader biology into rare contexts. That does not replace disease-specific validation, but it may help researchers generate better hypotheses and connect a rare disorder to pathways or mechanisms studied elsewhere.

How it works

The model learns statistical relationships in biological data, such as how protein sequences relate to structure or how gene-expression patterns relate to cell state. Researchers can then prompt, fine-tune, or evaluate the model for a specific question. Any useful prediction still needs experimental confirmation, especially when the disease is rare or poorly represented in training data.

Infographic

Key terms

Foundation ModelProtein SequenceSingle-Cell DataEmbeddingFine-Tuning