Jun 16, 2025

Designer Spotlight: Can a language model reason about protein design?

In this designer spotlight, we take a look at Michael Hla’s recent Pro-1. It is a protein reasoning and optimization model capable to explain why it proposes a mutation. He tested FGF-1 designs in our lab at Adaptyv and managed to show significant melting temperature improvements, while still maintaining binding to their targets. One design even reached a melting temperature comparable to the most optimized FGF-1 variants in the literature!

TL;DR

Pro-1: the protein reasoning model

In March, Michael Hla took the protein design community by storm when he released Pro-1 - the first protein reasoning language model. His proposal is simple: let a language model distil biochemical intuition so it can do all the protein optimization you want. What this means more precisely: use a complex training scheme, with recent innovations in large language model reasoning used by AI labs like OpenAI and DeepSeek, to have a model that both proposes mutations and explains why it chose them.

Michael makes several good points for why protein design should be delegated to language models in his blog post. Some of the more interesting ones we found are interpretability - the model argues for each mutation it makes, pointing to relevant (or hallucinated) biochemical motives and paper references; and flexibility — he mentions Pro-1 can be prompted with sequences, PDB structures, even experimental results.

How to train your protein thinker

We found his training scheme incredibly unique. Michael combines biochemical intuition with synthetic data generation from specialized protein language models, a training framework from reinforcement learning, and a physics-based representation of protein stability. We will briefly describe it, but you should check out Michael’s blog post and thread for more info!

  1. Fine-tuning on synthetic reasoning traces

  2. Reinforcement learning with the Rosetta energy function

  3. Creativity rewards

We were all impressed by Pro-1, so we wanted to put it to the ultimate test: lab validation. We gave Michael some free binding affinity and thermostability assays for any designs he wanted. He chose to optimize the fibroblast growth factor 1 (FGF-1).

Why FGF-1?

As a growth factor, FGF-1 is one of the most versatile proteins. It regulates the fate of bone marrow cells and may promote bone repair, the development of lung epithelial cells with a therapeutic effect on pulmonary fibrosis, and is highly expressed in inflammatory cells. Its role in type 2 diabetes is becoming better understood, with experiments showing FGF-1 injections reduced the levels of glucose and increased the sensitivity to insulin in mice.

It binds to plenty of targets, including the fibroblast growth factor receptor 1 (FGFR1) and FGFR2. FGFR1 aberrations occur in several types of cancer and there are already FGFR1-inhibiting drugs like Pemigatinib for bile duct cancer treatment. FGF-like binders to FGFR1, especially when conjugated with cytotoxic drugs, could be a potent cancer therapeutic.

Michael mentioned another interesting fact about FGF-1: it has a pretty low denaturation temperature. Maintaining its binding while also increasing the melting temperature is a worthwhile task for Pro-1.

How we are measuring melting temperatures and binding affinities

We ran our standard automated assay for affinity characterization and thermostability. Proteins were expressed with a cell-free system, followed by affinity characterization via bio-layer interferometry (BLI) with the FGFR-1 target, FGF-1 wild-type control, and the designs Michael uploaded on our Foundry Portal. BLI measures the binder association and dissociation kinetics via the interference pattern of light reflected from a sensor surface. With these measurements and our in-house post-processing and curve-fitting software, we can calculate the binding affinity ($K_D$) of a protein to its target.

To measure the melting temperature of Michael’s designs, we used our newly-developed thermostability assay. The melting temperature (or $T_m$) represents the temperature at which 50% of a protein is in its unfolded state. This is around 49 °C for the wild-type FGF-1. We are using an automated nanoDSF (nano differential scanning fluorimetry) protocol: proteins are heated up and we measure the fluorescence shift of the tryptophan and tyrosine amino acids as they get more exposed from the protein’s core. We normalize these values and quantify the melting temperatures in our post-processing pipeline.

Pro-1 yields more stable binders

Most of the variants tested were expressed. Out of these, only 6 maintained binding to their target in the same range as the wild-type. In the figure above, we have highlighted the 3 variants with a $T_m$ higher than the measured control - the wild-type FGF-1 with 50.8 °C - that also bind to FGFR1.

What is more impressive is that a single-point mutant (K116E) reached a melting temperature improvement of 24 °C over the wild-type, and that Pro-1 even suggested this variant. When we consider it was trained on synthetic “perturbed” data and reasoning traces and an in silico objective (the Rosetta energy function), these are spectacular results! Most other Rosetta-based thermostability optimization studies also reach an improvement of 20 °C, yet none of them have a model able to explain in writing why it chose those mutations.

Michael showed an example of Pro-1’s reasoning trace for v37 variant (the 7-mutant in our bar plot). We found it interesting how it knows FGF-1’s binding partners and some plausible biochemical interpretations (e.g., mutations that reduce flexibility should increase stability, targeting hydrophobic patches to reduce the chance of aggregation). However do not attempt to fact-check its references: we tried that and we could not find any “Wu et al., 2017” that suggested the K127E mutation could increase stability of the FGF2 heterodimer, nor any “Kim et al., 2015). But this should not diminish Pro-1’s success - who knows, maybe the Pro-2 will align its reasoning with verified references. If OpenAI’s Deep Research can do it, so could Pro-1.

Resources and links

  • You can find all thermostability data here and the binding affinity data here

  • Try out Pro-1 here

  • Say hi to Michael Hla: Website, X, LinkedIn

  • Have some novel proteins you want to test in the lab? Come talk to us — we’d like to run many more of those protein designer spotlights, so if you have a cool new hypothesis or model to test we’d love to hear from you!