After a busy few weeks with over 680 protein designers submitting their best binder designs against the surface glycoprotein of the Nipah virus, the submissions and community vote doors have closed and we are waiting in anticipation for the experimental results to arrive. In the meantime, we want to highlight some interesting new insights and cool protein structures that we have gathered during the course of this competition.
Let’s start with a quick recap: Nipah is one of the deadliest viruses in the world and considered one of the top future pandemic risks. It’s a virus that’s found in fruit bats and was first discovered near the Nipah river in Malaysia during an outbreak in 1999. The virus is asymptomatic in bats but can infect livestock like pigs if they eat fruit contaminated by bat feces. From there it can infect humans if they are in contact with sick animals. In humans, it causes severe respiratory and neurological disease (Ganguly et al.), with mortality rates of up to 70%. In comparison, SARS-CoV-2 had mortality rates of ˜1-3%. Because of this high mortality rate, Nipah is considered a top-priority virus for vaccine development. No approved treatments or vaccines currently exist (Chan et al.). So the goal of this competition is as simple as it is difficult: Designing the best protein binders capable of neutralizing the Nipah virus.
Specifically, we asked participants to create binders that can disrupt the interaction between the viral glycoprotein G and its human receptor, ephrin-B2/B3 - an essential step the virus uses to enter host cells and initiate infection. Blocking this interaction has shown to reduce viral infection.
Out of the thousands of designs submitted we wanted to select the 1200 most promising designs for experimental validation:
So in total, more than 1 000 binders are being tested in our lab right now for binding to Niv-G. Stay tuned for the results of the competition, which we will release on January 16th on Proteinbase!
Mirroring the trend we observed in our previous binder design competition, participants relied heavily on RFDiffusion and BindCraft to design their binders. However, the design toolkit is evolving and in this latest competition, the newly launched BoltzGen model (Stark et al.) emerged as the most popular design tool. We’ll explain the top design choices in more detail below.
BoltzGen is an all-atom generative diffusion model that unifies structure design and prediction into a single framework. By embedding structural reasoning directly into the generative process, the model achieves state-of-the-art accuracy in both design and folding. The model was validated across 26 targets in eight experimental campaigns, reaching a 66% success rate for designing binders with affinities in the nanomolar range. BoltzGen unifies the traditionally separate stages of design, inverse folding and filtering into one streamlined pipeline. This ease of use might also explain why the tool was very popular among designers.
The 2nd most popular design tool was design-a-protein.com, a platform we developed specifically for this competition to run a protein design workflow in less than 2 min! The goal here was not to run the most powerful model but allow non-experts and beginners to explore protein design in a playful way with a very fast design model. Users can generate Nipah virus binders by selecting structural hotspots on the viral protein, which serve as input to the design pipeline. Protpardelle-1c (Lu et al.) generates 3D binder backbones around these hotspots, and ProteinMPNN (Dauparas et al.) is used to designs sequences that fold into these structures. Additionally, users can control parameters such as chain length, number of designs, and temperature, and inspect predicted ipSAE scores in a dashboard to easily select high-scoring candidates.
The animation below shows the cumulative growth of submissions throughout the competition. Many participants submitted multiple entries over the course of several weeks, and momentum clearly built over time, with a sharp increase in submissions as the deadline approached.
As in the previous competition, we selected designs for validation using an in silico metric, Boltz-2 ipSAE (more details on that metric later). This naturally fostered competition for the top spots on the leaderboard. The leading ipSAE score changed repeatedly over the course of the competition, with participants actively pushing the limits of this metric. This led to a steady upward trend in the highest-scoring ipSAE designs submitted over time. Competition intensity was high, with some participants even opting to experimentally test their top in silico candidates experimentally before submission, aiming to ensure that only the most promising designs were entered (here).
When comparing the molecule types between all submissions with the top 100 designs ranked by ipSAE score, miniproteins are approximately twice as prevalent in the top-ranked set, accounting for 55% of the top 100 submissions. Miniproteins are defined as designs with a molecular weight below 10.5 kDa and less than 35% loop content. Miniprotein-like designs meet the same loop-content criterion but have molecular weights between 10.5 and 15 kDa.
This enrichment is consistent with the tendency of Boltz-2, when used without a template multiple sequence alignment (MSA), to favor de novo miniprotein designs rather than antibody derivatives such as nanobodies and scFvs (Yin et al.).
Interestingly, the top 100 designs are enriched for proteins dominated by α-helical structures compared to the full set of submissions. Correspondingly, designs containing mixed α-helical and β-sheet architectures, as well as predominantly β-sheet structures, are underrepresented among the top-ranked entries. This shift may reflect the greater fold stability and lower structural ambiguity of α-helical proteins, which can lead to more confident structure predictions and higher ipSAE scores.
Previous competitions have shown that choosing an appropriate computational score to prioritize the most promising designs for experimental validation is in itself a challenge. Previously, we had used either the interface PAE (iPAE) metric from AlphaFold2 or a combination of iPAE, AlphaFold2’s interface pTM (iPTM) score and ESM2’s log-likelihood score (ESM-PLL). Both turned out to have their shortcomings. In the last competition, our use of unnormalized ESM log-likelihoods created a bias toward shorter designs.
For this competition, we opted to use the Boltz-2 ipSAE score for filtering and selected the 600 most promising designs based on this metric. You can find our reasoning for choosing it in the FAQ. Already during the course of the competition, some interesting questions and concerns were raised regarding the reliance of this metric.
Many participants evaluated their designs locally prior to submission and encountered reproducibility issues of the ipSAE score. These differences arise in part because the model can yield slightly different results depending on hardware configuration and random seeds. A certain degree of unpredictibility was intentional, as it makes direct optimization for the metric more difficult. However, as it turned out over the course of the competition, the ipSAE score variance was not uniform across designs and therefore problematic. Importantly, variability in ipSAE reproducibility may also have conferred an unintended advantage to designs with more stable scores, as these could be more easily optimized directly for the metric.
Given the difficulty of choosing a universally robust scoring function, we wanted to complement the computational filtering with community input and expert knowledge when deciding which designs would ultimately be selected for experimental testing. We’d like to sincerely thank the community for enthusiastically participating in the community vote, championing their favorite designs, and generally making this process far more interesting than a simple ranking table. We also owe a big thank-you to a group of protein design experts who volunteered to dive into the submissions and hand-pick proteins they found particularly promising.
This combination of computational scoring, community engagement, and expert review was intended to mitigate the limitations of any single selection strategy and to arrive at a more balanced and informed set of designs for experimental validation. Additionally, to mitigate the impact of ipSAE reproducibility issues, we added 200 additional expert-selected sequences to the pool of designs designated for experimental validation.
While we’re waiting for the experimental results to come out of the lab, we thought it would be fun to create some prediction markets for people to bet on the hit rate for this competition, which model(s) will perform best and which designers will have binders.
Check out the Proteinbase questions on Manifold: https://manifold.markets/Proteinbase
Manifold uses a play money currency called Mana that you can bet with. You can just sign up for free and you’ll get 1000 mana to play with.
Welcome to our mini “Nipah Gallery” — a highlight reel of protein structures we found particularly interesting from this competition. Each image links directly to its Proteinbase entry. Think of it like “Spotify Wrapped” for our Nipah design competition. The featured designs were selected based on how unique and compelling the structures were, the creativity of the design approaches used, and our collective intuition as the Adaptyv team about which candidates might express well and bind effectively.
Some of the most interesting competition submissions are those that offer insight into the design process, including the decisions, motivations, and reasoning behind them. We’d like to thank everyone who took the time to talk about the strategies. We believe this contributes to the collective understanding of protein design tools quite a lot! In the following, we highlight several of these submissions, along with blog posts and social media discussions that reflect participants’ experiences during the competition.
We’d like to thank all participants the Nipah competition, the people who voted for their favorite designs, our panel of experts, and everyone else involved who made this such a cool experience. We’re super excited to release all the experimental results and reveal the final rankings soon!