Helixgate

Helixgate

Uncategorized

Bio-IT World Keynote Highlights Collaborative Intelligence in AI-Driven Drug Discovery

Published

on

BOSTONA critical part of the conversation around the use of artificial intelligence (AI) in drug discovery focuses on the development of the foundation models that underpin AI-based applications and workflows. There are also discussions about federated learning and how it provides a secure path to accessing critical training data for AI models. These two themes underpinned the keynote panel that kicked off the second day of Bio-IT World Conference 2026, which took place last month in Boston. 

Through presentations and a group discussion, the six-person panel painted a picture of the different types of foundation models and federated learning approaches, as well as ways to optimize AI’s performance for specific projects. Importantly, they discussed the AI Structural Biology (AISB) initiative, which provides a platform for pooling proprietary protein-ligand structure data to train OpenFold3, an AI model designed to precisely predict molecular interactions. In fact, several members of the panel were either directly or indirectly involved in the OpenFold consortium.   

Woody Sherman, PhD Founder and Chief Innovation Officer, PsiThera [Uduak Thomas]
Woody Sherman, PhD, founder and chief innovation officer, PsiThera [Uduak Thomas]

That group included Woody Sherman, PhD, founder and chief innovation officer at PsiThera, who serves as chair of the OpenFold executive committee. Sherman reiterated the benefits of open-source platforms and how they are making inroads into the drug discovery space.   

“We’re going to need these open platforms that we can all build on,” he said. “We can’t all be building our own foundation models from scratch. It just doesn’t make sense as an ecosystem. It is important to have these open platforms so that we can interact precompetitively, build the best foundation models, and then “we can get into federated learning.”    

From AlphaFold to OpenFold 

The non-profit OpenFold Consortium consists of scientists from over 40 technology companies, startups, pharma companies, and academic institutions. It builds on a lot of the progress made in the 2010s in terms of predicting protein structures from sequences, “a foundational problem in biochemistry.” That progress was quantified at least in part by efforts like the Critical Assessment of Structure Prediction (CAS) competitions, said Mohammed AlQuraishi, PhD, an assistant professor of systems biology at Columbia University, during his presentation.  

The image shows Mohammed AlQuraishi, PhD Assistant Professor, Systems Biology Columbia University, one of the keynote speakers at the recent Bio-IT World Conference
Mohammed AlQuraishi, PhD,
assistant professor, systems biology,
Columbia University [Uduak Thomas]

AlphaFold and later iterations of the platform “compressed decades of progress in about four years,” he said. Besides reliably predicting protein structures, AlphaFold provided “calibrated predictions” that gave biologists a sense of the accuracy of its predictions. But there were limitations.   

“It did not really have an understanding of anything other than just protein structure,” meaning it missed things like ions that were also part of the structure, he said. It also struggled to handle things like protein complexes, ligands, and cofactors. Another challenge was that although the computational models could predict targets that were closer to the training dataset used, their ability to make viable predictions dropped the further away the targets were from the training dataset. Additionally, “these models have limited ability to capture conformational changes,” AlQuraishi noted. “This becomes a major bottleneck in being able to reliably model allosteric modulators or cryptic pockets or similar types of systems.” Besides the technical limitations, there were also licensing limitations to consider.  

AlQuraishi positioned OpenFold as an open source, high-performance, and reproducible alternative to AlphaFold that serves as a common platform for innovation for the community. “Partly it’s a code base, essentially a set of tools that allow [scientists] to build these types of models and extend them and apply them,” AlQuraishi explained. “It’s also an academic-industry consortium that provides a steerable mechanism for industry to support science that is open source and that’s broadly useful, but it’s also in tune with the needs of industry.”

Federated learning and foundation models 

The next set of presentations made the argument for using federated learning to leverage proprietary biopharma datasets to train AI models. The presentation from Jonathan Gilbert, PhD, senior director, ecosystem growth and contributor partnerships at Eli Lilly, offered an example of how the pharma company has used federated learning to improve model predictions in different contexts.  

“It’s not surprising that companies are very sensitive to the proprietary data that they’ve spent incredible investments generating,” he said. With federated learning, models are trained in the environment where the data is housed, making it possible to “improve model performance while maintaining the privacy of the individual training sets.”   

Eli Lilly launched the TuneLab platform in 2025, through which it provides access to its own AI and machine learning models to biotech companies at no costalthough those that choose to use the models are expected to contribute datasets to help improve them. “These are the same models that we use every day,” he said. “These models have been trained on decades of internal data sets. That’s maybe over a billion dollars in data that have been brought into models by Lilly.” 

Jonathan Gilbert, PhD Senior Director, Ecosystem Growth and Contributor Partnerships, Eli Lilly and Company. [Uduak Thomas]
Jonathan Gilbert, PhD, senior director, Ecosystem Growth and Contributor Partnerships, Eli Lilly and Company. [Uduak Thomas]

Gilbert noted that since its launch, the appetite for TuneLab has been quite strong. At the time of the presentation, there were more than 75 partners in TuneLab, and it was being used in dozens of countries across three continents. Furthermore, during the meeting, Eli Lilly and Collaborative Drug Discovery (CDD), a provider of data management solutions for pharma and biotech, announced an agreement to integrate TuneLab into both the core and AI modules within the CDD Vault platform.  

For now, TuneLab is focused on models for small molecules and antibody development, but there are plans to release additional models in the near future. Lilly is also working on additional partnerships similar to the one with Collaborative Drug Discovery. “This is an active work in progress and [we are] thinking [about] how we can scale this,” Gilbert said. And how can “[we] build a community to improve those models such that we can create medicines faster for more people.” 

The presentation from José-Tomás Prieto, PhD, director of AI programs at Apheris, built on Gilbert’s presentation but focused on the complexities of implementing industrial federated learning setups. The key takeaway from his talk was that successfully implementing federated learning at an industrial scale is not a plug-and-play capability but rather a process that requires engineering rigor, data preparation without centralization, and enterprise-level deployment strategies. His company, Apheris, has experience with this process as they provide solutions that power federated networks for drug discovery. 

José-Tomás Prieto, PhD Director of AI Programs Apheris. [Uduak Thomas]
José-Tomás Prieto, PhD, director of AI programs, Apheris [Uduak Thomas]

One of the networks that they support is the AI Structural Biology Network, a collaboration that brings together several of the top 20 biopharma companies. Its intent is to allow AI models designed to predict the 3D structure of molecule complexes to be trained on proprietary protein structure data. The common denominator for these and other networks that Apheris supports is that the models are trained on proprietary data in a secure way, so the data never leaves the environments of any of the nodes in the network.  

“It’s obvious that there’s a lot of public data, but the public data skews toward well-characterized targets,” Prieto said. “The industry data complements that view, with more diverse data and sometimes higher quality data. And if there is something to learn about the AI world today is that you cannot necessarily model your way out of a data problem, and you can’t buy this data either.” Federated learning provides a solution to that problem. “It’s quite remarkable that a couple of years ago … it was mostly IT people making the decision of whether to use federated learning products,” he noted. “Today, we have business leaders trying to get ahold of this technology and leverage the power.” 

Prieto also discussed some important considerations for building federated networks. To provide a sense of the complexity involved, “each one of these companies have their own network constraints, their own firewall rules, their own compute window that they have to negotiate with the cloud providers to make sure that the compute comes online at the right time.”  

It is also important to consider “that data preparation without centralization is a new paradigm,” he continued. “Each company has [their] own ways of organizing the data or harmonizing your data,” as well as their own standards, but at the same time you have to have comparable training setups so that the foundation models can actually learn from this.” Furthermore, “your federated learning partner has to be able to work with your processes, has to be able to understand how to streamline the reviews, the security [and] the privacy requirements” among other things before projects can move forward. 

A key point that both Prieto and Arman Zaribafiyan, PhD, head of strategic alliances, AI simulation at SandboxAQ, emphasized was that while federated models provide broad generalizations, fine-tuning them on specific, project-level data is crucial for translating model performance into practical impact for drug programs.  

SandboxAQ has worked with the OpenFold consortium on its models and co-folding models, among other projects. “We are really living in exciting times when it comes to ML-accelerated drug discovery,” Zaribafiyan said. “There’s really an explosion of new models we see every day. And what we see at Sandbox with our partners in large pharma and biotech companies is that it’s getting a little bit overwhelming and harder to put these models into good use.” Furthermore, “a lot of these models are amazing in achieving great results on benchmarks, and it’s fascinating for publishing papers, but they fail to generalize to real drug discovery use cases.”

Arman Zaribafiyan, PhD, Head of Strategic Alliances, AI Simulation SandboxAQ. [Uduak Thomas]
Arman Zaribafiyan, PhD, head of strategic alliances, AI simulation, SandboxAQ [Uduak Thomas]

Commenting on some of the lessons SandboxAQ has learned through its partnerships, Zaribafiyan noted that “fine-tuning could help a lot to bridge this gap.” SandboxAQ and others have published data showing that “even a small fine-tuning effort can dramatically change the predictive accuracy of these models.” Over the next few years, he believes these are going to become the norm: “We’re going to see more and more of these federated platforms we use for both pooling data but also for fine-tuning these models on project-specific data.”   

Zaribafiyan closed his presentation with an announcement of a new platform from SandboxAq that connects quantitative models for drug discovery to large language models, allowing scientists to launch and run simulations and workflows using plain English, much like prompts written for ChatGPT. “No code required,” he said. 

Foundational models at work in crop science and drug development 

Christina Taylor, PhD, senior science fellow and computational molecular design lead at Bayer, focused on how her company has leveraged foundational models and AI to drive decisions in crop science and pharma.

Christina Taylor, PhD Senior Science Fellow and Computational Molecular Design Lead Bayer. [Uduak Thomas]
Christina Taylor, PhD, senior science fellow and computational molecular design lead, Bayer [Uduak Thomas]

“I think that this community-driven software has really allowed faster innovation in the field overall,” and “sharing some of these foundational architectures allows everyone to be able to drive biomolecular AI work,” and “ has driven some of the very quick advancements we’ve seen in the field over the past few years.” Community projects like this also save time and are more sustainable since “everybody doesn’t need to be training their own foundational models.”   

To date, these models have helped Taylor and her team better solve crystal structures. “One of the big problems with solving crystal structures is actually determining the phase, and by doing protein, we’re able to actually solve these structures faster and more efficiently,” she said. “Another thing is taking these foundational models and fine-tuning them … we’re using that quite regularly to improve our development of biomolecular pharmaceuticals as well as some of our crop science traits.” Other applications that Taylor and her team have used the models for include studying protein-protein interaction as well as for modeling enzyme catalysis.

The post Bio-IT World Keynote Highlights Collaborative Intelligence in AI-Driven Drug Discovery appeared first on GEN – Genetic Engineering and Biotechnology News.

Continue Reading
Click to comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Uncategorized

Beyond sunshine: Iberia’s biotech moment has arrived with developing capital networks

Published

on

Strong science, lower costs and growing capital networks are putting Spain and Portugal on the biotech investment map, even as structural bottlenecks persist, according to two investors.

Continue Reading

Uncategorized

Laser‑Driven Phase Contrast Enhances Cryo‑EM Resolution of Small Proteins

Published

on

You know when you are at the eye doctor getting an updated prescription, and suddenly the world snaps into sharper focus? Physicists at the University of California (UC), Berkeley, have now done something similar for electron microscopy. By introducing phase contrast into a cryo‑electron microscope, they have delivered dramatically sharper images of some of biology’s smallest and most elusive proteins.

The advance comes from a new laser phase plate (LPP), described in the paper “Laser phase plate improves structure determination of small proteins by cryo‑EM,” which was published recently in Science. Led by physicist Holger Mueller, PhD, of UC Berkeley and Lawrence Berkeley National Laboratory, the team demonstrated that a laser‑driven phase plate can overcome one of cryo‑EM’s most persistent limitations: poor contrast for small proteins.

Cryo-EM images of two proteins, apoferritin and hemoglobin, taken without and with a laser phase plate. The images are analyzed in a computer to produce detailed 3D structures of the proteins. [Holger Müller, Jessie Zhang/UC Berkeley]

Cryo‑EM has transformed structural biology over the past decade, earning a Nobel Prize in 2017 for enabling high‑resolution structures without crystallization. But despite its impact, the technique still struggles with proteins below ~70 kilodaltons—a size range that includes about 90% of the human proteome. “Because of signal-to-noise limitations, the majority of human and animal proteins are too small to be analyzed by these methods [cryo-EM and cryoelectron tomography]. The increase in signal-to-noise ratio provided by this laser phase plate is expected to overcome these important limitations.”

The new LPP begins to address that problem. The LPP uses an intense, continuous‑wave laser to shift the phase of the electron beam itself. This produces true phase contrast without dimming or destabilizing the beam. Mueller described the laser focus as “75 kilowatts focused to a few microns… That’s more powerful than what you use for welding. It has more power than a military laser. It builds up the brightest continuous laser focus ever.”

Installed in a custom Thermo Fisher Titan Krios, the LPP immediately improved the clarity and resolvability of small proteins, including hemoglobin, which sits at the lower limit of what today’s cryo‑EM instruments can handle. As the authors wrote in the abstract: “Here, we show that the laser phase plate (LPP)… enhances the resolution in single-particle reconstruction of small proteins by improving specimen-motion correction, recovery of information from the early frames, as well as particle visualization, 3D classification, and alignment.”

phase plate cover Cryo-EM
A laser (purple) is powerfully amplified by highly polished mirrors and focused on the electron beam (blue) to shift its phase and increase the cryo-EM microscope’s contrast, allowing biologists to image smaller proteins and the crowded structures inside cells. [Sayo Studio]

These improvements were achieved using standard defocus ranges and reconstruction workflows. “For the most challenging cases—small particles, bad specimens—the laser produces a very considerable advantage,” Mueller said.

 

The impact extends beyond single‑particle analysis. Cryo‑electron tomography (cryo‑ET), which assembles multiple angular views of a molecule or protein into a three-dimensional image, stands to benefit even more. “With cryo-ET, we’re looking at small, very complicated cellular material that’s incredibly crowded inside the cell,” said Bridget Carragher, PhD, founding technical director of imaging at Biohub. “It’s like a forest of trees, and you’re trying to find one leaf on one tree in there. Cryo-ET needs a dramatic step forward in contrast, so we can start to see what’s going on inside the cell. That’s what the laser phase plate promises to give us.”

Biohub is developing a dual‑laser version of the system, designed to reduce component wear and minimize aberrations. Meanwhile, Mueller’s team is pushing toward imaging proteins as small as 17 kilodaltons, a threshold that would open access to vast regions of the human proteome previously invisible to cryo‑EM.

“This technology is a step function change for biology,” said Stephani Otte, PhD, Biohub’s vice president of imaging science. “What was once invisible will become visible—and that changes everything about how we understand disease.”

“The bottom line is, if you have a large protein and a really good sample—a fresh one or one frozen without bubbles, for example—you may not need the phase plate to get a single, high-quality image. But for a small protein and a bad sample, laser-on is best,” Mueller said. “This could fill an enormous gap in our knowledge of protein structures that can’t be crystallized or are too small for today’s cryo-EM. And it will be revolutionary for cryo-ET.”

The post Laser‑Driven Phase Contrast Enhances Cryo‑EM Resolution of Small Proteins appeared first on GEN – Genetic Engineering and Biotechnology News.

Continue Reading

Uncategorized

STAT+: Updated: Tracking RFK Jr.’s promises to remake health in America

Updated June 11, 2026

WASHINGTON — A pledge to “Make America Healthy Again” earned Robert F. Kennedy Jr. his job atop U.S. health agencies a year and some change ago. He’s now had the opportunity to turn his words into action, with mixed results.  

“All one needs” to prove the health secretary’s attentiveness is to “review my unprecedented list of accomplishments on a wide range of issues, all of which I drove,” Kennedy posted on X on Wednesday in response to a journalist.

Continue to STAT+ to read the full story…

Read More

Published

on

Updated June 11, 2026

WASHINGTON — A pledge to “Make America Healthy Again” earned Robert F. Kennedy Jr. his job atop U.S. health agencies a year and some change ago. He’s now had the opportunity to turn his words into action, with mixed results.  

“All one needs” to prove the health secretary’s attentiveness is to “review my unprecedented list of accomplishments on a wide range of issues, all of which I drove,” Kennedy posted on X on Wednesday in response to a journalist.

Continue to STAT+ to read the full story…

Read More

Continue Reading
Advertisement

Trending