February 26, 2026
Tech company NVIDIA powered the development of an AI tool that will enable scientists to efficiently design bespoke proteins for pharmaceuticals and materials.

Mizzou is a leader in pioneering new ways to leverage artificial intelligence (AI) to enable advances in chemistry, energy, health care and materials.
Now an interdisciplinary team of researchers has developed technology that helps AI systems learn from and work with protein structures and could ultimately help reduce the time and cost required to develop new drugs.
The team鈥檚 method, called the Geometry鈥慍omplete Vector鈥慟uantized Variational Autoencoder (GCP鈥慥QVAE), creates a language for protein geometry that preserves true shape and orientation and can decode that language back into detailed 3D structures with near-atomic accuracy.
Scientists have long dreamed of harnessing and directing the creative power of proteins, the molecular machines that build cells, carry signals and power chemical reactions. But working directly with 3D structure is difficult for many AI systems.
GCP鈥慥QVAE translates complex protein 3D structure into a form that generative models can learn from and use in protein engineering and drug discovery.
鈥淧urpose-built proteins could create new medicines, clean up pollution or build sustainable materials,鈥 said Mahdi Pourmirzaei, a computer science PhD student and research assistant at , where he applies computer science to bioinformatics and protein design.
Overcoming challenges
What a protein does is determined by the shape, or fold, of its long strings of amino acids. To engineer proteins with specific functions, scientists first need to understand how different folds relate to different activities.
鈥淚n theory, you could start building proteins blindly, but the design space is astronomically large,鈥 said Alex Morehead (PhD 2025), now a Hopper Postdoctoral Fellow at the Lawrence Berkeley National Laboratory. 鈥淥f the billions or trillions of possible amino acid sequences, only a tiny fraction will fold into a useful shape.
To avoid countless iterations and enormous time and cost, scientists in recent years developed a series of computational tools, including AlphaFold-class methods, that were able to predict a protein鈥檚 3D structure, turn that structure into a string of symbols, and turn that string into a set of distinct, separate pieces.
But raw 3D coordinates are not a natural fit for many AI systems, because a protein鈥檚 coordinates can move or rotate even when the protein itself has not changed.
鈥淎ny system that tries to read a protein must recognize the molecule when it rotates or moves across the room,鈥 Morehead said. 鈥淓ven a shift of a few angstroms or a flipped chirality can mean the difference between a life-saving drug and a useless molecule.鈥
Fast and accurate
Now the team can translate protein structure into a form AI can learn from without losing critical 3D detail.
By translating complex protein 3D structure into a compact, physically faithful representation that can be decoded back into detailed structures, GCP鈥慥QVAE opens the door to multiple downstream applications across protein science鈥攆rom accelerating early-stage discovery to making structure easier for AI systems to learn from and work with.
鈥淕CP鈥慥QVAE respects the physics of 3D space,鈥 Pourmirzaei said, emphasizing that the representation stays meaningful in real 3D settings rather than getting distorted by how a structure is positioned.
That matters in practical pipelines such as high鈥憈hroughput drug discovery, where thousands of candidate designs need to be generated, reconstructed, and screened quickly 鈥 and where GCP鈥慥QVAE鈥檚 fast reconstruction makes that scale more realistic.
Just as importantly, it helps make structural information 鈥渘ative鈥 to the new generation of generative AI models, so they can learn from protein structure and propose new designs that can be decoded back into usable 3D candidates.
But protein design is still a numbers game.
鈥淩esearchers currently fail in about 90% of their attempts,鈥 Pourmirzaei said. 鈥淕CP鈥慥QVAE, combined with the new wave of generative AI, helps us generate, reconstruct and triage many more candidate designs computationally, so we can dramatically reduce the time and cost compared to traditional lab鈥憃nly approaches.鈥
Modern AI projects require a combination of cloud computing expertise, distributed training, strong software engineering skills and domain knowledge. To translate ideas into working systems, universities like Mizzou will have to build these capabilities and foster collaboration.
Pourmirzaei said the project success relied on computing resources from the National AI Research Resource (NAIRR) Pilot with support from NVIDIA DGX Cloud and the NVIDIA AI Enterprise Software Platform.
鈥淲ithout NVIDIA鈥檚 continuous support, mentorship and engineering guidance, it would have been impossible to bring a project like this to completion and reach such an important research goal,鈥 he said.
The research team plans to iterate on the model and eventually enable chatbots capable of reading, understanding and analyzing protein structures or even designing new proteins in the future. They have released their and .
The principal investigator on the study was Pourmirzaei鈥檚 advisor, Dong Xu, former Curators’ Distinguished Professor at Mizzou 糖心Vlog传媒. Other researchers on the project were Farzaneh Esmaili (University of Missouri), Jarett Ren (Carnegie Mellon University) and Mohammadreza Pourmirzaei (Politecnico di Milano).
鈥淢aking modern AI to understand and design proteins opens the door to new molecules, new therapies and entirely new possibilities,鈥 Pourmirzaei said. 鈥淧recisely the kind of practical, impactful innovation Mizzou is known for.鈥
Discover more ways Mizzou 糖心Vlog传媒 is creating meaningful change.