cv | Alex C. Morehead

Basics

Name	Alex Morehead
Label	Hopper Postdoctoral Fellow
Email	alex.morehead@gmail.com
Url	https://amorehead.github.io/
Summary	Machine learning and computational biology researcher.

Work

2025.06 - 2027.06
Hopper Postdoctoral Fellow

NERSC, Lawrence Berkeley National Laboratory

Research and develop deep generative models for science.
2020.08 - 2025.06
Graduate Research Assistant

Bioinformatics & Machine Learning Lab, University of Missouri-Columbia

Researched geometric deep learning and generative modeling methods for bioinformatics, yielding 20+ academic works.
- Geometric Transformers
- GCPNet
- GCDM
- FlowDock
- PoseBench
- MULTICOM_ligand

Volunteer

2022.05 - 2023.08

Columbia, Missouri
Teaching Assistant

Encircle Technologies

Crafted and taught special education computer science curriculum for neurodivergent youth.
- Implemented reinforcement learning algorithms for Minecraft in a collaborative virtual classroom environment

Education

2020.08 - 2024.06

Columbia, Missouri
MS

University of Missouri-Columbia

Computer Science
- Machine Learning
- Design & Analysis of Algorithms
- Computational Systems Biology
2020.08 - 2025.06

Columbia, Missouri
PhD

University of Missouri-Columbia

Computer Science
- Machine Learning
- Design & Analysis of Algorithms
- Computational Systems Biology
2016.08 - 2020.06

Saint Joseph, Missouri
BS

Missouri Western State University

Computer Science
- Data Structures
- Operating Systems
- Linear Algebra

Awards

2025.06.01

Admiral Grace Hopper Fellowship in Computing Sciences

Lawrence Berkeley National Laboratory

Awarded a competitive postdoctoral fellowship for research in computing sciences.
2025.05.01

Outstanding EECS PhD Student Award

University of Missouri-Columbia

Ranked as a top EECS PhD student in the spring of 2025.
2025.03.01

Outstanding Computer Science PhD Student Award

University of Missouri-Columbia

Ranked as a top computer science PhD student in the spring of 2025.
2020.08.01

Dean’s Engineering Excellence Fellowship

University of Missouri-Columbia

Earned a competitive graduate fellowship for ﬁrst-year PhD students.
2020.08.01

O'Neill Graduate Fellowship

University of Missouri-Columbia

Won a competitive graduate fellowship for ﬁrst-year PhD students.

Publications

2025.05.01

Protein-ligand structure and affinity prediction in CASP16 using a geometric deep learning ensemble and flow matching

Proteins: Structure, Function, and Bioinformatics

Predicting the structure of ligands bound to proteins is a foundational problem in modern biotechnology and drug discovery, yet little is known about how to combine the predictions of protein-ligand structure (poses) produced by the latest deep learning methods to identify the best poses and how to accurately estimate the binding affinity between a protein target and a list of ligand candidates. Further, a blind benchmarking and assessment of protein-ligand structure and binding affinity prediction is necessary to ensure it generalizes well to new settings. Towards this end, we introduce MULTICOM_ligand, a deep learning-based protein-ligand structure and binding affinity prediction ensemble featuring structural consensus ranking for unsupervised pose ranking and a new deep generative flow matching model for joint structure and binding affinity prediction. Notably, MULTICOM_ligand ranked among the top-5 ligand prediction methods in both protein-ligand structure prediction and binding affinity prediction in the 16th Critical Assessment of Techniques for Structure Prediction (CASP16), demonstrating its efficacy and utility for real-world drug discovery efforts. The source code for MULTICOM_ligand is freely available at https://github.com/BioinfoMachineLearning/MULTICOM_ligand.
2025.04.01

FlowDock: Geometric flow matching for generative protein-ligand docking and affinity prediction

Intelligent Systems for Molecular Biology

Powerful generative AI models of protein-ligand structure have recently been proposed, but few of these methods support both flexible protein-ligand docking and affinity estimation. Of those that do, none can directly model multiple binding ligands concurrently or have been rigorously benchmarked on pharmacologically relevant drug targets, hindering their widespread adoption in drug discovery efforts. In this work, we propose FlowDock, the first deep geometric generative model based on conditional flow matching that learns to directly map unbound (apo) structures to their bound (holo) counterparts for an arbitrary number of binding ligands. Furthermore, FlowDock provides predicted structural confidence scores and binding affinity values with each of its generated protein-ligand complex structures, enabling fast virtual screening of new (multi-ligand) drug targets. For the well-known PoseBusters Benchmark dataset, FlowDock outperforms single-sequence AlphaFold 3 with a 51% blind docking success rate using unbound (apo) protein input structures and without any information derived from multiple sequence alignments, and for the challenging new DockGen-E dataset, FlowDock outperforms single-sequence AlphaFold 3 and matches single-sequence Chai-1 for binding pocket generalization. Additionally, in the ligand category of the 16th community-wide Critical Assessment of Techniques for Structure Prediction (CASP16), FlowDock ranked among the top-5 methods for pharmacological binding affinity estimation across 140 protein-ligand complexes, demonstrating the efficacy of its learned representations in virtual screening. Source code, data, and pre-trained models are available at https://github.com/BioinfoMachineLearning/FlowDock.
2024.07.01

Geometry-complete diffusion for 3D molecule generation and optimization

Communications Chemistry

Generative deep learning methods have recently been proposed for generating 3D molecules using equivariant graph neural networks (GNNs) within a denoising diffusion framework. However, such methods are unable to learn important geometric properties of 3D molecules, as they adopt molecule-agnostic and non-geometric GNNs as their 3D graph denoising networks, which notably hinders their ability to generate valid large 3D molecules. In this work, we address these gaps by introducing the Geometry-Complete Diffusion Model (GCDM) for 3D molecule generation, which outperforms existing 3D molecular diffusion models by significant margins across conditional and unconditional settings for the QM9 dataset and the larger GEOM-Drugs dataset, respectively. Importantly, we demonstrate that GCDM’s generative denoising process enables the model to generate a significant proportion of valid and energetically-stable large molecules at the scale of GEOM-Drugs, whereas previous methods fail to do so with the features they learn. Additionally, we show that extensions of GCDM can not only effectively design 3D molecules for specific protein pockets but can be repurposed to consistently optimize the geometry and chemical composition of existing 3D molecules for molecular stability and property specificity, demonstrating new versatility of molecular diffusion models. Code and data are freely available at https://github.com/BioinfoMachineLearning/bio-diffusion.
2024.05.01

Deep learning for protein-ligand docking: are we there yet?

ICML 2024 AI4Science Spotlight

The effects of ligand binding on protein structures and their in vivo functions carry numerous implications for modern biomedical research and biotechnology development efforts such as drug discovery. Although several deep learning (DL) methods and benchmarks designed for protein-ligand docking have recently been introduced, to date no prior works have systematically studied the behavior of the latest docking and structure prediction methods within the broadly applicable context of (1) using predicted (apo) protein structures for docking (e.g., for applicability to new proteins); (2) binding multiple (cofactor) ligands concurrently to a given target protein (e.g., for enzyme design); and (3) having no prior knowledge of binding pockets (e.g., for generalization to unknown pockets). To enable a deeper understanding of docking methods' real-world utility, we introduce PoseBench, the first comprehensive benchmark for broadly applicable protein-ligand docking. PoseBench enables researchers to rigorously and systematically evaluate DL methods for apo-to-holo protein-ligand docking and protein-ligand structure prediction using both primary ligand and multi-ligand benchmark datasets, the latter of which we introduce for the first time to the DL community. Empirically, using PoseBench, we find that (1) DL co-folding methods generally outperform comparable conventional and DL docking baselines, yet popular methods such as AlphaFold 3 are still challenged by prediction targets with novel protein sequences; (2) certain DL co-folding methods are highly sensitive to their input multiple sequence alignments, while others are not; and (3) DL methods struggle to strike a balance between structural accuracy and chemical specificity when predicting novel or multi-ligand protein targets. Code, data, tutorials, and benchmark results are available at https://github.com/BioinfoMachineLearning/PoseBench.
2024.02.01

Geometry-complete perceptron networks for 3d molecular graphs

Bioinformatics

The field of geometric deep learning has recently had a profound impact on several scientific domains such as protein structure prediction and design, leading to methodological advancements within and outside of the realm of traditional machine learning. Within this spirit, in this work, we introduce GCPNet, a new chirality-aware SE(3)-equivariant graph neural network designed for representation learning of 3D biomolecular graphs. We show that GCPNet, unlike previous representation learning methods for 3D biomolecules, is widely applicable to a variety of invariant or equivariant node-level, edge-level, and graph-level tasks on biomolecular structures while being able to (1) learn important chiral properties of 3D molecules and (2) detect external force fields. Across four distinct molecular-geometric tasks, we demonstrate that GCPNet’s predictions (1) for protein–ligand binding affinity achieve a statistically significant correlation of 0.608, more than 5%, greater than current state-of-the-art methods; (2) for protein structure ranking achieve statistically significant target-local and dataset-global correlations of 0.616 and 0.871, respectively; (3) for Newtownian many-body systems modeling achieve a task-averaged mean squared error less than 0.01, more than 15% better than current methods; and (4) for molecular chirality recognition achieve a state-of-the-art prediction accuracy of 98.7%, better than any other machine learning method to date. The source code, data, and instructions to train new models or reproduce our results are freely available at https://github.com/BioinfoMachineLearning/GCPNet.
2022.01.01

Geometric transformers for protein interface contact prediction

International Conference on Learning Representations

Computational methods for predicting the interface contacts between proteins come highly sought after for drug discovery as they can significantly advance the accuracy of alternative approaches, such as protein-protein docking, protein function analysis tools, and other computational methods for protein bioinformatics. In this work, we present the Geometric Transformer, a novel geometry-evolving graph transformer for rotation and translation-invariant protein interface contact prediction, packaged within DeepInteract, an end-to-end prediction pipeline. DeepInteract predicts partner-specific protein interface contacts (i.e., inter-protein residue-residue contacts) given the 3D tertiary structures of two proteins as input. In rigorous benchmarks, DeepInteract, on challenging protein complex targets from the 13th and 14th CASP-CAPRI experiments as well as Docking Benchmark 5, achieves 14% and 1.1% top L/5 precision (L: length of a protein unit in a complex), respectively. In doing so, DeepInteract, with the Geometric Transformer as its graph-based backbone, outperforms existing methods for interface contact prediction in addition to other graph-based neural network backbones compatible with DeepInteract, thereby validating the effectiveness of the Geometric Transformer for learning rich relational-geometric features for downstream tasks on 3D protein structures.

Skills

	Python
	Software Development
	Machine Learning
	Data Analysis
	Data Visualization

	PyTorch
	Geometric Deep Learning
	Generative Modeling
	Computer Vision
	Natural Language Processing

Languages

	English
	Native speaker

Interests

	Science
	Computer Science
	Biology
	Chemistry
	Physics

References

	Dr. Jianlin Cheng
	chengji@missouri.edu

	Dr. Dong Xu
	xudong@missouri.edu

	Dr. Xiaoqin Zou
	zoux@missouri.edu

Basics

Work

NERSC, Lawrence Berkeley National Laboratory

Research and develop deep generative models for science.

Bioinformatics & Machine Learning Lab, University of Missouri-Columbia

Researched geometric deep learning and generative modeling methods for bioinformatics, yielding 20+ academic works.

Volunteer

Encircle Technologies

Crafted and taught special education computer science curriculum for neurodivergent youth.

Education

University of Missouri-Columbia

Computer Science

University of Missouri-Columbia

Computer Science

Missouri Western State University

Computer Science

Awards

Lawrence Berkeley National Laboratory

Awarded a competitive postdoctoral fellowship for research in computing sciences.

University of Missouri-Columbia

Ranked as a top EECS PhD student in the spring of 2025.

University of Missouri-Columbia

Ranked as a top computer science PhD student in the spring of 2025.

University of Missouri-Columbia

Earned a competitive graduate fellowship for ﬁrst-year PhD students.

University of Missouri-Columbia

Won a competitive graduate fellowship for ﬁrst-year PhD students.

Publications

Proteins: Structure, Function, and Bioinformatics

Intelligent Systems for Molecular Biology

Communications Chemistry

ICML 2024 AI4Science Spotlight

Bioinformatics

International Conference on Learning Representations

Skills

Languages

Interests

References