Geometric Multi-color Message Passing Graph Neural Networks for Blood-brain Barrier Permeability Prediction

Abstract

Accurate prediction of blood-brain barrier permeability (BBBP) is essential for central nervous system (CNS) drug development. While graph neural networks (GNNs) have advanced molecular property prediction, they often rely on molecular topology and neglect the three-dimensional geometric information crucial for modeling transport mechanisms. This paper introduces the geometric multi-color message-passing graph neural network (GMC-MPNN), a novel framework that enhances standard message-passing architectures by explicitly incorporating atomic-level geometric features and long-range interactions. Our model constructs weighted colored subgraphs based on atom types to capture the spatial relationships and chemical context that govern BBB permeability. We evaluated GMC-MPNN on three benchmark datasets for both classification and regression tasks, using rigorous scaffold-based splitting to ensure a robust assessment of generalization. The results demonstrate that GMC-MPNN consistently outperforms existing state-of-the-art models, achieving superior performance in both classifying compounds as permeable/non-permeable (AUC-ROC of 0.9704 and 0.9685) and in regressing continuous permeability values (RMSE of 0.4609, Pearson correlation of 0.7759). An ablation study further quantified the impact of specific atom-pair interactions, revealing that the model’s predictive power derives from its ability to learn from both common and rare, but chemically significant, functional motifs. By integrating spatial geometry into the graph representation, GMC-MPNN sets a new performance benchmark and offers a more accurate and generalizable tool for drug discovery pipelines.

Trung Nguyen
Trung Nguyen
PhD Student

PhD Student at Bredesen Center

Masud Rana
Masud Rana
Assistant Professor of Mathematics, Kennesaw State University (former Nguyen Lab postdoc)

Masud Rana is an assistant professor of mathematics at Kennesaw State University and a former postdoctoral scholar in the Nguyen Lab. He works on graph-theoretic and geometric methods for AI-driven drug discovery.

Farjana Mukta
Farjana Mukta
Lecturer of Mathematics, Kennesaw State University (Former Nguyen Lab PhD Student)

Farjana Tasnim Mukta is a Lecturer of Mathematics at Kennesaw State University. She received her PhD in Mathematics from the University of Kentucky in 2024, advised by Dr. Duc Nguyen. Her research focuses on advanced mathematical graph-based machine learning and deep learning models for drug design.

Duc Nguyen
Duc Nguyen
Associate Professor of Mathematics

Duc Nguyen develops mathematical and AI frameworks for molecular bioscience, drug discovery, and scientific computing. His group blends differential geometry, graph theory, and machine learning to build high-fidelity models for biomolecular systems, with notable wins in the D3R Grand Challenges and collaborations with Pfizer and Bristol Myers Squibb. Supported by multiple NSF awards, he has advised students and postdocs across theory and applications of AI-driven drug design.