Imperiali Lab News Brief: combining bioinformatics and biochemistry

Imperiali Lab News Brief: combining bioinformatics and biochemistry

Parsing endless possibilities

Lillian Eden | Department of Biology
December 11, 2024

New research from the Imperiali Lab in the Department of Biology at MIT combines bioinformatics and biochemistry to reveal critical players in assembling glycans, the large sugar molecules on bacterial cell surfaces responsible for behaviors such as evading immune responses and causing infections.

In most cases, single-celled organisms such as bacteria interact with their environment through complex chains of sugars known as glycans bound to lipids on their outer membranes. Glycans orchestrate biological responses and interactions, such as evading immune responses and causing infections. 

The first step in assembling most bacterial glycans is the addition of a sugar-phosphate group onto a lipid, which is catalyzed by phosphoglycosyl transferases (PGTs) on the inner membrane. This first sugar is then further built upon by other enzymes in subsequent steps in an assembly-line-like pathway. These critical biochemical processes are challenging to explore because the proteins involved in these processes are embedded in membranes, which makes them difficult to isolate and study. 

Although glycans are found in all living organisms, the sugar molecules that compose glycans are especially diverse in bacteria. There are over 30,000 known bacterial PGTs, and hundreds of sugars for them to act upon. 

Research recently published in PNAS from the Imperiali Lab in the Department of Biology at MIT uses a combination of bioinformatics and biochemistry to predict clusters of “like-minded” PGTs and verify which sugars they will use in the first step of glycan assembly. 

Defining the biochemical machinery for these assembly pathways could reveal new strategies for tackling antibiotic-resistant strains of bacteria. This comprehensive approach could also be used to develop and test inhibitors, halting the assembly pathway at this critical first step. 

Exploring Sequence Similarity

First author Theo Durand, an undergraduate student from Imperial College London who studied at MIT for a year, worked in the Imperiali Lab as part of a research placement. Durand was first tasked with determining which sugars some PGTs would use in the first step of glycan assembly, known as the sugar substrates of the PGTs. When initially those substrate-testing experiments didn’t work, Durand turned to the power of bioinformatics to develop predictive tools. 

Strategically exploring the sugar substrates for PGTs is challenging due to the sheer number of PGTs and the diversity of bacteria, each with its own assorted set of glycans and glycoconjugates. To tackle this problem, Durand deployed a tool called a Sequence Similarity Network (SSN), part of a computational toolkit developed by the Enzyme Function Initiative. 

According to senior author Barbara Imperiali, Class of 1922 Professor of Biology and Chemistry, an SSN provides a powerful way to analyze protein sequences through comparisons of the sequences of tens of thousands of proteins. In an optimized SSN, similar proteins cluster together, and, in the case of PGTs, proteins in the same cluster are likely to share the same sugar substrate. 

For example, a previously uncharacterized PGT that appears in a cluster of PGTs whose first sugar substrate is FucNAc4N would also be predicted to use FucNAc4N. The researchers could then test that prediction to verify the accuracy of the SSN. 

FucNAc4N is the sugar substrate for the PGT of Fusobacterium nucleatum (F. nucleatum), a bacterium that is normally only present in the oral cavity but is correlated with certain cancers and endometriosis, and Streptococcus pneumoniae, a bacterium that causes pneumonia. 

Adjusting the assay

The critical biochemical process of assembling glycans has historically been challenging to define, mainly because assembly is anchored to the interior side of the inner membrane of the bacterium. The purification process itself can be difficult, and the purified proteins don’t necessarily behave in the same manner once outside their native membrane environment.

To address this, the researchers modified a commercially available test to work with proteins still embedded in the membrane of the bacterium, thus saving them weeks of work to purify the proteins. They could then determine the substrate for the PGT by measuring whether there was activity. This first step in glycan assembly is chemically unique, and the test measures one of the reaction products. 

For PGTs whose substrate was unknown, Durand did a deep dive into the literature to find new substrates to test. FucNAc4N, the first sugar substrate for F. nucleatum, was, in fact, Durand’s favorite sugar – he found it in the literature and reached out to a former Imperiali Lab postdoc for the instructions and materials to make it. 

“I ended up down a rabbit hole where I was excited every time I found a new, weird sugar,” Durand recalls with a laugh. “These bacteria are doing a bunch of really complicated things and any tools to help us understand what is actually happening is useful.” 

Exploring inhibitors

Imperiali noted that this research both represents a huge step forward in our understanding of bacterial PGTs and their substrates and presents a pipeline for further exploration. She’s hoping to create a searchable database where other researchers can seed their own sequences into the SSN for their organisms of interest. 

This pipeline could also reveal antibiotic targets in bacteria. For example, she says, the team is using this approach to explore inhibitor development. 

The Imperiali lab worked with Karen Allen, a professor of Chemistry at Boston University, and graduate student Roxanne Siuda to test inhibitors, including ones for F. nucleatum, the bacterium correlated with certain cancers and endometriosis whose first sugar substrate is FucNAc4N. They are also hoping to obtain structures of inhibitors bound to the PGT to enable structure-guided optimization.

“We were able to, using the network, discover the substrate for a PGT, verify the substrate, use it in a screen, and test an inhibitor,” Imperiali says. “This is bioinformatics, biochemistry, and probe development all bundled together, and represents the best of functional genomics.”