Biologists build proteins that avoid crosstalk with existing molecules

Engineered signaling pathways could offer a new way to build synthetic biology circuits.

Anne Trafton | MIT News Office
October 23, 2019

Inside a living cell, many important messages are communicated via interactions between proteins. For these signals to be accurately relayed, each protein must interact only with its specific partner, avoiding unwanted crosstalk with any similar proteins.

A new MIT study sheds light on how cells are able to prevent crosstalk between these proteins, and also shows that there remains a huge number of possible protein interactions that cells have not used for signaling. This means that synthetic biologists could generate new pairs of proteins that can act as artificial circuits for applications such as diagnosing disease, without interfering with cells’ existing signaling pathways.

“Using our high-throughput approach, you can generate many orthogonal versions of a particular interaction, allowing you to see how many different insulated versions of that protein complex can be built,” says Conor McClune, an MIT graduate student and the lead author of the study.

In the new paper, which appears today in Nature, the researchers produced novel pairs of signaling proteins and demonstrated how they can be used to link new signals to new outputs by engineering E. coli cells that produce yellow fluorescence after encountering a specific plant hormone.

Michael Laub, an MIT professor of biology, is the senior author of the study. Other authors are recent MIT graduate Aurora Alvarez-Buylla and Christopher Voigt, the Daniel I.C. Wang Professor of Advanced Biotechnology.

New combinations

In this study, the researchers focused on a type of signaling pathway called two-component signaling, which is found in bacteria and some other organisms. A wide variety of two-component pathways has evolved through a process in which cells duplicate genes for signaling proteins they already have, and then mutate them, creating families of similar proteins.

“It’s intrinsically advantageous for organisms to be able to expand this small number of signaling families quite dramatically, but it runs the risk that you’re going to have crosstalk between these systems that are all very similar,” Laub says. “It then becomes an interesting challenge for cells: How do you maintain the fidelity of information flow, and how do you couple specific inputs to specific outputs?”

Most of these signaling pairs consist of an enzyme called a kinase and its substrate, which is activated by the kinase. Bacteria can have dozens or even hundreds of these protein pairs relaying different signals.

About 10 years ago, Laub showed that the specificity between bacterial kinases and their substrates is determined by only five amino acids in each of the partner proteins. This raised the question of whether cells have already used up, or are coming close to using up, all of the possible unique combinations that won’t interfere with existing pathways.

Some previous studies from other labs had suggested that the possible number of interactions that would not interfere with each other might be running out, but the evidence was not definitive. The MIT researchers decided to take a systematic approach in which they began with one pair of existing E. coli signaling proteins, known as PhoQ and PhoP, and then introduced mutations in the regions that determine their specificity.

This yielded more than 10,000 pairs of proteins. The researchers tested each kinase to see if they would activate any of the substrates, and identified about 200 pairs that interact with each other but not the parent proteins, the other novel pairs, or any other type of kinase-substrate family found in E. coli.

“What we found is that it’s pretty easy to find combinations that will work, where two proteins interact to transduce a signal and they don’t talk to anything else inside the cell,” Laub says.

He now plans to try to reconstruct the evolutionary history that has led to certain protein pairs being used by cells while many other possible combinations have not naturally evolved.

Synthetic circuits

This study also offers a new strategy for creating new synthetic biology circuits based on protein pairs that don’t crosstalk with other cellular proteins, the researchers say. To demonstrate that possibility, they took one of their new protein pairs and modified the kinase so that it would be activated by a plant hormone called trans-zeatin, and engineered the substrate so that it would glow yellow when the kinase activated it.

“This shows that we can overcome one of the challenges of putting a synthetic circuit in a cell, which is that the cell is already filled with signaling proteins,” Voigt says. “When we try to move a sensor or circuit between species, one of the biggest problems is that it interferes with the pathways already there.”

One possible application for this new approach is designing circuits that detect the presence of other microbes. Such circuits could be useful for creating probiotic bacteria that could help diagnose infectious diseases.

“Bacteria can be engineered to sense and respond to their environment, with widespread applications such as ‘smart’ gut bacteria that could diagnose and treat inflammation, diabetes, or cancer, or soil microbes that maintain proper nitrogen levels and eliminate the need for fertilizer. To build such bacteria, synthetic biologists require genetically encoded ‘sensors,’” says Jeffrey Tabor, an associate professor of bioengineering and biosciences at Rice University.

“One of the major limitations of synthetic biology has been our genetic parts failing in new organisms for reasons that we don’t understand (like cross-talk). What this paper shows is that there is a lot of space available to re-engineer circuits so that this doesn’t happen,” says Tabor, who was not involved in the research.

If adapted for use in human cells, this approach could also help researchers design new ways to program human T cells to destroy cancer cells. This type of therapy, known as CAR-T cell therapy, has been approved to treat some blood cancers and is being developed for other cancers as well.

Although the signaling proteins involved would be different from those in this study, “the same principle applies in that the therapeutic relies on our ability to take sets of engineered proteins and put them into a novel genomic context, and hope that they don’t interfere with pathways already in the cells,” McClune says.

The research was funded by the Howard Hughes Medical Institute, the Office of Naval Research, and the National Institutes of Health Pre-Doctoral Training Grant.

Researchers discover new source of drug resistance in pancreatic cancer
Lucy Jackub
October 17, 2019

The best available treatments for pancreatic cancer are highly toxic, and, as chemotherapies go, not very effective. The drug gemcitabine has been used for decades to extend the life of patients, but very high doses are required to combat the tumor, which grows in the pancreas surrounded by stiff, fibrous, noncancerous tissue called stroma. This hallmark of pancreatic cancer makes it unusually difficult to treat: the more stromal tissue accumulates, the less the drug works, while patients still endure brutal side effects. Only 8.5 percent of pancreatic cancer patients survive five years beyond their diagnosis, so there’s an urgent need to figure out why existing treatments are failing.

Scientists have known for a long time that gemcitabine fights cancer by killing cells during replication, though why it works for pancreatic cancer in particular is a bit of a mystery. The drug is a small molecule that masquerades as the nucleoside deoxycytidine, one unit in the nucleic acids that make up DNA. Once gemcitabine is integrated into a replicating strand of DNA, additional nucleosides can’t be joined to it. The new DNA strand can’t be completed, and the cell dies. Now, researchers from MIT have discovered that non-cancer cells in the pancreatic stromal tissue secrete astonishing quantities of deoxycytidine. They found that competition with deoxycytidine makes its imposter, gemcitabine, less effective, explaining why higher doses of the drug are needed as more stromal tissue grows around the tumor.

“That was an answer we were looking for — what is making pancreatic tumors resistant to gemcitabine?” says Michael Hemann, associate professor of biology, a member of MIT’s Koch Institute for Integrative Cancer Research, and co-senior author of the study. “Understanding the basic mechanisms of these drugs allows us to return to the clinic with improved strategies to treat patients with cancer.”

Douglas Lauffenburger, a professor of biological engineering, is also a co-senior author of the study, which represents a collaboration between the Hemann lab, the Lauffenburger lab, and the Vander Heiden lab, and appeared online in Cancer Research on September 4. Hemann lab graduate student Simona Dalin is the lead author.

The mystery ingredient

For years, researchers at MIT have been investigating different sources of chemotherapy resistance in stromal tissue. When Dalin took up the study two years ago, she was building on the findings of a former postdoc in the Hemann lab, Emanuel Kreidl. Kreidl had found that stellate cells, one type of cell in the pancreatic stromal tissue surrounding the tumor, were releasing something into the microenvironment of the pancreas that disrupted the function of gemcitabine.

Cells secrete all sorts of things — micro RNAs, fatty acids, proteins — that may be taken up and used by neighboring cells. Biologists call these ambient materials around the cell its “media.”  Kreidl had tried boiling, digesting, and filtering the stellate cell media, but nothing he did made gemcitabine any more effective against the cancer cells. The usual suspects commonly implicated in drug resistance caused by neighboring cells, like proteins, would break down under such tests. “That’s when we knew there was something new here,” says Dalin. Her challenge was to figure out what that mystery ingredient was.

Mark Sullivan PhD ‘19, then a graduate student and biochemist in Vander Heiden lab, was enlisted to help separate the stellate cell media into its molecular components and identify them. After doing so, Dalin says, “it was fairly obvious that deoxycytidine was the thing that we were looking for.” Because gemcitabine works by taking deoxycytidine’s place in DNA replication, it made sense that the presence of a lot of deoxycytidine could make it difficult for gemcitabine to fulfill its function.

Molecules pass in and out of cells through gates in the cell membrane, called transporters. Using a drug that blocks certain transporters, Dalin was able to shut the gate in the stellate cells through which deoxycytidine is released. With less deoxycytidine around, the gemcitabine was effective at lower doses, confirming her hypothesis. Now, the researchers just needed to figure out how and where deoxycytidine was getting in the way of the drug.

Once inside the cell, a nucleoside must have one or more phosphate groups added to it by several enzymes in order to become a nucleotide that can be used to build DNA. Gemcitabine goes through the same process. The researchers determined that gemcitabine was competing with deoxycytidine for the first of those enzymes, deoxycytidine kinase. When they flooded the cell with that enzyme, gemcitabine didn’t have to wait in line for its phosphate groups — and could get into the DNA to work its fatal subterfuge.

Upending Assumptions

Going forward, the Hemann lab aims to identify drugs that could inhibit the production of deoxycytidine and restore the tumor’s sensitivity to gemcitabine. Senthil Muthuswamy, an associate professor of medicine at Beth Israel Deaconess Medical Center who was not involved in the research, says this study provides “new and important insights” into how and why tumors develop resistance to gemcitabine. The findings, he adds, are “likely to have important implications for developing ways to overcome gemcitabine resistance in pancreatic cancer.”

The study’s findings may shed light on other cancer treatments that work similarly to gemcitabine. For every nucleoside, there are look-alike molecules, or analogs, that are used in cancer therapies. For example, the purine analog fludarabine is used to treat acute myeloid leukemia, another tenacious carcinoma. These generic drugs have been adopted through trial and error in the clinic, but scientists don’t fully understand why they are effective at the molecular level.

In theory, nucleoside analog drugs should work interchangeably; every nucleoside is necessary in either the replication of DNA or RNA. In practice, though, these drugs are only effective for certain cancers. The MIT researchers speculate that the sheer amount of deoxycytidine being produced in the pancreas could suggest that pancreatic cells have a particular need for deoxycytidine that also makes them more responsive to its analogs — perhaps explaining why gemcitabine targets pancreatic cancer cells effectively.

“Understanding more about nucleoside biology, and more about which organs have high levels of which nucleosides, might help us understand when to use which chemotherapies,” Dalin says.

This study leaves the researchers with many questions about how and why nucleosides are produced in the body, a realm of basic biology that is still poorly understood. It’s generally assumed that cells only make nucleosides for their own internal use in DNA replication. But pancreatic stellate cells produce a lot of deoxycytidine, far more than they need for themselves, suggesting the excess nucleosides may serve some unknown purpose in neighboring cells. Although more experiments are needed to determine this mysterious purpose, the MIT researchers have some ideas.

“These extra nucleosides introduce a possibility that perhaps making deoxycytidine is a normal function of stellate cells in the pancreas, in order to provide building blocks for the cells around them,” says Hemann. “And that’s a real surprise.”

This work was funded in part by a David H. Koch Fellowship and the MIT Center for Precision Cancer Medicine.

Image: Deoxycytidine and gemcitabine, its look-alike molecule, enter a cancer cell through the same gate in the cell membrane and are altered by the same enzyme (dCK) before they are integrated into DNA. Credit: Courtesy of the researchers.

Citation:
“Deoxycytidine Release from Pancreatic Stellate Cells Promotes Gemcitabine Resistance.”
Cancer Research, online Sept. 4, 2019, DOI: 10.1158/0008-5472.CAN-19-0960.
Dalin, S., Sullivan, M.R., Lau, A.N., Grauman-Boss, B., Mueller, H.S., Kreidl, E., Fenoglio, S., Luengo, A., Lees, J.A., Vander Heiden, M.G. and Lauffenburger, D.A.

Signaling factor seeking gene
Greta Friar | Whitehead Institute
September 25, 2019

Cambridge, MA — During embryonic development, stem cells begin to take on specific identities, becoming distinct cell types with specialized characteristics and functions, in order to form the diverse organs and systems in our bodies. Cells rely on two main classes of regulators to define and maintain their identities; the first of these are master transcription factors, keystone proteins in each cell’s regulatory network, which keep the DNA sequences associated with crucial cell identity genes accessible for transcription — the process by which DNA is “read” into RNA. The other main regulators are signaling factors, which transmit information from the environment to the nucleus through a chain of proteins like a game of cellular telephone. Signaling factors can prompt changes in gene transcription as the cells react to that information.

One long-standing conundrum of how cell identity is determined is that many species, including humans, use the same core signaling pathways, with the same signaling factors, in all of their cells, yet this uniform machinery can cue a diverse array of cell-type specific gene activity, like an identical line of code being entered in many computers and causing each to start running a completely different program. New research from Whitehead Institute Member Richard Young, who is also a professor of biology at the Massachusetts Institute of Technology, published online in Molecular Cell on September 25, sheds light on how the same signaling factor can lead to so many distinct responses — with the help of a mechanism called phase separation.

Co-senior author on the paper Jurian Schuijers, previously a postdoctoral researcher in Young’s lab and now a professor at the Center for Molecular Medicine at the University Medical Center Utrecht, was drawn to this puzzle after previously working in a signaling lab: “Two cells of different types that are right next to each other in the body can receive the exact same signal and have different reactions, and there was not a satisfying explanation for how that happens,” Schuijers says.

Young’s lab had previously found that signaling factors in pathways important for development tend to concentrate at super enhancers, clusters of DNA sequences that increase transcription of crucial cell identity genes. Because super enhancers are established at the genes important to identity in each cell, their activity is cell-type specific, so this co-localization provided a partial explanation of the puzzle, but it raised the question of how signaling factors are recruited to super enhancers. Young found the vague explanations that had been put forward, such as super enhancers being the most accessible DNA to signaling factors and their co-factors, unconvincing.

Young and his team suspected that an explanation for signaling factor recruitment might lie in their research on transcriptional condensates — droplets that form at super enhancers and concentrate transcriptional machinery there using phase separation, meaning the molecules separate out of their surroundings to form a distinct liquid compartment, like a drop of vinegar in a pool of oil. The proteins in condensates can do this because they contain intrinsically disordered regions (IDRs), stretches of amino acids that remain flexible, like wet spaghetti, and do not become fixed into a single shape the way most protein structures do. This property allows them to mesh together to form a condensate. The researchers reasoned that if signaling factors were joining transcriptional condensates, that could explain their concentration at super enhancers.

Young’s team confirmed that signaling factors in several of the most important pathways for embryonic development in mammals — WNT, TGF-β and JAK/STAT — contained IDRs. They further found that these factors were able to use their IDRs to form and join condensates, and that, in mouse cells, they appeared to join the condensates at super enhancers upon activation of their respective pathways.

The researchers then decided to focus on beta catenin, the signaling factor at the end of the Wnt signaling pathway, a pathway essential for development; it helps to coordinate things like body axis patterning and cell fate specification, proliferation and migration. When Wnt signaling goes awry in embryos, they fail to develop, and when it goes awry in adults it is implicated in diseases including cancer. The beta catenin protein has IDRs on both of its ends and a structured middle section, called the Armadillo repeat domain, where it binds to other transcription factors. Typically, beta catenin binds to transcription factors in the TCF/LEF family, which in turn bind to DNA—beta catenin cannot bind to DNA on its own — anchoring the signaling factor at the right site and prompting gene transcription. However, the researchers found that beta catenin could concentrate at super enhancers even when it could not bind to its usual partners, suggesting that transcriptional condensates were a sufficient recruitment mechanism. The researchers then created two abridged beta catenin molecules: one version that only contained the IDRs and one that only contained the Armadillo repeat domain. Both partial factors were able to concentrate at super enhancers, but neither was as effective as the combined whole.

“If you ask most people how these factors find their target locations in the genome, they would say it’s through their DNA binding domains,” says first author Alicia Zamudio, a graduate student in Young’s lab. “This research suggests that factors use both their structured DNA binding domains and their unstructured domains to find the right locations to bind in the genome and to activate target genes.”

One advantage for cells of using IDRs, versus DNA binding alone, might be reducing the time it takes for signaling factors to concentrate near the right genes, the researchers say. Speed is of the essence for some signaling pathways in order for cells to be able to respond quickly to environmental stimuli. Transcriptional condensates are larger in size and much fewer in number than DNA binding sites or DNA-binding co-factors, and so they shrink the space that a signaling factor entering the nucleus must search.

This research could provide new opportunities for drug discovery. Signaling pathways and super enhancers are both co-opted by oncogenes to drive the spread of cancer, so transcriptional condensates could be a promising target to disrupt both oncogenic signaling and oncogene transcription. Young also hopes that this research, which adds to his lab’s growing body of work on transcriptional condensates, will lead to a new appreciation of the disordered regions of proteins.

“For a long time, researchers have mostly ignored the intrinsically disordered regions of proteins — we literally cut them off when identifying the crystal structures — much in the same way that researchers used to study genes and ignore ‘junk DNA,’” Young says. “But, just as with junk DNA, we are discovering that the overlooked, less obviously functional regions of these molecules are very important after all.”

 

This work is supported by NIH grant GM123511 and NSF grant PHY1743900 (R.A.Y.), NIH grant GM117370 (D.J.T.), NSF Graduate Research Fellowship (A.V.Z.), NIH grant T32CA009172 (I.A.K.), and DFG Research Fellowship DE 3069/1-1 (T.M.D.).

 

Written by Greta Friar

***

Richard Young’s primary affiliation is with Whitehead Institute for Biomedical Research, where his laboratory is located and all his research is conducted. He is also a professor of biology at the Massachusetts Institute of Technology.

***

Full citation:

“Mediator condensates localize signaling factors to key cell identity genes”

Molecular Cell, published online September 25, 2019. DOI: 10.1016/j.molcel.2019.08.016

Alicia V. Zamudio (1, 2), Alessandra Dall’Agnese (1), Jonathan E. Henninger (1), John C. Manteiga(1, 2), Lena K. Afeyan (1, 2), Nancy M. Hannett (1), Eliot L. Coffey (1, 2), Charles H. Li (1, 2), Ozgur Oksuz (1), Benjamin R. Sabari (1), Ann Boija (1), Isaac A. Klein (1,3), Susana W. Hawken (4), Jan-Hendrik Spille (5), Tim-Michael Decker (6), Ibrahim I. Cisse (5), Brian J. Abraham (1,7), Tong I. Lee (1), Dylan J. Taatjes (6), Jurian Schuijers (1,8,9), and Richard A. Young (1, 2, 9).

1. Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA

2. Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA 3. Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, 02215, USA

4. Program in Computational and Systems Biology, Massachusetts Institute of Technology, Cambridge, MA, 02139 USA

5. Department of Physics, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA 6. Department of Biochemistry, University of Colorado, Boulder, CO 80303, USA

7. St. Jude Children’s Research Hospital, Memphis, TN, 038105, USA

8. Present address: Center for Molecular Medicine, University Medical Center Utrecht, Utrecht, 3584 CX, The Netherlands.

9. Equal contribution

Understanding genetic circuits and genome organization

Assistant professors Pulin Li and Seychelle Vos are investigating how cells become tissues and the proteins that organize DNA.

Raleigh McElvery | Department of Biology
September 12, 2019

MIT’s Department of Biology welcomed two new assistant professors in recent months: Pulin Li began at the Whitehead Institute in May, and Seychelle Vos arrived at Building 68 in September. Their respective expertise in genetic circuits and genome organization will augment the department’s efforts to explore cell biology at all levels — from intricate molecular structures to the basis for human disease.

“Pulin and Seychelle bring new perspectives and exciting ideas to our research community,” says Alan Grossman, department head. “I’m excited to see them start their independent research programs and look forward to the impact that they will have.”

From cells to tissues

Growing up in Yingkou, China, Li was exposed to science at a young age. Her dad worked for a pharmaceutical company researching traditional Chinese medicine, and Li would spend hours playing with his lab tools and beakers. “I can still vividly remember the smell of his Chinese herbs,” she says. “Maybe that’s part of the reason why I’ve always been interested in biology as it relates to medical sciences.”

She earned her BS in life sciences from Peking University, and went on to pursue a PhD in chemical biology at Harvard University studying hematopoietic stem cells. Li performed chemical screens to find drugs that would make stem cell transplantation in animal models more efficient, and eventually help patients with leukemia. In doing so, she became captivated by the molecular mechanisms that control cell-to-cell communication.

“I would like to eventually go back to developing new therapies and medicines,” she says, “but that translational research requires a basic understanding of how things work at a molecular level.”

As a result, her postdoc at Caltech was firmly rooted in basic biology. She investigated the genetic circuits that underlie cell-cell communication in developing and regenerating tissues, and now aims to develop new methods to study these same processes here at MIT.

Traditional genetic approaches involve breaking components of a system one at a time to investigate the role they play. However, Li’s lab will adopt a “bottom-up” approach that involves building these systems from the ground up, adding the components back into the cell one by one to pinpoint which genetic circuits are sufficient for programming tissue function. “Building up a system, rather than tearing it down, allows you to test different circuit designs, tune important parameters, and understand why a circuit has evolved to perform a specific function,” she explains.

She is most interested in determining which aspects of cellular communication are critical for tissue formation, in hopes of understanding the diversity of life forms in nature, as well as inspiring new methods to engineer or regenerate different tissues.

“My dream would be to put a bunch of genetic circuits into cells in such a way that they could enable the cells to self-organize into certain patterns and shapes, and replace damaged tissues in a patient,” she says.

Proteins that organize DNA

Although Vos was born in South Africa, her family moved so frequently for her father’s job that she doesn’t call any one place home. “If I had to pick, I’d say it would be the middle of the Atlantic Ocean,” she says.

Both of her grandparents on her mother’s side were researchers, and encouraged various scientific escapades, like bringing wolf spiders to kindergarten for show-and-tell. Her grandmother on her father’s side found her early passions “mildly disturbing,” but dutifully fulfilled her requests for high-resolution insect microscopy books nonetheless.

“I really wanted to know how plants and animals worked starting from a young age, thanks to my grandparents,” Vos says.

In high school she was already conducting research on the side at Clemson University, South Carolina, and went on to earn her BS in genetics from the University of Georgia. She began her PhD in molecular cell biology at the University of California at Berkeley intending to study immunology, but surprised herself by becoming taken with structural biology instead.

Purifying proteins and solving structures required a much different skill set than performing screens and manipulating genomes, but she very much enjoyed her work on topoisomerase, the enzyme that modifies DNA so it doesn’t become too coiled.

She continued conducting biochemical and structural research during her postdoc at the Max Planck Institute for Biophysical Chemistry in Germany. There, she used cryogenic electron microscopy to probe how different RNA polymerase II complexes are regulated during transcription in eukaryotes.

Today, she’s a molecular biologist at her core, but she’s prepared to use “whatever technique gets the answer.” As she explains: “You need biochemistry to solve structures and genetics to understand how they’re working within the whole organism, so it’s all related.”

In her new lab in Building 68, she will continue investigating gene expression, but this time in the context of genome organization. DNA must be compacted in order to fit into a cell, and Vos will study the proteins that organize DNA so it can be compressed without interfering with gene expression. She also wants to know how those same proteins are affected by gene expression.

“How gene regulation impacts compaction is a really critical question to address because different cell types are organized in different ways, and that impacts which genes are ultimately expressed,” she says. “We still don’t really understand how these processes work at an atomic level, so that’s where my expertise in biochemistry and structural biology can be useful.”

When asked what they are most excited about as the school year begins, both Li and Vos say the same thing: the diverse skills and expertise of the students and faculty.

“It’s not just about solving one structure, people here want to understand the entire process,” Vos says. “Biology is a conglomeration of many different fields, and if we can have engineers, mathematicians, physicists, chemists, biologists, and others work together, we can begin to tackle pressing questions.”

An emerging view of RNA transcription and splicing

Whitehead Institute scientists find chemical modification contributes to trafficking between non-membrane-bound compartments that control gene expression.

Nicole Davis | Whitehead Institute
August 9, 2019

Cells often create compartments to control important biological functions. The nucleus is a prime example; surrounded by a membrane, it houses the genome. Yet cells also harbor enclosures that are not membrane-bound and more transient, like oil droplets in water. Over the past two years, these droplets (called “condensates”) have become increasingly recognized as major players in controlling genes. Now, a team led by Whitehead Institute scientists helps expand this emerging picture with the discovery that condensates play a role in splicing, an essential activity that ensures the genetic code is prepared to be translated into protein. The researchers also reveal how a critical piece of cellular machinery moves between different condensates. The team’s findings appear in the Aug. 7 online issue of Nature.

“Condensates represent a real paradigm shift in the way molecular biologists think about gene control,” says senior author Richard Young, a member of the Whitehead Institute and professor of biology at MIT. “Now, we’ve added a critical new layer to this thinking that enhances our understanding of splicing as well as the major transcriptional apparatus RNA polymerase II.”

Young’s lab has been at the forefront of studying how and when condensates form as well as their functions in gene regulation. In the current study, Young and his colleagues, including first authors Eric Guo and John Manteiga, focused their efforts on a key transition that happens when genes undergo transcription — an early step in gene activation whereby an RNA copy is created from the genes’ DNA template. First, all of the molecular machinery needed to make RNA, including a large protein complex known as RNA polymerase II, assembles at a given gene. Then, specific chemical modifications to RNA polymerase II allow it to begin transcribing DNA into RNA. This shift from so-called transcription initiation to active transcription also involves another important molecular transition: As RNA molecules begin to grow, the splicing apparatus must also move in and carry out its job.

“We wanted to step back and ask, ‘Do condensates play an important role in this switch, and if so, what mechanism might be responsible?’” explains Young.

For roughly three decades, it has been recognized that the factors required for splicing are stored in compartments called speckles. Yet whether these speckles play an active role in splicing, or are simply storage vessels, has remained unclear.

Using confocal microscopy, the Whitehead team discovered condensates filled with components of the splicing machinery in the vicinity of highly active genes. Notably, these structures exhibited similar liquid-like characteristics to those condensates described in prior studies from Young’s lab that are involved in transcription initiation.

“These findings signaled to us that there are two types of condensates at work here: one involved in transcription initiation and the other in splicing and transcriptional elongation,” said Manteiga, a graduate student in Young’s lab.

With two different condensates at play, the researchers wondered: How does the critical transcriptional machinery, specifically RNA polymerase II, move from one condensate to the other?

Guo, Manteiga, and their colleagues found that chemical modification, specifically the addition of phosphate groups, serves as a kind of molecular switch that alters the protein complex’s affinity for a particular condensate. With fewer phosphate groups, it associates with the condensates for transcription initiation; when more phosphates are added, it enters the splicing condensates. Such phosphorylation occurs on one end of the protein complex, which contains a specialized region known as the C-terminal domain (CTD). Importantly, the CTD lacks a specific three-dimensional structure, and previous work has shown that such intrinsically disordered regions can influence how and when certain proteins are incorporated into condensates.

“It is well-documented that phosphorylation acts as a signal to help regulate the activity of RNA polymerase II,” says Guo, a postdoc in Young’s lab. “Now, we’ve shown that it also acts as a switch to alter the protein’s preference for different condensates.”

In light of their discoveries, the researchers propose a new view of splicing compartments, where speckles serve primarily as warehouses, storing the thousands of molecules required to support the splicing apparatus when they are not needed. But when splicing is active, the phosphorylated CTD of RNA Pol II serves as an attractant, drawing the necessary splicing materials toward the gene where they are needed and into the splicing condensate.

According to Young, this new outlook on gene control has emerged in part through a multidisciplinary approach, bringing together perspectives from biology and physics to learn how properties of matter predict some of the molecular behaviors he and his team have observed experimentally. “Working at the interface of these two fields is incredibly exciting,” says Young. “It is giving us a whole new way of looking at the world of regulatory biology.”

Support for this work was provided by the U.S. National Institutes of Health, National Science Foundation, Cancer Research Institute, Damon Runyon Cancer Research Foundation, Hope Funds for Cancer Research, Swedish Research Council, and German Research Foundation DFG.

Study furthers radically new view of gene control

Along the genome, proteins form liquid-like droplets that appear to boost the expression of particular genes.

Anne Trafton | MIT News Office
August 8, 2019

In recent years, MIT scientists have developed a new model for how key genes are controlled that suggests the cellular machinery that transcribes DNA into RNA forms specialized droplets called condensates. These droplets occur only at certain sites on the genome, helping to determine which genes are expressed in different types of cells.

In a new study that supports that model, researchers at MIT and the Whitehead Institute for Biomedical Research have discovered physical interactions between proteins and with DNA that help explain why these droplets, which stimulate the transcription of nearby genes, tend to cluster along specific stretches of DNA known as super enhancers. These enhancer regions do not encode proteins but instead regulate other genes.

“This study provides a fundamentally important new approach to deciphering how the ‘dark matter’ in our genome functions in gene control,” says Richard Young, an MIT professor of biology and member of the Whitehead Institute.

Young is one of the senior authors of the paper, along with Phillip Sharp, an MIT Institute Professor and member of MIT’s Koch Institute for Integrative Cancer Research; and Arup K. Chakraborty, the Robert T. Haslam Professor in Chemical Engineering, a professor of physics and chemistry, and a member of MIT’s Institute for Medical Engineering and Science and the Ragon Institute of MGH, MIT, and Harvard.

Graduate student Krishna Shrinivas and postdoc Benjamin Sabari are the lead authors of the paper, which appears in Molecular Cell on Aug. 8.

“A biochemical factory”

Every cell in an organism has an identical genome, but cells such as neurons or heart cells express different subsets of those genes, allowing them to carry out their specialized functions. Previous research has shown that many of these genes are located near super enhancers, which bind to proteins called transcription factors that stimulate the copying of nearby genes into RNA.

About three years ago, Sharp, Young, and Chakraborty joined forces to try to model the interactions that occur at enhancers. In a 2017 Cell paper, based on computational studies, they hypothesized that in these regions, transcription factors form droplets called phase-separated condensates. Similar to droplets of oil suspended in salad dressing, these condensates are collections of molecules that form distinct cellular compartments but have no membrane separating them from the rest of the cell.

In a 2018 Science paper, the researchers showed that these dynamic droplets do form at super enhancer locations. Made of clusters of transcription factors and other molecules, these droplets attract enzymes such as RNA polymerases that are needed to copy DNA into messenger RNA, keeping gene transcription active at specific sites.

“We had demonstrated that the transcription machinery forms liquid-like droplets at certain regulatory regions on our genome, however we didn’t fully understand how or why these dewdrops of biological molecules only seemed to condense around specific points on our genome,” Shrinivas says.

As one possible explanation for that site specificity, the research team hypothesized that weak interactions between intrinsically disordered regions of transcription factors and other transcriptional molecules, along with specific interactions between transcription factors and particular DNA elements, might determine whether a condensate forms at a particular stretch of DNA. Biologists have traditionally focused on “lock-and-key” style interactions between rigidly structured protein segments to explain most cellular processes, but more recent evidence suggests that weak interactions between floppy protein regions also play an important role in cell activities.

In this study, computational modeling and experimentation revealed that the cumulative force of these weak interactions conspire together with transcription factor-DNA interactions to determine whether a condensate of transcription factors will form at a particular site on the genome. Different cell types produce different transcription factors, which bind to different enhancers. When many transcription factors cluster around the same enhancers, weak interactions between the proteins are more likely to occur. Once a critical threshold concentration is reached, condensates form.

“Creating these local high concentrations within the crowded environment of the cell enables the right material to be in the right place at the right time to carry out the multiple steps required to activate a gene,” Sabari says. “Our current study begins to tease apart how certain regions of the genome are capable of pulling off this trick.”

These droplets form on a timescale of seconds to minutes, and they blink in and out of existence depending on a cell’s needs.

“It’s an on-demand biochemical factory that cells can form and dissolve, as and when they need it,” Chakraborty says. “When certain signals happen at the right locus on a gene, the condensates form, which concentrates all of the transcription molecules. Transcription happens, and when the cells are done with that task, they get rid of them.”

A new view

Weak cooperative interactions between proteins may also play an important role in evolution, the researchers proposed in a 2018 Proceedings of the National Academy of Sciences paper. The sequences of intrinsically disordered regions of transcription factors need to change only a little to evolve new types of specific functionality. In contrast, evolving new specific functions via “lock-and-key” interactions requires much more significant changes.

“If you think about how biological systems have evolved, they have been able to respond to different conditions without creating new genes. We don’t have any more genes that a fruit fly, yet we’re much more complex in many of our functions,” Sharp says. “The incremental expanding and contracting of these intrinsically disordered domains could explain a large part of how that evolution happens.”

Similar condensates appear to play a variety of other roles in biological systems, offering a new way to look at how the interior of a cell is organized. Instead of floating through the cytoplasm and randomly bumping into other molecules, proteins involved in processes such as relaying molecular signals may transiently form droplets that help them interact with the right partners.

“This is a very exciting turn in the field of cell biology,” Sharp says. “It is a whole new way of looking at biological systems that is richer and more meaningful.”

Some of the MIT researchers, led by Young, have helped form a company called Dewpoint Therapeutics to develop potential treatments for a wide variety of diseases by exploiting cellular condensates. There is emerging evidence that cancer cells use condensates to control sets of genes that promote cancer, and condensates have also been linked to neurodegenerative disorders such as amyotrophic lateral sclerosis (ALS) and Huntington’s disease.

The research was funded by the National Science Foundation, the National Institutes of Health, and the Koch Institute Support (core) Grant from the National Cancer Institute.