RNA molecules are masters of their own destiny
Eva Frederick | Whitehead Institute
December 16, 2020

At any given moment in the human body, in about 30 trillion cells, DNA is being “read” into molecules of messenger RNA, the intermediary step between DNA and proteins, in a process called transcription.

Scientists have a pretty good idea of how transcription gets started: proteins called RNA polymerases are recruited to specific regions of the DNA molecules and begin skimming their way down the strand, synthesizing mRNA molecules as they go. But part of this process is less well understood: how does the cell know when to stop transcribing?

Now, new work from the labs of Whitehead Institute Member Richard Young, also a professor of biology at Massachusetts Institute of Technology (MIT), and Arup K. Chakraborty, professor of chemical engineering, physics and chemistry at MIT, suggests that RNA molecules themselves are responsible for regulating their formation through a feedback loop. Too few RNA molecules, and the cell initiates transcription to create more. Then, at a certain threshold, too many RNA molecules cause transcription to draw to a halt.

The research, published in Cell on December 16, represents a collaboration between biologists and physicists, and provides some insight into the potential roles of the thousands of RNAs that are not translated into any proteins, called noncoding RNAs, which are common in mammals and have mystified scientists for decades.

A question of condensates

Previous work in Young’s lab has focused on transcriptional condensates, small cellular droplets that bring together the molecules needed to transcribe DNA to RNA. Scientists in the lab discovered the transcriptional droplets in 2018, noticing that they typically formed when transcription began and dissolved a few seconds or minutes later when the process was finished.

The researchers wondered if the force that governed the dissolution of the transcriptional condensates could be related to the chemical properties of the RNA they produced — specifically, its highly negative charge. If this were the case, it would be the latest example of cellular processes being regulated via a feedback mechanism — an elegant, efficient system used in the cell to control biological functions such as red blood cell production and DNA repair.

As an initial test, the researchers used an in vitro experiment to test whether the amount of RNA had an effect on condensate formation. They found that within the range of physiological levels observed in cells, low levels of RNA encouraged droplet formation and high levels of RNA discouraged it.

Thinking outside the biology box 

With these results in mind, Young Lab postdocs and co-first authors Ozgur Oksuz and Jon Henninger teamed up with physicist and co-first author Krishna Shrinivas, a graduate student in Arup Chakraborty’s lab, to investigate what physical forces were at play.

Shrinivas proposed that the team build a computational model to study the physical and chemical interactions between actively transcribed RNA and condensates formed by transcriptional proteins. The goal of the model was not to simply reproduce existing results, but to create a platform with which to test a variety of situations.

“The way most people study these kinds of problems is to take mixtures of molecules in a test tube, shake it and see what happens,” Shrinivas said. “That is as far away from what happens in a cell as one can imagine. Our thought was, ‘Can we try to study this problem in its biological context, which is this out-of-equilibrium, complex process?’”

Studying the problem from a physics perspective allowed the researchers to take a step back from traditional biology methods. “As a biologist, it’s difficult to come up with new hypotheses, new approaches to understanding how things work from available data,” Henninger said. “You can do screens, you can identify new players, new proteins, new RNAs that may be involved in a process, but you’re still limited by our classical understanding of how all these things interact. Whereas when talking with a physicist, you’re in this theoretical space extending beyond what the data can currently give you. Physicists love to think about how something would behave, given certain parameters.”

Once the model was complete, the researchers could ask it questions about situations that may arise in cells — for instance, what happens to condensates when RNAs of different lengths are produced at different rates as time ensues? — and then follow it up with an experiment at the lab bench. “We ended up with a very nice convergence of model and experiment,” Henninger said. “To me, it’s like the model helps distill the simplest features of this type of system, and then you can do more predictive experiments in cells to see if it fits that model.”

The charge is in charge

Through a series of modeling and experiments at the lab bench, the researchers were able to confirm their hypothesis that the effect of RNA on transcription is due to RNAs molecules’ highly negative charge. Furthermore, it was predicted that initial low levels of RNA enhance and subsequent higher levels dissolve condensates formed by transcriptional proteins. Because the charge is carried by the RNAs’ phosphate backbone, the effective charge of a given RNA molecule is directly proportional to its length.

In order to test this finding in a living cell, the researchers engineered mouse embryonic stem cells to have glowing condensates, then treated them with a chemical to disrupt the elongation phase of transcription. Consistent with the model’s predictions, the resulting dearth of condensate-dissolving RNA molecules increased the size and lifetime of condensates in the cell. Conversely, when the researchers engineered cells to induce the production of extra RNAs, transcriptional condensates at these sites dissolved. “These results highlight the importance of understanding how non-equilibrium feedback mechanisms regulate the functions of the biomolecular condensates present in cells,” said Chakraborty.

Confirmation of this feedback mechanism might help answer a long-standing mystery of the mammalian genome: the purpose of non-coding RNAs, which make up a large portion of genetic material. “While we know a lot about how proteins work, there are tens of thousands of noncoding RNA species, and we don’t know the functions of most of these molecules,” said Young. “The finding that RNA molecules can regulate transcriptional condensates makes us wonder if many of the noncoding species just function locally to tune gene expression throughout the genome. Then this giant mystery of what all these RNAs do has a potential solution.”

The researchers are optimistic that understanding this new role for RNA in the cell could inform therapies for a wide range of diseases. “Some diseases are actually caused by increased or decreased expression of a single gene,” said Oksuz, a co-first author. “We now know that if you modulate the levels of RNA, you have a predictable effect on condensates. So you could hypothetically tune up or down the expression of a disease gene to restore the expression — and possibly restore the phenotype — that you want, in order to treat a disease.”

Young added that a deeper understanding of RNA behavior could inform therapeutics more generally. In the last 10 years, a variety of drugs have been developed that directly target RNA successfully. “RNA is an important target,” Young said. “Understanding mechanistically how RNA molecules regulate gene expression bridges the gap between gene dysregulation in disease and new therapeutic approaches that target RNA.”

The untidy experiment that catalyzed recombinant DNA technology

Salvador Luria is known for his research on phage genetics, but his lab’s contribution to the discovery of restriction enzymes also sparked important technological advances.

Saima Sidik
December 15, 2020

In the early 1950s, a woman named Mary Human found the first evidence of a group of proteins called restriction enzymes — a discovery that would reverberate throughout the research community for decades. But many important discoveries, from penicillin to medical X-rays, are inspired by a messy fluke rather than carefully reasoned logic, and Human’s discovery was no different.

Fortunately, Human’s boss was a jovial scientist named Salvador Luria, who appreciated that life’s quirks often yield the most valuable results — so much so that he wrote a 1955 Scientific American article in which he praised Human’s approach. “It often pays to do somewhat untidy experiments, provided one is aware of the element of untidiness,” he wrote.

Indeed, Luria’s life was far from being a tidy package. This Italian native fled Europe to escape Nazis, was briefly blacklisted by the NIH presumably because of his vocal opposition to American foreign policy, and suffered from depression despite his outwardly cheery appearance. But Luria’s life was also extraordinary. He earned a medical degree in Torino, Italy, but decided he preferred performing research over practicing medicine. After leaving Europe in the 1940s to escape the persecution of Jews like himself, he held professorships at three American institutions, including MIT. He was known as an insightful scientist, a kind colleague, and a thoughtful mentor, right up until his death in 1991.

A Surprising Observation in the Midwest

For much of his career, Luria applied his keen insight to phages — viruses that invade and kill bacteria. He and two collaborators won the Nobel Prize after realizing that genetic mutations in bacteria can protect them from deadly phages. But the untidy experiment Luria referred to in his Scientific American article related to a lesser-known aspect of his lab’s phage work: restriction enzymes, which cut DNA at specific places. Luria was the first person to find evidence of these critical tools, which opened a whole new field of genetic manipulation. A cascade of research spanning two decades eventually led a scientist supervised by Luria’s former research associate to win a Nobel prize for characterizing these enzymes, which catalyzed modern molecular biology.

The restriction enzyme story starts in the late 1940s, when Luria was a professor at Indiana University. He noticed that a phage called T2 didn’t seem to grow inside and kill certain mutant strains of Escherichia coli. T2 always killed the first batch of mutant E. coli, but when he tested whether a new batch of the same type of bacteria would catch the virus from the dead bacteria, the new batch didn’t succumb to the virus.

In 1950, Luria moved to the University of Illinois, Urbana, where one of his employees, a woman named Mary Human, continued to work on the T2 mystery. One day, in the midst of an experiment, Human realized she’d run out of the strain of E. coli she usually used, and this is where the experiment got a little untidy. Instead of waiting to do the experiment on another day with a healthy batch of E. coli, Human mixed phage-killed E. coli with a different type of bacteria called Shigella. T2 always seemed to act the same in Shigella as it did in E. coli, so she didn’t expect the switch to matter. But the next morning, the Shigella were dead! It seemed that T2 could only reproduce once in the particular mutant strain of E. coli that Human was studying, but when she moved T2 from these mutant E. coli to Shigella, it restored the virus’ ability to reproduce. Human and Luria concluded that something about the mutant E. coli changed the T2, and limited the kinds of bacteria in which it could grow.

At the time, Human and Luria couldn’t explain what was happening to T2 in these mutant bacteria. Luria went about his career, still carrying this mystery with him.

An explanation in Cambridge, Massachusetts

In 1958, Luria came to MIT Biology for a sabbatical. The structure of DNA had been discovered just five years earlier, and MIT needed someone who understood its implications to usher the Institute into the genomics era. Luria was renowned for his ability to predict which direction biology would move, so the Institute wanted him to fill this role. At the end of his sabbatical, Luria accepted a permanent position in MIT Biology, where he stayed for the rest of his career.

“I asked Luria if he thought it was possible to do molecular biology with animal viruses, and he said, ‘I don’t know, why don’t you find out and tell me?’” Baltimore says.In addition to being a skilled scientist, Luria was a thoughtful mentor. David Baltimore, professor at the California Institute of Technology, was one of Luria’s early mentees at MIT. At the time, most research into viruses focused on the phages that Luria studied, but Baltimore wanted to break new ground by studying viruses that infect animals. He credits Luria for encouraging him to go down this path — one that led him to become a Nobel Laureate himself.

In addition to being a skilled scientist, Luria was deeply opposed to McCarthyism and the Vietnam War, and he devoted a lot of time to political activism like writing letters, to newspaper editors as well as to other scientists, trying to gather support for his views.

Fortunately, Luria had a deputy to help him run his lab while he was revamping MIT Biology and trying to stop the war. “If you wanted to know something on a daily basis, you went to Helen Revel,” recalls Costa Georgopoulos, a professor at the University of Utah who earned his PhD in Luria’s lab in the 1960s.

Revel earned her PhD with MIT Biology’s Boris Magasanik before becoming Luria’s research associate. “Those days, women were not readily made professors, so she worked on Luria’s grants,” Georgopoulos says.

Georgopoulos describes Revel as reserved and meticulous. She didn’t advertise her skill as a scientist; she just got to work. With this attitude, she led the scientists who figured out the mystery of the mutant bacteria that changed the T2 phage.

Since Human’s fortuitously messy experiment, a lineage of phage researchers that originated in Luria’s lab had learned a lot about how bacteria and phages interact. First, Luria’s former research associate, Guiseppe Bertani, showed that phages other than T2 also behave differently in different types of bacteria. Later, Bertani’s own research associate, Werner Arber, went on to discover that bacteria can mark the DNA of phages that replicate within them. When marked phages try to enter new bacteria, the marks can signal that the phages are foreign invaders, allowing the new bacteria to kill the phages. Arber and two of his colleagues, Daniel Nathans and Hamilton O. Smith, eventually won their own Nobel prize for their work on restriction enzymes.

Certain bacteria mark phage DNA by replacing one of the bases that make up the genetic code, called cytosine, with a modified version called 5-hydroxymethylcytosine. Revel, with help from Luria, Georgopoulos, and others, found that the T2 phage takes this system one step farther by using a bacterial enzyme to attach sugars to modified cytosines. Some mutant bacteria are unable to transfer sugars to phage cytosines, and so the phages grown in these bacteria come out “sour” instead of “sweet,” as Luria wrote. Restriction enzymes recognize these sweet-natured phages as foreign, and destroy them.

As researchers learned more about restriction enzymes, they realized that they can work in all sorts of ways. Bacteria can also mark their own DNA to prevent restriction enzymes from cutting it, allowing certain kinds of restriction enzymes to cut naked DNA sequences in the genomes of invading phages. Soon, biologists realized that restriction enzymes would let them cut any kind of DNA, not just phage genomes. This discovery had many consequences, one of which was that scientists could paste snipped DNA back together in new combinations. Many people were initially wary that combining DNA from different organisms could have unintended consequences. But by the 1980s, scientists had harnessed restriction enzymes for a whole host of safe purposes, and technologies centered around these enzymes continue to evolve.

Today, after decades of work, scientists have used restriction enzymes to study genetic variations in humans, find sequences that cause disease, identify relationships between people, and solve crimes. Scientists have used restriction enzymes to make proteins glow like jellyfish, to study the structure of DNA, and to make bacteria produce insulin.

T2 phages and their relationship to restriction enzymes are just one area of biology where Luria and his lab made profound contributions. Among his biggest achievements was recruiting and employing many forward-thinking scientists who built MIT Biology into the department it is today. In fact, as the first director of the Center for Cancer Research, Luria recruited Phillip Sharp, who would go on to win a Nobel Prize for discovering RNA splicing. Sharp joined a center that already included David Baltimore, as well as current MIT Biology professors Nancy Hopkins and Robert Weinberg, all of whom have made huge contributions to cancer research.

Scientists had just begun to elucidate the link between genetics, viruses, and cancer in the early 1970s, but Baltimore says that Luria was often the first person to jump on new applications for the techniques and thinking underlying molecular biology.

“Luria’s genius was understanding where biology was going,” says Baltimore. “At every stage, he was wondering what the next step would be.” But even geniuses need a messy fluke like Human’s now and then.

The bull Y chromosome has evolved to bully its way into gametes
Whitehead Institute
November 18, 2020

In a new study, published Nov. 18 in the journal Genome Research, scientists in the lab of Whitehead Institute Member David Page present the first ever full, high-resolution sequence of the Y chromosome of a Hereford bull. The research, more than a decade in the making, suggests that bulls’ Y chromosomes have evolved dozens of copies of the same genes in a selfish attempt to make more males — a move that is countered in the female-determining X chromosome.

“When you have an X and a Y chromosome, it’s a setup for conflict,” said Page, who is also a professor of biology at the Massachusetts Institute of Technology and investigator with the Howard Hughes Medical Institute. “Seeing this full blown competition between the cattle X and Y means we have to think more deeply about this conflict as a constant and general feature of sex chromosomes in mammals.”

This insight into the forces that govern sex chromosome behavior and evolution will help scientists in Page’s lab study genetic differences between males and females and how they play out in health and disease across every part of the body, Page added.

Of mice, men and cattle 

Sex chromosomes — the X and the Y — evolved from a regular pair of symmetrical chromosomes some 200 million years ago. Those born biologically female have two X chromosomes. Those born biologically male have one X and one Y.

Page’s lab successfully sequenced the human Y chromosome in 2003, and afterwards the researchers wanted to be able to compare the sequence to its counterparts in other animals in order to help understand how they have evolved and diverged over time.

To make these comparisons, researchers in Page’s lab laid out a list of several mammals — including chimps, opossums, and mice — that occupied different branches of the mammalian family tree. One after another, the scientists began sequencing these creatures’ Ys, using a high-resolution sequencing method called SHIMS — short for Single-Haplotype Iterative Mapping and Sequencing — to obtain a level of detail that other techniques, like shotgun sequencing, can’t.

This powerful sequencing technology allowed the researchers to observe a strange peculiarity of Y chromosomes: in some species, nearly all of the genetic material on the Y is made up of sequences of DNA that have been amplified dozens or hundreds of times over — “like a hall of mirrors,” Page said.

In mice, for example, repeats of just a few testis-specific genes make up nearly 98 percent of the Y chromosome. In humans, however, repeats make up only about 45 percent. “We wanted to know if this was just a peculiarity of rodents, or if other Y chromosomes might come close,” Page said.

That’s where the bull came in. “Outside of primates and rodents, the next branch off the mammalian tree includes bull,” said Jennifer Hughes, a researcher in Page’s lab and the first author of the paper. “We didn’t know if the bull’s Y chromosome would look like a mouse Y or a human Y or something else entirely.”

The running of the bull’s (sequencing data) 

It took the Page Lab and collaborators at Baylor College of Medicine’s Human Genome Sequencing Center, the McDonnell Genome Institute at Washington University, Texas A&M University, and other institutions more than a decade to tease apart the complexities of the bull Y chromosome. In fact, it turned out to be the most gene-dense of any Y chromosome ever mapped — largely due to the fact that 96 percent of its genetic material was made up of repetitive sequences.

As in the mouse, most of the bull’s “hall of mirrors” repeats appeared to be expressed in the testis. But the question remained: Why? “What drives it can’t just be purely making more sperm, because that’s just overkill, right?” Hughes said. “You don’t really need hundreds of copies of a gene to accomplish that task.”

The researchers found a clue when they took a closer look at the bovine X chromosome: the female-determining sex chromosome also had a few copies of these testis-specific genes. “We don’t really know the mechanism in the bull, but the thought is that somehow the amplification of these genes in the Y has to do with helping the Y get passed on — and the X copies are amplified to compete against that tendency and help the X,” Hughes said.

A selfish pursuit

This X-Y arms race has been proven to happen in mice: somehow, repetitive genes on the Y chromosome give it an extra edge when it comes to ending up in the sperm during gamete formation. In a 2012 study, researchers knocked out the Y-chromosome repeats. Without the extra genes, more X chromosomes than Ys ended up in sperm cells, and the sex ratio of offspring skewed female. Over years of evolution, the X has developed repeats as well — its own way to get a leg up in the race.

Competition between X and Y chromosomes is selfish, Hughes said, because it’s not a good thing for the species to have a skewed sex ratio. Thus, these alterations benefit only the lucky chromosome that ends up in the fertilized egg. The fact that a selfish — and even detrimental — mechanism would continue for millions of years in disparate branches of the evolutionary tree suggests that these conflicts may be an inevitable side effect of having a pair of asymmetrical sex chromosomes. “These X-Y arms races have probably been around for as long as mammals have been around,” Page said.

Evolutionary theory aside, knowing the mechanisms controlling the sex ratios of cattle could be of practical use in the coming years. “It could be of great interest to breeders, because they would love to be able to manipulate the sex of cattle offspring,” Hughes said. “For example, dairy farmers would prefer more females and meat farmers would prefer more males.”

Right now, the lab is working on leafing out the branches of their Y chromosome evolutionary tree. The bull’s is the seventh sex chromosome to be completely sequenced using the SHIMS method. Hughes, Page and the lab are also eyeing members of other animal groups, including reptiles.

“Our lab is focused on sex differences across the human body, and all of that work really is inspired by lessons that we’ve learned by comparing the Y chromosomes of different animals with our own,” Page said. “It’s like when you go to an art gallery and just sit on a bench and look and feel inspired — these sequences are an infinite source of inspiration in the work we are doing. And we can now add the bull to our gallery.”

***

Hughes, J. et al. “Sequence analysis in Bos taurus reveals pervasiveness of X-Y arms races in mammalian lineages.” Genome Research, Nov. 18, 2020. DOI: 10.1101/gr.269902.120

Regulating the regulators
Whitehead Institute
November 12, 2020

MicroRNAs are short RNA sequences that maintain a tight control on which genes are expressed and when. They do this by regulating which messenger RNA (mRNA) transcripts — the single-stranded templates for proteins — are actually read by the cell. But what controls these cellular controllers?

In a new study published Nov. 12 in Science, researchers in David Bartel’s lab at Whitehead Institute show that mRNAs and other RNAs often turn the tables on their microRNA regulators — and show that the path to microRNA degradation is not what scientists expected it to be.

 “A lot of people know that microRNAs repress mRNAs — that’s textbook,” said Charlie Shi, a graduate student in Bartel’s lab and first author on the paper. “But in certain cases, this logic is reversed. And I think that’s really interesting and weird, this idea that often the tables are turned.”

When transcripts attack

MicroRNAs typically control gene expression by binding to mRNA transcripts, and then working together with a protein called Argonaute to “silence” those transcripts by causing them to be more rapidly degraded. Because microRNAs are held cozily inside of the Argonaute protein, they are shielded from destructive enzymes in the cell, and are thus fairly long-lived by cellular standards. They can persist for up to a week, causing the destruction of many mRNA molecules over that time.

Sometimes, however, a microRNA binds to a special target site on an mRNA transcript that leads to premature destruction of the microRNA. This phenomenon — called target-directed microRNA degradation, or TDMD — happens naturally in cells, and is a way to control how much of certain microRNAs are allowed to persist at any given time.

Bartel’s lab began studying this form of degradation after researchers in the lab discovered that an RNA called CYRANO, which doesn’t code for any proteins, leads to the degradation of a specific microRNA called miR-7. This interaction was interesting to the researchers because the mechanism did not seem to line up with the current theories about TDMD.

Previous models of TDMD suggested that special target sites, like the one in CYRANO, cause one end of the microRNA to stick out of Argonaute and become vulnerable to the addition and subtraction of nucleotides by cytoplasmic enzymes.  This process, called tailing and trimming, was thought to be a key step in the path to degradation of the microRNA.

“But when you knock out the enzyme that causes tailing of miR-7, it has no impact on the degradation,” Shi said. “So that’s curious, right? So how can we really perturb this supposedly responsible system and have no impact?”

A new model

In order to further probe the mechanism of TDMD, the researchers focused in on this interaction between the CYRANO noncoding RNA and miR-7. Shi designed a CRISPR screen to identify genes essential for the microRNA’s degradation when it encountered a CYRANO transcript.

The screen yielded one gene that was essential to the microRNA’s degradation, called ZSWIM8. When they looked up the gene’s function, the researchers found that it codes for a component of a ubiquitin ligase. Ubiquitin — so named because it is found in virtually all types of cells — serves as a flag to mark proteins for degradation in a cellular garbage disposal called the proteasome.

The finding of the ZSWIM8 ubiquitin ligase implied that CYRANO-mediated microRNA degradation involves destruction of the Argonaute protein. In this new molecular model for TDMD, the regulating RNA, CYRANO, binds to the microRNA, mir-7, encased in its protective Argonaute protein, and then recruits the ZSWIM8 ubiquitin ligase.  This ligase then sticks a few ubiquitin molecules onto the microRNA’s Argonaute, leading Argonaute to be degraded, and thereby exposing its microRNA cargo to be destroyed by enzymes in the cell.  Importantly, this process does not require any trimming and tailing of the microRNA.

“The discovery of this unanticipated pathway for TDMD illustrates the power of CRISPR screens, which can simultaneously query essentially every protein in the cell, including those that you never dreamed would be involved,” said Bartel, who is also an investigator of the Howard Hughes Medical Institute and a professor of biology at Massachusetts Institute of Technology.

A multitude of microRNAs

When the researchers looked at other known examples of TDMD, they found the ZSWIM8 was essential in all of them. Having identified this key part of the degradation pathway allowed them to seek out more microRNAs that are subject to this regulation.

“When we started this project, there were only around four examples in nature of endogenous RNAs that are encoded by the cell that can perform TDMD,” Shi said. “We had a feeling that there would be many more, and so by finding a factor that was required for TDMD in a general way — ZSWIM8 –we were then able to ask and answer the question, ‘how widespread this phenomenon?’”

As it turns out, TDMD is fairly common in multicellular organisms. The researchers looked for evidence of the microRNA degradation mechanism in different cell types — two from mice, and one from fruit flies — and found that in any given cell, up to 20 different microRNAs were regulated by TDMD out of a couple hundred total microRNAs in the cell.

The researchers also observed this mechanism in human cells and nematodes, suggesting that TDMD as a method for regulating microRNAs dates back to the common ancestor of these disparate species. That definitely creates a lot of questions for us,” Shi said. “Each one of these microRNAs is a story.”

“Runaway” Transcription

Researchers discover new rules governing bacterial gene expression that overturn fundamental assumptions about basic biological pathways.

Raleigh McElvery
August 26, 2020

On the evolutionary tree, humans diverged from yeast roughly one billion years ago. By comparison, two seemingly similar species of bacteria, Escherichia coli and Bacillus subtilis, have been evolving apart for roughly twice as long. In other words: walking, talking bipeds are closer on the tree of life to single-celled fungus than these two bacteria are to one another. In fact, it’s becoming increasingly clear that what is true of one bacterial type may not be true of another — even when it comes down to life’s most basic biological pathways.

E. coli has served as a model organism in scientific research for over a century, and helped researchers parse many fundamental processes, including gene expression. In these bacteria, as one molecular machine, the RNA polymerase, moves along the DNA transcribing it into RNA, it is followed in close pursuit by a second molecular machine, the ribosome, which translates the RNA into proteins. This “coupled” transcription-translation helps monitor and tune RNA output, and is considered a hallmark of bacteria.

However, an interdisciplinary team of biologists and physicists recently showed that the B. subtilis bacterium employs a different set of rules. Rather than working in tandem with the ribosome, the polymerase in B. subtilis speeds ahead. This system of “runaway” transcription creates alternative rules for RNA quality control, and provides insights into the sheer diversity of bacterial species.

“Generations of researchers, including myself, were taught that coupled transcription-translation is fundamental to bacterial gene expression,” says Gene-Wei Li, an associate professor of biology and senior author of the study. “But our very precise, quantitative measurements have overturned that long-held view, and this study could be just the tip of the iceberg.”

Grace Johnson, a graduate student in the Department of Biology, and Jean-Benoît Lalanne, a graduate student in the Department of Physics, are the lead authors on the paper, which appeared in Nature on Aug. 26.

A curious clue

In 2018, Lalanne developed an experimental technique to measure the boundaries of RNA transcripts. When DNA is transcribed into RNA, the resulting transcripts are generally longer than the DNA coding sequence because they also have to include an extra bit at the end to signal the polymerase to stop. In B. subtilis, Lalanne noticed there simply wasn’t enough space between the ends of the coding sequences and the ends of the RNA transcripts — the extra code was too short for both the polymerase and the ribosome to fit at the same time. In this bacterium, coupled transcription-translation didn’t seem possible.

“It was a pretty weird observation,” Lalanne recalls. “It just didn’t square up with the accepted dogma.”

To delve further into these puzzling observations, Johnson measured the speeds of the RNA polymerase and ribosome in B. subtilis. She was surprised to find that they were moving at very different rates: the polymerase was going roughly twice as fast as the ribosome.

During coupled transcription-translation in E. coli, the ribosome is so closely associated with the RNA polymerase that it can control when transcription terminates. If the RNA encodes a “premature” signal for the polymerase to stop transcribing, the nearby ribosome can mask it and spur the polymerase on. However, if something goes awry and the ribosome is halted too far behind the polymerase, a protein called Rho can intervene to terminate transcription at these premature sites, halting the production of these presumably non-functional transcripts.

However, in B. subtilis, the ribosome is always too far behind the polymerase to exert its masking effect. Instead, Johnson found that Rho recognizes signals encoded in the RNA itself. This allows Rho to prevent production of select RNAs while ensuring it doesn’t suppress all RNAs. However, these specific signals also mean Rho likely has a more limited role in B. subtilis than it does in E. coli.

A family trait

To gauge how common runaway transcription is, Lalanne created algorithms that sifted through genomes from over 1,000 bacterial species to identify the ends of transcripts. In many cases, there was not enough space at the end of the transcripts for both the RNA polymerase and the ribosome to fit, indicating that more than 200 additional bacteria also rely on runaway transcription.

“It was striking to see just how widespread this phenomenon is,” Li says. “It raises the question: How much do we really know about these model organisms we’ve been studying for so many years?”

Carol Gross, a professor in the Department of Microbiology and Immunology at University of California San Francisco who was not involved with the study, refers to the work as a “tour de force.”

“Gene-Wei Li and colleagues show transcription-translation coupling, thought to be a foundational feature of bacterial gene regulation, is not universal,” she says. “Instead, runaway transcription leads to a host of alternative regulatory strategies, thereby opening a new frontier in our study of bacterial strategies to thrive in varied environments.”

As researchers widen their experimental radius to include more types of bacteria, they are learning more about the basic biological processes underlying these microorganisms — with implications for those that take up residence in the human body, from helpful gut microbes to noxious pathogens.

“We’re beginning to realize that bacteria can have distinct ways of regulating gene expression and responding to environmental stress,” Johnson says. “It just shows how much interesting biology is left to uncover as we study increasingly diverse bacteria.”

Citation:
“Functionally uncoupled transcription–translation in Bacillus subtilis
Nature, online August 26, 2020, DOI: 10.1038/s41586-020-2638-5
Grace E. Johnson, Jean-Benoît Lalanne, Michelle L. Peters, and Gene-Wei Li

Top illustration: Researchers discovered a new system of transcription and translation in bacteria, where the polymerase (pink) in B. subtilis “runs away” from the ribosome (blue). Credit: Grace Johnson
Posted: 8.26.20
Bringing RNA into genomics

ENCODE consortium identifies RNA sequences that are involved in regulating gene expression.

Anne Trafton | MIT News Office
July 29, 2020

The human genome contains about 20,000 protein-coding genes, but the coding parts of our genes account for only about 2 percent of the entire genome. For the past two decades, scientists have been trying to find out what the other 98 percent is doing.

A research consortium known as ENCODE (Encyclopedia of DNA Elements) has made significant progress toward that goal, identifying many genome locations that bind to regulatory proteins, helping to control which genes get turned on or off. In a new study that is also part of ENCODE, researchers have now identified many additional sites that code for RNA molecules that are likely to influence gene expression.

These RNA sequences do not get translated into proteins, but act in a variety of ways to control how much protein is made from protein-coding genes. The research team, which includes scientists from MIT and several other institutions, made use of RNA-binding proteins to help them locate and assign possible functions to tens of thousands of sequences of the genome.

“This is the first large-scale functional genomic analysis of RNA-binding proteins with multiple different techniques,” says Christopher Burge, an MIT professor of biology. “With the technologies for studying RNA-binding proteins now approaching the level of those that have been available for studying DNA-binding proteins, we hope to bring RNA function more fully into the genomic world.”

Burge is one of the senior authors of the study, along with Xiang-Dong Fu and Gene Yeo of the University of California at San Diego, Eric Lecuyer of the University of Montreal, and Brenton Graveley of UConn Health.

The lead authors of the study, which appears today in Nature, are Peter Freese, a recent MIT PhD recipient in Computational and Systems Biology; Eric Van Nostrand, Gabriel Pratt, and Rui Xiao of UCSD; Xiaofeng Wang of the University of Montreal; and Xintao Wei of UConn Health.

RNA regulation

Much of the ENCODE project has thus far relied on detecting regulatory sequences of DNA using a technique called ChIP-seq. This technique allows researchers to identify DNA sites that are bound to DNA-binding proteins such as transcription factors, helping to determine the functions of those DNA sequences.

However, Burge points out, this technique won’t detect genomic elements that must be copied into RNA before getting involved in gene regulation. Instead, the RNA team relied on a technique known as eCLIP, which uses ultraviolet light to cross-link RNA molecules with RNA-binding proteins (RBPs) inside cells. Researchers then isolate specific RBPs using antibodies and sequence the RNAs they were bound to.

RBPs have many different functions — some are splicing factors, which help to cut out sections of protein-coding messenger RNA, while others terminate transcription, enhance protein translation, break down RNA after translation, or guide RNA to a specific location in the cell. Determining the RNA sequences that are bound to RBPs can help to reveal information about the function of those RNA molecules.

“RBP binding sites are candidate functional elements in the transcriptome,” Burge says. “However, not all sites of binding have a function, so then you need to complement that with other types of assays to assess function.”

The researchers performed eCLIP on about 150 RBPs and integrated those results with data from another set of experiments in which they knocked down the expression of about 260 RBPs, one at a time, in human cells. They then measured the effects of this knockdown on the RNA molecules that interact with the protein.

Using a technique developed by Burge’s lab, the researchers were also able to narrow down more precisely where the RBPs bind to RNA. This technique, known as RNA Bind-N-Seq, reveals very short sequences, sometimes containing structural motifs such as bulges or hairpins, that RBPs bind to.

Overall, the researchers were able to study about 350 of the 1,500 known human RBPs, using one or more of these techniques per protein. RNA splicing factors often have different activity depending on where they bind in a transcript, for example activating splicing when they bind at one end of an intron and repressing it when they bind the other end. Combining the data from these techniques allowed the researchers to produce an “atlas” of maps describing how each RBP’s activity depends on its binding location.

“Why they activate in one location and repress when they bind to another location is a longstanding puzzle,” Burge says. “But having this set of maps may help researchers to figure out what protein features are associated with each pattern of activity.”

Additionally, Lecuyer’s group at the University of Montreal used green fluorescent protein to tag more than 300 RBPs and pinpoint their locations within cells, such as the nucleus, the cytoplasm, or the mitochondria. This location information can also help scientists to learn more about the functions of each RBP and the RNA it binds to.

“The strength of this manuscript is in the generation of a comprehensive and multilayered dataset that can be used by the biomedical community to develop therapies targeted to specific sites on the genome using genome-editing strategies, or on the transcriptome using antisense oligonucleotides or agents that mediate RNA interference,” says Gil Ast, a professor of human molecular genetics and biochemistry at Tel Aviv University, who was not involved in the research.

Linking RNA and disease

Many research labs around the world are now using these data in an effort to uncover links between some of the RNA sequences identified and human diseases. For many diseases, researchers have identified genetic variants called single nucleotide polymorphisms (SNPs) that are more common in people with a particular disease.

“If those occur in a protein-coding region, you can predict the effects on protein structure and function, which is done all the time. But if they occur in a noncoding region, it’s harder to figure out what they may be doing,” Burge says. “If they hit a noncoding region that we identified as binding to an RBP, and disrupt the RBP’s motif, then we could predict that the SNP may alter the splicing or stability of the gene.”

Burge and his colleagues now plan to use their RNA-based techniques to generate data on additional RNA-binding proteins.

“This work provides a resource that the human genetics community can use to help identify genetic variants that function at the RNA level,” he says.

The research was funded by the National Human Genome Research Institute ENCODE Project, as well as a grant from the Fonds de Recherche de Québec-Santé.

Yukiko Yamashita

Education

  • PhD, 1999, Kyoto University
  • BS, Biology, 1994, Kyoto University

Research Summary

Two remarkable feats of multicellular organisms are generation of many distinct cell types via asymmetric cell division and transmission of the germline genome to the next generation, essentially in eternity. Studying these processes using the Drosophila male germline as a model system has led us to venture into new areas of study, such as functions of satellite DNA, a ‘genomic junk,’ and how they might be involved in speciation.

Awards

  • Tsuneko and Reiji Okazaki Award, 2016
  • Howard Hughes Medical Institute, Investigator, 2014
  • MacArthur Fellow, 2011
  • Women in Cell Biology Early Career Award, American Society for Cell Biology, 2009
  • Searle Scholar, 2008
Ruth Lehmann

Education

  • Dr. rer. nat., 1985, University of Tübingen
  • MS, 1981, Biology, University of Freiburg

Research Summary

We study germ cells, the only cells in the body naturally able to generate completely new organisms. In addition to the nuclear genome, cytoplasmic information is passed though the egg cell to the next generation. We analyze the organization and regulation of germ line specific RNA-protein condensates, and explore mechanisms used by endosymbionts such as mitochondria and the intracellular bacterium, Wolbachia, to propagate through the cytoplasm of the female germ line.

Awards

  • Vanderbilt Prize in Biomedical Science, 2022
  • Gruber Genetics Prize, 2022
  • Thomas Hunt Morgan Medal, Genetics Society of America, 2021
  • Francis Amory Prize in Reproductive Medicine and Reproductive Physiology, American Academy of Arts and Sciences, 2020
  • Vilcek Prize in Biomedical Science, 2020
  • Keith R. Porter Award, American Society for Cell Biology, 2018
  • Inaugural Klaus Sander Prize, German Society for Developmental Biology, 2017
  • European Molecular Biology Organization, Foreign Associate, 2012
  • Conklin Medal of the Society of Developmental Biology, 2011
  • National Academy of Sciences, Foreign Associate, 2005; Member, 2008
  • American Academy of Arts and Sciences, Member, 1998
  • Howard Hughes Medical Institute, Investigator, 1990 and 1997
These muscle cells are guideposts to help regenerative flatworms grow back their eyes
Eva Frederick | Whitehead Institute
June 25, 2020

If anything happens to the eyes of the tiny, freshwater-dwelling planarian Schmidtea mediterranea, they can grow them back within just a few days. How they do this is a scientific conundrum — one that Peter Reddien’s lab at Whitehead Institute has been studying for years.

The lab’s latest project offers some insight: in a paper published in Science June 25, researchers in Reddien’s lab have identified a new type of cell that likely serves as a guidepost to help route axons from the eyes to the brain as the worms complete the difficult task of regrowing their neural circuitry.

Schmidtea mediterranea’s eyes are composed of light-capturing photoreceptor neurons connected to the brain with long, spindly processes called axons. They use their eyes to respond to light to help navigate their environment.

The worms, which are popular models for research into regeneration, can regrow pretty much any part of their body; eyes are an interesting part to study because regenerating the visual system requires the worms rewire their neurons to connect them to the brain.

When neural systems develop in embryos, the first nerve fibers, called pioneer axons, snake their way through tissue to form the circuitry needed to perceive and interpret external stimuli. The axons are helped along their way by specialized cells called guidepost cells. These special cells are positioned at choice points — places where the axon’s path could fork in different directions.

In many organisms, these guidepost cells aren’t a priority anymore once development is finished, and typically are not renewed through adulthood. That’s one reason why, when humans experience brain or nerve damage, the injury is usually permanent.

“This is a fundamental mystery of regeneration that we hadn’t even been thinking about,” says Reddien, the senior author of the paper who is also a professor of biology at Massachusetts Institute of Technology and an investigator with the Howard Hughes Medical Institute. “How can an adult animal regenerate a functional nervous system when the original development of the nervous system typically involves a number of cues that are thought to be transient?”

Then, in 2018, Reddien Lab scientist Lucila Scimone found something surprising in adult planarians: groups of mysterious cells that looked like they might play a role in guiding growing axons. She’d noticed this group of cells because they co-expressed two genes not often seen together and some were conspicuously close to the eyes.

“I was captivated by these cells,” she says. They appeared in very small numbers (a normal worm might have around 5; a large one might have up to 10) in every planarian she examined. They were divided into two distinct groups: some around the flatworms’ eyes, and others spaced out along the path to the brain center. When she traced the path of existing axons leading from the planarians’ eyes to their brain, they coincided with the positions of these cells without exception.

When the researchers characterized the cells, they found that they did not express any of the genes that are hallmarks of photoreceptor neurons; instead, they had markers often found in muscle tissue. “That was very striking, because muscle cells — that’s not what they do in most animals,” Scimone says.

In other organisms, guidepost cells are often neurons or glia. It would be unusual for muscle cells to serve as guideposts; but past work in the Reddien Lab had shown that planarian muscle cells played other special roles, such as secreting the extracellular matrix. The researchers now wondered whether they could add the role of guidepost to the long list of planarian muscle cell functions.

To test their hypothesis, the researchers designed a series of experiments. “We developed an eye transplantation method where you can take an eye from an animal and transplant it into another animal,” says Reddien Lab postdoc Kutay Deniz Atabay. “When you do this, the axonal projections from that eye will basically, if positioned appropriately, correctly wire themselves into the brain, producing a functional state.”

The researchers also created genetically engineered planarians that had the muscle cells, but no eyes, and then transplanted eyes onto their eyeless heads. Sure enough, the neurons grew as normal, snaking towards the cells and then adjusting their trajectories after encountering them.

Without the cells, it was a different story. When the researchers transplanted eyes to distant parts of planarians’ bodies without a population of these muscle cells, the photoreceptor neurons did not connect to the brain center. Likewise, when they transplanted eyes into planarians that had been modified to not have these muscle cells, their photoreceptor neurons still grew — but they did not wire properly to reach the brain.

These findings combined suggested that the cells were fully independent of the visual system — they did not form because of eyes or photoreceptor neurons, but likely established themselves before the neurons grew — which provided more evidence for the guidepost role.

The guidepost-like activity of these cells then begged the question: how do the cells themselves know where to be? “We found that there’s a pattern of signaling molecules in muscle that is setting where these cells should be,” Reddien says. “If we perturb the global positional information of the system, these cells get placed in the wrong positions, and then axons go to the wrong positions — so we think there’s a positional information framework that places the cells during regeneration, and that allows them to work as guideposts in the correct locations.”

At this point, the researchers don’t know exactly how the cells are able to communicate with growing axons to serve as guideposts. They could be releasing some sort of signaling molecule that attracts the axons, or they could be communicating by using trans-membrane proteins.

“That will be an exciting direction for the future,” Reddien says. “We have now identified the transcriptome for the cells, which means we know all the genes that these cells express. That provides us with an intriguing list of genes that can be probed functionally, to try to see which ones are mediating the functions of these cells.”

This study is a step forward in a body of work that aims to expand the capabilities of regenerative medicine. “Imagine a scenario where someone experiences a spinal cord injury or an eye injury or stroke that leads to the loss of a neural circuit,” says Atabay. “The reason we can’t fully cure these cases today is that we lack fundamental information regarding how these systems can regenerate. Looking at regenerative organisms provides a lot of insights. From this case, we see that regenerating the lost system may not be enough; you may also need to regenerate systems that are properly patterning that system.”

***

Written by Eva Frederick

***

Scimone, M. L. et al. “Muscle and neuronal guidepost-like cells facilitate planarian visual system regeneration.” Science, June 25, 2020.

3 Questions with Seychelle Vos

An unconventional geneticist uses cryogenic electron microscopy and crystallography to understand gene expression and cell fate.

Lucy Jakub
June 1, 2020

Seychelle Vos arrived in September 2019 as the Department of Biology’s newest assistant professor. Her lab in Building 68 uses cryogenic electron microscopy (cryo-EM), X-ray crystallography, biochemistry, and genetics to study how DNA and its associated proteins are organized inside the cell. Vos received her PhD from the University of California at Berkeley and completed her postdoctoral research at the Max Planck Institute for Biophysical Chemistry in Germany. She sat down to discuss her structural biology research, and why it’s so important to understand DNA as a physical structure.

Q: Your research is on the proteins that compress DNA so it can fit inside a cellular organelle called the nucleus. How does the genome organize itself in different shapes to perform different functions in the cell, and why is this an important process for us to understand?

A: If we take all the DNA inside of one human cell and stretch it out end to end, it extends 2 meters in length. But it needs to fit into the nucleus, which is only a few microns wide. It’s essentially like stringing a fishing line from here to New Haven and trying to put it in a soccer ball. That’s not an easy thing to do. There are lots of proteins that compact the genome either by wrapping the DNA around themselves or by forming loops in the DNA.

In order to replicate DNA or transcribe it to make a protein, the cell’s molecular machinery needs to be able to access and read it. Depending on how the DNA is wrapped and organized, different genes will be more accessible than others. In a stem cell, essentially any gene can be turned on. But as cells begin to differentiate into kidney cells, liver cells, and so on, only the genes specific to those functions can be turned on. Every cell has its own set of proteins that make it special, and most of that regulation happens at the level of RNA expression.

Our lab wants to understand how DNA organization impacts gene expression at the atomic level. This gets to the crux of how a stem cell becomes a specific cell type, and what happens when those programs go wrong. Without the right kind of compaction you can have cancer phenotypes, because things get turned on that shouldn’t be, or a cell thinks it’s a stem cell again and divides really fast. Many of the proteins we study are involved either in developmental disorders or cancers. If we don’t understand their basic biology, it’s very hard to come up with reasonable ways of treating these diseases.

Q: What was it about structural biology that hooked you during your early career?

A: When I started my PhD at UC Berkeley, I didn’t have much of an interest in structural biology. I thought that I wanted to study the immunology of nucleic acids, and I did my first lab rotation with Jennifer Doudna, one of the biochemists who was instrumental in developing CRISPR-Cas9 as a gene-editing tool. She might seem like a funny first person to do a rotation with if you were doing immunology, but CRISPR is essentially a bacterial immune system, and I went to her lab just to see a completely different way of viewing immunology. During that rotation, I fell in love with crystallography. What’s so beautiful about this technique is that it shows us how different atoms are communicating with each other, and how one molecule might be engaging with another molecule.

For the rest of my rotations as a graduate student, I did research in biochemistry and structural biology labs, and ended up joining James Berger’s lab, which did a combination of both. I worked on a class of enzymes called topoisomerases that bind to DNA and uncoil the DNA when it gets tangled. I was able to solve a number of very interesting structures, and do biochemistry and genetics all at the same time.

During my postdoc I studied RNA polymerase II, the enzyme that makes all the RNAs that turn into proteins in the cell and determine the cell’s identity. I wanted to know how it is regulated after the initiation stage of transcription. One of the proteins I was working with wouldn’t crystallize, and we had to come up with some other ways of seeing it. So we turned to cryo-EM, which had just become a very high-resolution technology — we could actually see the atoms touching each other! That was a game-changer for me. If you told me at the beginning of my PhD that these technologies could become central to my research, I would have told you there’s no way that would happen. But life has surprises.

Q: How does your expertise in genetics and biochemistry help you solve structural problems?

A: I’m definitely not your average structural biologist — I use structural tools to advance the genetics I want to do. My lab uses genetics to inform which protein complexes we want to look at, and then we use cryo-EM and X-ray crystallography to understand how those proteins actually affect RNA polymerase II. With what we learn about the structure, we can go back and use targeted genetic approaches to remove those proteins from the genome and see what happens to gene expression in particular cells. I also have projects where we’ll do a genetic screen first, and then use structural biology and chemistry techniques to get more information. The research is like a giant feedback loop. You need all of those perspectives to really understand the whole system.