Balance between proteins keeps sperm swimming swiftly

Developing sperm cells swap out histones for proteins called protamines to coil DNA tightly enough to fit inside the hydrodynamic shape ideal for the task of swimming swiftly to an egg in order to fertilize it. If the balance of protamines in the sperm is wrong, however, the sperm may become misshapen and die, making the animal infertile. Whitehead Institute Member Yukiko Yamashita and former graduate student Jun Park have discovered why this imbalance causes infertility in the fruit fly.

Greta Friar | Whitehead Institute
April 10, 2023

Sperm must swim swiftly to an egg in order to fertilize it, and so they have evolved hydrodynamic shapes. Most of the space in the head of sperm cells is taken up by the DNA they carry, so the cells coil up their DNA super tightly to stay small and streamlined. In most cell types, DNA is coiled around proteins called histones. These do not package DNA tightly enough for sperm, so when a sperm cell is developing, it swaps out histones for another type of protein called protamines that coil DNA very tightly.

Many species, including humans, mice, and flies, have multiple types of protamines. If the balance between the different types is wrong, then the sperm’s DNA may not be packaged correctly and it may become misshapen and die, making the animal infertile. Whitehead Institute Member Yukiko Yamashita and former graduate student Jun Park have discovered why this imbalance causes infertility in the fruit fly (Drosophila melanogaster). The finding, published in the Proceedings of the National Academy of Sciences on April 10, showed a mechanism that balances different types of protamines to ensure male fertility.

Mst77F is a major fruit fly protamine. Yamashita and Park determined that the fruit fly protamine Mst77Y, which is related to Mst77F, can interfere with the function of Mst77F. Fruit flies usually make a lot of Mst77F and a little of Mst77Y. The researchers found that when expression of the Mst77Y gene is too high, especially when expression of Mst77F is low, it disrupts the process of DNA packaging, leading to infertility.

How does Mst77Y interfere with Mst77F? The researchers discovered that this is because the Mst77Y gene makes faulty protamines. There are multiple copies of Mst77Y on the fly’s Y chromosome. They likely evolved from a copy of Mst77F, which is not on a sex chromosome. However, the different versions of Mst77Y have lost or altered parts that they need in order to function, so unlike the Mst77F protamine, Mst77Y protamines likely cannot coil DNA tightly around themselves. In spite of the fact that the Mst77Y protamines do not work correctly, they are dominant: when they are present, the sperm cell will use them over the functional Mst77F protamines.

“Mst77Y is a half-broken tool,” says Yamashita, who is also a professor of biology at the Massachusetts Institute of Technology and a Howard Hughes Medical Institute investigator. “It is able to take the place of the working tool, Mst77F, but not to do its job, so when too much Mst77Y is present, the sperm cell does not have enough working tools in place to compact its DNA.”

The researchers also figured out how sperm cells keep expression of Mst77F high and Mst77Y low: with the help of a protein called Modulo. In order for an RNA read from a gene to be made into a protein, it needs to have a tail added to it made up of a string of adenines—one of the four building blocks that make up RNA. Modulo makes sure that the cell preferentially adds this tail to the RNA coding for Mst77F. Although Yamashita and Park did not determine the exact mechanism by which Modulo ensures this preferential treatment, they did observe that Modulo and the Mst77F RNA group together in the same part of the cell, the nucleolus, whereas Mst77Y does not.

Altogether, these findings explain why and how fruit fly sperm cells carefully balance the levels of these two protamines. However, the research raises the question, what are sperm cells using the non-functional Mst77Y protamines for? Yamashita and Park can only speculate, but the answer may have to do with their observation that high levels of Mst77Y killed off more X-chromosome bearing sperm than Y-chromosome bearing sperm. Past research has suggested that protamines may be involved in a process called meiotic drive, which animals can use to skew the sex ratio of their offspring. This new work is not only consistent with that hypothesis, but provides a possible mechanism to explain how protamines contribute. The researchers note that they did not see a strong effect on the sex ratio of offspring in this experiment, but hope that this work could set the stage to understand the role of non-functional protamines in meiotic drive.

“At the cell level, we were able to show that there’s some basis for this protamine to be involved in biasing whether X or Y chromosome bearing sperm survive,” Park says. “An interesting next question would be to see if there are certain conditions in which this mechanism more clearly acts as a driver at the level of offspring’s sex ratio.”

Notes

Park, Jun I., George W. Bell, and Yukiko M. Yamashita. 2023. “Derepression of Y-linked multicopy protamine-like genes interferes with sperm nuclear compaction in D. melanogaster,” PNAS 120 (16). https://www.pnas.org/doi/10.1073/pnas.2220576120

Not so inactive X chromosome

Whitehead Institute Member David Page has spent his career understanding how the differences between X and Y contribute to these sex differences, but a recent project is taking his lab in a new direction: understanding how the differences between X chromosomes contribute to sex differences.

Greta Friar | Whitehead Institute
February 7, 2023

Nearly every cell in our body contains pairs of each of our chromosomes, and these pairs are identical in all but one case: that of our sex chromosomes. Males typically have one X and one Y sex chromosome, while females typically have two X chromosomes. In recent years, research has suggested that these different chromosomes can influence far more than sex determination. Gene expression from the sex chromosomes appears to contribute to sex differences in health and disease, which males and females experience in everything from the incidence of getting certain diseases, to the symptoms of diseases, to responses to drugs, and more. For example, women are more likely to develop autoimmune disorders, while men are more likely to develop heart conditions.

Whitehead Institute Member David Page has spent his career understanding how the differences between X and Y contribute to these sex differences, but a recent project is taking his lab in a new direction: understanding how the differences between X chromosomes contribute to sex differences. Although females’ pair of X chromosomes contain the same genes, they have different patterns of gene expression. New research from Page and postdoc Adrianna San Roman reveals just how different the two types of X chromosomes are. The findings, published in the journal Cell Genomics on February 8, show that one type of X chromosome, known as the inactive X chromosome, can modulate the gene expression of the other type of X chromosome, known as the active X chromosome. Their work indicates that inactive X chromosomes have underappreciated roles in gene regulation and, most likely, in sex differences in health and disease.

Difference rooted in history

Females’ two X chromosomes have different gene expression activity because of the sex chromosomes’ evolutionary history. The X and Y sex chromosomes evolved from a pair of identical non-sex chromosomes. Because of this ancestry, the sex chromosomes still contain genes that are important outside of regulating sex differences, such as genes that contribute to our immune system or regulate gene expression throughout the body. However, over time the Y chromosome shrank and lost most of its genes. Researchers think that in order to make up for the loss of necessary genes on the Y, expression of the corresponding genes on the X chromosome increased. This ensured that males still had the necessary levels of gene expression from their sex chromosomes, but now females, with two copies of X both working overtime, had levels of gene expression that were too high. To solve this problem, our bodies developed a process called X chromosome inactivation, by which the majority of genes on all but one copy of the X chromosome in each cell are silenced, or turned off. This means that everyone, male and female alike, has one copy of the X chromosome working at full strength–the active X chromosome. In males, the active X chromosome is paired with a Y chromosome, and in females, it is paired with a so-called inactive X chromosome, on which most of the genes are turned off.

In spite of the evolution of X chromosome inactivation, some percentage of genes on the inactive X chromosome are still expressed, such as genes that have an active counterpart on the Y chromosome. Previous research has indicated that about a quarter of the genes on the inactive X are, in fact, active, so researchers have long been aware that the chromosome is not completely silent. However, it’s still often painted as a passive copy playing backup for its more active partner. San Roman’s work shows that the inactive X chromosome’s gene expression is much more potent and complex than that.

A spectrum of sex chromosomes

In order to understand the inactive X chromosome’s contributions to gene expression, San Roman and colleagues in the Page lab collected blood and skin samples from people born with unusual combinations of sex chromosomes—everything from X0 (one X chromosome) to XXXXY. People with these different sets of chromosomes often have health issues; for example, X0 females have Turner syndrome, which can cause heart defects, hearing impairment, and more; and XXY males have Klinefelter syndrome, which can cause infertility, weak muscles, and more. Page and San Roman hope their research could provide useful insights into these health issues as well as into sex differences between XY males and XX females.

In people with more than one X chromosome, every X but one is an inactive X. The researchers graphed sex chromosome gene expression, measuring the change in expression level of each gene with the addition of each inactive X, for people with anything from zero to three inactive X chromosomes, as well as different numbers of Y chromosomes. They also looked at the relative contribution to overall expression from the active versus inactive X chromosomes. One might expect the graphs they made to be relatively straightforward: for genes that are turned off on the inactive X chromosome, the gene expression level would not change at all as the number of copies of the inactive X increased. For genes that are turned on, the gene expression level would double with two X chromosomes, triple with three X chromosomes, and so on. When the researchers looked at chromosomes other than X with extra copies—namely, Y and chromosome 21—this is essentially the pattern they observed. Gene expression from additional X chromosomes, however, was not so straightforward.

Each additional inactive X chromosome changes gene expression by the same amount. However, the researchers found a surprising diversity in expression levels across X chromosome genes. The presence of each additional inactive X might increase one gene’s expression by 20 percent and another’s by 70 percent. Then the results grew more surprising: for some genes, the addition of an inactive X decreased their expression. For some genes that are only expressed on the active X chromosome, and completely silent on the inactive X, additional inactive X chromosomes nonetheless changed their expression level.

These discrepancies led the researchers to a startling finding. The X chromosomes do not function independently of each other. Instead, the inactive X chromosome can modulate expression of genes on the active X chromosome. In other words, some genes on the inactive X chromosome regulate genes on the active X chromosome, dialing their expression up or down. Altogether, the researchers found that 38% of the X chromosome genes in the two cell types that they tested are affected by the presence of inactive X chromosomes, either because the genes are expressed on the inactive X, or because the inactive X regulates their expression on the active X, or through some combination of the two mechanisms.

These findings show that the inactive X plays a much more active role in gene expression and regulation than was previously appreciated. Rather than just playing second fiddle to the active X chromosome, the inactive X is sometimes harmonizing with and sometimes even conducting its partner.

Rethinking the role of the inactive X in health and disease

Page and San Roman hope that their findings will help refocus research into sex differences. Previous research into the mechanisms behind these differences has focused on the effects of having X versus Y chromosomes. Page and San Roman’s work show that researchers also need to consider how the presence (in females) or absence (in males) of an inactive X chromosome contributes to sex differences.

“Everybody on the planet carries one active X chromosome, so that first X chromosome really does not contribute, we think, to differences between males and females,” says Page, who is also a professor of biology at the Massachusetts Institute of Technology and Investigator with the Howard Hughes Medical Institute. “If we transition from saying that females are XX and males are XY, to saying that females are Xi [have an inactive X] and males are Y, that really focuses the question.”

Page lab researchers have already begun using their findings to identify X chromosome genes that are likely to be important for sex differences in health and disease. From their list of genes that change in expression based on the presence of an inactive X, the researchers narrowed in on a top ten list of genes that need to maintain a specific expression level or else there will be severe negative consequences. These genes are also likely to be responsible for causing the health issues associated with different atypical sex chromosome compositions, because changes in their expression level are most likely to have strong effects on cells.

“This is a new way of thinking about how the X chromosome is expressed and how it might be impacting our biology,” San Roman says. “This top ten list will be really interesting to consider in the future in terms of how the level of expression of these genes affects cells and tissues in very fundamental ways.”

Notes

Citation:

Adrianna K. San Roman, Alexander K. Godfrey, Helen Skaletsky, Daniel W. Bellott, Abigail F. Groff, Hannah L. Harris, Laura V. Blanton, Jennifer F. Hughes, Laura Brown, Sidaly Phou, Ashley Buscetta, Paul Kruszka, Nicole Banks, Amalia Dutra, Evgenia Pak, Patricia C. Lasutschinkow, Colleen Keen, Shanlee M. Davis, Nicole R. Tartaglia, Carole Samango-Sprouse, Maximilian Muenke, and David C. Page. (2023). The human inactive X chromosome modulates expression of the active X chromosome. Cell Genomics. https://doi.org/10.1016/j.xgen.2023.100259

Daniel Lew

Education

  • Graduate: PhD, 1990, Rockefeller University
  • Undergraduate: BA, 1984, Genetics, Cambridge University

Research Summary

Different cells take on an astonishing variety of shapes, which are often critical to be able to perform specialized cell functions like absorbing nutrients or contracting muscles. We study how different cell shapes arise and how cells control the spatial distribution of their internal constituents. We take advantage of the tractability of fungal model systems, and address these questions using approaches from cell biology, genetics, and computational biology to understand molecular mechanisms. 

Honors and Awards

  • Fellow, American Academy of Microbiology, 2008
  • Fellow, American Association for the Advancement of Science, 2010
  • Duke Equity, Diversity, and Inclusion Award, 2019
Enzyme “atlas” helps researchers decipher cellular pathways

Biologists have mapped out more than 300 protein kinases and their targets, which they hope could yield new leads for cancer drugs.

Anne Trafton | MIT News Office
January 11, 2023

One of the most important classes of human enzymes are protein kinases — signaling molecules that regulate nearly all cellular activities, including growth, cell division, and metabolism. Dysfunction in these cellular pathways can lead to a variety of diseases, particularly cancer.

Identifying the protein kinases involved in cellular dysfunction and cancer development could yield many new drug targets, but for the vast majority of these kinases, scientists don’t have a clear picture of which cellular pathways they are involved in, or what their substrates are.

“We have a lot of sequencing data for cancer genomes, but what we’re missing is the large-scale study of signaling pathway and protein kinase activation states in cancer. If we had that information, we would have a much better idea of how to drug particular tumors,” says Michael Yaffe, who is a David H. Koch Professor of Science at MIT, the director of the MIT Center for Precision Cancer Medicine, a member of MIT’s Koch Institute for Integrative Cancer Research, and one of the senior authors of the new study.

Yaffe and other researchers have now created a comprehensive atlas of more than 300 of the protein kinases found in human cells, and identified which proteins they likely target and control. This information could help scientists decipher many cellular signaling pathways, and help them to discover what happens to those pathways when cells become cancerous or are treated with specific drugs.

Lewis Cantley, a professor of cell biology at Harvard Medical School and Dana Farber Cancer Institute, and Benjamin Turk, an associate professor of pharmacology at Yale School of Medicine, are also senior authors of the paper, which appears today in Nature. The paper’s lead authors are Jared Johnson, an instructor in pharmacology at Weill Cornell Medical College, and Tomer Yaron, a graduate student at Weill Cornell Medical College.

“A Rosetta stone”

The human genome includes more than 500 protein kinases, which activate or deactivate other proteins by tagging them with a chemical modification known as a phosphate group. For most of these kinases, the proteins they target are unknown, although research into kinases such as MEK and RAF, which are both involved in cellular pathways that control growth, has led to new cancer drugs that inhibit those kinases.

To identify additional pathways that are dysregulated in cancer cells, researchers rely on phosphoproteomics using mass spectrometry — a technique that separates molecules based on their mass and charge — to discover proteins that are more highly phosphorylated in cancer cells or healthy cells. However, until now, there has been no easy way to interrogate the mass spectrometry data to determine which protein kinases are responsible for phosphorylating those proteins. Because of that, it has remained unknown how those proteins are regulated or misregulated in disease.

“For most of the phosphopeptides that are measured, we don’t know where they fit in a signaling pathway. We don’t have a Rosetta stone that you could use to look at these peptides and say, this is the pathway that the data is telling us about,” Yaffe says. “The reason for this is that for most protein kinases, we don’t know what their substrates are.”

Twenty-five years ago, while a postdoc in Cantley’s lab, Yaffe began studying the role of protein kinases in signaling pathways. Turk joined the lab shortly after, and the three have since spent decades studying these enzymes in their own research groups.

“This is a collaboration that began when Ben and I were in Lew’s lab 25 years ago, and now it’s all finally really coming together, driven in large part by what the lead authors, Jared and Tomer, did,” Yaffe says.

In this study, the researchers analyzed two classes of kinases — serine kinases and threonine kinases, which make up about 85 percent of the protein kinases in the human body — based on what type of structural motif they put phosphate groups onto.

Working with a library of peptides that Cantley and Turk had previously created to search for motifs that kinases interact with, the researchers measured how the peptides interacted with all 303 of the known serine and threonine kinases. Using a computational model to analyze the interactions they observed, the researchers were able to identify the kinases capable of phosphorylating every one of the 90,000 known phosphorylation sites that have been reported in human cells, for those two classes of kinases.

To their surprise, the researchers found that many kinases with very different amino acid sequences have evolved to bind and phosphorylate the same motifs on their substrates. They also showed that about half of the kinases they studied target one of three major classes of motifs, while the remaining half are specific to one of about a dozen smaller classes.

Decoding networks

This new kinase atlas can help researchers identify signaling pathways that differ between normal and cancerous cells, or between treated and untreated cancer cells, Yaffe says.

“This atlas of kinase motifs now lets us decode signaling networks,” he says. “We can look at all those phosphorylated peptides, and we can map them back onto a specific kinase.”

To demonstrate this approach, the researchers analyzed cells treated with an anticancer drug that inhibits a kinase called Plk1, which regulates cell division. When they analyzed the expression of phosphorylated proteins, they found that many of those affected were controlled by Plk1, as they expected. To their surprise, they also discovered that this treatment increased the activity of two kinases that are involved in the cellular response to DNA damage.

Yaffe’s lab is now interested in using this atlas to try to find other dysfunctional signaling pathways that drive cancer development, particularly in certain types of cancer for which no genetic drivers have been found.

“We can now use phosphoproteomics to say, maybe in this patient’s tumor, these pathways are upregulated or these pathways are downregulated,” he says. “It’s likely to identify signaling pathways that drive cancer in conditions where it isn’t obvious what the genetics that drives the cancer are.”

The research was funded by the Leukemia and Lymphoma Society, the National Institutes of Health, Cancer Research UK, the Brain Tumour Charity, the Charles and Marjorie Holloway foundation, the MIT Center for Precision Cancer Medicine, and the Koch Institute Support (core) grant from the National Cancer Institute.

The molecules behind metastasis
Greta Friar | Whitehead Institute
January 4, 2023

Many cancer cells never leave their original tumors. Some cancer cells evolve the ability to migrate to other tissues, but once there cannot manage to form new tumors, and so remain dormant. The deadliest cancer cells are those that can not only migrate to, but also thrive and multiply in distant tissues. These metastatic cancer cells are responsible for most of the deaths associated with cancer. Understanding what enables some cancer cells to metastasize—to spread and form new tumors—is an important goal for researchers, as it will help them develop therapies to prevent or reverse those deadly occurrences.

Past research from Whitehead Institute Member Robert Weinberg and others suggests that cancer cells are best able to form metastatic tumors when the cells are in a particular state called the quasi-mesenchymal (qM) state. New research from Weinberg and Arthur Lambert, once a postdoc in Weinberg’s lab and now an associate director of translational medicine at AstraZeneca, has identified two gene-regulating molecules as important for keeping cancer cells in the qM state. The research, published in the journal Developmental Cell on December 19, shows that these molecules, ΔNp63 and p73, enable breast cancer cells to form new tumors in mice, and illuminates important aspects of how they do so.

Most potent in the middle

Cells enter the qM state by undergoing the epithelial-mesenchymal transition (EMT), a developmental process that can be co-opted by cancer cells. In the EMT, cells transition from an epithelial state through a spectrum of more mesenchymal states, which allows them to become more mobile and aggressive. Cells in the qM state have only transitioned partway through the EMT, becoming more, but not fully, mesenchymal. This middle ground is perfect for metastasis, whereas cells on either end of the spectrum—cells that are excessively epithelial or excessively mesenchymal—lose their metastatic abilities.

Lambert and colleagues wanted to understand more about how cancer stem cells, which can seed metastases and recurrent tumors, remain in a metastasis-prone qM state. They analyzed how gene activity was regulated in those cells and identified two transcription factors—molecules that influence the activity of target genes—as important. One of the transcription factors, ΔNp63, appeared to most directly control cancer stem cells’ ability to maintain a qM state. The other molecule, p73, seemed to have a similar role because it can activate ΔNp63. When either transcription factor was inactivated, the cancer stem cells transitioned to the far end of the EMT spectrum and so were unable to metastasize.

Next, the researchers looked at what genes ΔNp63 regulates in cancer stem cells. They expected to find a pattern of gene regulation resembling what they would see in healthy breast stem cells. Instead they found a pattern closely resembling what one would see in cells involved in wound healing and regeneration. Notably, ΔNp63 stimulates EGFR signaling, which is used in wound healing to promote rapid multiplication of cells.

“Although this is not what we expected to see, it makes a lot of sense because the process of metastasis requires active proliferation,” Lambert says. “Metastatic cancer cells need both the properties of stem cells—such as the ability to self-renew and differentiate into different cell types—and the ability to multiply their numbers to grow new tumors.”

This finding may help to explain why qM cells are so uniquely good at metastasizing. Only in the qM state can the cells strongly stimulate EGFR signaling and so promote their own proliferation.

“This work gives us some mechanistic understanding of what it is about the quasi-mesenchymal state that drives metastatic tumor growth,” says Weinberg, who is also the Daniel K. Ludwig Professor for Cancer Research at the Massachusetts Institute of Technology.

The researchers hope that these insights could eventually contribute to therapies that prevent metastasis. They also hope to pursue further research into the role of ΔNp63. For example, this work illuminated a possible connection between ΔNp63 and the activation of dormant cancer cells, the cells that travel to new tissues but then cannot proliferate after they arrive there. Such dormant cells are viewed as ticking time bombs, as at any point they may reawaken. Lambert hopes that further research may reveal new insights into what causes dormant cancer cells to eventually gain the ability to grow tumors, adding to our understanding of the mechanisms of metastatic cancer.

Notes

Arthur W. Lambert, Christopher Fiore, Yogesh Chutake, Elisha R. Verhaar, Patrick C. Strasser, Mei Wei Chen, Daneyal Farouq, Sunny Das, Xin Li, Elinor Ng Eaton, Yun Zhang, Joana Liu Donaher, Ian Engstrom, Ferenc Reinhardt, Bingbing Yuan, Sumeet Gupta, Bruce Wollison, Matthew Eaton, Brian Bierie, John Carulli, Eric R. Olson, Matthew G. Guenther, Robert A. Weinberg. “ΔNp63/p73 drive metastatic colonization by controlling a regenerative epithelial stem cell program in quasi-mesenchymal cancer stem cells.” Developmental Cell, Volume 57, Issue 24,
2022, 2714-2730.e8, https://doi.org/10.1016/j.devcel.2022.11.015.

Scientists discover a new way of sharing genetic information in a common ocean microbe

Prochlorococcus, the world’s most abundant photosynthetic organism, reveals a gene-transfer mechanism that may be key to its abundance and diversity.

David L. Chandler | MIT News Office
January 5, 2023

From the tropics to the poles, from the sea surface to hundreds of feet below, the world’s oceans are teeming with one of the tiniest of organisms: a type of bacteria called Prochlorococcus, which despite their minute size are collectively responsible for a sizable portion of the oceans’ oxygen production. But the remarkable ability of these diminutive organisms to diversify and adapt to such profoundly different environments has remained something of a mystery.

Now, new research reveals that these tiny bacteria exchange genetic information with one another, even when widely separated, by a previously undocumented mechanism. This enables them to transmit whole blocks of genes, such as those conferring the ability to metabolize a particular kind of nutrient or to defend themselves from viruses, even in regions where their population in the water is relatively sparse.

The findings describe a new class of genetic agents involved in horizontal gene transfer, in which genetic information is passed directly between organisms — whether of the same or different species — through means other than lineal descent. The researchers have dubbed the agents that carry out this transfer “tycheposons,” which are sequences of DNA that can include several entire genes as well as surrounding sequences, and can spontaneously separate out from the surrounding DNA. Then, they can be transported to other organisms by one or another possible carrier system including tiny bubbles known as vesicles that cells can produce from their own membranes.

The research, which included studying hundreds of Prochlorococcus genomes from different ecosystems around the world, as well as lab-grown samples of different variants, and even evolutionary processes carried out and observed in the lab, is reported today in the journal Cell, in a paper by former MIT postdocs Thomas Hackl and Raphaël Laurenceau, visiting postdoc Markus Ankenbrand, Institute Professor Sallie “Penny” Chisholm, and 16 others at MIT and other institutions.

Chisholm, who played a role in the discovery of these ubiquitous organisms in 1988, says of the new findings, “We’re very excited about it because it’s a new horizontal gene-transfer agent for bacteria, and it explains a lot of the patterns that we see in Prochlorococcus in the wild, the incredible diversity.” Now thought to be the world’s most abundant photosynthetic organism, the tiny variants of what are known as cyanobacteria are also the smallest of all photosynthesizers.

Hackl, who is now at the University of Groningen in the Netherlands, says the work began by studying the 623 reported genome sequences of different species of Prochlorococcus from different regions, trying to figure out how they were able to so readily lose or gain particular functions despite their apparent lack of any of the known systems that promote/boost horizontal gene transfer, such as plasmids or viruses known as prophages.

What Hackl, Laurenceau, and Ankenbrand investigated were “islands” of genetic material that seemed to be hotspots of variability and often contained genes that were associated with known key survival processes such as the ability to    assimilate essential, and often limiting, nutrients such as iron, or nitrogen, or phosphates. These islands contained genes that varied enormously between different species, but they always occurred in the same parts of the genome and sometimes were nearly identical even in widely different species — a strong indicator of horizontal transfer.

But the genomes showed none of the usual features associated with what are known as mobile genetic elements, so initially this remained a puzzle. It gradually became apparent that this system of gene transfer and diversification was different from any of the several other mechanisms that have been observed in other organisms, including in humans.

Hackl describes what they found as being something like a genetic LEGO set, with chunks of DNA bundled together in ways that could almost instantly confer the ability to adapt to a particular environment. For example, a species limited by the availability of particular nutrients could acquire genes necessary to enhance the uptake of that nutrient.

The microbes appear to use a variety of mechanisms to transport these tycheposons (a name derived from the name of the Greek goddess Tyche, daughter of Oceanus). One is the use of membrane vesicles, little bubbles pouched off from the surface of a bacterial cell and released with tycheposons inside it. Another is by “hijacking” virus or phage infections and allowing them to carry the tycheposons along with their own infectious particles, called capsids. These are efficient solutions, Hackl says, “because in the open ocean, these cells rarely have cell-to-cell contacts, so it’s difficult for them to exchange genetic information without a vehicle.”

And sure enough, when capsids or vesicles collected from the open ocean were studied, “they’re actually quite enriched” in these genetic elements, Hackl says. The packets of useful genetic coding are “actually swimming around in these extracellular particles and potentially being able to be taken up by other cells.”

Chisholm says that “in the world of genomics, there’s a lot of different types of these elements” — sequences of DNA that are capable of being transferred from one genome to another. However, “this is a new type,” she says. Hackl adds that “it’s a distinct family of mobile genetic elements. It has similarities to others, but no really tight connections to any of them.”

While this study was specific to Prochlorococcus, Hackl says the team believes the phenomenon may be more generalized. They have already found similar genetic elements in other, unrelated marine bacteria, but have not yet analyzed these samples in detail. “Analogous elements have been described in other bacteria, and we now think that they may function similarly,” he says.

“It’s kind of a plug-and-play mechanism, where you can have pieces that you can play around with and make all these different combinations,” he says. “And with the enormous population size of Prochlorococcus, it can play around a lot, and try a lot of different combinations.”

Nathan Ahlgren, an assistant professor of biology at Clark University who was not associated with this research, says “The discovery of tycheposons is important and exciting because it provides a new mechanistic understanding of how Prochlorococcus are able to swap in and out new genes, and thus ecologically important traits. Tycheposons provide a new mechanistic explanation for how it’s done.” He says “they took a creative way to fish out and characterize these new genetic elements ‘hiding’ in the genomes of Prochlorococcus.

He adds that genomic islands, the portions of the genome where these tycheposons were found, “are found in many bacteria, not just marine bacteria, so future work on tycheposons has wider implications for our understanding of the evolution of bacterial genomes.”

The team included researchers at MIT’s Department of Civil and Environmental Engineering, the University of Wuerzburg in Germany, the University of Hawaii at Manoa, Ohio State University, Oxford Nanopore Technologies in California, Bigelow Laboratory for Ocean Sciences in Maine, and Wellesley College. The work was supported by the Simons Foundation, the Gordon and Betty Moore Foundation, the U.S. Department of Energy, and the U.S. National Science Foundation.

Scientists unveil the functional landscape of essential genes

Researchers harness new pooled, image-based screening method to probe the functions of over 5,000 essential genes in human cells.

Nicole Davis | Whitehead Institute
November 21, 2022

A team of scientists at the Whitehead Institute for Biomedical Research and the Broad Institute of MIT and Harvard has systematically evaluated the functions of over 5,000 essential human genes using a novel, pooled, imaged-based screening method. Their analysis harnesses CRISPR-Cas9 to knock out gene activity and forms a first-of-its-kind resource for understanding and visualizing gene function in a wide range of cellular processes with both spatial and temporal resolution. The team’s findings span over 31 million individual cells and include quantitative data on hundreds of different parameters that enable predictions about how genes work and operate together. The new study appears in the Nov. 7 online issue of the journal Cell.

“For my entire career, I’ve wanted to see what happens in cells when the function of an essential gene is eliminated,” says MIT Professor Iain Cheeseman, who is a senior author of the study and a member of Whitehead Institute. “Now, we can do that, not just for one gene but for every single gene that matters for a human cell dividing in a dish, and it’s enormously powerful. The resource we’ve created will benefit not just our own lab, but labs around the world.”

Systematically disrupting the function of essential genes is not a new concept, but conventional methods have been limited by various factors, including cost, feasibility, and the ability to fully eliminate the activity of essential genes. Cheeseman, who is the Herman and Margaret Sokol Professor of Biology at MIT, and his colleagues collaborated with MIT Associate Professor Paul Blainey and his team at the Broad Institute to define and realize this ambitious joint goal. The Broad Institute researchers have pioneered a new genetic screening technology that marries two approaches — large-scale, pooled, genetic screens using CRISPR-Cas9 and imaging of cells to reveal both quantitative and qualitative differences. Moreover, the method is inexpensive compared to other methods and is practiced using commercially available equipment.

“We are proud to show the incredible resolution of cellular processes that are accessible with low-cost imaging assays in partnership with Iain’s lab at the Whitehead Institute,” says Blainey, a senior author of the study, an associate professor in the Department of Biological Engineering at MIT, a member of the Koch Institute for Integrative Cancer Research at MIT, and a core institute member at the Broad Institute. “And it’s clear that this is just the tip of the iceberg for our approach. The ability to relate genetic perturbations based on even more detailed phenotypic readouts is imperative, and now accessible, for many areas of research going forward.”

Cheeseman adds, “The ability to do pooled cell biological screening just fundamentally changes the game. You have two cells sitting next to each other and so your ability to make statistically significant calculations about whether they are the same or not is just so much higher, and you can discern very small differences.”

Cheeseman, Blainey, lead authors Luke Funk and Kuan-Chung Su, and their colleagues evaluated the functions of 5,072 essential genes in a human cell line. They analyzed four markers across the cells in their screen — DNA; the DNA damage response, a key cellular pathway that detects and responds to damaged DNA; and two important structural proteins, actin and tubulin. In addition to their primary screen, the scientists also conducted a smaller, follow-up screen focused on some 200 genes involved in cell division (also called “mitosis”). The genes were identified in their initial screen as playing a clear role in mitosis but had not been previously associated with the process. These data, which are made available via a companion website, provide a resource for other scientists to investigate the functions of genes they are interested in.

“There’s a huge amount of information that we collected on these cells. For example, for the cells’ nucleus, it is not just how brightly stained it is, but how large is it, how round is it, are the edges smooth or bumpy?” says Cheeseman. “A computer really can extract a wealth of spatial information.”

Flowing from this rich, multi-dimensional data, the scientists’ work provides a kind of cell biological “fingerprint” for each gene analyzed in the screen. Using sophisticated computational clustering strategies, the researchers can compare these fingerprints to each other and construct potential regulatory relationships among genes. Because the team’s data confirms multiple relationships that are already known, it can be used to confidently make predictions about genes whose functions and/or interactions with other genes are unknown.

There are a multitude of notable discoveries to emerge from the researchers’ screening data, including a surprising one related to ion channels. Two genes, AQP7 and ATP1A1, were identified for their roles in mitosis, specifically the proper segregation of chromosomes. These genes encode membrane-bound proteins that transport ions into and out of the cell. “In all the years I’ve been working on mitosis, I never imagined ion channels were involved,” says Cheeseman.

He adds, “We’re really just scratching the surface of what can be unearthed from our data. We hope many others will not only benefit from — but also build upon — this resource.”

This work was supported by grants from the U.S. National Institutes of Health as well as support from the Gordon and Betty Moore Foundation, a National Defense Science and Engineering Graduate Fellowship, and a Natural Sciences and Engineering Research Council Fellowship.

Unusual Labmates: How C. elegans Wormed Its Way into Science Stardom
Greta Friar | Whitehead Institute
September 20, 2022

 

Introduction

Michael Stubna, a graduate student in Whitehead Institute Member David Bartel’s lab, peers into his microscope at the Petri dish full of agar gel below. He spots one of his research specimens, a millimeter-long nematode worm known as Caenorhabditis elegans (C. elegans), slithering across the coating of bacteria–the worm’s food source–on the surface of the gel. The worm leaves sinuous tracks in its wake like a skier slaloming down a slope.

 

Michael looks up from the microscope and grabs his worm pick, a metal wire sticking out of a glass tube. He runs the end of the pick through a Bunsen burner flame until the wire glows red, using heat to sanitize the tool. Then he returns his attention to the microscope. He nudges the Petri dish to re-center the worm, and, once the pick has cooled, he coats the tip with some of the sticky bacterial food and uses it to skillfully pluck the worm from the surface of the gel. He puts a fresh dish of agar under the microscope, and presses the pick, with the worm still adhered, to the surface. Almost immediately, the worm sets off, carving fresh tracks into the pristine bacterial lawn.

Michael is cultivating C. elegans in order to use them to study microRNAs, tiny RNA molecules involved in gene regulation. Right now, Michael is the only researcher in the Bartel lab using the worms, but in the wider research world, C. elegans is a popular model organism. At first glance, C. elegans is a rather unassuming animal. Barely large enough to see with the naked eye, in nature the worms reside in soil and decomposing vegetation, feasting on bacteria. Except for their heads, their bodies can only bend up and down, so the animals crawl on their sides. The worms have simple tube bodies and are capable of a limited range of behaviors. Nevertheless, researchers frequently turn to C. elegans to learn about not only their biology, but our own. C. elegans is one of the most intimately understood species in biology—the first animal to have its complete genome sequenced or its neural circuitry completely mapped. How did this simple worm become so well studied and a fixture in laboratories around the world?

Making a model

The species C. elegans was first identified, and used in research, after being found in the soil in Algeria around 1900. However, its popularity as a research model skyrocketed in the 1970s, after biologist Sydney Brenner, then at the Medical Research Council  Laboratory of Molecular Biology and later the founder of the Molecular Sciences Institute, made the case for it as the best new model species for the field of molecular biology. [1], [2]

What makes C. elegans such a good model organism? The worm exists in a “just right” zone of biological complexity: it is complex enough to have many of the features that researchers want to study, but simple enough that those same features can be examined comprehensively. For example, each C. elegans has 302 neurons, which is enough to be a useful model for everything from questions about how brains form, to how they sense and respond to stimuli, to how neuronal pathways give rise to specific behaviors, to how different diseases cause neurodegeneration. At the same time, 302 neurons is a small enough number for researchers to be able to study each individual  neuron and its connections thoroughly. (In comparison, a fruit fly has around 100,000 neurons.)

In the same way, C. elegans has just enough complexity to be used to model other common aspects of animal biology, including muscle function, reproduction, digestion, wound healing, aging, and more. It shares many genes with humans and can even be used to model human disease. For example, researchers have used C. elegans to model neurodegenerative diseases such as Alzheimer’s and Parkinson’s.

Additionally, C. elegans has many advantages as a research subject. The worm’s skin is transparent, so researchers can easily observe and capture images of changes occurring inside of its body down to a cellular or even sub-cellular level.

C. elegans is small, hardy, and easy to rear in the laboratory. Fed a simple diet of bacteria and kept at 20 degrees Celsius, C. elegans will mature from an egg, through four larval stages, to a fertile adult in three days. It can then rapidly reproduce to provide researchers with thousands of specimens. C. elegans live for about three weeks, allowing for quick generational turnover, but if researchers want to keep the worms alive for longer, this is easy to do by putting the worms in stasis.

In nature, it is common to find the worms in a state of suspended animation, in which they can survive for months. [3] During poor environmental conditions, such as when food is scarce, instead of maturing into their usual third larval stage, the worms will enter what is known as a dauer stage, a hardier but inactive larval form. When environmental conditions improve, the worms exit the dauer stage and resume normal development. Researchers can recreate this process in the lab.

If a researcher is going out of town for a few days, they can keep their worms in a refrigerator at 4 degrees Celsius. They also have a simple solution for storing worms long term: freezing them. Worms put in a negative 80-degree Celsius freezer can survive for years and still be recovered. This makes the worms much easier to maintain than other common model organisms, which need constant maintenance to keep them alive, fed, and reproducing.

“With most model organisms, if you go away on vacation you need to find someone to look after your specimens while you are gone,” Stubna says. “That’s not necessary with worms.”

Another advantage of C. elegans is how easy it is to generate large numbers of them. Most C. elegans are self-fertilizing hermaphrodites. They cannot mate with each other, but each worm can generate hundreds of offspring on its own. The self-fertilizing nature of C. elegans generally works in researchers’ favor, as along with making it easy to generate lots of new worms, self-fertilization makes genetic inheritance simpler to predict and manipulate in order to maintain a desired mutation throughout the generations. However, sometimes researchers may want to cross-breed their specimens, for example to combine mutations found in separate adults in an offspring.
Fortunately for researchers, the rare C. elegans worm is male—around .1-.2% in the wild [4] — and males can fertilize hermaphrodites’ eggs, enabling cross-breeding. C. elegans males arise in the wild due to a deviation during sex cell division. Hermaphrodite C. elegans have two X chromosomes. When one of these fails to form correctly, the resulting worm with its one X chromosome will be male, having some key anatomical differences. If researchers need lots of males for their work, they can increase the percentage of a hermaphrodite’s offspring that are male by exposing the worm to heat before it reproduces or by using genetic manipulation. Then, researchers can selectively breed male-heavy populations to further increase the ratio over time.
Researchers have developed a variety of tools and approaches over the years with which to manipulate C. elegans genetically. These have enabled researchers to learn a lot about both the worm’s genes, and genes that it shares with humans and other animals. One useful approach is the development of marker strains.

Getting to know C. elegans inside and out

After Brenner landed on C. elegans as an ideal model, his research group began several ambitious projects to comprehensively understand the worm’s biology. In the following decades, the worm’s anatomy and genome would be detailed in unprecedented detail. The more that researchers learned about the worm, the better a model it became.

Creating a complete cell lineage and neural map

C. elegans is remarkable in that every worm has the same exact number of cells: 959 in the adult hermaphrodite (not counting the cells that will become eggs or sperm). 302 of these cells are neurons. Researchers in Brenner’s group created two first-of-their-kind resources documenting the details of this biology. First, they mapped the worms’ complete cell lineage, recording every cell division that occurs during the worms’ development from fertilized egg to adult. This resource makes it easy for researchers to study how different factors contribute to—or can alter—this development.

Then, the researchers created a wiring diagram, or connectome, of the hermaphrodite worm’s 302 neurons and their thousands of synapses—the junction points where neurons interact. Researchers have used this wiring diagram to identify neurons involved in many different behaviors in C. elegans, as well as to understand how brains form and function across animal species. [5,6] C. elegans was the first, and as of 2021 the only, animal to have had its brain completely mapped.

Capturing the complete genome

In 1998, C. elegans made the news as the first animal to have its complete genome sequenced. The completion of the 15-year-long sequencing project, helmed by the C. elegans Sequencing Consortium, was announced in a special issue of Science. Researchers had previously compiled complete genomes for a variety of single-celled species, but as an animal, C. elegans had a significantly larger genome. The complete genome provided many useful insights into individual genes, and the relationships between genes both within C. elegans and between species. The ambitious project also proved instructive for how to sequence large genomes. In fact, the Human Genome Project helped to fund the sequencing of C. elegans as a stepping stone to the ultimate goal of sequencing the human genome, which was achieved in 2003.

With C. elegans’ genetics, anatomy, and other biology so thoroughly documented, the worms became an even more potent model organism. Researchers now had a wealth of foundational knowledge about the worm that they could use to make and test hypotheses about specific questions.

Worm culture

As the use of C. elegans in science grew, a community formed among the worm’s researchers. The C. elegans community was quick to develop and share resources. The Worm Breeder’s Gazette is a semi-annual newsletter first published in 1975, which shares information of interest to the C. elegans community such as experimental techniques and new findings. The Caenorhabditis Genetics Center (CGC), founded in 1979, is a central repository from which researchers can order thousands of different strains of C. elegans for use in their own research.

From the early years, prominent researchers working with C. elegans believed strongly in sharing data both among researchers and with the public. This openness set the tone for the field of molecular biology more broadly; for example, open data sharing policies around the sequencing of the C. elegans genome encouraged the Human Genome Project to follow suit.

The worm community often had to build its own tools in order to share data on the scales its members desired. One big project was the creation of ACeDB (A C. elegans Database), a database management system capable of storing and displaying many different kinds of biological information about C. elegans, including its complete genome, in a user-friendly way. The current iteration of ACeDB, known as WormBase, contains the annotated genomes of C. elegans and related nematodes, information on every known C. elegans gene and its function, genetic maps, the C. elegans cell lineage and connectome, and much more. The ACeDB software was soon used to create similar databases for other model organisms. Such databases now exist for many different species, making detailed biological data widely available to everyone. These databases are also often used to share the latest research, maintain a consensus around scientific terminology and gene annotation, and provide educational resources on the model organism. Anyone looking for general information on C. elegans can also visit WormBook, an open access, online review of C. elegans biology.

With these resources and others, the C. elegans community fostered a culture of sharing and scientific openness that continues to this day.

A few of the many discoveries and further tool development

C. elegans researchwith its wealth of experimental tools and methods, pre-existing data with which to build and test hypotheses, and a worldwide community happy to share resources, has been the source of many important discoveries over the years. Many of these discoveries have also added to researchers’ toolkits, providing new ways to experiment with C. elegans and other research specimens. A few of these myriad impactful discoveries are highlighted below.

Insights into development and programmed cell death

Brenner and two researchers whom he had mentored, John Sulston, then at the The Wellcome Trust Sanger Institute, and Robert Horvitz, then at the Massachusetts Institute of Technology (MIT), were awarded the first Nobel Prize for work done in C. elegans in 2002. When the researchers were creating the C. elegans cell lineage map, they saw that some cells created during development died off at particular moments, and that this programmed destruction of cells, called apoptosis, was an essential part of creating the adult body. They identified key genes that regulate apoptosis, and their work led to insights into the role of apoptosis in human development, as well as in health and disease. For example, cancer cells are able to avoid apoptosis, and many modern cancer therapies work by reenabling apoptosis of cancer cells.

Andrew Fire at the Stanford University School of Medicine and Craig Mello at the University of Massachusetts Medical School used C. elegans to discover RNA interference (RNAi), a process that cells use to stop genes from being expressed. RNAi became an important research tool after researchers figured out how to tailor RNAi to turn off genes that they are interested in studying in different cells and species. Researchers turn off a gene and see what changes, which helps them figure out the gene’s function. People have also found uses for RNAi in medicine and industry. RNAi is easy to use in C. elegans — researchers can apply it to worms by simply feeding them modified bacteria — so this tool made the worms an even better model for genetics research. Fire and Mellow earned a Nobel Prize for their discovery in 2006.

Introducing a new visual tag

C. elegans also contributed to the development of another popular and powerful research tool, green fluorescent protein (GFP). GFP is a protein first found in jellyfish. It glows green under certain light waves. Martin Chalfie at Columbia University showed in C. elegans that the genetic code for GFP could be added as a tag to genes of interest, and then the products of those genes would glow, providing researchers with a great visual marker of where and when the genes were expressed. Chalfie shared the 2008 Nobel Prize in Chemistry for this work, and researchers now frequently use GFP and similar molecules as visual markers in experiments across species and cell types.

A model for aging

In 1993, Cynthia Kenyon at the University of California, San Francisco and colleagues discovered that mutations to a single gene, daf-2, along with the normal activity of a second gene, daf-16, could more than double the lifespan of C. elegans. Kenyon and others intrigued by this  discovery would go on to use C. elegans to ask questions about the molecular mechanisms governing aging. Researchers have also studied how equivalent genes affect aging in other animals, including humans.

A model for sex determination, reproduction, and development

C. elegans has been used to explore questions related to sex, reproduction, and development. Barbara Meyer, then at MIT, now at the University of California, Berkeley, discovered the mechanism of sex determination in the worm, and has uncovered mechanisms by which gene expression is regulated to compensate an animal having one or two X chromosomes. Other researchers have used C. elegans to make important discoveries about germ cells, the cells that give rise to eggs and sperm. Judith Kimble and John White, then at the MRC Laboratory of Molecular Biology, now at the University of Wisconsin–Madison discovered the first germline stem cell niche in C. elegans, which is the place where animals maintain a pool of stem cells with which to keep producing new germ cells over time. This finding had implications for fertility and regeneration research. Geraldine Seydoux at Johns Hopkins University has used C. elegans to investigate unique features of germ cells, as well as how sperm and egg interact and how the early embryo prepares to form a complex adult body.

Understanding sense of smell

Thanks in large part to C. elegans having such a well-mapped nervous system, the worm has been a common model for researchers studying how animals sense and respond to stimuli in their environments. Cori Bargmann, an alumna of Whitehead Institute, now at Rockefeller University, studies how C. elegans sense and process outside stimuli, how those stimuli can trigger changes in behavior, and how the brain can be rewired to modify behaviors over time. Bargmann’s research has particularly illuminated the worm’s sense of smell. She found the first evidence of a receptor for a specific smell, and her work more broadly shed light on how animals are able to recognize many different types of smells.

A rich history of discovery

This is just a small sampling of the important discoveries that have been made in C. elegans. WormBook has compiled a list of many such achievements, including the discovery of multiple key molecules and pathways present across animals.

Worms at Whitehead Institute

Michael uses C. elegans to study microRNAs.

C. elegans have long played an important role in microRNA research; in fact, microRNAs were first discovered in C. elegansVictor Ambros and colleagues, and Gary Ruvkun and colleagues, published papers describing the first identified microRNA, lin-4, and its target, in 1993. At first, researchers thought that the small gene-regulating molecule might be an oddity. However, in 2000, Ruvkun discovered a second microRNA, and by the next year researchers—including Whitehead Institute Member David Bartel—had identified many more microRNAs in C. elegans, as well as microRNAs in other species. [12] Collectively, this research implied that microRNAs were a common and important regulator of gene expression across species. The field of microRNA research exploded, and microRNAs became the focus of Bartel’s lab.

In spite of the importance of C. elegans in establishing microRNA research, both in the field at large and in the Bartel lab specifically, no one in the lab was using C. elegans as a research model when Michael joined. However, as Michael–who had previous experience working with C. elegans–began to plan his graduate research, he realized that the worm would be the perfect model in which to explore his topic of interest: how microRNAs are regulated. Bartel agreed, and so C. elegans made their triumphant return to Whitehead Institute.

“It’s great to return to C. elegans,” says Bartel, who is also a professor of biology at MIT and a HHMI investigator.  “Michael is working on sets of microRNAs that we discovered over 20 years ago. Since then, we and others have learned a lot about microRNAs, using a variety of research models. It will be fun to see what new things we will learn with C. elegans.

Michael is using C. elegans to better understand how microRNAs, which degrade messenger RNAs, are themselves degraded. In recent years, researchers discovered a surprising mechanism of microRNA degradation: in some instances, when a microRNA pairs with a messenger RNA, instead of this leading to the destruction of the messenger RNA, it leads to the destruction of the microRNA.

“The normal regulatory logic is completely flipped,” Michael says. “This was discovered in mammalian cells, and our lab and others have been working out the mechanism for how this happens.”

The Bartel lab found that a particular gene is necessary for this process. When Michael joined the lab, he wondered whether that gene’s equivalent in C. elegans serves the same role. He found that it does. However, there are differences between how this process works in C. elegans and mammals, in particular in the way that the RNAs pair to trigger destruction of the microRNA. Those differences, and what they reveal about how microRNAs are regulated, are what Michael is studying now.

“What is the underlying principle of what’s required for microRNAs to be degraded through this pathway in worms? It’s not known, and that’s what I’m trying to find out,” Michael says.

Michael hopes that what he learns from the worms will shed light on the logic for how microRNAs are regulated across animal species. This will in turn give researchers a better understanding of how cells are able to so precisely tailor their gene expression. The prospect of deciphering such a central facet of cell biology is exciting for Michael, but it’s just another day for C. elegans. Decades of results have shown that there’s no limit to what these simple worms can be used to discover.

 

New players in an essential pathway to destroy microRNAs

In a study from the lab of Whitehead Institute Member David Bartel, researchers have identified genetic sequences that can lead to the degradation of cellular regulators called microRNAs in the fruit fly Drosophila melanogaster.

Eva Frederick | Whitehead Institute
September 26, 2022

In a study from the lab of Whitehead Institute Member David Bartel, researchers have identified genetic sequences that can lead to the degradation of cellular regulators called microRNAs in the fruit fly Drosophila melanogaster. The findings were published September 22 in Molecular Cell.

“This is an exciting study that paves the way for a deeper understanding of the microRNA degradation pathway,” says Bartel, who is also a professor of biology at the Massachusetts Institute of Technology and investigator with the Howard Hughes Medical Institute. “Finding these ‘trigger’ sequences will allow us to more precisely probe the workings of this pathway in the lab, which is likely critical for flies — and possibly other species — to survive to adulthood.”

In order to produce new proteins, cells transcribe their DNA into messenger RNAs (or mRNAs), which provide information required to make the proteins . When a given mRNA has served its purpose, it’s degraded. The process of degradation is often led by tiny RNA sequences called microRNAs.

In previous work, researchers showed that certain mRNA or non coding RNA transcripts, rather than being degraded by microRNAs, can instead turn the tables on the microRNAs and lead to their destruction through a pathway called target-directed microRNA degradation, or TDMD. “This pathway leads to rapid turnover of certain microRNAs within the cell,” says former Bartel Lab graduate student Elena Kingston.

Kingston wanted to further understand the functions of the TDMD pathway in cells. “I wanted to get at the ‘why,’” she said. “Why are microRNAs regulated in this way, and why does it matter in an organism?”

Previous work on the TDMD pathway was primarily conducted in cultured cells. For the new study, the researchers decided to use the fruit fly Drosophila melanogaster.  A fly model could provide more insight into how the pathway worked in a live organism — including whether or not it had an effect on the organism’s fitness or was essential for survival.

The researchers created a model to study TDMD by using flies with mutations in an essential TDMD pathway gene called Dora (the equivalent human gene is called ZSWIM8, as detailed in this paper). Very few flies with mutations in Dora were seen to make it to adulthood. Most died early in development, suggesting the TDMD pathway was likely important for their embryonic viability.

Putting a finger on the triggers of the TDMD pathway

While microRNAs don’t need many complementary base pairs to bind and regulate their mRNA targets, the opposite is true in the TDMD pathway. In order to work properly, the TDMD pathway needs a highly specific trigger, which can either be a mRNA that codes for proteins, or a non-coding RNA. “What’s unique about a trigger is it has a site that the microRNA can bind to that has a lot of complementarity to the microRNA,” Kingston said.

During the isolation of the early Covid-19 pandemic, Kingston set out to write a program that could pick out probable triggers of microRNA degradation in Drosophila based on their sequencesThe program returned thousands of hits, and the researchers set to work narrowing down which sites were the best candidates to test in flies.

“As soon as we were able to get back into lab [after the lockdown], I took our top 10 or so candidates and tried perturbing them in flies,” she said. “Fortunately for me, about half of them ended up working out.”

These six new triggers more than double the list of known RNA sequences that can direct degradation of microRNAs. To take this finding a step further, the researchers conducted an analysis of what happened to the flies when a trigger was disrupted.

The researchers found that one of the triggers — a long non-coding RNA — plays a role in proper development of the cuticle, or the waterproof outer shell of a fly embryo. “We noticed that when we perturbed this trigger, the cuticles of fly embryos had altered elasticity,” Kingston said. “When we popped the embryos out of their egg shells, we could see these cuticles expand up and bloat.”

Because of the bloated phenotype, Kingston decided to name the long non-coding RNA marge after Aunt Marge, a character in the Harry Potter series. In “Harry Potter and the Prisoner of Azkaban”, Aunt Marge’s taunts lead Harry to accidentally perform magic on her, causing her to inflate and float away.

In the future, Kingston, who has since graduated and begun a career in the biotech industry, hopes researchers will pick up the torch on learning the roles of other TDMD triggers. “We still have several other triggers [from this paper] where there’s no known biological role for them in the fly,” she said. “I think this opens up the field for others to go in and to ask the questions, ‘Where are these triggers acting? What are they doing? And what’s the phenotype when you lose them?’”

Notes

Elena Kingston, Lianne Blodgett and David Bartel. “Endogenous transcripts direct microRNA degradation in Drosophila, and this targeted degradation is required for proper embryonic development.” Molecular Cell, September 22, 2022. DOI: https://doi.org/10.1016/j.molcel.2022.08.029

Biologists glean insight into repetitive protein sequences

A computational analysis reveals that many repetitive sequences are shared across proteins and are similar in species from bacteria to humans.

Anne Trafton | MIT News Office
September 13, 2022

About 70 percent of all human proteins include at least one sequence consisting of a single amino acid repeated many times, with a few other amino acids sprinkled in. These “low-complexity regions” are also found in most other organisms.

The proteins that contain these sequences have many different functions, but MIT biologists have now come up with a way to identify and study them as a unified group. Their technique allows them to analyze similarities and differences between LCRs from different species, and helps them to determine the functions of these sequences and the proteins in which they are found.

Using their technique, the researchers have analyzed all of the proteins found in eight different species, from bacteria to humans. They found that while LCRs can vary between proteins and species, they often share a similar role — helping the protein in which they’re found to join a larger-scale assembly such as the nucleolus, an organelle found in nearly all human cells.

“Instead of looking at specific LCRs and their functions, which might seem separate because they’re involved in different processes, our broader approach allows us to see similarities between their properties, suggesting that maybe the functions of LCRs aren’t so disparate after all,” says Byron Lee, an MIT graduate student.

The researchers also found some differences between LCRs of different species and showed that these species-specific LCR sequences correspond to species-specific functions, such as forming plant cell walls.

Lee and graduate student Nima Jaberi-Lashkari are the lead authors of the study, which appears today in eLife. Eliezer Calo, an assistant professor of biology at MIT, is the senior author of the paper.

Large-scale study

Previous research has revealed that LCRs are involved in a variety of cellular processes, including cell adhesion and DNA binding. These LCRs are often rich in a single amino acid such as alanine, lysine, or glutamic acid.

Finding these sequences and then studying their functions individually is a time-consuming process, so the MIT team decided to use bioinformatics — an approach that uses computational methods to analyze large sets of biological data — to evaluate them as a larger group.

“What we wanted to do is take a step back and instead of looking at individual LCRs, to try to take a look at all of them and to see if we could observe some patterns on a larger scale that might help us figure out what the ones that have assigned functions are doing, and also help us learn a bit about what the ones that don’t have assigned functions are doing,” Jaberi-Lashkari says.

To do that, the researchers used a technique called dotplot matrix, which is a way to visually represent amino acid sequences, to generate images of each protein under study. They then used computational image processing methods to compare thousands of these matrices at the same time.

Using this technique, the researchers were able to categorize LCRs based on which amino acids were most frequently repeated in the LCR. They also grouped LCR-containing proteins by the number of copies of each LCR type found in the protein. Analyzing these traits helped the researchers to learn more about the functions of these LCRs.

As one demonstration, the researchers picked out a human protein, known as RPA43, that has three lysine-rich LCRs. This protein is one of many subunits that make up an enzyme called RNA polymerase 1, which synthesizes ribosomal RNA. The researchers found that the copy number of lysine-rich LCRs is important for helping the protein integrate into the nucleolus, the organelle responsible for synthesizing ribosomes.

Biological assemblies

In a comparison of the proteins found in eight different species, the researchers found that some LCR types are highly conserved between species, meaning that the sequences have changed very little over evolutionary timescales. These sequences tend to be found in proteins and cell structures that are also highly conserved, such as the nucleolus.

“These sequences seem to be important for the assembly of certain parts of the nucleolus,” Lee says. “Some of the principles that are known to be important for higher order assembly seem to be at play because the copy number, which might control how many interactions a protein can make, is important for the protein to integrate into that compartment.”

The researchers also found differences between LCRs seen in two different types of proteins that are involved in nucleolus assembly. They discovered that a nucleolar protein known as TCOF contains many glutamine-rich LCRs that can help scaffold the formation of assemblies, while nucleolar proteins with only a few of these glutamic acid-rich LCRs could be recruited as clients (proteins that interact with the scaffold).

Another structure that appears to have many conserved LCRs is the nuclear speckle, which is found inside the cell nucleus. The researchers also found many similarities between LCRs that are involved in forming larger-scale assemblies such as the extracellular matrix, a network of molecules that provides structural support to cells in plants and animals.

The research team also found examples of structures with LCRs that seem to have diverged between species. For example, plants have distinctive LCR sequences in the proteins that they use to scaffold their cell walls, and these LCRs are not seen in other types of organisms.

The researchers now plan to expand their LCR analysis to additional species.

“There’s so much to explore, because we can expand this map to essentially any species,” Lee says. “That gives us the opportunity and the framework to identify new biological assemblies.”

The research was funded by the National Institute of General Medical Sciences, National Cancer Institute, the Ludwig Center at MIT, a National Institutes of Health Pre-Doctoral Training Grant, and the Pew Charitable Trusts.