Yamashita elected to American Academy of Arts and Sciences for 2023

The prestigious honor society announces more than 250 new members, including MIT Biology Professor Yukiko Yamashita.

MIT News Office
April 24, 2023

Eight MIT faculty members are among more than 250 leaders from academia, the arts, industry, public policy, and research elected to the American Academy of Arts and Sciences, the academy announced April 19.

One of the nation’s most prestigious honorary societies, the academy is also a leading center for independent policy research. Members contribute to academy publications, as well as studies of science and technology policy, energy and global security, social policy and American institutions, the humanities and culture, and education.

Those elected from MIT in 2023 are:

  • Arnaud Costinot, professor of economics;
  • James J. DiCarlo, the Peter de Florez Professor of Brain and Cognitive Sciences and director of the MIT Quest for Intelligence;
  • Piotr Indyk, the Thomas D. and Virginia W. Cabot Professor of Electrical Engineering and Computer Science;
  • Senthil Todadri, professor of physics;
  • Evelyn N. Wang, the Ford Professor of Engineering (on leave) and director of the Department of Energy’s Advanced Research Projects Agency-Energy;
  • Boleslaw Wyslouch, professor of physics and director of the Laboratory for Nuclear Science and Bates Research and Engineering Center;
  • Yukiko Yamashita, professor of biology and core member of the Whitehead Institute; and
  • Wei Zhang, professor of mathematics.

“With the election of these members, the academy is honoring excellence, innovation, and leadership and recognizing a broad array of stellar accomplishments. We hope every new member celebrates this achievement and joins our work advancing the common good,” says David W. Oxtoby, president of the academy.

Since its founding in 1780, the academy has elected leading thinkers from each generation, including George Washington and Benjamin Franklin in the 18th century, Maria Mitchell and Daniel Webster in the 19th century, and Toni Morrison and Albert Einstein in the 20th century. The current membership includes more than 250 Nobel and Pulitzer Prize winners.

The Human Genome Project Turns 20: Here’s How It Altered the World

"It simply changed the way that people thought that biology could be done."

Ed Cara | Gizmodo
April 11, 2023

On April 14 2003, scientists announced the end to one of the most remarkable achievements in history: the first (nearly) complete sequencing of a human genome. It was the culmination of a decade-plus endeavor that involved thousands of scientists across the globe. Many people hoped the accomplishment would change the world for the better.

For the 20-year anniversary of this historic event, we took a look back at the Human Genome Project and its impact. How did it shape science moving forward? How many of the expected goals have been reached since? And what lies ahead for the study of genetics?

A genome is the entire set of genetic information that makes up an organism. This information is packaged into sequences of DNA we call genes, which in humans are spread along 23 pairs of chromosomes. Only a small portion of these genes contain the instructions for coding the many proteins essential for life, but much of the rest is still thought to be important to our functioning. As scientists would eventually confirm, one copy of the human genome has around 3 billion base pairs of DNA. The sheer magnitude of the effort needed to map all this wasn’t lost on the researchers involved in the project, especially given the technology available decades ago.

“There’s been lots of analogies that people have put forward—like us being Lewis and Clark. We didn’t really have a map,” said Richard Gibbs, founder and director of the Baylor College of Medicine Human Genome Sequencing Center in Texas, one of the major institutions involved in the project.

Gibbs and his many colleagues knew they had to make compromises. Despite the advancements in sequencing technology since the official start of the project in 1990, they couldn’t fill in every gap with their current tools. And the first human genome was a composite of several blood donors in the U.S., not a single person. Along the way, private company Celera entered the picture, promising that it would complete a separate genome project using its own techniques even faster. Ultimately, both groups finished ahead of schedule around the same time, with the first draft sequences released in 2000, though Celera announced its success a few months earlier.

Regardless of the victor, the feat certainly did usher in a new era of genetics research—one that has seen great leaps in speed and efficiency since 2003.

“I think the very most important accomplishment in the past 20 years has been the advent of next-generation sequencing. The ability to perform sequencing in a massively parallel way, so that you could do it far more quickly and cheaply,” said Stacey Gabriel, director of the Genomics Platform at the Broad Institute of MIT and Harvard, another major research site involved in the Human Genome Project. “And that has come with all of the associated advancement in our computational abilities, too, to really be able to take that data and analyze it at a massive scale as well.”

The original project cost $2.7 billion, with most of the genome being mapped over a two-year span. Nowadays, the current speed record for sequencing a genome is around five hours (more often, though, it takes weeks), and this past fall, the company Illumina unveiled a machine that it claims will cost as little as $200 per sequence, down from the recently typical $600 cost.

Greg Findlay leads the Genome Function Laboratory at the Francis Crick Institute in the UK. His team is one of many around the world that is building on the work of the original project. They’re currently trying to identify and understand how certain variants in tumor-suppressor genes can raise our risk of cancer.

“So what my lab tries to do is actually understand what variants do to the genome. That is, we want to know exactly which variants cause disease and why they cause disease. Right now, we’re focused on certain genetic changes that lead to cancer, and I think this particular field has really been revolutionized by the Human Genome Project,” Findlay said. “We now know there are many, many different genetic paths by which cells can turn into cancer. And we know this, because we’ve been able to actually sequence the DNA in so many different tumors repeatedly, across all different types of cancer, to see what are the mutations that actually lead to cancer forming.”

Perhaps equally important was the project’s impact on scientific collaboration. The effort directly led to an international agreement meant to ensure open access to DNA sequences. It also made clear that great things could be possible when large groups of scientists worked together, according to Gibbs.

“It simply changed the way that people thought that biology could be done,” he said. “It built a model for team science that was not there before.”

Even at the time, though, the project knowingly left some things unfinished. They had mapped roughly 92% of the genome by 2003, but it would take almost 20 more years for other scientists to track down the remaining 8%. This missing “dark matter” of our genome could very well provide new clues about how humans evolved or our susceptibility to various diseases.

Much of the genetic information collected and analyzed since the project ended has come from white and European populations—a disparity that hampers our ability to truly understand the impact of genetics on everyone’s health. But scientists today are working on bridging that gap through initiatives like the Human Pangenome Project, which will sequence and make available the full genomes of over 300 people intended to represent the breadth of human diversity around the globe.

“There’s genetic variation that exists across all the world’s populations. And if you only use variations from a sliver of the world’s populations, and you try to apply that to everybody else, it just doesn’t work very well, because we all have different backgrounds,” said Lucinda Antonacci-Fulton, one of the project’s coordinators and director of project development & new initiatives at Washington University in St. Louis’s McDonnell Genome Institute. “So the more inclusive you can be, the better off you are in terms of treatments that you want to bring into the clinic.”

As important as genetics research has been, some of the expectations fueled by the Human Genome Project likely were too lofty. In 2000, for instance, President Bill Clinton claimed that the project would “revolutionize the diagnosis, prevention and treatment of most, if not all, human diseases.” While we do continue to discover new gene variants that strongly predict the odds of developing a specific disease or trait, there are countless others that we’re still in the dark about. Elsewhere, it’s become clear that our genome often only plays a small or negligible role in why we get sick or experience something in a particular manner. So although the project has helped unlock some of the mysteries of the world, there are so many more questions out there about why we are the way we are, and our genes are probably not going to provide a neat answer to many of them.

“I think where things are oversold, sometimes, is with this notion that just because the human genome is done, you’re going to be able to read it off for some sort of deterministic answer—where finding genes for disease becomes like falling off a log,” Gabriel notes. “But often human disease, especially the diseases that impact us the most, these are not simple genetic diseases. They’re multifactorial. They’re combinations of your genes, your behavior, exposure to the environment, sometimes just bad luck.”

None of this is to sell short the potential of genetics. Hans Lehrach, a former director at the Max Planck Institute for Molecular Genetics in Germany, was one of the first researchers involved in the Human Genome Project. He’s also one of many scientists who believe that we’ll someday be able to cheaply and easily scan a person’s genome at a moment’s notice and that this information, along with other aspects of our molecular make-up, will help guide the specific drugs or interventions doctors prescribe—a concept known as personalized medicine. Notably, the treatment of some cancers is already influenced by the variants that underlie their growth. Some experts even argue that widespread whole genome sequencing should start as early as birth, and there are already small-scale programs in the U.S., UK, and elsewhere testing out its potential benefits and risks.

“Not knowing about your genome is a bit like crossing the street while closing your eyes because you don’t want to see a bus coming. If we don’t sequence our genomes, the buses keep coming anyway—it just lets us open our eyes and maybe see the kind of danger that we can escape or do something about,” Lehrach said.

The Human Genome Project truly has changed the scientific landscape, but we’re still only at the very beginning of seeing the world that it’s made possible.

Balance between proteins keeps sperm swimming swiftly

Developing sperm cells swap out histones for proteins called protamines to coil DNA tightly enough to fit inside the hydrodynamic shape ideal for the task of swimming swiftly to an egg in order to fertilize it. If the balance of protamines in the sperm is wrong, however, the sperm may become misshapen and die, making the animal infertile. Whitehead Institute Member Yukiko Yamashita and former graduate student Jun Park have discovered why this imbalance causes infertility in the fruit fly.

Greta Friar | Whitehead Institute
April 10, 2023

Sperm must swim swiftly to an egg in order to fertilize it, and so they have evolved hydrodynamic shapes. Most of the space in the head of sperm cells is taken up by the DNA they carry, so the cells coil up their DNA super tightly to stay small and streamlined. In most cell types, DNA is coiled around proteins called histones. These do not package DNA tightly enough for sperm, so when a sperm cell is developing, it swaps out histones for another type of protein called protamines that coil DNA very tightly.

Many species, including humans, mice, and flies, have multiple types of protamines. If the balance between the different types is wrong, then the sperm’s DNA may not be packaged correctly and it may become misshapen and die, making the animal infertile. Whitehead Institute Member Yukiko Yamashita and former graduate student Jun Park have discovered why this imbalance causes infertility in the fruit fly (Drosophila melanogaster). The finding, published in the Proceedings of the National Academy of Sciences on April 10, showed a mechanism that balances different types of protamines to ensure male fertility.

Mst77F is a major fruit fly protamine. Yamashita and Park determined that the fruit fly protamine Mst77Y, which is related to Mst77F, can interfere with the function of Mst77F. Fruit flies usually make a lot of Mst77F and a little of Mst77Y. The researchers found that when expression of the Mst77Y gene is too high, especially when expression of Mst77F is low, it disrupts the process of DNA packaging, leading to infertility.

How does Mst77Y interfere with Mst77F? The researchers discovered that this is because the Mst77Y gene makes faulty protamines. There are multiple copies of Mst77Y on the fly’s Y chromosome. They likely evolved from a copy of Mst77F, which is not on a sex chromosome. However, the different versions of Mst77Y have lost or altered parts that they need in order to function, so unlike the Mst77F protamine, Mst77Y protamines likely cannot coil DNA tightly around themselves. In spite of the fact that the Mst77Y protamines do not work correctly, they are dominant: when they are present, the sperm cell will use them over the functional Mst77F protamines.

“Mst77Y is a half-broken tool,” says Yamashita, who is also a professor of biology at the Massachusetts Institute of Technology and a Howard Hughes Medical Institute investigator. “It is able to take the place of the working tool, Mst77F, but not to do its job, so when too much Mst77Y is present, the sperm cell does not have enough working tools in place to compact its DNA.”

The researchers also figured out how sperm cells keep expression of Mst77F high and Mst77Y low: with the help of a protein called Modulo. In order for an RNA read from a gene to be made into a protein, it needs to have a tail added to it made up of a string of adenines—one of the four building blocks that make up RNA. Modulo makes sure that the cell preferentially adds this tail to the RNA coding for Mst77F. Although Yamashita and Park did not determine the exact mechanism by which Modulo ensures this preferential treatment, they did observe that Modulo and the Mst77F RNA group together in the same part of the cell, the nucleolus, whereas Mst77Y does not.

Altogether, these findings explain why and how fruit fly sperm cells carefully balance the levels of these two protamines. However, the research raises the question, what are sperm cells using the non-functional Mst77Y protamines for? Yamashita and Park can only speculate, but the answer may have to do with their observation that high levels of Mst77Y killed off more X-chromosome bearing sperm than Y-chromosome bearing sperm. Past research has suggested that protamines may be involved in a process called meiotic drive, which animals can use to skew the sex ratio of their offspring. This new work is not only consistent with that hypothesis, but provides a possible mechanism to explain how protamines contribute. The researchers note that they did not see a strong effect on the sex ratio of offspring in this experiment, but hope that this work could set the stage to understand the role of non-functional protamines in meiotic drive.

“At the cell level, we were able to show that there’s some basis for this protamine to be involved in biasing whether X or Y chromosome bearing sperm survive,” Park says. “An interesting next question would be to see if there are certain conditions in which this mechanism more clearly acts as a driver at the level of offspring’s sex ratio.”

Notes

Park, Jun I., George W. Bell, and Yukiko M. Yamashita. 2023. “Derepression of Y-linked multicopy protamine-like genes interferes with sperm nuclear compaction in D. melanogaster,” PNAS 120 (16). https://www.pnas.org/doi/10.1073/pnas.2220576120

Not so inactive X chromosome

Whitehead Institute Member David Page has spent his career understanding how the differences between X and Y contribute to these sex differences, but a recent project is taking his lab in a new direction: understanding how the differences between X chromosomes contribute to sex differences.

Greta Friar | Whitehead Institute
February 7, 2023

Nearly every cell in our body contains pairs of each of our chromosomes, and these pairs are identical in all but one case: that of our sex chromosomes. Males typically have one X and one Y sex chromosome, while females typically have two X chromosomes. In recent years, research has suggested that these different chromosomes can influence far more than sex determination. Gene expression from the sex chromosomes appears to contribute to sex differences in health and disease, which males and females experience in everything from the incidence of getting certain diseases, to the symptoms of diseases, to responses to drugs, and more. For example, women are more likely to develop autoimmune disorders, while men are more likely to develop heart conditions.

Whitehead Institute Member David Page has spent his career understanding how the differences between X and Y contribute to these sex differences, but a recent project is taking his lab in a new direction: understanding how the differences between X chromosomes contribute to sex differences. Although females’ pair of X chromosomes contain the same genes, they have different patterns of gene expression. New research from Page and postdoc Adrianna San Roman reveals just how different the two types of X chromosomes are. The findings, published in the journal Cell Genomics on February 8, show that one type of X chromosome, known as the inactive X chromosome, can modulate the gene expression of the other type of X chromosome, known as the active X chromosome. Their work indicates that inactive X chromosomes have underappreciated roles in gene regulation and, most likely, in sex differences in health and disease.

Difference rooted in history

Females’ two X chromosomes have different gene expression activity because of the sex chromosomes’ evolutionary history. The X and Y sex chromosomes evolved from a pair of identical non-sex chromosomes. Because of this ancestry, the sex chromosomes still contain genes that are important outside of regulating sex differences, such as genes that contribute to our immune system or regulate gene expression throughout the body. However, over time the Y chromosome shrank and lost most of its genes. Researchers think that in order to make up for the loss of necessary genes on the Y, expression of the corresponding genes on the X chromosome increased. This ensured that males still had the necessary levels of gene expression from their sex chromosomes, but now females, with two copies of X both working overtime, had levels of gene expression that were too high. To solve this problem, our bodies developed a process called X chromosome inactivation, by which the majority of genes on all but one copy of the X chromosome in each cell are silenced, or turned off. This means that everyone, male and female alike, has one copy of the X chromosome working at full strength–the active X chromosome. In males, the active X chromosome is paired with a Y chromosome, and in females, it is paired with a so-called inactive X chromosome, on which most of the genes are turned off.

In spite of the evolution of X chromosome inactivation, some percentage of genes on the inactive X chromosome are still expressed, such as genes that have an active counterpart on the Y chromosome. Previous research has indicated that about a quarter of the genes on the inactive X are, in fact, active, so researchers have long been aware that the chromosome is not completely silent. However, it’s still often painted as a passive copy playing backup for its more active partner. San Roman’s work shows that the inactive X chromosome’s gene expression is much more potent and complex than that.

A spectrum of sex chromosomes

In order to understand the inactive X chromosome’s contributions to gene expression, San Roman and colleagues in the Page lab collected blood and skin samples from people born with unusual combinations of sex chromosomes—everything from X0 (one X chromosome) to XXXXY. People with these different sets of chromosomes often have health issues; for example, X0 females have Turner syndrome, which can cause heart defects, hearing impairment, and more; and XXY males have Klinefelter syndrome, which can cause infertility, weak muscles, and more. Page and San Roman hope their research could provide useful insights into these health issues as well as into sex differences between XY males and XX females.

In people with more than one X chromosome, every X but one is an inactive X. The researchers graphed sex chromosome gene expression, measuring the change in expression level of each gene with the addition of each inactive X, for people with anything from zero to three inactive X chromosomes, as well as different numbers of Y chromosomes. They also looked at the relative contribution to overall expression from the active versus inactive X chromosomes. One might expect the graphs they made to be relatively straightforward: for genes that are turned off on the inactive X chromosome, the gene expression level would not change at all as the number of copies of the inactive X increased. For genes that are turned on, the gene expression level would double with two X chromosomes, triple with three X chromosomes, and so on. When the researchers looked at chromosomes other than X with extra copies—namely, Y and chromosome 21—this is essentially the pattern they observed. Gene expression from additional X chromosomes, however, was not so straightforward.

Each additional inactive X chromosome changes gene expression by the same amount. However, the researchers found a surprising diversity in expression levels across X chromosome genes. The presence of each additional inactive X might increase one gene’s expression by 20 percent and another’s by 70 percent. Then the results grew more surprising: for some genes, the addition of an inactive X decreased their expression. For some genes that are only expressed on the active X chromosome, and completely silent on the inactive X, additional inactive X chromosomes nonetheless changed their expression level.

These discrepancies led the researchers to a startling finding. The X chromosomes do not function independently of each other. Instead, the inactive X chromosome can modulate expression of genes on the active X chromosome. In other words, some genes on the inactive X chromosome regulate genes on the active X chromosome, dialing their expression up or down. Altogether, the researchers found that 38% of the X chromosome genes in the two cell types that they tested are affected by the presence of inactive X chromosomes, either because the genes are expressed on the inactive X, or because the inactive X regulates their expression on the active X, or through some combination of the two mechanisms.

These findings show that the inactive X plays a much more active role in gene expression and regulation than was previously appreciated. Rather than just playing second fiddle to the active X chromosome, the inactive X is sometimes harmonizing with and sometimes even conducting its partner.

Rethinking the role of the inactive X in health and disease

Page and San Roman hope that their findings will help refocus research into sex differences. Previous research into the mechanisms behind these differences has focused on the effects of having X versus Y chromosomes. Page and San Roman’s work show that researchers also need to consider how the presence (in females) or absence (in males) of an inactive X chromosome contributes to sex differences.

“Everybody on the planet carries one active X chromosome, so that first X chromosome really does not contribute, we think, to differences between males and females,” says Page, who is also a professor of biology at the Massachusetts Institute of Technology and Investigator with the Howard Hughes Medical Institute. “If we transition from saying that females are XX and males are XY, to saying that females are Xi [have an inactive X] and males are Y, that really focuses the question.”

Page lab researchers have already begun using their findings to identify X chromosome genes that are likely to be important for sex differences in health and disease. From their list of genes that change in expression based on the presence of an inactive X, the researchers narrowed in on a top ten list of genes that need to maintain a specific expression level or else there will be severe negative consequences. These genes are also likely to be responsible for causing the health issues associated with different atypical sex chromosome compositions, because changes in their expression level are most likely to have strong effects on cells.

“This is a new way of thinking about how the X chromosome is expressed and how it might be impacting our biology,” San Roman says. “This top ten list will be really interesting to consider in the future in terms of how the level of expression of these genes affects cells and tissues in very fundamental ways.”

Notes

Citation:

Adrianna K. San Roman, Alexander K. Godfrey, Helen Skaletsky, Daniel W. Bellott, Abigail F. Groff, Hannah L. Harris, Laura V. Blanton, Jennifer F. Hughes, Laura Brown, Sidaly Phou, Ashley Buscetta, Paul Kruszka, Nicole Banks, Amalia Dutra, Evgenia Pak, Patricia C. Lasutschinkow, Colleen Keen, Shanlee M. Davis, Nicole R. Tartaglia, Carole Samango-Sprouse, Maximilian Muenke, and David C. Page. (2023). The human inactive X chromosome modulates expression of the active X chromosome. Cell Genomics. https://doi.org/10.1016/j.xgen.2023.100259

Daniel Lew

Education

  • Graduate: PhD, 1990, Rockefeller University
  • Undergraduate: BA, 1984, Genetics, Cambridge University

Research Summary

Different cells take on an astonishing variety of shapes, which are often critical to be able to perform specialized cell functions like absorbing nutrients or contracting muscles. We study how different cell shapes arise and how cells control the spatial distribution of their internal constituents. We take advantage of the tractability of fungal model systems, and address these questions using approaches from cell biology, genetics, and computational biology to understand molecular mechanisms. 

Honors and Awards

  • Fellow, American Academy of Microbiology, 2008
  • Fellow, American Association for the Advancement of Science, 2010
  • Duke Equity, Diversity, and Inclusion Award, 2019
Enzyme “atlas” helps researchers decipher cellular pathways

Biologists have mapped out more than 300 protein kinases and their targets, which they hope could yield new leads for cancer drugs.

Anne Trafton | MIT News Office
January 11, 2023

One of the most important classes of human enzymes are protein kinases — signaling molecules that regulate nearly all cellular activities, including growth, cell division, and metabolism. Dysfunction in these cellular pathways can lead to a variety of diseases, particularly cancer.

Identifying the protein kinases involved in cellular dysfunction and cancer development could yield many new drug targets, but for the vast majority of these kinases, scientists don’t have a clear picture of which cellular pathways they are involved in, or what their substrates are.

“We have a lot of sequencing data for cancer genomes, but what we’re missing is the large-scale study of signaling pathway and protein kinase activation states in cancer. If we had that information, we would have a much better idea of how to drug particular tumors,” says Michael Yaffe, who is a David H. Koch Professor of Science at MIT, the director of the MIT Center for Precision Cancer Medicine, a member of MIT’s Koch Institute for Integrative Cancer Research, and one of the senior authors of the new study.

Yaffe and other researchers have now created a comprehensive atlas of more than 300 of the protein kinases found in human cells, and identified which proteins they likely target and control. This information could help scientists decipher many cellular signaling pathways, and help them to discover what happens to those pathways when cells become cancerous or are treated with specific drugs.

Lewis Cantley, a professor of cell biology at Harvard Medical School and Dana Farber Cancer Institute, and Benjamin Turk, an associate professor of pharmacology at Yale School of Medicine, are also senior authors of the paper, which appears today in Nature. The paper’s lead authors are Jared Johnson, an instructor in pharmacology at Weill Cornell Medical College, and Tomer Yaron, a graduate student at Weill Cornell Medical College.

“A Rosetta stone”

The human genome includes more than 500 protein kinases, which activate or deactivate other proteins by tagging them with a chemical modification known as a phosphate group. For most of these kinases, the proteins they target are unknown, although research into kinases such as MEK and RAF, which are both involved in cellular pathways that control growth, has led to new cancer drugs that inhibit those kinases.

To identify additional pathways that are dysregulated in cancer cells, researchers rely on phosphoproteomics using mass spectrometry — a technique that separates molecules based on their mass and charge — to discover proteins that are more highly phosphorylated in cancer cells or healthy cells. However, until now, there has been no easy way to interrogate the mass spectrometry data to determine which protein kinases are responsible for phosphorylating those proteins. Because of that, it has remained unknown how those proteins are regulated or misregulated in disease.

“For most of the phosphopeptides that are measured, we don’t know where they fit in a signaling pathway. We don’t have a Rosetta stone that you could use to look at these peptides and say, this is the pathway that the data is telling us about,” Yaffe says. “The reason for this is that for most protein kinases, we don’t know what their substrates are.”

Twenty-five years ago, while a postdoc in Cantley’s lab, Yaffe began studying the role of protein kinases in signaling pathways. Turk joined the lab shortly after, and the three have since spent decades studying these enzymes in their own research groups.

“This is a collaboration that began when Ben and I were in Lew’s lab 25 years ago, and now it’s all finally really coming together, driven in large part by what the lead authors, Jared and Tomer, did,” Yaffe says.

In this study, the researchers analyzed two classes of kinases — serine kinases and threonine kinases, which make up about 85 percent of the protein kinases in the human body — based on what type of structural motif they put phosphate groups onto.

Working with a library of peptides that Cantley and Turk had previously created to search for motifs that kinases interact with, the researchers measured how the peptides interacted with all 303 of the known serine and threonine kinases. Using a computational model to analyze the interactions they observed, the researchers were able to identify the kinases capable of phosphorylating every one of the 90,000 known phosphorylation sites that have been reported in human cells, for those two classes of kinases.

To their surprise, the researchers found that many kinases with very different amino acid sequences have evolved to bind and phosphorylate the same motifs on their substrates. They also showed that about half of the kinases they studied target one of three major classes of motifs, while the remaining half are specific to one of about a dozen smaller classes.

Decoding networks

This new kinase atlas can help researchers identify signaling pathways that differ between normal and cancerous cells, or between treated and untreated cancer cells, Yaffe says.

“This atlas of kinase motifs now lets us decode signaling networks,” he says. “We can look at all those phosphorylated peptides, and we can map them back onto a specific kinase.”

To demonstrate this approach, the researchers analyzed cells treated with an anticancer drug that inhibits a kinase called Plk1, which regulates cell division. When they analyzed the expression of phosphorylated proteins, they found that many of those affected were controlled by Plk1, as they expected. To their surprise, they also discovered that this treatment increased the activity of two kinases that are involved in the cellular response to DNA damage.

Yaffe’s lab is now interested in using this atlas to try to find other dysfunctional signaling pathways that drive cancer development, particularly in certain types of cancer for which no genetic drivers have been found.

“We can now use phosphoproteomics to say, maybe in this patient’s tumor, these pathways are upregulated or these pathways are downregulated,” he says. “It’s likely to identify signaling pathways that drive cancer in conditions where it isn’t obvious what the genetics that drives the cancer are.”

The research was funded by the Leukemia and Lymphoma Society, the National Institutes of Health, Cancer Research UK, the Brain Tumour Charity, the Charles and Marjorie Holloway foundation, the MIT Center for Precision Cancer Medicine, and the Koch Institute Support (core) grant from the National Cancer Institute.

The molecules behind metastasis
Greta Friar | Whitehead Institute
January 4, 2023

Many cancer cells never leave their original tumors. Some cancer cells evolve the ability to migrate to other tissues, but once there cannot manage to form new tumors, and so remain dormant. The deadliest cancer cells are those that can not only migrate to, but also thrive and multiply in distant tissues. These metastatic cancer cells are responsible for most of the deaths associated with cancer. Understanding what enables some cancer cells to metastasize—to spread and form new tumors—is an important goal for researchers, as it will help them develop therapies to prevent or reverse those deadly occurrences.

Past research from Whitehead Institute Member Robert Weinberg and others suggests that cancer cells are best able to form metastatic tumors when the cells are in a particular state called the quasi-mesenchymal (qM) state. New research from Weinberg and Arthur Lambert, once a postdoc in Weinberg’s lab and now an associate director of translational medicine at AstraZeneca, has identified two gene-regulating molecules as important for keeping cancer cells in the qM state. The research, published in the journal Developmental Cell on December 19, shows that these molecules, ΔNp63 and p73, enable breast cancer cells to form new tumors in mice, and illuminates important aspects of how they do so.

Most potent in the middle

Cells enter the qM state by undergoing the epithelial-mesenchymal transition (EMT), a developmental process that can be co-opted by cancer cells. In the EMT, cells transition from an epithelial state through a spectrum of more mesenchymal states, which allows them to become more mobile and aggressive. Cells in the qM state have only transitioned partway through the EMT, becoming more, but not fully, mesenchymal. This middle ground is perfect for metastasis, whereas cells on either end of the spectrum—cells that are excessively epithelial or excessively mesenchymal—lose their metastatic abilities.

Lambert and colleagues wanted to understand more about how cancer stem cells, which can seed metastases and recurrent tumors, remain in a metastasis-prone qM state. They analyzed how gene activity was regulated in those cells and identified two transcription factors—molecules that influence the activity of target genes—as important. One of the transcription factors, ΔNp63, appeared to most directly control cancer stem cells’ ability to maintain a qM state. The other molecule, p73, seemed to have a similar role because it can activate ΔNp63. When either transcription factor was inactivated, the cancer stem cells transitioned to the far end of the EMT spectrum and so were unable to metastasize.

Next, the researchers looked at what genes ΔNp63 regulates in cancer stem cells. They expected to find a pattern of gene regulation resembling what they would see in healthy breast stem cells. Instead they found a pattern closely resembling what one would see in cells involved in wound healing and regeneration. Notably, ΔNp63 stimulates EGFR signaling, which is used in wound healing to promote rapid multiplication of cells.

“Although this is not what we expected to see, it makes a lot of sense because the process of metastasis requires active proliferation,” Lambert says. “Metastatic cancer cells need both the properties of stem cells—such as the ability to self-renew and differentiate into different cell types—and the ability to multiply their numbers to grow new tumors.”

This finding may help to explain why qM cells are so uniquely good at metastasizing. Only in the qM state can the cells strongly stimulate EGFR signaling and so promote their own proliferation.

“This work gives us some mechanistic understanding of what it is about the quasi-mesenchymal state that drives metastatic tumor growth,” says Weinberg, who is also the Daniel K. Ludwig Professor for Cancer Research at the Massachusetts Institute of Technology.

The researchers hope that these insights could eventually contribute to therapies that prevent metastasis. They also hope to pursue further research into the role of ΔNp63. For example, this work illuminated a possible connection between ΔNp63 and the activation of dormant cancer cells, the cells that travel to new tissues but then cannot proliferate after they arrive there. Such dormant cells are viewed as ticking time bombs, as at any point they may reawaken. Lambert hopes that further research may reveal new insights into what causes dormant cancer cells to eventually gain the ability to grow tumors, adding to our understanding of the mechanisms of metastatic cancer.

Notes

Arthur W. Lambert, Christopher Fiore, Yogesh Chutake, Elisha R. Verhaar, Patrick C. Strasser, Mei Wei Chen, Daneyal Farouq, Sunny Das, Xin Li, Elinor Ng Eaton, Yun Zhang, Joana Liu Donaher, Ian Engstrom, Ferenc Reinhardt, Bingbing Yuan, Sumeet Gupta, Bruce Wollison, Matthew Eaton, Brian Bierie, John Carulli, Eric R. Olson, Matthew G. Guenther, Robert A. Weinberg. “ΔNp63/p73 drive metastatic colonization by controlling a regenerative epithelial stem cell program in quasi-mesenchymal cancer stem cells.” Developmental Cell, Volume 57, Issue 24,
2022, 2714-2730.e8, https://doi.org/10.1016/j.devcel.2022.11.015.

Scientists discover a new way of sharing genetic information in a common ocean microbe

Prochlorococcus, the world’s most abundant photosynthetic organism, reveals a gene-transfer mechanism that may be key to its abundance and diversity.

David L. Chandler | MIT News Office
January 5, 2023

From the tropics to the poles, from the sea surface to hundreds of feet below, the world’s oceans are teeming with one of the tiniest of organisms: a type of bacteria called Prochlorococcus, which despite their minute size are collectively responsible for a sizable portion of the oceans’ oxygen production. But the remarkable ability of these diminutive organisms to diversify and adapt to such profoundly different environments has remained something of a mystery.

Now, new research reveals that these tiny bacteria exchange genetic information with one another, even when widely separated, by a previously undocumented mechanism. This enables them to transmit whole blocks of genes, such as those conferring the ability to metabolize a particular kind of nutrient or to defend themselves from viruses, even in regions where their population in the water is relatively sparse.

The findings describe a new class of genetic agents involved in horizontal gene transfer, in which genetic information is passed directly between organisms — whether of the same or different species — through means other than lineal descent. The researchers have dubbed the agents that carry out this transfer “tycheposons,” which are sequences of DNA that can include several entire genes as well as surrounding sequences, and can spontaneously separate out from the surrounding DNA. Then, they can be transported to other organisms by one or another possible carrier system including tiny bubbles known as vesicles that cells can produce from their own membranes.

The research, which included studying hundreds of Prochlorococcus genomes from different ecosystems around the world, as well as lab-grown samples of different variants, and even evolutionary processes carried out and observed in the lab, is reported today in the journal Cell, in a paper by former MIT postdocs Thomas Hackl and Raphaël Laurenceau, visiting postdoc Markus Ankenbrand, Institute Professor Sallie “Penny” Chisholm, and 16 others at MIT and other institutions.

Chisholm, who played a role in the discovery of these ubiquitous organisms in 1988, says of the new findings, “We’re very excited about it because it’s a new horizontal gene-transfer agent for bacteria, and it explains a lot of the patterns that we see in Prochlorococcus in the wild, the incredible diversity.” Now thought to be the world’s most abundant photosynthetic organism, the tiny variants of what are known as cyanobacteria are also the smallest of all photosynthesizers.

Hackl, who is now at the University of Groningen in the Netherlands, says the work began by studying the 623 reported genome sequences of different species of Prochlorococcus from different regions, trying to figure out how they were able to so readily lose or gain particular functions despite their apparent lack of any of the known systems that promote/boost horizontal gene transfer, such as plasmids or viruses known as prophages.

What Hackl, Laurenceau, and Ankenbrand investigated were “islands” of genetic material that seemed to be hotspots of variability and often contained genes that were associated with known key survival processes such as the ability to    assimilate essential, and often limiting, nutrients such as iron, or nitrogen, or phosphates. These islands contained genes that varied enormously between different species, but they always occurred in the same parts of the genome and sometimes were nearly identical even in widely different species — a strong indicator of horizontal transfer.

But the genomes showed none of the usual features associated with what are known as mobile genetic elements, so initially this remained a puzzle. It gradually became apparent that this system of gene transfer and diversification was different from any of the several other mechanisms that have been observed in other organisms, including in humans.

Hackl describes what they found as being something like a genetic LEGO set, with chunks of DNA bundled together in ways that could almost instantly confer the ability to adapt to a particular environment. For example, a species limited by the availability of particular nutrients could acquire genes necessary to enhance the uptake of that nutrient.

The microbes appear to use a variety of mechanisms to transport these tycheposons (a name derived from the name of the Greek goddess Tyche, daughter of Oceanus). One is the use of membrane vesicles, little bubbles pouched off from the surface of a bacterial cell and released with tycheposons inside it. Another is by “hijacking” virus or phage infections and allowing them to carry the tycheposons along with their own infectious particles, called capsids. These are efficient solutions, Hackl says, “because in the open ocean, these cells rarely have cell-to-cell contacts, so it’s difficult for them to exchange genetic information without a vehicle.”

And sure enough, when capsids or vesicles collected from the open ocean were studied, “they’re actually quite enriched” in these genetic elements, Hackl says. The packets of useful genetic coding are “actually swimming around in these extracellular particles and potentially being able to be taken up by other cells.”

Chisholm says that “in the world of genomics, there’s a lot of different types of these elements” — sequences of DNA that are capable of being transferred from one genome to another. However, “this is a new type,” she says. Hackl adds that “it’s a distinct family of mobile genetic elements. It has similarities to others, but no really tight connections to any of them.”

While this study was specific to Prochlorococcus, Hackl says the team believes the phenomenon may be more generalized. They have already found similar genetic elements in other, unrelated marine bacteria, but have not yet analyzed these samples in detail. “Analogous elements have been described in other bacteria, and we now think that they may function similarly,” he says.

“It’s kind of a plug-and-play mechanism, where you can have pieces that you can play around with and make all these different combinations,” he says. “And with the enormous population size of Prochlorococcus, it can play around a lot, and try a lot of different combinations.”

Nathan Ahlgren, an assistant professor of biology at Clark University who was not associated with this research, says “The discovery of tycheposons is important and exciting because it provides a new mechanistic understanding of how Prochlorococcus are able to swap in and out new genes, and thus ecologically important traits. Tycheposons provide a new mechanistic explanation for how it’s done.” He says “they took a creative way to fish out and characterize these new genetic elements ‘hiding’ in the genomes of Prochlorococcus.

He adds that genomic islands, the portions of the genome where these tycheposons were found, “are found in many bacteria, not just marine bacteria, so future work on tycheposons has wider implications for our understanding of the evolution of bacterial genomes.”

The team included researchers at MIT’s Department of Civil and Environmental Engineering, the University of Wuerzburg in Germany, the University of Hawaii at Manoa, Ohio State University, Oxford Nanopore Technologies in California, Bigelow Laboratory for Ocean Sciences in Maine, and Wellesley College. The work was supported by the Simons Foundation, the Gordon and Betty Moore Foundation, the U.S. Department of Energy, and the U.S. National Science Foundation.

Scientists unveil the functional landscape of essential genes

Researchers harness new pooled, image-based screening method to probe the functions of over 5,000 essential genes in human cells.

Nicole Davis | Whitehead Institute
November 21, 2022

A team of scientists at the Whitehead Institute for Biomedical Research and the Broad Institute of MIT and Harvard has systematically evaluated the functions of over 5,000 essential human genes using a novel, pooled, imaged-based screening method. Their analysis harnesses CRISPR-Cas9 to knock out gene activity and forms a first-of-its-kind resource for understanding and visualizing gene function in a wide range of cellular processes with both spatial and temporal resolution. The team’s findings span over 31 million individual cells and include quantitative data on hundreds of different parameters that enable predictions about how genes work and operate together. The new study appears in the Nov. 7 online issue of the journal Cell.

“For my entire career, I’ve wanted to see what happens in cells when the function of an essential gene is eliminated,” says MIT Professor Iain Cheeseman, who is a senior author of the study and a member of Whitehead Institute. “Now, we can do that, not just for one gene but for every single gene that matters for a human cell dividing in a dish, and it’s enormously powerful. The resource we’ve created will benefit not just our own lab, but labs around the world.”

Systematically disrupting the function of essential genes is not a new concept, but conventional methods have been limited by various factors, including cost, feasibility, and the ability to fully eliminate the activity of essential genes. Cheeseman, who is the Herman and Margaret Sokol Professor of Biology at MIT, and his colleagues collaborated with MIT Associate Professor Paul Blainey and his team at the Broad Institute to define and realize this ambitious joint goal. The Broad Institute researchers have pioneered a new genetic screening technology that marries two approaches — large-scale, pooled, genetic screens using CRISPR-Cas9 and imaging of cells to reveal both quantitative and qualitative differences. Moreover, the method is inexpensive compared to other methods and is practiced using commercially available equipment.

“We are proud to show the incredible resolution of cellular processes that are accessible with low-cost imaging assays in partnership with Iain’s lab at the Whitehead Institute,” says Blainey, a senior author of the study, an associate professor in the Department of Biological Engineering at MIT, a member of the Koch Institute for Integrative Cancer Research at MIT, and a core institute member at the Broad Institute. “And it’s clear that this is just the tip of the iceberg for our approach. The ability to relate genetic perturbations based on even more detailed phenotypic readouts is imperative, and now accessible, for many areas of research going forward.”

Cheeseman adds, “The ability to do pooled cell biological screening just fundamentally changes the game. You have two cells sitting next to each other and so your ability to make statistically significant calculations about whether they are the same or not is just so much higher, and you can discern very small differences.”

Cheeseman, Blainey, lead authors Luke Funk and Kuan-Chung Su, and their colleagues evaluated the functions of 5,072 essential genes in a human cell line. They analyzed four markers across the cells in their screen — DNA; the DNA damage response, a key cellular pathway that detects and responds to damaged DNA; and two important structural proteins, actin and tubulin. In addition to their primary screen, the scientists also conducted a smaller, follow-up screen focused on some 200 genes involved in cell division (also called “mitosis”). The genes were identified in their initial screen as playing a clear role in mitosis but had not been previously associated with the process. These data, which are made available via a companion website, provide a resource for other scientists to investigate the functions of genes they are interested in.

“There’s a huge amount of information that we collected on these cells. For example, for the cells’ nucleus, it is not just how brightly stained it is, but how large is it, how round is it, are the edges smooth or bumpy?” says Cheeseman. “A computer really can extract a wealth of spatial information.”

Flowing from this rich, multi-dimensional data, the scientists’ work provides a kind of cell biological “fingerprint” for each gene analyzed in the screen. Using sophisticated computational clustering strategies, the researchers can compare these fingerprints to each other and construct potential regulatory relationships among genes. Because the team’s data confirms multiple relationships that are already known, it can be used to confidently make predictions about genes whose functions and/or interactions with other genes are unknown.

There are a multitude of notable discoveries to emerge from the researchers’ screening data, including a surprising one related to ion channels. Two genes, AQP7 and ATP1A1, were identified for their roles in mitosis, specifically the proper segregation of chromosomes. These genes encode membrane-bound proteins that transport ions into and out of the cell. “In all the years I’ve been working on mitosis, I never imagined ion channels were involved,” says Cheeseman.

He adds, “We’re really just scratching the surface of what can be unearthed from our data. We hope many others will not only benefit from — but also build upon — this resource.”

This work was supported by grants from the U.S. National Institutes of Health as well as support from the Gordon and Betty Moore Foundation, a National Defense Science and Engineering Graduate Fellowship, and a Natural Sciences and Engineering Research Council Fellowship.

Unusual Labmates: How C. elegans Wormed Its Way into Science Stardom
Greta Friar | Whitehead Institute
September 20, 2022

 

Introduction

Michael Stubna, a graduate student in Whitehead Institute Member David Bartel’s lab, peers into his microscope at the Petri dish full of agar gel below. He spots one of his research specimens, a millimeter-long nematode worm known as Caenorhabditis elegans (C. elegans), slithering across the coating of bacteria–the worm’s food source–on the surface of the gel. The worm leaves sinuous tracks in its wake like a skier slaloming down a slope.

 

Michael looks up from the microscope and grabs his worm pick, a metal wire sticking out of a glass tube. He runs the end of the pick through a Bunsen burner flame until the wire glows red, using heat to sanitize the tool. Then he returns his attention to the microscope. He nudges the Petri dish to re-center the worm, and, once the pick has cooled, he coats the tip with some of the sticky bacterial food and uses it to skillfully pluck the worm from the surface of the gel. He puts a fresh dish of agar under the microscope, and presses the pick, with the worm still adhered, to the surface. Almost immediately, the worm sets off, carving fresh tracks into the pristine bacterial lawn.

Michael is cultivating C. elegans in order to use them to study microRNAs, tiny RNA molecules involved in gene regulation. Right now, Michael is the only researcher in the Bartel lab using the worms, but in the wider research world, C. elegans is a popular model organism. At first glance, C. elegans is a rather unassuming animal. Barely large enough to see with the naked eye, in nature the worms reside in soil and decomposing vegetation, feasting on bacteria. Except for their heads, their bodies can only bend up and down, so the animals crawl on their sides. The worms have simple tube bodies and are capable of a limited range of behaviors. Nevertheless, researchers frequently turn to C. elegans to learn about not only their biology, but our own. C. elegans is one of the most intimately understood species in biology—the first animal to have its complete genome sequenced or its neural circuitry completely mapped. How did this simple worm become so well studied and a fixture in laboratories around the world?

Making a model

The species C. elegans was first identified, and used in research, after being found in the soil in Algeria around 1900. However, its popularity as a research model skyrocketed in the 1970s, after biologist Sydney Brenner, then at the Medical Research Council  Laboratory of Molecular Biology and later the founder of the Molecular Sciences Institute, made the case for it as the best new model species for the field of molecular biology. [1], [2]

What makes C. elegans such a good model organism? The worm exists in a “just right” zone of biological complexity: it is complex enough to have many of the features that researchers want to study, but simple enough that those same features can be examined comprehensively. For example, each C. elegans has 302 neurons, which is enough to be a useful model for everything from questions about how brains form, to how they sense and respond to stimuli, to how neuronal pathways give rise to specific behaviors, to how different diseases cause neurodegeneration. At the same time, 302 neurons is a small enough number for researchers to be able to study each individual  neuron and its connections thoroughly. (In comparison, a fruit fly has around 100,000 neurons.)

In the same way, C. elegans has just enough complexity to be used to model other common aspects of animal biology, including muscle function, reproduction, digestion, wound healing, aging, and more. It shares many genes with humans and can even be used to model human disease. For example, researchers have used C. elegans to model neurodegenerative diseases such as Alzheimer’s and Parkinson’s.

Additionally, C. elegans has many advantages as a research subject. The worm’s skin is transparent, so researchers can easily observe and capture images of changes occurring inside of its body down to a cellular or even sub-cellular level.

C. elegans is small, hardy, and easy to rear in the laboratory. Fed a simple diet of bacteria and kept at 20 degrees Celsius, C. elegans will mature from an egg, through four larval stages, to a fertile adult in three days. It can then rapidly reproduce to provide researchers with thousands of specimens. C. elegans live for about three weeks, allowing for quick generational turnover, but if researchers want to keep the worms alive for longer, this is easy to do by putting the worms in stasis.

In nature, it is common to find the worms in a state of suspended animation, in which they can survive for months. [3] During poor environmental conditions, such as when food is scarce, instead of maturing into their usual third larval stage, the worms will enter what is known as a dauer stage, a hardier but inactive larval form. When environmental conditions improve, the worms exit the dauer stage and resume normal development. Researchers can recreate this process in the lab.

If a researcher is going out of town for a few days, they can keep their worms in a refrigerator at 4 degrees Celsius. They also have a simple solution for storing worms long term: freezing them. Worms put in a negative 80-degree Celsius freezer can survive for years and still be recovered. This makes the worms much easier to maintain than other common model organisms, which need constant maintenance to keep them alive, fed, and reproducing.

“With most model organisms, if you go away on vacation you need to find someone to look after your specimens while you are gone,” Stubna says. “That’s not necessary with worms.”

Another advantage of C. elegans is how easy it is to generate large numbers of them. Most C. elegans are self-fertilizing hermaphrodites. They cannot mate with each other, but each worm can generate hundreds of offspring on its own. The self-fertilizing nature of C. elegans generally works in researchers’ favor, as along with making it easy to generate lots of new worms, self-fertilization makes genetic inheritance simpler to predict and manipulate in order to maintain a desired mutation throughout the generations. However, sometimes researchers may want to cross-breed their specimens, for example to combine mutations found in separate adults in an offspring.
Fortunately for researchers, the rare C. elegans worm is male—around .1-.2% in the wild [4] — and males can fertilize hermaphrodites’ eggs, enabling cross-breeding. C. elegans males arise in the wild due to a deviation during sex cell division. Hermaphrodite C. elegans have two X chromosomes. When one of these fails to form correctly, the resulting worm with its one X chromosome will be male, having some key anatomical differences. If researchers need lots of males for their work, they can increase the percentage of a hermaphrodite’s offspring that are male by exposing the worm to heat before it reproduces or by using genetic manipulation. Then, researchers can selectively breed male-heavy populations to further increase the ratio over time.
Researchers have developed a variety of tools and approaches over the years with which to manipulate C. elegans genetically. These have enabled researchers to learn a lot about both the worm’s genes, and genes that it shares with humans and other animals. One useful approach is the development of marker strains.

Getting to know C. elegans inside and out

After Brenner landed on C. elegans as an ideal model, his research group began several ambitious projects to comprehensively understand the worm’s biology. In the following decades, the worm’s anatomy and genome would be detailed in unprecedented detail. The more that researchers learned about the worm, the better a model it became.

Creating a complete cell lineage and neural map

C. elegans is remarkable in that every worm has the same exact number of cells: 959 in the adult hermaphrodite (not counting the cells that will become eggs or sperm). 302 of these cells are neurons. Researchers in Brenner’s group created two first-of-their-kind resources documenting the details of this biology. First, they mapped the worms’ complete cell lineage, recording every cell division that occurs during the worms’ development from fertilized egg to adult. This resource makes it easy for researchers to study how different factors contribute to—or can alter—this development.

Then, the researchers created a wiring diagram, or connectome, of the hermaphrodite worm’s 302 neurons and their thousands of synapses—the junction points where neurons interact. Researchers have used this wiring diagram to identify neurons involved in many different behaviors in C. elegans, as well as to understand how brains form and function across animal species. [5,6] C. elegans was the first, and as of 2021 the only, animal to have had its brain completely mapped.

Capturing the complete genome

In 1998, C. elegans made the news as the first animal to have its complete genome sequenced. The completion of the 15-year-long sequencing project, helmed by the C. elegans Sequencing Consortium, was announced in a special issue of Science. Researchers had previously compiled complete genomes for a variety of single-celled species, but as an animal, C. elegans had a significantly larger genome. The complete genome provided many useful insights into individual genes, and the relationships between genes both within C. elegans and between species. The ambitious project also proved instructive for how to sequence large genomes. In fact, the Human Genome Project helped to fund the sequencing of C. elegans as a stepping stone to the ultimate goal of sequencing the human genome, which was achieved in 2003.

With C. elegans’ genetics, anatomy, and other biology so thoroughly documented, the worms became an even more potent model organism. Researchers now had a wealth of foundational knowledge about the worm that they could use to make and test hypotheses about specific questions.

Worm culture

As the use of C. elegans in science grew, a community formed among the worm’s researchers. The C. elegans community was quick to develop and share resources. The Worm Breeder’s Gazette is a semi-annual newsletter first published in 1975, which shares information of interest to the C. elegans community such as experimental techniques and new findings. The Caenorhabditis Genetics Center (CGC), founded in 1979, is a central repository from which researchers can order thousands of different strains of C. elegans for use in their own research.

From the early years, prominent researchers working with C. elegans believed strongly in sharing data both among researchers and with the public. This openness set the tone for the field of molecular biology more broadly; for example, open data sharing policies around the sequencing of the C. elegans genome encouraged the Human Genome Project to follow suit.

The worm community often had to build its own tools in order to share data on the scales its members desired. One big project was the creation of ACeDB (A C. elegans Database), a database management system capable of storing and displaying many different kinds of biological information about C. elegans, including its complete genome, in a user-friendly way. The current iteration of ACeDB, known as WormBase, contains the annotated genomes of C. elegans and related nematodes, information on every known C. elegans gene and its function, genetic maps, the C. elegans cell lineage and connectome, and much more. The ACeDB software was soon used to create similar databases for other model organisms. Such databases now exist for many different species, making detailed biological data widely available to everyone. These databases are also often used to share the latest research, maintain a consensus around scientific terminology and gene annotation, and provide educational resources on the model organism. Anyone looking for general information on C. elegans can also visit WormBook, an open access, online review of C. elegans biology.

With these resources and others, the C. elegans community fostered a culture of sharing and scientific openness that continues to this day.

A few of the many discoveries and further tool development

C. elegans researchwith its wealth of experimental tools and methods, pre-existing data with which to build and test hypotheses, and a worldwide community happy to share resources, has been the source of many important discoveries over the years. Many of these discoveries have also added to researchers’ toolkits, providing new ways to experiment with C. elegans and other research specimens. A few of these myriad impactful discoveries are highlighted below.

Insights into development and programmed cell death

Brenner and two researchers whom he had mentored, John Sulston, then at the The Wellcome Trust Sanger Institute, and Robert Horvitz, then at the Massachusetts Institute of Technology (MIT), were awarded the first Nobel Prize for work done in C. elegans in 2002. When the researchers were creating the C. elegans cell lineage map, they saw that some cells created during development died off at particular moments, and that this programmed destruction of cells, called apoptosis, was an essential part of creating the adult body. They identified key genes that regulate apoptosis, and their work led to insights into the role of apoptosis in human development, as well as in health and disease. For example, cancer cells are able to avoid apoptosis, and many modern cancer therapies work by reenabling apoptosis of cancer cells.

Andrew Fire at the Stanford University School of Medicine and Craig Mello at the University of Massachusetts Medical School used C. elegans to discover RNA interference (RNAi), a process that cells use to stop genes from being expressed. RNAi became an important research tool after researchers figured out how to tailor RNAi to turn off genes that they are interested in studying in different cells and species. Researchers turn off a gene and see what changes, which helps them figure out the gene’s function. People have also found uses for RNAi in medicine and industry. RNAi is easy to use in C. elegans — researchers can apply it to worms by simply feeding them modified bacteria — so this tool made the worms an even better model for genetics research. Fire and Mellow earned a Nobel Prize for their discovery in 2006.

Introducing a new visual tag

C. elegans also contributed to the development of another popular and powerful research tool, green fluorescent protein (GFP). GFP is a protein first found in jellyfish. It glows green under certain light waves. Martin Chalfie at Columbia University showed in C. elegans that the genetic code for GFP could be added as a tag to genes of interest, and then the products of those genes would glow, providing researchers with a great visual marker of where and when the genes were expressed. Chalfie shared the 2008 Nobel Prize in Chemistry for this work, and researchers now frequently use GFP and similar molecules as visual markers in experiments across species and cell types.

A model for aging

In 1993, Cynthia Kenyon at the University of California, San Francisco and colleagues discovered that mutations to a single gene, daf-2, along with the normal activity of a second gene, daf-16, could more than double the lifespan of C. elegans. Kenyon and others intrigued by this  discovery would go on to use C. elegans to ask questions about the molecular mechanisms governing aging. Researchers have also studied how equivalent genes affect aging in other animals, including humans.

A model for sex determination, reproduction, and development

C. elegans has been used to explore questions related to sex, reproduction, and development. Barbara Meyer, then at MIT, now at the University of California, Berkeley, discovered the mechanism of sex determination in the worm, and has uncovered mechanisms by which gene expression is regulated to compensate an animal having one or two X chromosomes. Other researchers have used C. elegans to make important discoveries about germ cells, the cells that give rise to eggs and sperm. Judith Kimble and John White, then at the MRC Laboratory of Molecular Biology, now at the University of Wisconsin–Madison discovered the first germline stem cell niche in C. elegans, which is the place where animals maintain a pool of stem cells with which to keep producing new germ cells over time. This finding had implications for fertility and regeneration research. Geraldine Seydoux at Johns Hopkins University has used C. elegans to investigate unique features of germ cells, as well as how sperm and egg interact and how the early embryo prepares to form a complex adult body.

Understanding sense of smell

Thanks in large part to C. elegans having such a well-mapped nervous system, the worm has been a common model for researchers studying how animals sense and respond to stimuli in their environments. Cori Bargmann, an alumna of Whitehead Institute, now at Rockefeller University, studies how C. elegans sense and process outside stimuli, how those stimuli can trigger changes in behavior, and how the brain can be rewired to modify behaviors over time. Bargmann’s research has particularly illuminated the worm’s sense of smell. She found the first evidence of a receptor for a specific smell, and her work more broadly shed light on how animals are able to recognize many different types of smells.

A rich history of discovery

This is just a small sampling of the important discoveries that have been made in C. elegans. WormBook has compiled a list of many such achievements, including the discovery of multiple key molecules and pathways present across animals.

Worms at Whitehead Institute

Michael uses C. elegans to study microRNAs.

C. elegans have long played an important role in microRNA research; in fact, microRNAs were first discovered in C. elegansVictor Ambros and colleagues, and Gary Ruvkun and colleagues, published papers describing the first identified microRNA, lin-4, and its target, in 1993. At first, researchers thought that the small gene-regulating molecule might be an oddity. However, in 2000, Ruvkun discovered a second microRNA, and by the next year researchers—including Whitehead Institute Member David Bartel—had identified many more microRNAs in C. elegans, as well as microRNAs in other species. [12] Collectively, this research implied that microRNAs were a common and important regulator of gene expression across species. The field of microRNA research exploded, and microRNAs became the focus of Bartel’s lab.

In spite of the importance of C. elegans in establishing microRNA research, both in the field at large and in the Bartel lab specifically, no one in the lab was using C. elegans as a research model when Michael joined. However, as Michael–who had previous experience working with C. elegans–began to plan his graduate research, he realized that the worm would be the perfect model in which to explore his topic of interest: how microRNAs are regulated. Bartel agreed, and so C. elegans made their triumphant return to Whitehead Institute.

“It’s great to return to C. elegans,” says Bartel, who is also a professor of biology at MIT and a HHMI investigator.  “Michael is working on sets of microRNAs that we discovered over 20 years ago. Since then, we and others have learned a lot about microRNAs, using a variety of research models. It will be fun to see what new things we will learn with C. elegans.

Michael is using C. elegans to better understand how microRNAs, which degrade messenger RNAs, are themselves degraded. In recent years, researchers discovered a surprising mechanism of microRNA degradation: in some instances, when a microRNA pairs with a messenger RNA, instead of this leading to the destruction of the messenger RNA, it leads to the destruction of the microRNA.

“The normal regulatory logic is completely flipped,” Michael says. “This was discovered in mammalian cells, and our lab and others have been working out the mechanism for how this happens.”

The Bartel lab found that a particular gene is necessary for this process. When Michael joined the lab, he wondered whether that gene’s equivalent in C. elegans serves the same role. He found that it does. However, there are differences between how this process works in C. elegans and mammals, in particular in the way that the RNAs pair to trigger destruction of the microRNA. Those differences, and what they reveal about how microRNAs are regulated, are what Michael is studying now.

“What is the underlying principle of what’s required for microRNAs to be degraded through this pathway in worms? It’s not known, and that’s what I’m trying to find out,” Michael says.

Michael hopes that what he learns from the worms will shed light on the logic for how microRNAs are regulated across animal species. This will in turn give researchers a better understanding of how cells are able to so precisely tailor their gene expression. The prospect of deciphering such a central facet of cell biology is exciting for Michael, but it’s just another day for C. elegans. Decades of results have shown that there’s no limit to what these simple worms can be used to discover.