Study reveals how egg cells get so big

Oocyte growth relies on physical phenomena that drive smaller cells to dump their contents into a larger cell.

Anne Trafton | MIT News Office
March 10, 2021

Egg cells are by far the largest cells produced by most organisms. In humans, they are several times larger than a typical body cell and about 10,000 times larger than sperm cells.

There’s a reason why egg cells, or oocytes, are so big: They need to accumulate enough nutrients to support a growing embryo after fertilization, plus mitochondria to power all of that growth. However, biologists don’t yet understand the full picture of how egg cells become so large.

A new study in fruit flies, by a team of MIT biologists and mathematicians, reveals that the process through which the oocyte grows significantly and rapidly before fertilization relies on physical phenomena analogous to the exchange of gases between balloons of different sizes. Specifically, the researchers showed that “nurse cells” surrounding the much larger oocyte dump their contents into the larger cell, just as air flows from a smaller balloon into a larger one when they are connected by small tubes in an experimental setup.

“The study shows how physics and biology come together, and how nature can use physical processes to create this robust mechanism,” says Jörn Dunkel, an MIT associate professor of physical applied mathematics. “If you want to develop as an embryo, one of the goals is to make things very reproducible, and physics provides a very robust way of achieving certain transport processes.”

Dunkel and Adam Martin, an MIT associate professor of biology, are the senior authors of the paper, which appears this week in the Proceedings of the National Academy of Sciences. The study’s lead authors are postdoc Jasmin Imran Alsous and graduate student Nicolas Romeo. Jonathan Jackson, a Harvard University graduate student, and Frank Mason, a research assistant professor at Vanderbilt University School of Medicine, are also authors of the paper.

A physical process

In female fruit flies, eggs develop within cell clusters known as cysts. An immature oocyte undergoes four cycles of cell division to produce one egg cell and 15 nurse cells. However, the cell separation is incomplete, and each cell remains connected to the others by narrow channels that act as valves that allow material to pass between cells.

Members of Martin’s lab began studying this process because of their longstanding interest in myosin, a class of proteins that can act as motors and help muscle cells contract. Imran Alsous performed high-resolution, live imaging of egg formation in fruit flies and found that myosin does indeed play a role, but only in the second phase of the transport process. During the earliest phase, the researchers were puzzled to see that the cells did not appear to be increasing their contractility at all, suggesting that a mechanism other than “squeezing” was initiating the transport.

“The two phases are strikingly obvious,” Martin says. “After we saw this, we were mystified, because there’s really not a change in myosin associated with the onset of this process, which is what we were expecting to see.”

cluster of cells

Martin and his lab then joined forces with Dunkel, who studies the physics of soft surfaces and flowing matter. Dunkel and Romeo wondered if the cells might be behaving the same way that balloons of different sizes behave when they are connected. While one might expect that the larger balloon would leak air to the smaller until they are the same size, what actually happens is that air flows from the smaller to the larger.

This happens because the smaller balloon, which has greater curvature, experiences more surface tension, and therefore higher pressure, than the larger balloon. Air is therefore forced out of the smaller balloon and into the larger one. “It’s counterintuitive, but it’s a very robust process,” Dunkel says.

Adapting mathematical equations that had already been derived to explain this “two-balloon effect,” the researchers came up with a model that describes how cell contents are transferred from the 15 small nurse cells to the large oocyte, based on their sizes and their connections to each other. The nurse cells in the layer closest to the oocyte transfer their contents first, followed by the cells in more distant layers.

“After I spent some time building a more complicated model to explain the 16-cell problem, we realized that the simulation of the simpler 16-balloon system looked very much like the 16-cell network. It is surprising to see that such counterintuitive but mathematically simple ideas describe the process so well,” Romeo says.

The first phase of nurse cell dumping appears to coincide with when the channels connecting the cells become large enough for cytoplasm to move through them. Once the nurse cells shrink to about 25 percent of their original size, leaving them only slightly larger than their nuclei, the second phase of the process is triggered and myosin contractions force the remaining contents of the nurse cells into the egg cell.

“In the first part of the process, there’s very little squeezing going on, and the cells just shrink uniformly. Then this second process kicks in toward the end where you start to get more active squeezing, or peristalsis-like deformations of the cell, that complete the dumping process,” Martin says.

Cell cooperation

The findings demonstrate how cells can coordinate their behavior, using both biological and physical mechanisms, to bring about tissue-level behavior, Imran Alsous says.

“Here, you have several nurse cells whose job it is to nurse the future egg cell, and to do so, these cells appear to transport their contents in a coordinated and directional manner to the oocyte,” she says.

Oocyte and early embryonic development in fruit flies and other invertebrates bears some similarities to those of mammals, but it’s unknown if the same mechanism of egg cell growth might be seen in humans or other mammals, the researchers say.

“There’s evidence in mice that the oocyte develops as a cyst with other interconnected cells, and that there is some transport between them, but we don’t know if the mechanisms that we’re seeing here operate in mammals,” Martin says.

The researchers are now studying what triggers the second, myosin-powered phase of the dumping process to start. They are also investigating how changes to the original sizes of the nurse cells might affect egg formation.

The research was funded by the National Institute of General Medical Sciences, a Complex Systems Scholar Award from the James S. McDonnell Foundation, and the Robert E. Collins Distinguished Scholarship Fund.

Cells are known by the company they keep
Eva Frederick
March 2, 2021

In the paper, published online March 1 in the journal Cell Metabolism, researchers at Whitehead Institute and the Morgridge Institute for Research performed CRISPR-based genetic screens of cells cultured in either traditional media or a new physiologic medium previously designed in the Sabatini Lab at Whitehead Institute designed to more closely reflect the nutrient composition of human blood. The screen revealed that different genes became essential for survival and reproduction in the various conditions.

“This work underscores the importance of using more human-like, physiologically relevant media for culturing human cancer cell lines,” said Whitehead Institute Member and co-senior author David Sabatini, who is also a professor of biology at the Massachusetts Institute of Technology and an investigator of the Howard Hughes Medical Institute. “The information we can learn from screens in human plasma-like media — or media designed to mimic other fluids throughout the body — may inform new therapeutic methods down the line.”

The widespread use of a human plasma-like medium could open the door for many researchers to conduct experiments in the lab that could have more relevance to human disease, but without complicated methods or prohibitive costs.

“Medium composition is both relatively accessible and quite flexible,” said co-senior author Jason Cantor, an Investigator at the Morgridge Institute for Research and an assistant professor of biochemistry at the University of Wisconsin-Madison, and a former postdoc in Sabatini’s lab. “Not all researchers have access to specialized tissue culture incubators, nor can everyone easily pursue some of the more complex 3D and co-culture methods, but most can get their hands on a bottle of media.”

The big screen

The idea that different environmental conditions may lead to different genes being essential is not a new one. “People have done this in microorganisms, and shown that if you throw [bacteria] into different growth conditions — the contributions of different genes to cell fitness can change,” Cantor said.

With this reasoning in mind — that medium composition could affect which genes become necessary for human cells to grow — the researchers set up screens to identify essential genes in a single leukemia cell line in different kinds of culture media. One batch was grown in a traditional medium, and another cultured in the lab’s new medium called Human Plasma-Like Medium, or HPLM, which has a metabolic composition more reflective of that in human blood.

The approach they used — called a CRISPR screen —  takes advantage of CRISPR-Cas9 gene editing technology to systematically snip and knock out genes across the genome, with the goal of creating a population of cells in which every possible gene knockout is represented. The expression of some genes is essential to survival, and cells grow substantially slower or die when those genes are deleted. Other cells may have trouble functioning, and some may grow even faster. Once the pooled cells have had a chance to grow and proliferate, researchers sequence the genetic material of the entire population to determine which genes were critical for survival within the given screen.

Once they completed the initial screens, the researchers identified hundreds of genes that were conditionally essential — that is, necessary for cell growth in one medium versus another. Interestingly, these medium-dependent essential genes collectively had roles in a number of different biological processes.

To determine how much these genes were dependent on the type of cells studied, the researchers then ran similar screens across a panel of human blood cancer cell lines, and then pursued follow-up work to understand why certain genes were identified as conditionally essential.

Ultimately, they uncovered the underlying gene-nutrient interactions, and specifically for these hit genes, traced the effects to availability of certain metabolites — the nutrients and small molecules necessary for metabolism — that are uniquely defined in HPLM versus the traditional media.

The next steps

CRISPR screens can help scientists identify potential drug targets and map out important cellular interactions to inform cancer therapies. “There are so many ways that people use CRISPR screens right now,” said Cantor. “What this study is showing is that the availability metabolites can have a major impact on which genes are important for cell growth, and so I think there are a lot of implications here in terms of how these types of screens could be performed in the future in order to potentially increase the fidelity of what we see in the lab and what might happen in the body.”

The research also establishes more nuanced relationships between cells’ genes and their environment. “What this allows us to do down the line, theoretically, is to tune how important a gene is — how important the encoded protein is — by manipulating metabolite levels in the blood,” said Cantor. “That’s one of our bigger-picture ideas.”

In the future, these relationships could inform cancer treatments. For example, if scientists could “tune” the importance of a specific gene for cancer cell growth, then the protein encoded by that gene could become a more promising drug target — in effect, tricking cancer cells into revealing possible context-dependent vulnerabilities. “The idea of targeting metabolites to treat cancer isn’t itself new — in fact, it [underlies] a well-established anti-cancer therapeutic enzyme still in use today — but I think our work maybe enables us to look for ways to couple this type of approach with other targeted therapies.”

“At our core, we are a basic cell biology lab,” added Nicholas Rossiter, a technician in Cantor’s lab and the first author of the study. “But whenever you’re studying basic cell biology, there’s the potential to translate it into therapeutic strategy. Our plan is just to keep on chugging along in our lab and studying how exactly cell biology can be influenced by these environmental factors. We do the basics, and then there will hopefully be some auspicious findings that can be carried on into therapeutics.”

Seychelle Vos investigates how the genome is organized so it can fit inside the cell — and how that careful organization affects gene expression.

February 24, 2021

The Davis and Berger labs combined cryo-electron microscopy and machine learning to visualize molecules in 3D.

February 4, 2021
Machine-learning model helps determine protein structures

New technique reveals many possible conformations that a protein may take.

Anne Trafton | MIT News Office
February 4, 2021

Cryo-electron microscopy (cryo-EM) allows scientists to produce high-resolution, three-dimensional images of tiny molecules such as proteins. This technique works best for imaging proteins that exist in only one conformation, but MIT researchers have now developed a machine-learning algorithm that helps them identify multiple possible structures that a protein can take.

Unlike AI techniques that aim to predict protein structure from sequence data alone, protein structure can also be experimentally determined using cryo-EM, which produces hundreds of thousands, or even millions, of two-dimensional images of protein samples frozen in a thin layer of ice. Computer algorithms then piece together these images, taken from different angles, into a three-dimensional representation of the protein in a process termed reconstruction.

In a Nature Methods paper, the MIT researchers report a new AI-based software for reconstructing multiple structures and motions of the imaged protein — a major goal in the protein science community. Instead of using the traditional representation of protein structure as electron-scattering intensities on a 3D lattice, which is impractical for modeling multiple structures, the researchers introduced a new neural network architecture that can efficiently generate the full ensemble of structures in a single model.

“With the broad representation power of neural networks, we can extract structural information from noisy images and visualize detailed movements of macromolecular machines,” says Ellen Zhong, an MIT graduate student and the lead author of the paper.

With their software, they discovered protein motions from imaging datasets where only a single static 3D structure was originally identified. They also visualized large-scale flexible motions of the spliceosome — a protein complex that coordinates the splicing of the protein coding sequences of transcribed RNA.

“Our idea was to try to use machine-learning techniques to better capture the underlying structural heterogeneity, and to allow us to inspect the variety of structural states that are present in a sample,” says Joseph Davis, the Whitehead Career Development Assistant Professor in MIT’s Department of Biology.

Davis and Bonnie Berger, the Simons Professor of Mathematics at MIT and head of the Computation and Biology group at the Computer Science and Artificial Intelligence Laboratory, are the senior authors of the study, which appears today in Nature Methods. MIT postdoc Tristan Bepler is also an author of the paper.

Visualizing a multistep process

The researchers demonstrated the utility of their new approach by analyzing structures that form during the process of assembling ribosomes — the cell organelles responsible for reading messenger RNA and translating it into proteins. Davis began studying the structure of ribosomes while a postdoc at the Scripps Research Institute. Ribosomes have two major subunits, each of which contains many individual proteins that are assembled in a multistep process.

To study the steps of ribosome assembly in detail, Davis stalled the process at different points and then took electron microscope images of the resulting structures. At some points, blocking assembly resulted in accumulation of just a single structure, suggesting that there is only one way for that step to occur. However, blocking other points resulted in many different structures, suggesting that the assembly could occur in a variety of ways.

Because some of these experiments generated so many different protein structures, traditional cryo-EM reconstruction tools did not work well to determine what those structures were.

“In general, it’s an extremely challenging problem to try to figure out how many states you have when you have a mixture of particles,” Davis says.

After starting his lab at MIT in 2017, he teamed up with Berger to use machine learning to develop a model that can use the two-dimensional images produced by cryo-EM to generate all of the three-dimensional structures found in the original sample.

In the new Nature Methods study, the researchers demonstrated the power of the technique by using it to identify a new ribosomal state that hadn’t been seen before. Previous studies had suggested that as a ribosome is assembled, large structural elements, which are akin to the foundation for a building, form first. Only after this foundation is formed are the “active sites” of the ribosome, which read messenger RNA and synthesize proteins, added to the structure.

In the new study, however, the researchers found that in a very small subset of ribosomes, about 1 percent, a structure that is normally added at the end actually appears before assembly of the foundation. To account for that, Davis hypothesizes that it might be too energetically expensive for cells to ensure that every single ribosome is assembled in the correct order.

“The cells are likely evolved to find a balance between what they can tolerate, which is maybe a small percentage of these types of potentially deleterious structures, and what it would cost to completely remove them from the assembly pathway,” he says.

Viral proteins

The researchers are now using this technique to study the coronavirus spike protein, which is the viral protein that binds to receptors on human cells and allows them to enter cells. The receptor binding domain (RBD) of the spike protein has three subunits, each of which can point either up or down.

“For me, watching the pandemic unfold over the past year has emphasized how important front-line antiviral drugs will be in battling similar viruses, which are likely to emerge in the future. As we start to think about how one might develop small molecule compounds to force all of the RBDs into the ‘down’ state so that they can’t interact with human cells, understanding exactly what the ‘up’ state looks like and how much conformational flexibility there is will be informative for drug design. We hope our new technique can reveal these sorts of structural details,” Davis says.

The research was funded by the National Science Foundation Graduate Research Fellowship Program, the National Institutes of Health, and the MIT Jameel Clinic for Machine Learning and Health. This work was supported by MIT Satori computation cluster hosted at the MGHPCC.

A new database of potential antibiotic targets
Raleigh McElvery
January 20, 2021

Many cells, including bacteria, are covered in a sugar-rich coating that protects their membrane and internal components. These sugars are often joined to other macromolecules, like proteins or lipids, to form glycoconjugates. The glycoconjugates that encrust bacteria help prevent them from “popping” under environmental stress, and facilitate host-pathogen interactions. Because the sugary layer perpetuates survival and virulence, researchers are looking for ways to create chinks in this microbial armor — or better yet, to prevent it from being made in the first place.

Glycoconjugates are built by many enzymes working in close succession at the cell membrane. One enzyme family, comprised of phosphoglycosyl transferases (PGTs), is responsible for catalyzing the first step in the assembly line. Of this large enzyme family, one subtype in particular stands out: “monotopic” PGTs, which are unique to bacteria and could serve as antibiotic targets. If researchers can develop drugs that inhibit monoPGTs, the sugar armor wouldn’t be built and noxious bacteria could be easier to defeat.

new PNAS study co-authored by Professor of Biology and Chemistry, Barbara Imperiali, highlights the diversity and significance of these potential drug targets. Imperiali teamed up with graduate student Katherine O’Toole and Professor of Chemistry Karen Allen from Boston University to categorize over 38,000 different monoPGTs, compiling this information into the first database of its kind.

“We’ve taken an enzyme family that was once considered quirky and insignificant, and demonstrated that it’s actually very prevalent,” Imperiali says. “Hopefully the database will help us better understand these enzymes, their molecular pathways, and the human pathogens they support.”

Imperiali and her colleagues used sequence analysis of known monoPGTs to define a “signature” amino acid sequence. They leveraged this signature to identify the entire superfamily of monoPGTs amidst the 63,152 sequences downloaded from an online portal, which they then clustered into closely-related subtypes. The researchers also created a family tree, which included over 100 monoPGTs from diverse bacterial species. Imperiali hopes others will take advantage of this new information to pinpoint monoPGTs in pathogens of interest, and explore similarities and differences in related microbes and their enzymes.

The researchers’ analyses also revealed strange, new proteins that appeared to include two enzymes in one — a monoPGT fused to one of the other enzymes that typically play a separate role in the same sugar-modifying pathway. “It’s essentially one protein with two functions,” Imperiali explains. These fusion enzymes could reveal which enzymes “talk” to one another and work sequentially during the glycoconjugate-building process, she adds, revealing the complicated chain of events that creates the bacterial sugar shield.

The team even found cases where one monoPGT was fused to a member of a different PGT family — polytopic PGTs (polyPGTs). MonoPGTs and polyPGTs are involved in different pathways that each build glycoconjugates, so having a dual-function protein could allow cells to easily switch between mechanisms. Bacterial cells lack the organizational compartments that human and other eukaryotic cells have, so perhaps these fusion enzymes help exert control and order at different points in the cell cycle, Imperiali speculates. At the moment, though, the hybrid PGTs remain an evolutionary mystery.

While some researchers parse these ancient puzzles, others may use the database to inspire new drugs to combat antibiotic resistance. “At the end of the day,” Imperiali says, “we’ve shed light on a set of enzymes that could become pivotal therapeutic targets.”

RNA molecules are masters of their own destiny
Eva Frederick | Whitehead Institute
December 16, 2020

At any given moment in the human body, in about 30 trillion cells, DNA is being “read” into molecules of messenger RNA, the intermediary step between DNA and proteins, in a process called transcription.

Scientists have a pretty good idea of how transcription gets started: proteins called RNA polymerases are recruited to specific regions of the DNA molecules and begin skimming their way down the strand, synthesizing mRNA molecules as they go. But part of this process is less well understood: how does the cell know when to stop transcribing?

Now, new work from the labs of Whitehead Institute Member Richard Young, also a professor of biology at Massachusetts Institute of Technology (MIT), and Arup K. Chakraborty, professor of chemical engineering, physics and chemistry at MIT, suggests that RNA molecules themselves are responsible for regulating their formation through a feedback loop. Too few RNA molecules, and the cell initiates transcription to create more. Then, at a certain threshold, too many RNA molecules cause transcription to draw to a halt.

The research, published in Cell on December 16, represents a collaboration between biologists and physicists, and provides some insight into the potential roles of the thousands of RNAs that are not translated into any proteins, called noncoding RNAs, which are common in mammals and have mystified scientists for decades.

A question of condensates

Previous work in Young’s lab has focused on transcriptional condensates, small cellular droplets that bring together the molecules needed to transcribe DNA to RNA. Scientists in the lab discovered the transcriptional droplets in 2018, noticing that they typically formed when transcription began and dissolved a few seconds or minutes later when the process was finished.

The researchers wondered if the force that governed the dissolution of the transcriptional condensates could be related to the chemical properties of the RNA they produced — specifically, its highly negative charge. If this were the case, it would be the latest example of cellular processes being regulated via a feedback mechanism — an elegant, efficient system used in the cell to control biological functions such as red blood cell production and DNA repair.

As an initial test, the researchers used an in vitro experiment to test whether the amount of RNA had an effect on condensate formation. They found that within the range of physiological levels observed in cells, low levels of RNA encouraged droplet formation and high levels of RNA discouraged it.

Thinking outside the biology box 

With these results in mind, Young Lab postdocs and co-first authors Ozgur Oksuz and Jon Henninger teamed up with physicist and co-first author Krishna Shrinivas, a graduate student in Arup Chakraborty’s lab, to investigate what physical forces were at play.

Shrinivas proposed that the team build a computational model to study the physical and chemical interactions between actively transcribed RNA and condensates formed by transcriptional proteins. The goal of the model was not to simply reproduce existing results, but to create a platform with which to test a variety of situations.

“The way most people study these kinds of problems is to take mixtures of molecules in a test tube, shake it and see what happens,” Shrinivas said. “That is as far away from what happens in a cell as one can imagine. Our thought was, ‘Can we try to study this problem in its biological context, which is this out-of-equilibrium, complex process?’”

Studying the problem from a physics perspective allowed the researchers to take a step back from traditional biology methods. “As a biologist, it’s difficult to come up with new hypotheses, new approaches to understanding how things work from available data,” Henninger said. “You can do screens, you can identify new players, new proteins, new RNAs that may be involved in a process, but you’re still limited by our classical understanding of how all these things interact. Whereas when talking with a physicist, you’re in this theoretical space extending beyond what the data can currently give you. Physicists love to think about how something would behave, given certain parameters.”

Once the model was complete, the researchers could ask it questions about situations that may arise in cells — for instance, what happens to condensates when RNAs of different lengths are produced at different rates as time ensues? — and then follow it up with an experiment at the lab bench. “We ended up with a very nice convergence of model and experiment,” Henninger said. “To me, it’s like the model helps distill the simplest features of this type of system, and then you can do more predictive experiments in cells to see if it fits that model.”

The charge is in charge

Through a series of modeling and experiments at the lab bench, the researchers were able to confirm their hypothesis that the effect of RNA on transcription is due to RNAs molecules’ highly negative charge. Furthermore, it was predicted that initial low levels of RNA enhance and subsequent higher levels dissolve condensates formed by transcriptional proteins. Because the charge is carried by the RNAs’ phosphate backbone, the effective charge of a given RNA molecule is directly proportional to its length.

In order to test this finding in a living cell, the researchers engineered mouse embryonic stem cells to have glowing condensates, then treated them with a chemical to disrupt the elongation phase of transcription. Consistent with the model’s predictions, the resulting dearth of condensate-dissolving RNA molecules increased the size and lifetime of condensates in the cell. Conversely, when the researchers engineered cells to induce the production of extra RNAs, transcriptional condensates at these sites dissolved. “These results highlight the importance of understanding how non-equilibrium feedback mechanisms regulate the functions of the biomolecular condensates present in cells,” said Chakraborty.

Confirmation of this feedback mechanism might help answer a long-standing mystery of the mammalian genome: the purpose of non-coding RNAs, which make up a large portion of genetic material. “While we know a lot about how proteins work, there are tens of thousands of noncoding RNA species, and we don’t know the functions of most of these molecules,” said Young. “The finding that RNA molecules can regulate transcriptional condensates makes us wonder if many of the noncoding species just function locally to tune gene expression throughout the genome. Then this giant mystery of what all these RNAs do has a potential solution.”

The researchers are optimistic that understanding this new role for RNA in the cell could inform therapies for a wide range of diseases. “Some diseases are actually caused by increased or decreased expression of a single gene,” said Oksuz, a co-first author. “We now know that if you modulate the levels of RNA, you have a predictable effect on condensates. So you could hypothetically tune up or down the expression of a disease gene to restore the expression — and possibly restore the phenotype — that you want, in order to treat a disease.”

Young added that a deeper understanding of RNA behavior could inform therapeutics more generally. In the last 10 years, a variety of drugs have been developed that directly target RNA successfully. “RNA is an important target,” Young said. “Understanding mechanistically how RNA molecules regulate gene expression bridges the gap between gene dysregulation in disease and new therapeutic approaches that target RNA.”