Rethinking transcription factors and gene expression

Study shows that, like proteins, genomes must fold appropriately to function properly and that some transcription factors provide the structural support.

Nicole Giese Rura | Whitehead Institute
December 7, 2017

Transcription — the reading of a segment of DNA into an RNA template for protein synthesis — is fundamental for nearly all cellular processes, including growth, responding to stimuli, and reproduction. Now, Whitehead Institute researchers have upended our understanding of how transcription is controlled and the role of transcription factors in the process.

The paradigm shift, described in an article online on Dec. 7 in the journal Cell, hinges on a small protein that plays a key role in genome structure and gives us new insights into how changes in the control of transcription and gene expression can lead to disease.

Transcription has several important players that must all be in the right place at the right time: the transcription machinery, transcription factors, promoters, and enhancers.  According to the existing model, transcription factors are proteins that bind to enhancer regions of the genome and recruit the transcription machinery to the promoter DNA regions, which then initiate the genes’ transcription.

“We’ve always assumed that the role of transcription factors was to recruit the transcription machinery to genes to turn them on or turn them off,” says Richard Young, a Whitehead Insistute member and professor of biology at MIT. “But we never imagined that the transcription factors we’ve studied for three decades actually contribute to the genome’s structure. And as a consequence, they regulate genes. So we now look at genomes like proteins: They have to fold up appropriately in order to control genes.”

Scientists have known that the genome’s structure — how it bends and folds — is essential for efficiently compressing two meters of DNA into each human cell, which is the equivalent of packing a strand ten football fields long into a space the size of a marble. Yet until recently, researchers have not had the tools necessary to appreciate this architecture’s importance in fine control of gene expression or study the genome’s structure at sites ready for transcription.

In 2014, Young and his lab determined that portions of the genome reside in loop-based structures, creating insulated neighborhoods that bring enhancers, promoters, and genes into close proximity. Each loop is tied at the top by a pair of molecules, called CTCF, that are bound together. This structure is essential for proper gene control: If the loop structure is broken, gene expression is altered, and cells can become diseased or die.

In the current research, Young along with co-first authors Abraham Weintraub and Charles Li took a closer look at a protein that is well known but not well understood: Yin Yang 1 (YY1). Hundreds of scientific papers have linked YY1 dysfunction to diseases such as viral infections, cancer, and arthritis, and yet the studies produced seemingly contradictory observations of YY1’s function.

According to Young and colleagues, YY1 is a unique transcription factor that occupies both enhancers and promoters, is essential for cell survival, and is found in almost every cell type in humans and mice. Like CTCF, YY1 can also pair with itself and bind to DNA to form loops that enhance DNA transcription.

“YY1 is expressed broadly, and it is necessary for establishing enhancer-promoter loops in multiple cell types,” says Weintraub. “That’s its job, not recruiting the transcription apparatus. When the structure created by YY1 is removed, the genome is no longer folded properly, gene control is lost and transcription of the affected genes is significantly diminished, which can cause dysfunction.”

This model of YY1’s function could account for its association with a number of disparate diseases. Earlier this year, scientists reported YY1 syndrome — a genetic syndrome causing cognitive disabilities in people with mutations in their YY1 gene.

According to Young, YY1 is probably not the only transcription factor with this loop-forming role, and his lab will be searching for additional factors with similar functions.

“YY1 is most likely just the first one, and there are probably a bunch of collaborators that have similar roles,” says Young. “Instead of the classic function that we thought these transcription factors had — interacting with the transcription apparatus and giving instructions on how much or how little of a gene’s transcript to produce — they are bringing together regulatory elements with the gene. The whole job of these transcription factors is just making structure. We are realizing that the things that form physical structures are much more important than we had appreciated.”

The researchers’ work was supported by the National Institutes of Health, the Ludwig Graduate Fellowship funds, the National Science Foundation, the American Cancer Society, a Margaret and Herman Sokol Postdoctoral Award, the Damon Runyon Cancer Research Foundation, and the Cancer Research Institute. The Whitehead Institute has filed a patent application based on this study.

Robert A. Weinberg

Education

  • PhD, 1969, MIT
  • SB, 1964, Biology, MIT

Research Summary

We investigate three broad questions related to the origin and spread of cancer. First, how do cancer cells within a primary tumor acquire the ability to invade and metastasize? Second, how are the stem-cell state and the epithelial-mesenchymal transition interrelated? Third, how are the regulators of the epithelial-mesenchymal transition able to activate this profound change in cell phenotype?

Awards

  • Japan Prize, Japan Prize Foundation, 2021
  • Salk Institute Medal for Research Excellence, 2016
  • Breakthrough Prize in Life Sciences, 2013
  • Wolf Foundation Prize, 2004
  • Institute of Medicine, Member, 2000
  • Keio Medical Science Foundation Prize, 1997
  • National Science Foundation, National Medal of Science, 1997
  • Harvey Prize, 1994
  • American Academy of Arts and Sciences, Fellow, 1987
  • Sloan Prize, GM Cancer Research Foundation, 1987
  • National Academy of Sciences, Member, 1985
  • Robert Koch Foundation Prize, 1983
Monty Krieger

Education

  • PhD, 1976, California Institute of Technology
  • BS, 1971, Chemistry, Tulane University

Research Summary

We use genetic, biochemical, physiologic, chemical, cellular and molecular biological methods to study cell surface receptor structure and function. We focus on lipoprotein receptors — in particular, the High Density Lipoprotein (HDL) receptor called Scavenger Receptor, Class B, Type I (SR-BI). Our analyses have provided insight into basic biological processes, contributed to our understanding of atherosclerosis and coronary heart disease (CHD) and have uncovered an unexpected connection between cholesterol and mammalian female infertility.

No longer accepting new students.

Awards 

  • Tulane University School of Science and Engineering Outstanding Alumnus Award, 2010
  • National Academy of Sciences, Member, 2009
  • Outstanding Achievement Award for Contributions to Atherosclerosis Research, International Atherosclerosis Society, 2009
  • Margaret MacVicar Faculty Fellow, 1993-2003
Sebastian Lourido

Education

  • PhD, 2012, Washington University in St. Louis
  • BS, 2004, Cellular and Molecular Biology and Studio Art, Tulane University

Research Summary

Our lab is interested in the molecular events that enable apicomplexan parasites to remain widespread and deadly infectious agents. We study many important human pathogens, including Toxoplasma gondii, to model features conserved throughout the phylum. We seek to expand our understanding of eukaryotic diversity and identify specific features that can be targeted to treat parasite infections.

Awards

  • Odyssey Award, Smith Family Foundation, 2021
Adam C. Martin

Education

  • PhD, 2006, University of California, Berkeley
  • BS, 2000, Biology and Genetics, Cornell University

Research Summary

We study how cells and tissues change shape during embryonic development, giving rise to different body parts. We visualize these changes to determine how mechanical forces drive massive tissue movements in the fruit fly, Drosophila melanogaster. In addition, we also study the regulation of tissue integrity, investigating the processes that regulate the epithelial-to-mesenchymal transition or EMT.

Thomas U. Schwartz

Education

  • PhD, 2000, Free University of Berlin
  • MS, 1996, Biochemistry, Free University of Berlin
  • BS, 1993, Biochemistry, Free University of Berlin

Research Summary

Our primary goal is to understand how signals and molecules are transmitted between the nucleus and cytoplasm across the nuclear envelope. We work to decipher the mechanism and structure of the machinery that executes these cellular processes.

Rebecca Lamason

Education

  • PhD, 2011, The Johns Hopkins University School of Medicine
  • BS, 2002, Molecular Biology and Biotechnology, Millersville University

Research Summary

In the Lamason lab, we investigate how intracellular bacterial pathogens hijack host cell processes to promote infection. In particular, we study how Rickettsia parkeri and Listeria monocytogenes move through our tissues via a process called cell-to-cell spread. We utilize cellular, molecular, genetic, biochemical and biophysical approaches to elucidate the mechanisms of spread in order to reveal key aspects of pathogenesis and host cell biology.

Awards

  • NIH Pathway to Independence Award, 2015
Joseph (Joey) Davis

Education

  • PhD, 2010, MIT
  • BA,  2003, Computer Science, University of California, Berkeley
  • BS, 2003, Biological Engineering, University of California, Berkeley

Research Summary

The Davis lab is working to uncover how cells construct and degrade complex molecular machines rapidly and efficiently. We apply a variety of biochemical, biophysical, and structural approaches including quantitative mass spectrometry and single particle cryo-electron microscopy to understand the detailed molecular mechanisms of these processes. Ongoing projects in the lab are focused on autophagy, an essential eukaryotic protein and organelle degradation pathway, and assembly of the ribosome, which is essential in all cells.

Awards

  • Sloan Research Fellowship, Alfred P. Sloan Foundation, 2021
  • National Institute on Aging R00 Fellowship, 2017
  • National Institute on Aging K99 Fellowship, 2015
Stephen Bell

Education 

  • PhD, 1990, University of California, Berkeley
  • BS, 1985, Integrated Science Program and Biochemistry, Molecular Biology and Cell Biology, Northwestern University

Research Summary

We focus on the events that occur at the starting points of chromosome duplication. These DNA sequences — called “origins of replication” — are found at multiple sites on each eukaryotic chromosome and direct the assembly of replisomes, which replicate the DNA on both sides of the origin. We study this assembly process to understand how chromosomes are replicated, and how these events are regulated during the cell cycle to ensure genome maintenance.

Awards

  • National Academy of Sciences, Member, 2017
  • National Academy of Sciences Award in Molecular Biology, 2009
  • Howard Hughes Medical Institute, HHMI Investigator, 2000
Revealing an imperfect actor in plant biotechnology

Whitehead Institute researchers detect the chemical mistakes of a common herbicide-resistance enzyme, then successfully re-engineer it for enhanced precision.

Nicole Davis | Whitehead Institute
November 29, 2017

A research team led by MIT’s Whitehead Institute for Biomedical Research has harnessed metabolomic technologies to unravel the molecular activities of a key protein that enables plants to withstand a common herbicide.

Their findings reveal how the protein — a kind of catalyst or enzyme first isolated in bacteria and introduced into plants such as corn and soybeans in the 1990s — can sometimes act imprecisely, and how it can be successfully re-engineered to be more precise. The new study, which appears online in the journal Nature Plants, raises the standards for bioengineering in the 21st century.

“Our work underscores a critical aspect of bioengineering that we are now becoming technically able to address,” says senior author Jing-Ke Weng, a member of the Whitehead Institute and an assistant professor of biology at MIT. “We know that enzymes can behave indiscriminately. Now, we have the scientific capabilities to detect their molecular side effects, and we can leverage those insights to design smarter enzymes with enhanced specificity.”

Plants provide an extraordinary model for scientists to study how metabolism changes over time. Because they cannot escape from predators or search for new food sources when supplies run low, plants must often grapple with an array of environmental insults using what is readily available — their own internal biochemistry.

“Although they appear to be stationary, plants have rapidly evolving metabolic systems,” Weng explains. “Now, we can gain an unprecedented view of these changes because of cutting-edge techniques like metabolomics, allowing us to analyze metabolites and other biochemicals on a broad scale.”

Key players in this evolutionary process, and a major focus of research in Weng’s laboratory, are enzymes. Traditionally, these naturally occurring catalysts have been viewed as mini-machines, taking the proper starting material (or substrate) and flawlessly converting it to the correct product. But Weng and other scientists now recognize that they make mistakes, often by latching on to an unintended substrate.

“This concept, known as enzyme promiscuity, has a variety of implications, both in enzyme evolution and more broadly, in human disease,” Weng says.

It also has implications for bioengineering, as Bastien Christ, a postdoctoral fellow in Weng’s laboratory, and his colleagues recently discovered.

Christ, then a graduate student in Stefan Hörtensteiner’s lab at the University of Zurich in Switzerland, was studying a particular strain of the flowering plant Arabidopsis thaliana as part of a separate project when he made a puzzling observation. He found that two biochemical compounds were present at unusually high levels in the plant’s leaves.

Strangely, these compounds (called acetyl-aminoadipate and acetyl-tryptophan) weren’t present in any of the normal, so-called wild type plants. As he and his colleagues searched for an explanation, they narrowed in on the source: an enzyme, called BAR, that was engineered into the plants as a kind of chemical beacon, enabling scientists to more readily study them.

But BAR is more than just a tool for scientists. It is also one of the most commonly deployed traits in genetically modified crops such as soybeans, corn, and cotton, enabling them to withstand a widely-used herbicide (known as phosphinothricin or glufosinate).

For decades, scientists have known that BAR, originally isolated from bacteria, can render the herbicide inactive by tacking on a short string of chemicals, made of two carbons and one oxygen (also called an acetyl group). As the researchers describe in their Nature Plants paper, BAR has a promiscuous side, and can work on other substrates, too, such as the amino acids tryptophan and aminoadipate (a lysine derivative).

That explains why they can detect the unintended products (acetyl-tryptophan and acetyl-aminoadipate) in crops genetically engineered to carry BAR, such as soybeans and canola.

Their research included detailed studies of the BAR protein, including crystal structures of the protein bound to its substrates. This provided them with a blueprint for how to strategically modify BAR to make it less promiscuous, and favor only the herbicide as a substrate and not the amino acids. Christ and his colleagues created several versions that lack the non-specific activity of the original BAR protein.

“These are natural catalysts, so when we borrow them from an organism and put them into another, they may not necessarily be perfect for our purposes,” Christ says. “Gathering this kind of fundamental knowledge about how enzymes work and how their structure influences function can teach us how to select the best tools for bioengineering.”

There are other important lessons, too. When the BAR trait was first evaluated by the U.S. Food and Drug Administration (FDA) in 1995 for use in canola, and in subsequent years for other crops, metabolomics was largely non-existent as a technology for biomedical research. Therefore, it could not be applied toward the characterization of genetically engineered plants and foods, as part of their regulatory review. Nevertheless, acetyl-aminoadipate and acetyl-tryptophan, which are normally present in humans, have been reviewed by the FDA and are safe for human and animal consumption.

Weng and his colleagues believe their study makes a strong case for considering metabolomics analyses as part of the review process for future genetically engineered crops.

“This is a cautionary tale,” Weng says.

The work was supported by the Swiss National Science Foundation, the EU-funded Plant Fellows program, the Pew Scholar Program in the Biomedical Sciences, and the Searle Scholars Program.