| 
  • If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.

  • Finally, you can manage your Google Docs, uploads, and email attachments (plus Dropbox and Slack files) in one convenient place. Claim a free account, and in less than 2 minutes, Dokkio (from the makers of PBworks) can automatically organize your content for you.

View
 

Transcription Regluatory Network

Page history last edited by PBworks 13 years, 12 months ago

Transcription Regulatory Network

by Rosa Ng


 

What is Transcription Regulatory Network?

Transcriptional regulation of gene expression in a cell is important to the survival and development of all organisms. For example, bacteria living in an environment of high lactose but no glucose (which is their “food source”) need to be able to alter the expression of certain genes to produce a protein that can break down lactose into glucose in order to survive and reproduce. Similarly, a cell in the human body needs to be able to respond to environmental stimuli such as harmful toxins by controlling its gene expressions so that certain proteins won’t be produced to stop the cell from moving onward to cell division. Cellular specialization of a multi-cellular organism also requires a coordinated effort of turning genes “on” and “off” at specific times so that different cells can become specialized in different ways according to the proteins they synthesize. These tasks of regulating gene transcriptions are achieved by collections of regulatory proteins and their interactions with specific sequences in the promoter regions of targeted genes. A “Transcriptional Regulatory Network” describes such regulatory proteins and interactions.

 

Figure: A Transcription regulatory network. (1)

 

Transcriptional regulatory networks in viruses and prokaryotes

 

Transcription regulatory proteins, often known as “activators” (activate gene transcriptions) or “repressors” (repress gene transcriptions), can be found to function in the relatively simpler genomes of viruses and prokaryokes. For example, the bacteriophage lambda alters its gene expression depending on whether it is entering the lytic cycle or the lysogenic cycle.

 

Although the phage lambda genome is only 48,502 base pairs long, the regulatory network that allows the phage to choose and follow one of the two cycles is quite complex already and still remains to be fully elucidated. (2)

 

Figure: The phage lambda genome. (2)

 

Figure: The phage lambda regulatory network. (2)

For bacteria, the transcription regulatory network has been most extensively studied in E.coli. We have mentioned the need for the bacteria to metabolize lactose in the absence of glucose. In E.coli, the lac operon , a piece of DNA that contains a promoter region, an operator region and three genes, serves to regulate this process. The lac operon represents perhaps the simplest transcription regulatory mechanism; it involves a repressor protein, which is transcribed by the lacI gene and binds to the operator region, and the operator sequence itself. The presence of lactose serves to inactivate the repressor and thus allow for the transcriptions of the genes. Still, this transcription regulatory mechanism is only part of a bigger E.coli metabolic regulatory network, which may consist of as many as 149 genes, including genes that code for 16 regulatory proteins and 73 enzymes! (3)

 

Figure: The transcription regulatory network for metabolism in E.coli. (3)

 

Transcription regulation networks in eukaryotes

 

As you can imagine, transcription regulatory networks in eukaryotes are even more complicated. Besides elements such as promoters, activators and repressors, eukaryotic transcription features other cis-regulatory elements including inducers and enhancers. The best-studied eukaryotic model for transcriptional regulation is yeast Saccharomyces cerevisiae. Even still, for this “simple” eukaryotic organism, there are more than 200 regulatory proteins for the transcription of its >6000 genes. (4) In order to map these transcription regulation networks, a variety of experimental and computational tools are used, as described below.

 

Figure: An artistic rendition of the yeast gene regulatory network from Professor M. Synder's website.

 

Deciphering Transcription Regulatory Networks

In deciphering transcription regulatory networks, we want to know how each gene is controlled by transcription regulation. We can think of this as a reverse engineering problem. We see the end results of the working of a system (for example, we can observe the physiological changes in the development of an organism), but we don’t know how the system works. Since the expression of a gene is largely determined by transcription factors (TFs) and their binding sites, we also want to know which gene is regulated by which TF. Developments in genomic tools and computational algorithms have allowed us to better study these questions, illuminating the workings of transcription regulatory networks.

 

Gene expression analysis

To understand transcription regulation networks, we first have to identify which gene are transcribed. Microarray expression analysis allows us to define the targets of transcription factors as we delete or overexpress TF-encoding genes in cells under studied. Then, using singular value decomposition analysis , we can find patterns in the expression data and try to construct the actions of the TFs. (For more details on this approach, see 5 , 6 , 7.) Alternatively, researchers have also used Bayesian belief networks to analyze expression data. (For experiments using this approach, see 8 , 9) Still, microarray expression data alone cannot produce the whole picture of transcription regulation network. First of all, the observed gene expressions could be due to secondary effects not directly attributable to TFs. Also, each gene could be activated or repressed by a number of TFs, so TF functions could be redundant and compensated for in gene expression data. In addition, transcription regulation could be conditionally-dependent. For example, depending on certain growth condition, the TF may not be active, or it may regulate a different numbers of genes, or it may regulate a different set of genes.

 

Figure: Transcription regulation is conditionally specific. (10)

 

ChIP-chip

The ChIP-chip technology is also used to study transcription regulatory networks. The ChIP-chip methodology is described in details elsewhere, but briefly: it involves the chromatic immunoprecipitation of DNA fragments bound by the transcription factor of interest, and then hybridization to a DNA microarray. The following figure shows the schematic of ChIP-chip analyses compared to genome-wide expression analyses.

 

Figure: (a) cDNA microarray genome-wide expression analyses. (b) ChIP-chip. (4)

The advantage of ChIP-chip data is that they provide a direct measure of TF-DNA binding, thus eliminating possible secondary effects in regular gene expression analysis. For example, ChIP-chip studies have revealed that although the DNA sequence targeted by a certain TF may occur at many other sites in the genome, the TF actually only bind specifically to a subset of these sites in vivo. The study of the transcriptional activator Rap1 by Lieb et al. is a classic example. They found that the consensus binding sequence of Rap1, ACACCCRYACAYM (remember HWA2?), albeit can be matched to many sites of the yeast genome, is preferentially bound by the transcriptional activator only at sites close to promoters. (11) Thus, ChIP-chip allows for a “location” analysis of gene regulation by TFs.

 

ChIP-chip analyses have been used on many studies of transcription regulatory networks, including a seminal study on 106 TFs in yeast by Dr. Richard Young’s lab at MIT. (12) In this study, Young and colleagues were able to obtain a distribution of TF binding, i.e. how many promoter regions does each TF bind to, and how many TFs bind to each promoter region. This not only gives us a glimpse into the complexity of the transcription regulatory network, but also allows us to begin constructing the transcription regulatory networks. Still, one caveat on ChIP-chip studies is that ChIP-chip data may be conditionally-dependent as well. For example, TFs may not bind to promoters in an inactive state. (13) Recent studies include investigating TF binding profiles under different growth conditions. (Review in (13).)

 

Figure: (left) Number of transcription regulators bound per promoter region. The red circles reflect results from actual data, whereas the white circles represent originally predicted results. (right) Distribution of bound promoter regions per transcription regulator.(12)

 

Promoter elements

Another approach in understanding transcription regulation and gene expression is to search for promoter motifs to which TFs recognize and bind. Computational algorithms that are used to search for TF binding sites (TFBSs) have been described in detail elsewhere, and will not be repeated here. By identifying TF motif sequence, location and orientation, we can potentially predict gene expression regulation. For example, the presence of multiple TF motifs may suggest the regulatioin of the gene by multiple TF, and the presence of co-occurring motifs may suggest that the regulatory proteins interact with each other in order to control the transcription of the gene. Of course, simply identifying TFBSs in promoter elements is insufficient to decipher transcription regulatory networks, as motif match is always a probability, and, as mentioned above, we don't know whether the motif actually functions as TFBSs in cells.

 

Figure: Some common promoter architectures in yeast. (10)

 

Epigenetics

We have to realize that there are also epigenetics factors in transcription regulatory networks that cannot be elucidated by simply looking at gene expression, ChIP-chip and promoter elements data. For example, as mentioned above, DNA sequences that can be considered as TF motifs may be found at many sites in the genome, but only a fraction of these motifs are actually bound by TF. The mechanism that allows the TFs to selectively bind to these specific motifs is still unknown, but has been proposed to involve nucleosome position and histone modification. A recent study by a group at the Harvard Bauer Center for Genomics Research (now Center for Systems Biology) has further linked nucleosome position to TF binding, showing that most functional TFBSs are "devoid of nucleosomes," making these motifs accessible to TF binding. (14)

 

Figure: Trancription factor motifs that are actually bound by transcription factors (i.e. functional) are mostly found in the linker regions of the chromatin, which are more accessible than the nucleosome regions. (14)

 

Figure: Histone modification can regulate transcription by allowing or disallowing TFs to bind to DNA through acetylation and deacetylation.

 

Network Features

In the previous section, we described how in elucidating transcription regulatory networks, we are interested in figuring out which gene is regulated by which protein, and how. But that is not all! Besides identifying the regulatory factors in transcription and their interactions with DNA, we also need to be able to organize and assemble these information in order to produce the full picture of a transcription regulatory network. Although each transcription regulatory network will be unique depending on its participating genes and proteins, there are some common features in the architecture of transcription regulatory networks. Scientists use these features to organize gene/protein interactions data and to construct models of what the transcription regulatory networks look like. The following sub-sections focus on what scientists have observed in yeast Saccharomyces cerevisiae, but one can imagine that such coordinations and network motifs can be found in other organisms as well, including human.

 

Network motifs

In a transcription regulatory network, there are often recurring patterns of interactions among the regulatory proteins. These regulatory patterns are called “network motifs” (not to be confused with DNA binding motifs). We can imagine them as the basic units that make up the network architecture. For example, consider two genes, A and B. It was found in the cis-regulatory element of gene A that there are binding sites for both the regulatory protein for A and Protein B, the product of Gene B. Indeed, it was observed that Gene A (or rather, the product of gene A) regulates Gene B, whose product further activates Gene A. This mode of interactions between A and B, called a positive feedback loop, can be considered as a network motif, because it is a pattern embedded in a larger circuit-like network, which involves other genes such as C and D that activate or inactivate other cellular factors.

 

Figure: Network motif as a part of the transcription regulatory network. (15)

The frequency that a motif is used by cells also reveals the evolutionary selection for the regulatory strategy. Some common types of network motifs include Autoregulation, Feedforward Loop, Multi-Component Loop, Regulatory Chain, Single-Input Motif and Multi-input Motif, as depicted in the following image and described in the following sub-sections.

 

Figure: Common network motifs in yeast. (12)

Autoregulation

In autoregulation, the regulatory protein binds to its own promoter to induce transcription of more regulatory proteins. The autoregulation motif is found for yeast regulatory proteins ARO80, NRG1, RAP1, RCS1, SMP1, STE12, SUM1, SWI4, YAP6 and ZAP1. (12)

 

Feedforward Loop (FFL)

In a feedforward loop, a regulatory protein regulates a second regulatory protein, and both proteins bind and regulate a common target gene. In yeast, the Young lab has found 39 regulatory proteins being involved in 49 FFLs, controlling about 240 genes. (12)

 

Multi-Component Loop

In a multi-component loop, there are two or more proteins participating in the regulatory circuit. For example, a regulatory protein can bind to the promoter of another regulatory protein, which induce the synthesis of that regulatory protein, which back-regulates the first regulatory protein by controlling its promoter. (12)

 

Regulator Chain

A regulator chain consisnts of a series of three or more regulatory proteins that regulate one another like a chain. (12)

 

Single-input Motif

In a single-input motif, one regulatory factor binds to a set of target genes under specific conditions. (12)

 

Multi-input Motif

On the other hand, in a mulit-input motif, a set of regulatory proteins bind together to a set of target genes with the same binding pattern. (12)

 

Coordination in regulation

As mentioned above, transcription regulation of a gene may depend on more than one regulatory protein. The interactions between TFs are thus as important as the interactions between TFs and promoter for gene regulation. Coordination between regulatory proteins can sometimes be revealed through the study of promoter architecture. In their recent study, Beer and Tavazoie combined gene expression analysis with sequence data in investigating transcription regulation, and found that there are constraints in the locations and orientations of TFBSs that appear together in genes, suggesting coordinations by the TFs in regulating these genes. (16) For example, it was found that two yeast TF motifs, RRPE and PAC, have to both be present and be a certain distance away from the transcription start site in order for the expression pattern of genes to be correlated. In other words, the expression of the gene depends on both RRPE and PAC, i.e gene transcription is "co-regulated" here. Similarly, RAP1, another TF motif, has to be present in a specific orientation with either another copy of itself or motifs M213 or M230 in order for gene expression to be properly co-regulated. (16)

 

Figure: Position constraint in gene expression predicted by gene expression and sequence data. (A),(B) and (C) all show how both RRPE and PAC need to be a certain distance away from transcriptional start site for gene expression to be corelated, or co-regulated. (16)

 

Figure: Orientation and partner constraints in gene expression. (D),(E) and (F) all show how RAP1 has to be in the <- orientation with the presence of M230, M213 or another copy of itself for gene expression to be co-regulated. (16)

 

Summary and Conclusion

The study of transcription regulatory networks is important to the understanding of how cells alter its gene expression in response to environmental signals for growth, development, survival and reproduction. (19) In deciphering transcription regulatory networks, we want to learn what, when and how genes are controlled by regulatory proteins or transcription factors. There are multiple data sources available for investigations: Gene expression data from microarray studies, ChIP-chip data that reveals direct TF-DNA binding and thus the TF binding targets, promoter motifs and architectures, and epigenetic factors. Instead of looking at a single source of data, recent studies have followed a combined approach in using experimental and computational data in elucidating transcription regulation networks in different organisms. The final goal of all these studies is not just simply identify regulatory motifs and factors, but to understand their interactions at a genomic level, using common network architectures such as network motifs and TF co-regulation constraints to construct a global picture of gene regulation for an organism, such as those in images below. To date, the best studied organisms for transcription regulatory networks are probably E.coli and yeast. In fact, two database of regulatory network already exist for E.coli: RegulonDB and EcoCyc. (17) Still, progress is being made in identifying transcription regulatory networks in human; about 2000 TFs have been predicted in human, and a handful of their genomic binding sites have been mapped. (18) The following refereneces include selected journal articles on transcription regulatory network mentioned on this page, particularly, references (4), (13) and (18) are great recent review articles on the topic.

 

Figure: A circuit-like diagram of transcription regulatory network in E.coli, indicating modes of regulations. Can you identify any network motifs? (20)

 

Figure: Network of 106 transcription regulatory factors in yeast, arranged in a circle, binding to genes that encode other transcription regulatory factors, as indicated by the arrows. (12)

 

References

(1) U.S. Department of Energy Genomics:GTL Program, http://genomicsgtl.energy.gov.

(2) Dodd, Ian B., Shearwin, Keith E., and Egan, J. Barry. Revisited gene regulation in bacteriophage lambda. 2005. Current Opinion in Genetics & Development. 15 (2): 145-152.

(3) Covert, Markus W., and Palsson, Bernhard O. Transcriptional Regulation in Constraints-based Metabolic Models of Escherichia coli. 2002. J. Biol. Chem. 277(31): 28058-28064.

(4) Wyrick, John J., and Young, Richard A. Deciphering gene expression regulatory networks. 2002. Current Opinion in Genetics & Development. 12:130-136.

(5) Alter, O., Brown, P.O., and Botstein, D. Singular value decompositioiin for genome-wide expression data processing and modeling. 2000. PNAS. 97:10101-6.

(6) Holter, N.S., Maritan, A., Cieplak, M., Fedoroff, N.V., and Banavar, J.R. Dynamic modeling of gene expression data. 2001. PNAS. 98:1693-98.

(7) Holter, N.S., Mitra, M., Maritan, A., Cieplak, M., Banavar, J.R. and Federoff, N.V. Fundamental patterns underlying gene expression profiles: simplicity from complexity. 2000. PNAS. 97: 8409-8414.

(8) Gifford, D.K. Blazing pathways through genetic mountains. 2001. Science. 293:2049-51.

(9) Friedman, N., Linial, M., Nachman, I., and Pe'er, D. Using Bayesian networks to analyze expression data. 2000. Journal of Computational Biology. 7:601-620.

(10) Harbison, C.T., Gordon, D.B., Lee, T.I., Rinaldi, N.J., Macisaac, K.D., Danford, T.W., Hannett, N.M., Tagne, J.B., reynolds, D.B., Yoo, J., Jennings, E.G., Zietlinger, J., Pokholok, D.K., Kellis, M., Rolfe, P.A., Takusagawa, K.T., Lander, E.S., Gifford, D.K., Fraenkel, E., and Young, R.A. Transcriptional regulatory code of a eukaryotic genome. 2004. Nature 431:99.

(11) Lieb, J.D., Liu, X., Botstein, D., and Brown, P.O. Promoter-specific binding of Rap1 revealed by genome-wide maps of protein-DNA association. 2001. Nature Genetics. 28:327-334.

(12) Lee, T.I., Rinaldi, N.J., Robert, F., Odom, D.T., Bar-Joseph, Z., Gerber, G.K., Hannett, N.M., Harbison, C.T., Thompson, C.M., Simon, I., Zeitlinger, J., Jennings, E.G., Murray, H.L., Gordon, D.B., Ren, B., Wyric, J.J., Tagne, J., Volkert, T.L., Fraenkel, E., Gifford, D.K., and Young, R.A. Transcriptional Regulatory Networks in Saccharomyces cerevisiae. 2002. Science. 298:799-804. Supported by website: http://web.wi.mit.edu/young/regulator_network/

(13) Chua, G., Robinson, M.D., Morris, Q., and Hughes, T.R. Transcriptional networks: reverse-engineering gene regulation on a global scale. 2004. Current Opinion in Microbiology. 7:638-646.

(14) Yuan, G.C., Liu, Y.J., Dion, M.F., Slack, M.D., Wu, L.F., Altschuler, S.J., and Rando, O.J. Genome-Scale Identification of Nucleosome Positions in S. cerevisiae. 2005. Science. 309: 626-630.

(15) de-Leon, Smadar, B., and Davidson, E.H. Deciphering the underlying mechanism of specification and differentiation: The sea urchin gene regulatory network. 2006. Science STKE. 2006(361): pe47.

(16) Beer, Michael A., and Tavazoie, Saeed. Predicting gene expression from sequence. 2004. Cell. 117(2):185-198.

(17) Salgado, H., Santos-Zavaleta, A., Gama-Castro, S., Peralta-Gil, M., Penaloza-Spinola, M.I., Martinez-Antonio, A., Karp, P., and Collado-Vides, J. The comprehensive updated regulatory network of Escherichia coli K-12. 2006. BMC Bioinformatics. 7:5.

(18) Hawkins, R.D., and Ren, B. Genome-wide location analysis: insights on transcriptional regulation. 2006. Human Molecular Genetics. 15:R1-R7.

(19) Balazsi, G., and Oltvaii, Z.N. Sensing your surroundings: How transcription-regulatory networks of the cell discern environmental signals. 2005. Science STKE. 2005(282): pe20.

(20) Babu, M.M., and Teichmann, S.A. Evolution of transcription factors and the gene regulatory netowrk in Escherichia coli. 2003. Nucleic Acids Research. 31(4):1234-1244.

 

Further Information

 

  • Recommended readings: References (4), (13) and (18) above are great recent review articles on Transcription Regulatory Networks. Reference (12) is a seminal research paper on transcription regulatory network in yeast, supported by the website at http://web.wi.mit.edu/young/regulator_network/ and followed up by the Harbison et al paper in 2004 (Reference 10). Reference (16) is a very interesting paper explicitly mentioned in Prof. Liu's BIO280 lecture. Reference (19) is also an interesting, short and easy-to-read review article.

 

  • Professor Liu's BIO280 lecture notes on Transcription Regulatory Networks can be found here.

 

  • Some Laboratories studying Transcription Regulatory Networks:

 

Comments (0)

You don't have permission to comment on this page.