ABSTRACT
Background. Transcriptional regulation in eukaryotes is often multifactorial, involving multiple transcription factors binding to the same transcription control region (e.g., upstream activating sequences and enhancers), and to understand the regulatory content of eukaryotic genomes it is necessary to consider the co-occurrence and spatial relationships of individual binding sites. The identification of sequences conserved among related species (often known as phylogenetic footprinting) has been successfully used to identify individual transcription factor binding sites. Here, we extend this concept of functional conservation to higher-order features of transcription control regions involved in the multifactorial control of gene expression.Results. We used the genome sequences of four yeast species of the genus Saccharomyces to identify sequences potentially involved in multifactorial control of gene expression. We found 1,117 potential regulatory "templates": pairs of hexameric sequences that are jointly conserved in transcription regulatory regions and also exhibit non-random relative spacing. Many of the individual sequences in these templates correspond to known transcription factor binding sites, and the sets of genes containing a particular template in their transcription control regions tend to be differentially expressed in conditions where the corresponding transcription factors are known to be active.Conclusions. The incorporation of both joint conservation and spacing constraints of sequence pairs predicts groups of target genes that were specific for common patterns of gene expression. Our work suggests that positional information, especially the relative spacing between transcription factor binding sites, may represent a common organizing principle of transcription control regions.
- Composite regulatory elements: structure, function and classification. {http://www.gene-regulation.com/pub/databases/transcompel/compel.html}Google Scholar
- Wolberger C: Multiprotein-DNA complexes in transcriptional regulation. Annu Rev Biophys Biomol Struct 1999, 28:29--56.Google Scholar
- Kel OV, Romaschenko AG, Kel AE, Wingender E, Kolchanov NA: A compilation of composite regulatory elements affecting gene-transcription in vertebrates. Nucleic Acids Res 1995, 23:4097--4103.Google Scholar
- Gasch AP: The environmental stress response: a common yeast response to diverse environmental stresses. In: Yeast Stress Responses Edited by S Hohmann, WH Mager, vol. 1. pp. 11--70. Berlin: Springer; 2003: 11--70.Google Scholar
- Mead J, Bruning AR, Gill MK, Steiner AM, Acton TB, Vershon AK: Interactions of the Mcm1 MADS box protein with cofactors that regulate mating in yeast. Mol Cell Biol 2002, 22:4607--4621.Google Scholar
- Bhoite LT, Allen JM, Garcia E, Thomas LR, Gregory ID, Voth WP, Whelihan K, Rolfes RJ, Stillman DJ: Mutations in the Pho2 (Bas2) transcription factor that differentially affect activation with its partner proteins Bas1, Pho4, and Swi5. J Biol Chem 2002, 277:37612--37618.Google Scholar
- Verger A, Duterque-Coquillaud M: When Ets transcription factors meet their partners. Bioessays 2002, 24:362--370.Google Scholar
- Ambrosetti DC, Basilico C, Dailey L: Synergistic activation of the fibroblast growth factor 4 enhancer by Sox2 and Oct-3 depends on protein-protein interactions facilitated by a specific spatial arrangement of factor binding sites. Mol Cell Biol 1997, 17:6321--6329.Google Scholar
- Ludwig MZ, Patel NH, Kreitman M: Functional analysis of eve stripe 2 enhancer evolution in Drosophila: rules governing conservation and change. Development 1998, 125:949--958.Google Scholar
- Liu Z, Little JW: The spacing between binding sites controls the mode of cooperative DNA-protein interactions: implications for evolution of regulatory circuitry. J Mol Biol 1998, 278:331--338.Google Scholar
- Stormo GD: DNA binding sites: representation and discovery. Bioinformatics 2000, 16:16--23.Google Scholar
- Duret L, Bucher P: Searching for regulatory elements in human noncoding sequences. Curr Opin Struct Biol 1997, 7:399--406.Google Scholar
- Pennacchio LA, Rubin EM: Genomic strategies to identify mammalian regulatory sequences. Nat Rev Genet 2001, 2:100--109.Google Scholar
- Blanchette M, Tompa M: Discovery of regulatory elements by a computational method for phylogenetic footprinting. Genome Res. 2002, 12:739--748.Google ScholarCross Ref
- Loots GG, Ovcharenko I, Pachter L, Dubchak I, Rubin EM: rVista for comparative sequence-based discovery of functional transcription factor binding sites. Genome Res 2002, 12:832--839.Google Scholar
- Schwartz S, Zhang Z, Frazer KA, Smit A, Riemer C, Bouck J, Gibbs R, Hardison R, Miller W: PipMaker--a web server for aligning two genomic DNA sequences. Genome Res 2000, 10:577--586.Google Scholar
- Pilpel Y, Sudarsanam P, Church GM: Identifying regulatory networks by combinatorial analysis of promoter elements. Nat. Genet. 2001, 29:153--159.Google ScholarCross Ref
- Bussemaker HJ, Li H, Siggia ED: Regulatory element detection using correlation with expression. Nat. Genet. 2001, 27:167--171.Google ScholarCross Ref
- Keles S, van der Laan M, Eisen MB: Identification of regulatory elements using a feature selection method. Bioinformatics 2002, 18:1167--1175.Google Scholar
- Wang W, Cherry JM, Botstein D, Li H: A systematic approach to reconstructing transcription networks in Saccharomyces cerevisiae. Proc Natl Acad Sci USA 2002, 99:16893--16898.Google Scholar
- Wagner A: Genes regulated cooperatively by one or more transcription factors and their identification in whole eukaryotic genomes. Bioinformatics 1999, 15:776--784.Google Scholar
- Klingenhoff A, Frech K, Quandt K, Werner T: Functional promoter modules can be detected by formal models independent of overall nucleotide sequence similarity. Bioinformatics 1999, 15:180--186.Google Scholar
- Pavlidis P, Furey TS, Liberto M, Haussler D, Grundy WN: Promoter region-based classification of genes. Proceedings of the Pacific Symposium on Biocomputing 2001, 6:151--164.Google Scholar
- Kel-Margoulis OV, Ivanova TG, Wingender E, Kel AE: Automatic annotation of genomic regulatory sequences by searching for composite clusters. Pac Symp Biocomput 2002, 7:187--198.Google Scholar
- Kamvysselis M, Patterson N, Birren B, Berger B, Lander ES: Whole-genome comparative annotation and regulatory motif discovery in multiple yeast species. Proceedings of the 7th International Conference on Research in Computational Molecular Biology 2003, 7. Google ScholarDigital Library
- Chiang DY, Brown PO, Eisen MB: Visualizing associations between genome sequences and gene expression data using genome-mean expression profiles. Bioinformatics 2001, 17 Suppl 1:S49--S55.Google Scholar
- DeRisi JL, Iyer VR, Brown PO: Exploring the metabolic and genetic control of gene expression on a genomic scale. Science 1997, 278:680--686.Google Scholar
- Gasch AP, Spellman PT, Kao CM, Carmel-Harel O, Eisen MB, Storz G, Botstein D, Brown PO: Genomic expression programs in the response of yeast cells to environmental changes. Mol Biol Cell 2000, 11:4241--4257.Google Scholar
- Spellman PT, Sherlock G, Zhang MQ, Iyer VR, Anders K, Eisen MB, Brown PO, Botstein D, Futcher B: Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. Mol Biol Cell 1998, 9:3273--3297.Google Scholar
- Cho RJ, Campbell MJ, Winzeler EA, Steinmetz L, Conway A, Wodicka L, Wolfsberg TG, Gabrielian AE, Landsman D, Lockhart DJ, et al: A genome-wide transcriptional analysis of the mitotic cell cycle. Mol. Cell. 1998, 2:65--73.Google ScholarCross Ref
- Gasch AP, Huang M, Metzner S, Botstein D, Elledge SJ, Brown PO: Genomic expression responses to DNA-damaging agents and the regulatory role of the yeast ATR homolog Mec1p. Mol Biol Cell 2001, 12:2987--3003.Google Scholar
- Lee SE, Pellecioli A, Demeter J, Vaze MP, Gasch AP, Malkova A, Brown PO, Botstein D, Stearns T, Foiani M, et al: In: Biological Responses to DNA Damage, vol. 65. pp. 303-314. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press; 2000: 303--314.Google Scholar
- Ogawa N, DeRisi J, Brown PO: New components of a system for phosphate accumulation and polyphosphate metabolism in Saccharomyces cerevisiae revealed by genomic expression analysis. Mol. Biol. Cell 2000, 11:4309--4321.Google Scholar
- Hughes TR, Marton MJ, Jones AR, Roberts CJ, Stoughton R, Armour CD, Bennett HA, Coffey E, Dai HY, He YDD, et al: Functional discovery via a compendium of expression profiles. Cell 2000, 102:109--126.Google Scholar
- Zhu J, Zhang MQ: SCPD: a promoter database of the yeast Saccharomyces cerevisiae. Bioinformatics 1999, 15:607--611.Google Scholar
- Ho Y, Costanzo M, Moore L, Kobayashi R, Andrews BJ: Regulation of transcription at the Saccharomyces cerevisiae start transition by Stb1, a Swi6-binding protein. Mol Cell Biol 1999, 19:5267--5278.Google Scholar
- Lee TI, Rinaldi NJ, Robert F, Odom DT, Bar-Joseph Z, Gerber GK, Hannett NM, Harbison CT, Thompson CM, Simon I, et al: Transcriptional regulatory networks in Saccharomyces cerevisiae. Science 2002, 298:799--804.Google Scholar
- Gancedo JM: Yeast carbon catabolite repression. Microbiol Mol Biol Rev 1998, 62:334--361.Google Scholar
- Kwast KE, Burke PV, Poyton RO: Oxygen sensing and the transcriptional regulation of oxygen-responsive genes in yeast. J Exp Biol 1998, 201:1177--1195.Google Scholar
- Vik A, Rine J: Upc2p and Ecm22p, dual regulators of sterol biosynthesis in Saccharomyces cerevisiae. Mol Cell Biol 2001, 21:6395--6405.Google Scholar
- Blaiseau PL, Thomas D: Multiple transcriptional activation complexes tether the yeast activator Met4 to DNA. EMBO J 1998, 17:6327--6336.Google Scholar
- Guarente L, Lalonde B, Gifford P, Alani E: Distinctly regulated tandem upstream activation sites mediate catabolite repression of the CYC1 gene of S. cerevisiae. Cell 1984, 36:503--511.Google Scholar
- Burke PV, Poyton RO: Structure/function of oxygen-regulated isoforms in cytochrome c oxidase. J Exp Biol 1998, 201:1163--1175.Google Scholar
- Lees ND, Skaggs B, Kirsch DR, Bard M: Cloning of the late genes in the ergosterol biosynthetic pathway of Saccharomyces cerevisiae--a review. Lipids 1995, 30:221--226.Google Scholar
- Liao GC, Rehm EJ, Rubin GM: Insertion site preferences of the P transposable element in Drosophila melanogaster. Proc Natl Acad Sci USA 2000, 97:3347--3351.Google Scholar
- Berman BP, Nibu Y, Pfeiffer BD, Tomancek P, Celniker SE, Levine M, Rubin GM, Eisen MB: Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome. Proc. Natl. Acad. Sci. USA 2002, 99:757--762.Google Scholar
- Cliften PF, Hillier LW, Fulton L, Graves T, Miner T, Gish WR, Waterston RH, Johnston M: Surveying Saccharomyces genomes to identify functional elements by comparative DNA sequence analysis. Genome Res. 2001, 11:1175--1186.Google ScholarCross Ref
- Saccharomyces Genome Database. {http://genome-www.stanford.edu/Saccharomyces/}Google Scholar
- Sherlock G, Hernandez-Boussard T, Kasarskis A, Binkley G, Matese JC, Dwight SS, Kaloper M, Weng S, Jin H, Ball CA, et al: The Stanford Microarray Database. Nucleic Acids Res 2001, 29:152--155.Google Scholar
- Benjamini Y, Hochberg Y: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Statist. Soc. B 1995, 57:289--300.Google Scholar
- Press WH, Teukolsky SA, Vertterling WT, Flannery BP: Numerical Recipes in C, Second Edition. Cambridge: Cambridge University Press; 1992. Google ScholarDigital Library
Index Terms
- Phylogenetically and spatially conserved word pairs associated with gene expression changes in yeasts
Recommendations
Pathway to functional studies: pipeline linking phylogenetic footprinting and transcription-factor binding analysis
WISB '06: Proceedings of the 2006 workshop on Intelligent systems for bioinformatics - Volume 73Identification of transcription-factor binding sites is a critical first step in studying transcriptional regulation of genes. The comparative genomics method of phylogenetic footprinting is based on identifying sequence elements that are conserved ...
Identifying conserved gene clusters in the presence of orthologous groups
RECOMB '04: Proceedings of the eighth annual international conference on Research in computational molecular biologyCurrent biological evidence suggests a correlation between the function and the position of genes in chromosomes. Examples include operon structure in prokaryotic genomes and similar expression patterns of neighboring genes in some eukaryotic genomes. ...
Conserved orthology in mitochondrial genomes of distantly related nematodes
ISB '10: Proceedings of the International Symposium on BiocomputingIdentification of orthologous segments plays a very important role in comparative genomics studies. In the present study, we have identified orthologous segments shared between Radopholus similis and 15 other nematodes. Complete genomes of 16 nematodes ...
Comments