|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
TOPICAL REVIEW |
1 Genome Sciences Department, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA2 US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| Abstract |
|---|
|
|
|---|
(Received 8 July 2003;
accepted after revision 9 September 2003;
first published online 12 September 2003)
Corresponding author L. A. Pennacchio: Genome Sciences Department, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkely, CA 94720, USA. Email: lapennacchio{at}lbl.gov
| Introduction |
|---|
|
|
|---|
Genomics is the most recent branch of biology to employ comparison-based strategies. At the foundation of the evolutionary relationship of all vertebrates is conserved genetic information in the form of DNA sequence, which is assumed to underlie homologous functional and anatomical similarities between species. Technological progress in DNA cloning and sequencing has resulted in the generation of a large dataset of genomic sequence information. In the past two years, draft genome sequence has become available for six vertebrates: human, mouse, rat, zebrafish and two pufferfish (Fugu rubripes and Tetraodon nigroviridis) (Lander et al. 2001; Venter et al. 2001; Aparicio et al. 2002; Waterston et al. 2002). The sudden wealth of sequence data has allowed whole genome alignments to compare and contrast the evolution and content of vertebrate genomes. Such comparative strategies have identified pockets of DNA sequences conserved over evolutionary time, and such evolutionary conservation has been a powerful guide in sorting functional from non-functional DNA (Duret & Bucher, 1997; Hardison et al. 1997; Hardison, 2000; Loots et al. 2000; Pennacchio & Rubin, 2001; Gottgens et al. 2002). Accordingly, this review focuses on the biological insights derived from comparative sequence-based studies and their increasing utility as the amount of genome sequence data increases. Details on various computational tools used in these studies can be found in several recent reviews (Pennacchio & Rubin, 2001; Frazer et al. 2003; Pennacchio & Rubin, 2003b; Ureta-Vidal et al. 2003). A list of the most commonly used tools in comparative genomics is provided in Table 1.
|
| The power of varying evolutionary distance in comparative genomics |
|---|
|
|
|---|
In recent years, the availability of sequence from numerous species has allowed multiple species comparisons to aid in calibrating the ideal evolutionary distance required for the optimal identification of functionally conserved sequences (Koop & Hood, 1994; Hood et al. 1995; Dubchak et al. 2000; Pennacchio & Rubin, 2001; Gottgens et al. 2002). In this review we adopt a human-centric focus of comparative genomics, describing strategies where sequence-based analyses alone have been used to enable better understanding of functional sequences in the human genome. Examples are provided of the most commonly used vertebrate genomes in cross-species sequence comparisons (Fig. 1), highlighting the uniqueness, usefulness and limitations of each.
|
The evolutionary distance between humans and mice places these species at strategic positions for the identification of shared functionally conserved sequences. It has been estimated that the rate of divergence in independently evolving vertebrate genomes is on average 0.10.5% per million years, supporting the premise that the
80 million years separating humans and mice from their last common ancestor is sufficient for functionally important sequences to be identified (Tautz, 2000). A number of recent studies have reported the identification of functional sequences solely through the use of humanmouse genomic comparisons, thereby further validating this assumption (Loots et al. 2000; Pennacchio et al. 2001; Gottgens et al. 2002; Kappen & Yaworsky, 2003). The most standard applications of humanmouse comparative sequence analyses involve: (1) the annotation of previously undefined genes; (2) the identification of large (801000 bp) functional gene-regulatory elements; and (3) the detailed characterization of transcription factor binding sites (phylogenetic footprints) present in larger conserved non-coding regions.
Identification of new genes. The first application of humanmouse comparative genomics relates to the discovery of new genes within the human and mouse genomes, which have previously been invisible to extensive computational and experimental investigation. Since coding sequences of active genes are commonly under strong negative selection, humanmouse sequence comparisons are expected to unveil sequences corresponding to previously unidentified genes, thus expanding the complete gene catalogue of each organism. The discovery of the apolipoprotein A5 gene (APOA5) exemplifies this principle. Solely through the use of humanmouse genomic sequence comparisons, this evolutionary paralog (a related gene or sequence arisen from the duplication of an ancestral gene or sequence) of the neighbouring APOA4 gene was identified based on its high degree of sequence conservation within a previously well-studied cluster of apolipoproteins (Pennacchio et al. 2001). Transcripts from this corresponding interval were identified in human and mouse liver tissue, serving as evidence that these conserved sequences correspond to a previously missed gene. Further studies in transgenic and knockout mice revealed that the newly described APOA5 gene is a pivotal determinant of plasma triglyceride levels (Pennacchio et al. 2001). In addition, these findings were extended to human physiology when strong genetic associations between common APOA5 polymorphisms and plasma triglyceride levels were uncovered in a wide range of studies (reviewed by Pennacchio & Rubin, 2003a). Similar strategies will be useful in identifying un-annotated genes that are still predicted to exist in the human and mouse genomes.
Identification of gene regulatory sequences.
While it is intuitive that comparative sequence analysis is suitable to identify exons based on conservation, its ability to uncover conserved gene regulatory sequences is less obvious owing to the small size of transcription factor binding sites (
612 bp in size). Nevertheless, the architecture of the majority of characterized enhancers in metazoan genomes is thought to be determined by a combination of multiple transcription factor binding sites, arranged in a modular fashion within large clusters. Thus, the size of these enhancer elements is expected to be similar to many exons. An early study of humanmouse comparative sequence analysis as a starting point to identify gene regulatory elements was performed on a human interleukin gene cluster, which has long been known to harbour genes involved in several human inflammatory conditions (Noguchi et al. 1997; Rioux et al. 2001). In this work, humanmouse comparisons revealed a highly conserved 401 bp non-coding sequence within a genomic interval containing the interleukin-4, -5 and -13 genes (Loots et al. 2000). Subsequent deletion of this conserved non-coding sequence from mice revealed inappropriate expression of all three interleukins upon TH2 cytokine stimulation (Loots et al. 2000; Mohrs et al. 2001), thus demonstrating that the 401 bp conserved element corresponds to a regulatory element able to coordinately modulate the expression of three interleukin genes spread over 120 kb of sequence. This coordinated expression of interleukins had been previously proposed, but several studies using traditional approaches failed to uncover the sequence so clearly revealed by a comparative approach (Noguchi et al. 1997; Lacy et al. 2000).
Humanmouse sequence comparisons are thus expected to represent a powerful tool in the puzzle of the decoding of gene-regulatory sequence. A paucity of published studies reporting the identification of functional gene regulatory sequences through traditional approaches highlights the difficulty in defining functional non-coding sequences. With the availability of large amounts of humanmouse genomic sequence, cross-species comparisons are poised to dramatically increase our ability to decipher non-coding DNA. Nevertheless, a handful of characterized enhancers, originally identified through labourious experimental strategies, have been retrospectively shown to be highly conserved between human and mouse. Several studies using standard enhancer-trapping strategies identified and characterized three regulatory sequences within a segment 1.53.0 kb upstream of the human pancreatic duodenal homeobox 1 (PDX-1) gene promoter (Sharma et al. 1996; Ben-Shushan et al. 2001). As shown in Fig. 2, both exons of the PDX-1 gene as well as several non-coding sequences are well conserved between humans and mice in this interval. Inspection of the sequence upstream of PDX-1 shows three distinct segments of sequence conservation located approximately 1.62.8 kb upstream of the promoter, corresponding to the three enhancers previously shown to regulate PDX-1 expression. These sequences, easily highlighted by direct genomic comparisons, would probably have been prioritized for characterization of biological function based solely on a comparative strategy. Similar strategies will probably identify many human gene regulatory elements in the genome.
|
Though a wealth of important examples may reinforce the notion that humans and mice occupy a privileged position for cross-species sequence comparisons, they alone cannot capture all biologically active sequences. First, it has been well established that the degree of sequence conservation is heterogeneous among different genomic segments in human and mouse. For instance, the T-cell receptor locus has been shown to be extremely conserved in humanmouse sequence comparisons (Koop & Hood, 1994), while the
-globin locus has been found to be highly divergent (Hardison et al. 1991). Such interspecies variation is due to wide-ranging differences in the humanmouse nucleotide substitution rates across the genome. The result is a set of genomic regions with vast amounts of conservation (though probably not functional), and a set lacking significant conservation (though still containing functional elements). Such an observation carries significant implications for cross-species sequence comparisons since this strategy assumes that natural selection has constrained functional sequences to evolve at slower rates than non-functional sequences. In practice, humanmouse comparisons are not always feasible for deriving biological insights for a given genomic region.
To study regions of the human genome where humanmouse sequence comparisons are not ideal, examination of species occupying different evolutionary distances may be useful. In regions that are too well conserved between human and mouse, the comparison of humans to more distantly related species is warranted (i.e. birds, reptiles, amphibians), while in regions that are poorly conserved between humans and mice, the comparison of humans to closer species can be beneficial (i.e. primates, dogs, rabbits). In the remainder of this review, we will describe the utility of human genomic comparisons to species other than mouse.
Humanchicken sequence comparisons
The shared ancestor that gave rise to birds and mammals existed approximately 300 million years ago during the vertebrate radiation (Kumar & Hedges, 1998), placing the distance between humans and chickens at approximately 34 times that of humans and mice (Fig. 1). A deeper phylogenetic relationship suggests that most neutrally evolving sequences in humans and birds will have diverged significantly more than those between humans and mice. In general, conserved DNA between humans and birds is more likely to be functional than that found between humans and mice. While no entire avian genome sequence is currently available, small genomic fragments of chicken DNA have been sequenced for comparative studies.
As an example, the identification and characterization of a human cardiac-specific enhancer regulating the homeobox gene Nkx2-5 was aided by the addition of orthologous chicken sequence (i.e. related sequences in chicken that started to diverge after a speciation event). Initial examination of the region 10 kb upstream of Nkx2-5 between humans and mice revealed five conserved non-coding sequences, but the addition of the orthologous chicken sequence revealed that only one of these five was also conserved in chickens. Functional studies in transgenic mice confirmed that this segment corresponds to a cardiac-specific enhancer regulating Nkx2-5 expression (Lien et al. 2002). Further dissection of this enhancer through phylogenetic footprinting revealed the precise transcription factor binding sites responsible for the enhancer activity, aided by having the chicken genomic sequence. While humanmouse enhancer sequence comparisons revealed between 90 and 100% identity throughout the segment, making the identification of conserved footprints difficult, humanmousechicken enhancer sequence comparison decreased the overall conservation in the region to 70% and revealed that four Smad binding sites were conserved in all three species. A combination of mouse transgenics and mutagenesis later confirmed that one of the conserved Smad sites mediates the enhancer activation of Nkx2-5 in the developing heart (Lien et al. 2002).
An interesting observation from these data is that genome sequences obtained from organisms with little or no use as model organisms for experimental biology represent extremely important resources for annotating the human genome. This underscores the importance of prioritizing the choices for sequencing further vertebrate genomes based not simply on a hierarchical list of experimentally suitable models, but also on a composite of factors that take into account the potential uses of the data generated for applications such as comparative genomics. Indeed, while the chicken, honeybee and chimpanzee are not standard experimental models, their genomes have been prioritized for the next round of DNA sequencing (Boguski, 2002).
Humanfish sequence comparisons
Humanfish comparisons also provide a useful evolutionary position for comparative sequence-based discovery. Several species of fish have been fully sequenced, which include working drafts for zebrafish, and the two pufferfish, Fugu rubripes and Tetraodon nigroviridis (Aparicio et al. 2002). The phylogenetic relationship between fish and humans dates back 400450 million years, making fish the most distant vertebrates with available genomic sequence for comparison with humans (Fig. 1). Although this large evolutionary distance implies that only a fraction of the functional sequences in the human genome are still shared, comparison has revealed that most known human genes are also conserved in fish. Importantly, the annotation of conserved sequences between the human and Fugu rubripes genomes led to the rapid identification of over 1000 previously unidentified human genes (Abrahams et al. 2002; Aparicio et al. 2002). While the majority of conserved orthologous sequences between human and pufferfish represent coding sequences, thousands of conserved sequences that do not appear to correspond to genes are also present. This suggests that humanpufferfish genomic comparisons may result in the discovery of functionally important non-coding sequences in the human genome. However, these comparisons are likely to miss gene regulatory functions that are no longer preserved across such large evolutionary distances.
One of the most attractive features of pufferfish for its use in cross-species sequence comparisons is the compact size of its genome, totaling a mere 365 million bp (one-eighth the size of the human genome; Brenner et al. 1993). This compactness predicts that regulatory sequences shared between humans and pufferfish will be found much closer to a given pufferfish gene than its human ortholog, so humanpufferfish comparisons may identify distant regulatory elements in the human genome (Gilligan et al. 2002). A recent comparison of a 3.7 million bp sequence from humans with pufferfish identified 195 kb of sequence with orthology, revealing several genes in the region shared between the two species (Bagheri-Fam et al. 2001). Moreover, eight conserved sequences which were not predicted to be exons were identified within 750 kb of the human SOX9 transcription factor gene. In the pufferfish genome these conserved sequences are located within less than 80 kb of SOX9, suggesting that these may represent distant sequences that regulate SOX9 expression (Bagheri-Fam et al. 2001). Thus, the use of Fugu rubripes sequences in genomic comparisons may be a useful tool for the identification of both local and distant regulatory elements. However, given the extreme evolutionary distance between fish and mammals, and the extensive biological differences that separate these species, it is likely that these conserved sequences represent only a subset of the functional elements in the human genome. Since many aspects of embryologic development are extremely well conserved in all vertebrates, it is possible that the catalogue of conserved sequences between mammals and fish will be enriched for elements regulating the expression of genes involved in these developmental processes, and represent sequences whose biological impact in the organism is the most dramatic.
Interprimate sequence comparisons
Finally, comparison of the human sequence to that of other primate species is a strategy likely to identify functional regions of the human genome. The overall strategy previously described for cross-species sequence comparisons is based on using species of relatively distant phylogenetic positions to maximize the identification of functionally conserved sequences in the human genome. However, this strategy is limited in that it does not allow studies aimed at identifying primate-specific genes or regulatory sequences. For instance, the comparison of the human and mouse genomes identified
1% of mouse genes without a human ortholog (Waterston et al. 2002). In addition, this estimate does not take into account the numerous examples where tandem duplications lead to the formation and expansion of gene families in one species but not the other. To this end, only 80% of humanmouse genes have a 1:1 orthologous relationship (Waterston et al. 2002). Therefore, there is a need to develop strategies to characterize the catalogue of the 20% of genes and regulatory elements that do not have a true ortholog in both humans and mice. For these studies, comparing human sequences to those of closer evolutionary species, such as primates, may prove essential. However, the use of primate sequences for cross-species sequence comparisons poses a paradox, in that while primates are likely to share most genes present in the human genome, their close phylogenetic relationship results in high levels of sequence identity between orthologous sequences. For example, humans, chimpanzees and gorillas shared a common ancestor approximately 6.08.0 million years ago and their average rate of sequence conservation is 9899% even in non-coding intervals (Hacia, 2001).
Recently, a strategy named phylogenetic shadowing was introduced to overcome the excessive sequence identity shared by primates, which makes their use in cross-species sequence comparisons possible (Boffelli et al. 2003). The foundation of this approach is to analyse orthologous sequence from numerous primate species to increase the evolutionary distance of the sequence comparisons. Rather than performing only pairwise comparisons between human and mouse, human and chicken, or human and pufferfish, phylogenetic shadowing compares a dozen or more different primate species. The summation of these primate comparisons robustly identifies regions of increased variation and shadows representing conserved segments (Fig. 3A).
|
| Conclusions and future perspective |
|---|
|
|
|---|
The number of cross-species sequence comparisons will undoubtedly increase in use as additional genomes are sequenced. While we currently have access to a handful of vertebrate genome sequences and our tools for dealing with these datasets are rapidly improving, the computational challenges ahead are formidable. Current efforts have focused primarily on pairwise comparisons to annotate and explore a single species of interest (such as humans), but future methods will require the simultaneous analysis of sequence data from numerous species in the form of multiple alignments in order to catalogue the evolutionary extent of sequence conservation and divergence. In addition, the area of high-throughput experimental biology is a quickly evolving field with vast opportunities to exploit comparative sequence data.
For the biologist, the application of cross-species sequence analyses requires flexibility. It should be emphasized that no single pairwise comparison is sufficient to capture all biologically functional sequences based on conservation. Thus, the primary decision in the process of designing a comparative genomic-based study is the biological question under investigation and which two (or more) species are most appropriate for comparison. While there is no clear way to predict which repertoire of species is ideally suited for each cross-species comparison, the analysis of aligned humanmouse orthologous sequences provides an initial starting point for most biological studies. However, for example, if the study aims to identify regulatory elements of a primate-specific gene, it will not be useful to compare human-mouse or other lower vertebrates. In contrast, the study of basic vertebrate biological processes may be aided by distant species sequence comparisons (human-bird, human-amphibian, human-fish, etc.). Therefore, the biologist must make logical predictions about which species to compare and should readily adopt additional species as warranted based on their initial comparative analysis. The currently available vertebrate genome sequences are immediate resources for the community and additional vertebrate genomes are in the pipeline.
| References |
|---|
|
|
|---|
Aparicio
S, Chapman
J, Stupka
E, Putnam
N, Chia
JM, Dehal
P, Christoffels
A, Rash
S, Hoon
S, Smit
A
et al. (2002). Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes. Science
297, 13011310.
Bagheri-Fam S, Ferraz C, Demaille J, Scherer G & Pfeifer D (2001). Comparative genomics of the SOX9 region in human and Fugu rubripes: conservation of short regulatory sequence elements within large intergenic regions. Genomics 78, 7382.[CrossRef][Medline]
Ben-Shushan
E, Marshak
S, Shoshkes
M, Cerasi
E
&
Melloul
D (2001). A pancreatic beta-cell-specific enhancer in the human PDX-1 gene is regulated by hepatocyte nuclear factor 3beta (HNF-3beta), HNF-1alpha, and SPs transcription factors. J Biol Chem
276, 1753317540.
Boffelli
D, Mcauliffe
J, Ovcharenko
D, Lewis
KD, Ovcharenko
I, Pachter
L
&
Rubin
EM (2003). Phylogenetic shadowing of primate sequences to find functional regions of the human genome. Science
299, 13911394.
Boguski MS (2002). Comparative genomics: the mouse that roared. Nature 420, 515516.[CrossRef][Medline]
Brenner S, Elgar G, Sandford R, Macrae A, Venkatesh B & Aparicio S (1993). Characterization of the pufferfish (Fugu) genome as a compact model vertebrate genome. Nature 366, 265268.[CrossRef][Medline]
Dubchak
I, Brudno
M, Loots
GG, Pachter
L, Mayor
C, Rubin
EM
&
Frazer
KA (2000). Active conservation of noncoding sequences revealed by three-way species comparisons. Genome Res
10, 13041306.
Duret L & Bucher P (1997). Searching for regulatory elements in human noncoding sequences. Curr Opin Struct Biol 7, 399406.[CrossRef][Medline]
Frazer
KA, Elnitski
L, Church
DM, Dubchak
I
&
Hardison
RC (2003). Cross-species sequence comparisons: a review of methods and available resources. Genome Res
13, 112.
Gilligan P, Brenner S & Venkatesh B (2002). Fugu and human sequence comparison identifies novel human genes and conserved non-coding sequences. Gene 294, 35.[CrossRef][Medline]
Gottgens
B, Barton
LM, Chapman
MA, Sinclair
AM, Knudsen
B, Grafham
D, Gilbert
JG, Rogers
J, Bentley
DR
&
Green
AR (2002). Transcriptional regulation of the stem cell leukemia gene (SCL) comparative analysis of five vertebrate SCL loci. Genome Res
12, 749759.
Gumucio DL, Shelton DA, Zhu W, Millinoff D, Gray T, Bock JH, Slightom JL & Goodman M (1996). Evolutionary strategies for the elucidation of cis and trans factors that regulate the developmental switching programs of the beta-like globin genes. Mol Phylogenet Evol 5, 1832.[CrossRef][Medline]
Hacia JG (2001). Genome of the apes. Trends Genet 17, 637645.[CrossRef][Medline]
Hardison RC (2000). Conserved noncoding sequences are reliable guides to regulatory elements. Trends Genet 16, 369372.[CrossRef][Medline]
Hardison R, Krane D, Vandenbergh D, Cheng JF, Mansberger J, Taddie J, Schwartz S, Huang XQ & Miller W (1991). Sequence and comparative analysis of the rabbit alpha-like globin gene cluster reveals a rapid mode of evolution in a G + C-rich region of mammalian genomes. J Mol Biol 222, 233249.[CrossRef][Medline]
Hardison
RC, Oeltjen
J
&
Miller
W (1997). Long human-mouse sequence alignments reveal novel regulatory elements: a reason to sequence the mouse genome. Genome Res
7, 959966.
Hood L, Rowen L & Koop BF (1995). Human and mouse T-cell receptor loci: genomics, evolution, diversity, and serendipity. Ann N Y Acad Sci 758, 390412.[Medline]
Kappen C & Yaworsky PJ (2003). Mutation of a putative nuclear receptor binding site abolishes activity of the nestin midbrain enhancer. Biochim Biophys Acta 1625, 109115.[Medline]
Koop BF & Hood L (1994). Striking sequence similarity over almost 100 kilobases of human and mouse T-cell receptor DNA. Nat Genet 7, 4853.[CrossRef][Medline]
Kumar S & Hedges SB (1998). A molecular timescale for vertebrate evolution. Nature 392, 917920.[CrossRef][Medline]
Lacy
DA, Wang
ZE, Symula
DJ, Mcarthur
CJ, Rubin
EM, Frazer
KA
&
Locksley
RM (2000). Faithful expression of the human 5q31 cytokine cluster in transgenic mice. J Immunol
164, 45694574.
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, Fitzhugh W et al. (2001). Initial sequencing and analysis of the human genome. Nature 409, 860921.[CrossRef][Medline]
Lien CL, McAnally J, Richardson JA & Olson EN (2002). Cardiac-specific activity of an Nkx2-5 enhancer requires an evolutionarily conserved Smad binding site. Dev Biol 244, 257266.[CrossRef][Medline]
Loots
GG, Locksley
RM, Blankespoor
CM, Wang
ZE, Miller
W, Rubin
EM
&
Frazer
KA (2000). Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons. Science
288, 136140.
Mohrs M, Blankespoor CM, Wang ZE, Loots GG, Afzal V, Hadeiba H, Shinkai K, Rubin EM & Locksley RM (2001). Deletion of a coordinate regulator of type 2 cytokine expression in mice. Nat Immunol 2, 842847.[CrossRef][Medline]
Noguchi
E, Shibasaki
M, Arinami
T, Takeda
K, Maki
T, Miyamoto
T, Kawashima
T, Kobayashi
K
&
Hamaguchi
H (1997). Evidence for linkage between asthma/atopy in childhood and chromosome 5q31-q33 in a Japanese population. Am J Respir Crit Care Med
156, 13901393.
Pennacchio
LA, Olivier
M, Hubacek
JA, Cohen
JC, Cox
DR, Fruchart
JC, Krauss
RM
&
Rubin
EM (2001). An apolipoprotein influencing triglycerides in humans and mice revealed by comparative sequencing. Science
294, 169173.
Pennacchio LA & Rubin EM (2001). Genomic strategies to identify mammalian regulatory sequences. Nat Rev Genet 2, 100109.[CrossRef][Medline]
Pennacchio
LA
&
Rubin
EM (2003a). Apolipoprotein A5: a newly identified gene impacting plasma triglyceride levels in humans and mice. Arteriosclerosis, Thrombosis, Vascular Biol
23, 529534.
Pennacchio LA & Rubin EM (2003b). Comparative genomic tools and databases: providing insights into the human genome. J Clin Invest 111, 10991106.[CrossRef][Medline]
Rioux JD, Daly MJ, Silverberg MS, Lindblad K, Steinhart H, Cohen Z, Delmonte T, Kocher K, Miller K, Guschwan S, Kulbokas EJ, O'Leary S, Winchester E, Dewar K, Green T, Stone V, Chow C, Cohen A, Langelier D, Lapointe G, Gaudet D, Faith J, Branco N, Bull SB, McLeod RS, Griffiths AM, Bitton A, Greenberg GR, Lander ES, Siminovitch KA & Hudson TJ (2001). Genetic variation in the 5q31 cytokine gene cluster confers susceptibility to Crohn disease. Nat Genet 29, 223228.[CrossRef][Medline]
Sharma
S, Leonard
J, Lee
S, Chapman
HD, Leiter
EH
&
Montminy
MR (1996). Pancreatic islet expression of the homeobox factor STF-1 relies on an E-box motif that binds USF. J Biol Chem
271, 22942299.
Tautz D (2000). Evolution of transcriptional regulation. Curr Opin Genet Dev 10, 575579.[CrossRef][Medline]
Ureta-Vidal A, Ettwiller L & Birney E (2003). Comparative genomics: genome-wide analysis in metazoan eukaryotes. Nat Rev Genet 4, 251262.[Medline]
Venter
JC, Adams
MD, Myers
EW, Li
PW, Mural
RJ, Sutton
GG, Smith
HO, Yandell
M, Evans
CA, Holt
RA
et al. (2001). The sequence of the human genome. Science
291, 13041351.
Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P, Agarwala R, Ainscough R, Alexandersson M, An P et al. (2002). Initial sequencing and comparative analysis of the mouse genome. Nature 420, 520562.[CrossRef][Medline]
| Acknowledgements |
|---|
This article has been cited by other articles:
![]() |
D. M. McGaughey, R. M. Vinton, J. Huynh, A. Al-Saif, M. A. Beer, and A. S. McCallion Metrics of sequence constraint overlook regulatory sequences in an exhaustive analysis at phox2b Genome Res., February 1, 2008; 18(2): 252 - 260. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. A. Cogburn, T. E. Porter, M. J. Duclos, J. Simon, S. C. Burgess, J. J. Zhu, H. H. Cheng, J. B. Dodgson, and J. Burnside Functional Genomics of the Chicken A Model Organism Poult. Sci., October 1, 2007; 86(10): 2059 - 2094. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. R. Copley, M. Totrov, J. Linnell, S. Field, J. Ragoussis, and I. A. Udalova Functional conservation of Rel binding sites in drosophilid genomes Genome Res., September 1, 2007; 17(9): 1327 - 1335. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. H. Margulies, G. M. Cooper, G. Asimenos, D. J. Thomas, C. N. Dewey, A. Siepel, E. Birney, D. Keefe, A. S. Schwartz, M. Hou, et al. Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome Genome Res., June 1, 2007; 17(6): 760 - 774. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ramos, W. Chi Ho, S. Forte, K. Dickson, J. Boutilier, K. Favell, and P. A. Barker Hypo-Osmolar Stress Induces p75NTR Expression by Activating Sp1-Dependent Transcription J. Neurosci., February 7, 2007; 27(6): 1498 - 1506. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Mussi, C. Yu, B. W. O'Malley, and J. Xu Stimulation of Steroid Receptor Coactivator-3 (SRC-3) Gene Overexpression by a Positive Regulatory Loop of E2F1 and SRC-3 Mol. Endocrinol., December 1, 2006; 20(12): 3105 - 3119. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Reue and L. Vergnes Thematic review series: Systems Biology Approaches to Metabolic and Cardiovascular Disorders. Approaches to lipid metabolism gene identification and characterization in the postgenomic era J. Lipid Res., September 1, 2006; 47(9): 1891 - 1907. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. J. Mullins, M. A. Bailey, and J. J. Mullins Hypertension, Kidney, and Transgenics: A Fresh Perspective Physiol Rev, April 1, 2006; 86(2): 709 - 746. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Weikard, C. Kuhn, T. Goldammer, G. Freyer, and M. Schwerin The bovine PPARGC1A gene: molecular characterization and association of an SNP with variation of milk fat synthesis Physiol Genomics, March 21, 2005; 21(1): 1 - 13. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. H. Margulies, NISC Comparative Sequencing Program, V. V. B. Maduro, P. J. Thomas, J. P. Tomkins, C. T. Amemiya, M. Luo, and E. D. Green Comparative sequencing provides insights about the structure and conservation of marsupial and monotreme genomes PNAS, March 1, 2005; 102(9): 3354 - 3359. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. H. Epstein and J. A. Epstein A Perspective on the Value of Aquatic Models in Biomedical Research Experimental Biology and Medicine, January 1, 2005; 230(1): 1 - 7. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Valverde-Garduno, B. Guyot, E. Anguita, I. Hamlett, C. Porcher, and P. Vyas Differences in the chromatin structure and cis-element organization of the human and mouse GATA1 loci: implications for cis-element identification Blood, November 15, 2004; 104(10): 3106 - 3116. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. W. Blakesley, N. F. Hansen, J. C. Mullikin, P. J. Thomas, J. C. McDowell, B. Maskeri, A. C. Young, B. Benjamin, S. Y. Brooks, B. I. Coleman, et al. An intermediate grade of finished genomic sequence suitable for comparative analyses Genome Res., November 1, 2004; 14(11): 2235 - 2244. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Castrignano, A. Canali, G. Grillo, S. Liuni, F. Mignone, and G. Pesole CSTminer: a web tool for the identification of coding and noncoding conserved sequence tags through cross-species genome comparison Nucleic Acids Res., July 1, 2004; 32(suppl_2): W624 - W627. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Halling-Brown, C. Sansom, D. S. Moss, G. Elgar, and Y. J. K. Edwards A Fugu-Human Genome Synteny Viewer: web software for graphical display and annotation reports of synteny between Fugu genomic sequence and human genes Nucleic Acids Res., May 11, 2004; 32(8): 2618 - 2622. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |