PMC:1624833 / 24642-29133
Annnotations
2_test
{"project":"2_test","denotations":[{"id":"16952321-14555958-1695224","span":{"begin":271,"end":273},"obj":"14555958"},{"id":"16952321-12399584-1695225","span":{"begin":274,"end":276},"obj":"12399584"},{"id":"16952321-12399584-1695226","span":{"begin":1140,"end":1142},"obj":"12399584"},{"id":"16952321-14555958-1695227","span":{"begin":1187,"end":1189},"obj":"14555958"},{"id":"16952321-11997479-1695228","span":{"begin":3077,"end":3079},"obj":"11997479"},{"id":"16952321-9242640-1695229","span":{"begin":3080,"end":3082},"obj":"9242640"},{"id":"16952321-10604478-1695230","span":{"begin":3310,"end":3312},"obj":"10604478"},{"id":"16952321-11390663-1695231","span":{"begin":3880,"end":3882},"obj":"11390663"}],"text":"4.2 Transcriptional regulation in S. cerevisiae\nTo demonstrate the ability of our visualization algorithm to highlight differences between biclusters in similar datasets, we analyzed datasets of transcriptional regulation in two experimental conditions in S. cerevisiae [30,31]. Each dataset is a binary matrix whose columns represent transcription factors and whose rows represent genes in S. cerevisiae. A matrix entry contains a one if a ChIP-on-chip experiment indicates that the transcription factor binds to the promoter of the gene with a p-value at most 0.001. An important problem that arises in the analysis of this data is determining if a set of genes are collectively regulated by a set of transcription factors and whether this combinatorial regulation changes when the cell is exposed to stress. Although ChIP-on-chip data is noisy and significant effort may be needed to clean it up, the analysis we present next demonstrates that a combination of biclustering and our layout algorithm yields biologically useful results.\nThe two protein-DNA datasets we study correspond to the growth of S. cerevisiae cells in rich medium [31] and to growth under exposure to rapamycin [30], a condition that mimics nutrient starvation. We restricted our attention to transcription factors studied in both papers. We ran our implementation of the Apriori algorithm [32] that computes closed biclusters (as defined in Section 1) on both these datasets, applied our layout algorithm on biclusters with at least two genes and at least two transcription factors, and obtained the layout in Figure 4(a). Biclusters obtained from the data under growth in rich medium are shown as blue boxes and rapamycin-induced biclusters are shown as red boxes. A cell in the figure is dark grey (respectively, light grey) if the transcription factor binds to the gene's promoter in both (respectively, one) condition. The image strikingly demonstrates that under exposure to rapamycin, the transcriptional regulatory network activated in the cell is very different from the network activated under growth in rich medium. The rich medium data contains only four biclusters involving these transcription factors while the rapamycin data contains 38 biclusters. We conclude that very few genes are co-regulated by the same set of transcription factors in both conditions.\nFigure 4 Bicluster layouts. Visualizations of the layouts computed by our algorithm. Since the layout may contain repeated rows and columns, a bicluster may appear at multiple locations in the layout. We only highlight only one occurrence of each bicluster. The layout on the left displays biclusters representing combinatorial control of transcription in S. cerevisiae. The layout on the right displays biclusters in gene expression data for ALL and AML. To illustrate the use of our web interface, we used it to search for biclusters that included the transcription factors RTG3 and GLN3. RTG3 is a transcription factor that forms a complex with RTG1 to activate the retrograde (RTG) and target of rapamycin (TOR) pathways [33,34]. GLN3 encodes a transcription factor that is phosphorylated and localised to the cytoplasm when the cell is grown in nitrogen-rich media.\nRapamycin treatment can induce the dephosphorylation and subsequent activation of GLN3 [35]. Figure 5 displays the layout of all the biclusters containing these two transcription factors. We note that all but one bicluster also includes either the transcription factor GAT1 or the transcription factor GCN4. GAT1 is a transcriptional activator of genes involved in nitrogen catabolite repression; the activity and localization of these genes is regulated by nitrogen limitation. GCN4 is another transcription activator that is a master regulator of gene expression during amino acid starvation in S. cerevisiae and is activated in multiple stress responses [36]. Thus, it is not surprising that GAT1 and GCN4 co-regulate genes with GLN3 and RTG3. The functional annotations of the set of nine genes targeted by GCN4, GLN3, and RTG3 is enriched in the Gene Ontology biological process \"glutamine family amino acid biosynthesis\" (p-value of 2 × 10-8, based on the hypergeometric distribution), indicating that this pathway may be activated by the three transcription factors upon rapamycin treatment.\nFigure 5 Genes combinatorially controlled by GLN3 and RTG3. A layout of nine biclusters of genes combinatorially controlled by GLN3 and RTG3 under exposure to rapamycin."}