PMC:3475481 / 3685-5967 JSONTXT

Annnotations TAB JSON ListView MergeView

{"target":"https://pubannotation.org/docs/sourcedb/PMC/sourceid/3475481","sourcedb":"PMC","sourceid":"3475481","source_url":"https://www.ncbi.nlm.nih.gov/pmc/3475481","text":"Methods\n\nDataset\nTwo microarray datasets were used for this study. We acquired these datasets from a public database (Gene Expression Omnibus, GEO). One was the expression dataset of 16 tumors and 4 normal tissues from 16 patients, using Affymetrix U133A gene chips (Affymetrix, Santa Clara, CA, USA). The other microarray dataset consisted of expression profiles of 22 tumors and 5 normal tissues. These two datasets were experimented on under the same platform, Affymetrix U133A. The datasets are summarized in Table 1.\n\nProcess for combining datasets\nFor combining datasets, gene expression ratios are rearranged in order of expression ratios by each gene in each dataset, and the ranks are matched with the corresponding experimental group. If the experimental groups are homogenous, the ranks within the same experimental group would be neighboring. The process of discretization of gene expressions is summarized in the following steps [3]:\nRank the gene expression ratios within a gene for each dataset.List in order of the ranks, and assign the order of gene expressions to the corresponding experimental groups.Summarize the result of (2) in the form of a contingency table for each gene.Combine the contingency tables that have been summarized for each dataset. \nWhen there are three datasets to be combined, the datasets can be added as a single entry, as shown in Table 2, after the transformation of each dataset by rank.\n\nIdentification of significant genes from a combined dataset\nAfter the summarization of gene expression ratios in the form of a contingency table for each gene, as shown in Table 3, a nonparametric statistical method was applied to the datasets for independence testing between gene expression patterns and experimental groups. The test statistics are calculated as follows for each gene:\n\nWhen the sample size is small - generally Ê(nij) less than 5 - Fisher's exact test is recommended rather than chi-square test.\nThe significant genes can be selected by an independence test between the phenotypes and gene expressions using this type of summarized dataset. ci and ri represent the marginal sums of the ith column and row, respectively. nij is the number of experiments belonging to Ei and Pj, and n represents the total number of experiments.","divisions":[{"label":"Title","span":{"begin":0,"end":7}},{"label":"Section","span":{"begin":9,"end":521}},{"label":"Title","span":{"begin":9,"end":16}},{"label":"Section","span":{"begin":523,"end":1434}},{"label":"Title","span":{"begin":523,"end":553}},{"label":"Title","span":{"begin":1436,"end":1495}}],"tracks":[{"project":"2_test","denotations":[{"id":"23105925-18554423-44845470","span":{"begin":943,"end":944},"obj":"18554423"}],"attributes":[{"subj":"23105925-18554423-44845470","pred":"source","obj":"2_test"}]}],"config":{"attribute types":[{"pred":"source","value type":"selection","values":[{"id":"2_test","color":"#ec9a93","default":true}]}]}}