PMC:3091641 / 37686-38365 JSONTXT

Annnotations TAB JSON ListView MergeView

{"target":"http://pubannotation.org/docs/sourcedb/PMC/sourceid/3091641","sourcedb":"PMC","sourceid":"3091641","source_url":"https://www.ncbi.nlm.nih.gov/pmc/3091641","text":"Estimation of Gene Family Size\nThe likelihood function for the observed set S of sequences is a function of the true number, g, of genes and the sample size, n, by summing over the unobserved number, c, of distinct genes sampled.14\nThe pmf on c is given by the recursion15\nwith Pc (1 | g,1) = 1 and Pc (c | g,1) = 0 for c ≠ 1.\nPS (S | c) is approximated by minimizing the number, m, of mutations over the assignments of observed sequences to c groups. PS is then the posterior probability of getting m mutations given the observed number of mutations in the IFNB and IFNK datasets.\nNote that this technique is independent of the assembly methods described elsewhere in the paper.","divisions":[{"label":"title","span":{"begin":0,"end":30}},{"label":"p","span":{"begin":31,"end":231}},{"label":"label","span":{"begin":229,"end":231}},{"label":"p","span":{"begin":232,"end":272}},{"label":"label","span":{"begin":270,"end":272}},{"label":"p","span":{"begin":273,"end":326}},{"label":"p","span":{"begin":327,"end":581}}],"tracks":[]}