PMC:1630425 / 10083-11328 JSONTXT

Annnotations TAB JSON ListView MergeView

{"target":"https://pubannotation.org/docs/sourcedb/PMC/sourceid/1630425","sourcedb":"PMC","sourceid":"1630425","source_url":"https://www.ncbi.nlm.nih.gov/pmc/1630425","text":"This report, and the iterative mapping field in general, mixes terminology from two distinct approaches to sequence analysis which are noteworthy elaboration for the sake of clarity. \"Scale\" and \"resolution\" are used as generic terms for a concept that is sometimes precised as \"sequence length\" or Markovian \"order\". \"Length\" is the term used in word-statistics and corresponds to the length of the L-tuple. \"Order\" describes the same concept but is more commonly used in the context of Markov models. To add to the confusion, L-tuple/word \"length\" is one unit smaller than \"order\". For example a simple 4 × 4 transition matrix between nucleotides resolves Markovian succession with order 1 and the conditional probabilities in each of the 16 squares correspond to the frequency of all possible dinucleotides (length 2). Another example, the \"scale of L-tuple distribution\" designates the length of the tuple for which all frequencies where determined. A variation on this theme is the use of \"alphabet size\" to access scale: it designates the number of unique symbols available for use in by the sequence. Along the same line of thought, \"vocabulary\" (not used in this report) would designate the number of possible L-tuples of a given length.","tracks":[]}