PMC:4331677 / 10918-11483
Annnotations
{"target":"https://pubannotation.org/docs/sourcedb/PMC/sourceid/4331677","sourcedb":"PMC","sourceid":"4331677","source_url":"https://www.ncbi.nlm.nih.gov/pmc/4331677","text":"Therefore, we first broke the binding site sequences into trimers. For example, the amino acid sequence NGMGN produces three trimers G(NM), M(GG) and G(MN). Since G(NM) and G(MN) are equivalence, we could combine them by adding a count. Then, we casted all the trimers into 199 clusters, and counted the occurring frequency of each cluster in every binding site. Finally, all cluster frequencies were normalized to unit L2 norm to obtain the feature vectors with 199 dimensions for the binding sites. For example, the sequence NGMGN can be represented as following:","tracks":[]}