> top > docs > PMC:7371427 > spans > 56324-60550 > annotations

PMC:7371427 / 56324-60550 JSONTXT

Annnotations TAB JSON ListView MergeView

LitCovid-PD-FMA-UBERON

Id Subject Object Predicate Lexical cue fma_id
T653 818-822 Body_part denotes hand http://purl.org/sig/ont/fma/fma9712
T654 840-854 Body_part denotes neural network http://purl.org/sig/ont/fma/fma74616
T655 3764-3767 Body_part denotes RNA http://purl.org/sig/ont/fma/fma67095
T656 3817-3824 Body_part denotes protein http://purl.org/sig/ont/fma/fma67257
T657 3825-3832 Body_part denotes protein http://purl.org/sig/ont/fma/fma67257

LitCovid-PD-UBERON

Id Subject Object Predicate Lexical cue uberon_id
T270 440-446 Body_part denotes corpus http://purl.obolibrary.org/obo/UBERON_3000645
T271 588-594 Body_part denotes corpus http://purl.obolibrary.org/obo/UBERON_3000645
T272 818-822 Body_part denotes hand http://purl.obolibrary.org/obo/UBERON_0002398

LitCovid-PD-CLO

Id Subject Object Predicate Lexical cue
T939 176-177 http://purl.obolibrary.org/obo/CLO_0001020 denotes a
T940 196-197 http://purl.obolibrary.org/obo/CLO_0001020 denotes a
T941 310-311 http://purl.obolibrary.org/obo/CLO_0001020 denotes a
T942 429-430 http://purl.obolibrary.org/obo/CLO_0001020 denotes a
T943 596-597 http://purl.obolibrary.org/obo/CLO_0001020 denotes A
T944 872-875 http://purl.obolibrary.org/obo/CLO_0051582 denotes has
T945 1014-1015 http://purl.obolibrary.org/obo/CLO_0001020 denotes a
T946 1149-1150 http://purl.obolibrary.org/obo/CLO_0001020 denotes a
T947 1369-1370 http://purl.obolibrary.org/obo/CLO_0001020 denotes a
T948 1860-1861 http://purl.obolibrary.org/obo/CLO_0001020 denotes a
T949 1891-1892 http://purl.obolibrary.org/obo/CLO_0001020 denotes a
T950 2039-2040 http://purl.obolibrary.org/obo/CLO_0001020 denotes a
T951 2103-2110 http://purl.obolibrary.org/obo/SO_0000418 denotes Signals
T952 2643-2644 http://purl.obolibrary.org/obo/CLO_0001020 denotes a
T953 2908-2909 http://purl.obolibrary.org/obo/CLO_0001020 denotes a
T954 2947-2948 http://purl.obolibrary.org/obo/CLO_0001020 denotes a
T955 3039-3040 http://purl.obolibrary.org/obo/CLO_0001020 denotes a
T956 3142-3143 http://purl.obolibrary.org/obo/CLO_0001020 denotes a
T957 3646-3653 http://purl.obolibrary.org/obo/SO_0000418 denotes signals
T958 3789-3794 http://purl.obolibrary.org/obo/NCBITaxon_9606 denotes human
T959 4161-4168 http://purl.obolibrary.org/obo/SO_0000418 denotes signals
T960 4205-4212 http://purl.obolibrary.org/obo/SO_0000418 denotes Signals

LitCovid-PD-CHEBI

Id Subject Object Predicate Lexical cue chebi_id
T68445 3817-3824 Chemical denotes protein http://purl.obolibrary.org/obo/CHEBI_36080
T67615 3825-3832 Chemical denotes protein http://purl.obolibrary.org/obo/CHEBI_36080

LitCovid-PD-GO-BP

Id Subject Object Predicate Lexical cue
T62 34-43 http://purl.obolibrary.org/obo/GO_0009058 denotes synthesis
T63 1543-1551 http://purl.obolibrary.org/obo/GO_0007612 denotes learning
T64 3890-3898 http://purl.obolibrary.org/obo/GO_0007612 denotes learning
T65 4151-4160 http://purl.obolibrary.org/obo/GO_0009058 denotes synthesis

LitCovid-sentences

Id Subject Object Predicate Lexical cue
T309 0-74 Sentence denotes Unstructured biomedical knowledge synthesis and triangulation capabilities
T310 75-258 Sentence denotes In order to capture biomedical literature-based associations, the nferX platform defines two scores: a ‘local score’ and a ‘global score’, as described previously (Park et al., 2020).
T311 259-595 Sentence denotes Briefly, the local score is obtained from applying a traditional natural language processing technique which captures the strength of association between two concepts in a selected corpus of biomedical literature based on the frequency of their co-occurrence normalized by the frequency of each individual concept throughout the corpus.
T312 596-786 Sentence denotes A higher local score between Concept X and Concept Y indicates that these concepts are frequently mentioned in close proximity to each other more frequently than would be expected by chance.
T313 787-934 Sentence denotes The global score, on the other hand, is based on the neural network renaissance that has recently taken place in Natural Language Processing (NLP).
T314 935-1065 Sentence denotes To compute global scores, all tokens (e.g. words and phrases) are projected in a high-dimensional vector space of word embeddings.
T315 1066-1165 Sentence denotes These vectors serve to represent the ‘neighborhood’ of concepts which occur around a given concept.
T316 1166-1391 Sentence denotes The cosine similarity between any two vectors measures the similarity of these neighborhoods and is the basis for our global score metric, where concepts which are more similar in this vector space have a higher global score.
T317 1392-1632 Sentence denotes While the global scores in this work are computed in the embedding space of word2vec model, it can also be computed in the embedding space of any deep learning model including recent transformer-based models like BERT (Devlin et al., 2019).
T318 1633-1794 Sentence denotes These may have complementary benefits to word2vec embeddings since the embeddings are context sensitive having different vectors for different sentence contexts.
T319 1795-2033 Sentence denotes However, despite the context sensitive nature of BERT embeddings a global score computation for a phrase may still be of value given the score is computed across sentence embeddings capturing the context sensitive nature of those phrases.
T320 2034-2260 Sentence denotes From a visualization perspective, the local score and global score (‘Signals’) are represented in the platform using bubbles where bubble size corresponds to the local score and color intensity corresponds to the global score.
T321 2261-2386 Sentence denotes This allows users to rapidly determine the strength of association between any two concepts throughout biomedical literature.
T322 2387-2545 Sentence denotes We consider concepts which show both high local and global scores to be ‘concordant’ and have found that these typically recapitulate well-known associations.
T323 2546-2699 Sentence denotes One key aspect of the nferX platform is that it allows the user to query associated concepts for a virtually unbounded number of possible query concepts.
T324 2700-2742 Sentence denotes This is achieved by means of two features:
T325 2743-2995 Sentence denotes Firstly, the nferX platform allows the user to compose queries using the logical AND, OR and NOT operators to logically combine any number of biomedical concepts in a query, each combination amounting to a gross or nuanced composite biomedical concept.
T326 2996-3498 Sentence denotes Secondly, since logical combinations yield a virtually unbounded number of biomedical concepts that can be queries, the nferX platform implements a completely dynamic method of computing local scores on the fly by using novel high performance parallel and distributed algorithms that, in real time, scan hundreds of millions of documents to quickly locate user query related text fragments and count co-occurring biomedical concepts for computing strength of association scores and their significances.
T327 3499-3847 Sentence denotes The platform further leverages statistical inference to calculate ‘enrichments’ based on structured data, thus enabling real-time triangulation of signals from the unstructured biomedical knowledge graph various other structured databases (e.g. curated ontologies, RNA-sequencing datasets, human genetic associations, protein-protein interactions).
T328 3848-4019 Sentence denotes This facilitates unbiased hypothesis-free learning and faster pattern recognition, and it allows users to more holistically determine the veracity of concept associations.
T329 4020-4226 Sentence denotes Finally, the platform allows the user to identify and further examine the documents and textual fragments from which the knowledge synthesis signals are derived using the Documents and Signals applications.