> top > users > Yue Wang

User 'Yue Wang'

Projects

NameTDescription# Ann.Updated at Status
1-10 / 15 show all
CyanoBaseCyanobacteria are prokaryotic organisms that have served as important model organisms for studying oxygenic photosynthesis and have played a significant role in the Earthfs history as primary producers of atmospheric oxygen. Publication: http://www.aclweb.org/anthology/W12-24301.1 K2016-05-17Released
AnEM_abstracts250 documents selected randomly from citation abstracts Entity types: organism subdivision, anatomical system, organ, multi-tissue structure, tissue, cell, developing anatomical structure, cellular component, organism substance, immaterial anatomical entity and pathological formation Together with AnEM_full-texts, it is probably the largest manually annotated corpus on anatomical entities.1.95 K2016-06-07Released
AnEM_full-texts250 documents selected randomly from full-text papers Entity types: organism subdivision, anatomical system, organ, multi-tissue structure, tissue, cell, developing anatomical structure, cellular component, organism substance, immaterial anatomical entity and pathological formation Together with AnEM_abstracts, it is probably the largest manually annotated corpus on anatomical entities.6892016-07-27Uploading
PIR-corpus1The Protein Information Resource (PIR) is not biased towards any particular biomedical domain, and is expected to provide more diverse protein names in a given sample size. Annotation category: protein, compound-protein, acronym.4.44 K2016-11-14Released
bionlp-st-epi-2011-trainingThe training dataset from the Epigenetics and Post-translational Modifications (EPI) task in the BioNLP Shared Task 2011. The core entities of the task are genes and gene products (RNA and proteins), identified in the data simply as "Protein" annotations. 7.6 K2016-12-06Released
bionlp-st-cg-2013-trainingThe training dataset from the cancer genetics task in the BioNLP Shared Task 2013. Composed of anatomical and molecular entities.10.9 K2016-12-06Released
PennBioIEThe PennBioIE corpus (0.9) covers two domains of biomedical knowledge. One is the inhibition of the cytochrome P450 family of enzymes (CYP450 or CYP for short) , and the other domain is the molecular genetics of dance (oncology or onco for short).23.9 K2016-12-06Released
PIR-corpus2The protein tag was used to tag proteins, or protein-associated or -related objects, such as domains, pathways, expression of gene. Annotation guideline: http://pir.georgetown.edu/pirwww/about/doc/manietal.pdf5.52 K2017-03-07Released
FSU-PRGEA new broad-coverage corpus composed of 3,306 MEDLINE abstracts dealing with gene and protein mentions. The annotation process was semi-automatic. Publication: http://aclweb.org/anthology/W/W10/W10-1838.pdf59.5 K2017-03-08Released
SCAI-TestA small corpus for the evaluation of dictionaries containing chemical entities. Publication: http://www.scai.fraunhofer.de/fileadmin/images/bio/data_mining/paper/kolarik2008.pdf Original source: https://www.scai.fraunhofer.de/en/business-research-areas/bioinformatics/downloads/corpora-for-chemical-entity-recognition.html1.21 K2017-04-03Released

Automatic annotators

none

Editors

none