Projects

Name	Description	# Ann.	Author	Maintainer	Updated_at	Status

« 1 2 3 4 5 6 7 8 9 10 11 ... 29 30 » 121-140 / 590 show all
metamap-sample	Sample annotation of MetaMep, produced by Aronson, et al. An overview of MetaMap: historical perspective and recent advances, JAMIA 2010	10.9 K	Alan R Aronson	Jin-Dong Kim	2023-11-27	Testing
bionlp-st-cg-2013-training	The training dataset from the cancer genetics task in the BioNLP Shared Task 2013. Composed of anatomical and molecular entities.	10.9 K	NaCTeM	Yue Wang	2023-11-28	Released
test3		10.7 K		ykyao	2023-11-29
FA_Top100Plus-GeneProtein	Top100＋本来Top100に入るべきだった７レビューの計、107レビュー中101レビュー。 5414, 6076, 6930, 8403, 9643, 18544は、0denotationでドキュメント自体登録していない。 attributesの詳細はconfig参照。ドキュメントのソースDBが@AikoHIRAKIとなっているものはTypo修正がPubAnnotationの公式FirstAuthorsドキュメントに反映された段階で置き換えます。	10.4 K		yucca	2023-11-29	Uploading
Zoonoses	This is a main data sets of Zoonoses project used by PanZoora.	10.3 K		AikoHIRAKI	2023-11-26	Developing
FA_Top107-forWeb	※※※　！要データ加工！　webリンク用には、この結果を加工して使っています。その他で使われる場合に、末尾記載の問題を別途解決する必要があります。　！要データ加工！　※※※ Top100＋本来Top100に入るべきだった７レビューの計、107レビュー中99レビュー。 5414, 6076, 6930, 8403, 9643, 12112, 18544, 18829は、0denotationでドキュメント自体登録していません。 @AikoHIRAKIはtypoを修正したレビューフォルダ。 attributesの詳細はconfig参照。 ※※※　！注意！ webリンク側のしばりで、選択文字列は複数のUniProtIDに対応していません。（例）Protein1～Protein7とある場合、 Protein1, 2, 3, 4, 5, 6, 7をさし、かつ全てにUniProtIDがあったとしても、1と7のみUniProtIDをとってきています。 "～"は、Protein2, 3, 4, 5, 6を意味していますが、positionではなく文字列で検索をかけているのと、見せ方の仕様上、これらのIDは全て未取得となっています。⇔GeneProteinでは"～"に2-6のIDsをもたせていました。該当レビュー；14898(～=MAPK2, MAPK3, MAPK4, MAPK5, MAPK6), 10471(～=Ago2, Ago3) --------------------------------------- （例）ProteinAB...ProteinCD...ProteinB...ProteinDとある場合、 ProteinABは、ProteinAとBというLexical_Cueになっています。ProteinCDも同様に、ProteinCとD。BとDだけでは、このレビュー内ではProteinBやProteinDをさすことが分かるのですが、それ以外で使用する場合に、BとDにそれぞれ該当UniProtIDをあてるのは不適切です。該当レビュー；11957(β4=itgb4, β1=itgb1, β5=itgb5, β3=itgb3) 他の例が出てきたら順次、ここに記載していきます。当座、これらは削除する必要があります。 attributeで削除フラグをつけるか、Jakeの機能がTextAEに実装されれば解決するか、検討して、何かしら分かるようにしておきます。（例）ProteinA/B　とある場合、 webリンクでは、"ProteinA"にUniProtID-Aを、"/B"にUniProtID-Bをつけています（リンク側のしばり）。webリンク以外で使われる場合には、別プロジェクトのFA_Top100Plus-GeneProteinで行っていたようなRelationを使って、"/B"ではなく、"ProteinB"として、UniProtID-Bと対応させる必要があります。現状のとり方ですと、要Relation箇所は救済出来ません。 Lexical cueには"/B"とありますが、Objectには"ProteinB"と残してあるので、Objectを参照して下さい。但し、言語処理のようなpositionがご入用な場合には上では対応出来ていません。該当レビュー；11935(/4=BMP4), 14898(/2=LATS2), 7412(/2=TSC2), 4629(/2=CtBP2) （webリンクでは、レビュー毎に完結しているので、"/B"がそのレビューで他の意味をなしていなければ対応出来るのと、文字列合致でリンクを貼っているためです。）！注意！　※※※ RelationのmergedはTextAEの既存機能で既に出来ます。	10.3 K		AikoHIRAKI	2023-11-29	Beta
silkwormbase		10 K		sakaniwa	2023-11-29	Testing
silkworm		9.87 K		Masakazu SAGA	2023-11-29	Testing
LitCovid-OGER	Using OGER (http://www.ontogene.org/resources/oger) to detect entities from 10 different vocabularies	9.31 K	Fabio Rinaldi	Nico Colic	2023-11-29	Released
bionlp-st-ge-2016-test-tees	NER and event extraction produced by TEES (with the default GE11 model) for the 14 full papers used in the BioNLP 2016 GE task test corpus.	9.17 K	Nico Colic	Nico Colic	2023-11-29	Released
sonoma2	sonoma2	9.09 K	Standigm	chanung	2023-11-29	Beta
LitCoin-GeneOrGeneProduct-v2	threshold = 0.93	8.98 K		Jin-Dong Kim	2023-11-29
Grays_part2	Osteology w/o 21	8.63 K		okubo	2023-11-29	Testing
Grays_part2_test		8.57 K		Jin-Dong Kim	2023-11-29	Testing
bionlp-st-gro-2013-training	The training data set of the BioNLP-ST 2013 GRO task, including 150 MEDLINE abstracts that are annotated with concepts and relations of the Gene Regulation Ontology (GRO; http://www.ebi.ac.uk/Rebholz-srv/GRO/GRO.html)	8.02 K	Jung-jae Kim	Jung-jae Kim	2023-11-29	Testing
bionlp-st-ge-2016-test	It is the benchmark test data set of the BioNLP-ST 2016 GE task. It includes Genia-style event annotations to 14 full paper articles which are about NFκB proteins. For testing purpose, however, annotations are all blinded, which means users cannot see the annotations in this project. Instead, annotations in any other project can be compared to the hidden annotations in this project, then the annotations in the project will be automatically evaluated based on the comparison. A participant of GE task can get the evaluation of his/her result of automatic annotation, through following process: Create a new project. Import documents from the project, bionlp-st-2016-test-proteins to your project. Import annotations from the project, bionlp-st-2016-test-proteins to your project. At this point, you may want to compare you project to this project, the benchmark data set. It will show that protein annotations in your project is 100% correct, but other annotations, e.g., events, are 0%. Produce event annotations, using your system, upon the protein annotations. Upload your event annotations to your project. Compare your project to this project, to get evaluation. GE 2016 benchmark data set is provided as multi-layer annotations which include: bionlp-st-ge-2016-reference: benchmark reference data set bionlp-st-ge-2016-test: benchmark test data set (this project) bionlp-st-ge-2016-test-proteins: protein annotation to the benchmark test data set Following is supporting resources: bionlp-st-ge-2016-coref: coreference annotation bionlp-st-ge-2016-uniprot: Protein annotation with UniProt IDs. pmc-enju-pas: dependency parsing result produced by Enju UBERON-AE: annotation for anatomical entities as defined in UBERON ICD10: annotation for disease names as defined in ICD10 GO-BP: annotation for biological process names as defined in GO GO-CC: annotation for cellular component names as defined in GO A SPARQL-driven search interface is provided at http://bionlp.dbcls.jp/sparql.	7.99 K	DBCLS	Jin-Dong Kim	2023-11-29	Released
bionlp-st-pc-2013-training	The training dataset from the pathway curation (PC) task in the BioNLP Shared Task 2013. The entity types defined in the PC task are simple chemical, gene or gene product, complex and cellular component.	7.86 K	NaCTeM and KISTI	Yue Wang	2023-11-27	Released
bionlp-st-epi-2011-training	The training dataset from the Epigenetics and Post-translational Modifications (EPI) task in the BioNLP Shared Task 2011. The core entities of the task are genes and gene products (RNA and proteins), identified in the data simply as "Protein" annotations.	7.59 K	GENIA	Yue Wang	2023-11-29	Released
Staphylococcus		7.46 K	haruo	haruo	2023-11-29	Testing
BioLarkPubmedHPO	228 abstracts manually annotated with Human Phenotype Ontology (HPO) concepts and harmonized by three curators, which can be used as a reference standard for free text annotation of human phenotypes. For more info, please see Groza et al. "Automatic concept recognition using the human phenotype ontology reference and test suite corpora", 2015.	7.16 K	Tudor Groza	simon	2023-11-29	Released

Name	# Ann.	Author	Maintainer	Updated_at	Status

« 1 2 3 4 5 6 7 8 9 10 11 ... 29 30 » 121-140 / 590 show all
metamap-sample	10.9 K	Alan R Aronson	Jin-Dong Kim	2023-11-27	Testing
bionlp-st-cg-2013-training	10.9 K	NaCTeM	Yue Wang	2023-11-28	Released
test3	10.7 K		ykyao	2023-11-29
FA_Top100Plus-GeneProtein	10.4 K		yucca	2023-11-29	Uploading
Zoonoses	10.3 K		AikoHIRAKI	2023-11-26	Developing
FA_Top107-forWeb	10.3 K		AikoHIRAKI	2023-11-29	Beta
silkwormbase	10 K		sakaniwa	2023-11-29	Testing
silkworm	9.87 K		Masakazu SAGA	2023-11-29	Testing
LitCovid-OGER	9.31 K	Fabio Rinaldi	Nico Colic	2023-11-29	Released
bionlp-st-ge-2016-test-tees	9.17 K	Nico Colic	Nico Colic	2023-11-29	Released
sonoma2	9.09 K	Standigm	chanung	2023-11-29	Beta
LitCoin-GeneOrGeneProduct-v2	8.98 K		Jin-Dong Kim	2023-11-29
Grays_part2	8.63 K		okubo	2023-11-29	Testing
Grays_part2_test	8.57 K		Jin-Dong Kim	2023-11-29	Testing
bionlp-st-gro-2013-training	8.02 K	Jung-jae Kim	Jung-jae Kim	2023-11-29	Testing
bionlp-st-ge-2016-test	7.99 K	DBCLS	Jin-Dong Kim	2023-11-29	Released
bionlp-st-pc-2013-training	7.86 K	NaCTeM and KISTI	Yue Wang	2023-11-27	Released
bionlp-st-epi-2011-training	7.59 K	GENIA	Yue Wang	2023-11-29	Released
Staphylococcus	7.46 K	haruo	haruo	2023-11-29	Testing
BioLarkPubmedHPO	7.16 K	Tudor Groza	simon	2023-11-29	Released