PMC:1420271 / 4318-5092 JSONTXT 4 Projects

Annnotations TAB TSV DIC JSON TextAE

Id Subject Object Predicate Lexical cue
T4031 0-2 TO denotes To
T4032 3-12 VB denotes determine
T4034 13-16 DT denotes the
T4036 17-28 JJ denotes chromosomal
T4035 29-42 NNS denotes localizations
T4037 43-45 IN denotes of
T4038 46-49 DT denotes the
T4040 50-55 NN denotes mouse
T4041 56-60 NN denotes ESG1
T4039 61-65 NN denotes gene
T4042 66-69 CC denotes and
T4043 70-81 NNS denotes pseudogenes
T4044 81-83 , denotes ,
T4045 83-85 PRP denotes we
T4033 86-95 VBD denotes performed
T4046 96-97 DT denotes a
T4048 98-103 NNP denotes Blast
T4047 104-112 NN denotes analysis
T4049 113-115 IN denotes of
T4050 116-119 DT denotes the
T4052 120-125 NN denotes mouse
T4053 126-133 JJ denotes genomic
T4051 134-142 NN denotes database
T4054 143-147 IN denotes with
T4055 148-151 DT denotes the
T4057 152-156 NN denotes ESG1
T4058 157-161 NN denotes cDNA
T4056 162-170 NN denotes sequence
T4059 171-173 IN denotes as
T4060 174-175 DT denotes a
T4061 176-181 NN denotes query
T4062 181-182 . denotes .
T4063 182-300 sentence denotes We identified several putative pseudogenes without introns on chromosomes 1, 5, 11, 12, 14, 16, 17, and X (Figure 1).
T4064 183-185 PRP denotes We
T4065 186-196 VBD denotes identified
T4066 197-204 JJ denotes several
T4068 205-213 JJ denotes putative
T4067 214-225 NNS denotes pseudogenes
T4069 226-233 IN denotes without
T4070 234-241 NNS denotes introns
T4071 242-244 IN denotes on
T4072 245-256 NNS denotes chromosomes
T4074 257-258 CD denotes 1
T4075 258-260 , denotes ,
T4076 260-261 CD denotes 5
T4077 261-263 , denotes ,
T4078 263-265 CD denotes 11
T4079 265-267 , denotes ,
T4080 267-269 CD denotes 12
T4081 269-271 , denotes ,
T4082 271-273 CD denotes 14
T4083 273-275 , denotes ,
T4084 275-277 CD denotes 16
T4085 277-279 , denotes ,
T4086 279-281 CD denotes 17
T4087 281-283 , denotes ,
T4088 283-286 CC denotes and
T4073 287-288 NN denotes X
T4089 289-290 -LRB- denotes (
T4090 290-296 NN denotes Figure
T4091 297-298 CD denotes 1
T4092 298-299 -RRB- denotes )
T4093 299-300 . denotes .
T4094 300-432 sentence denotes In addition, two intron-less pseudogenes were identified in DNA fragments for which the chromosomal localization remained unmapped.
T4095 301-303 IN denotes In
T4097 304-312 NN denotes addition
T4098 312-314 , denotes ,
T4099 314-317 CD denotes two
T4101 318-329 JJ denotes intron-less
T4100 330-341 NNS denotes pseudogenes
T4102 342-346 VBD denotes were
T4096 347-357 VBN denotes identified
T4103 358-360 IN denotes in
T4104 361-364 NN denotes DNA
T4105 365-374 NNS denotes fragments
T4106 375-378 IN denotes for
T4108 379-384 WDT denotes which
T4109 385-388 DT denotes the
T4111 389-400 JJ denotes chromosomal
T4110 401-413 NN denotes localization
T4107 414-422 VBD denotes remained
T4112 423-431 JJ denotes unmapped
T4113 431-432 . denotes .
T4114 432-572 sentence denotes While these pseudodgenes have significant homology to ESG1 cDNA, they could not produce functional proteins, because of critical mutations.
T4115 433-438 IN denotes While
T4117 439-444 DT denotes these
T4118 445-457 NNS denotes pseudodgenes
T4116 458-462 VBP denotes have
T4120 463-474 JJ denotes significant
T4121 475-483 NN denotes homology
T4122 484-486 IN denotes to
T4123 487-491 NN denotes ESG1
T4124 492-496 NN denotes cDNA
T4125 496-498 , denotes ,
T4126 498-502 PRP denotes they
T4127 503-508 MD denotes could
T4128 509-512 RB denotes not
T4119 513-520 VB denotes produce
T4129 521-531 JJ denotes functional
T4130 532-540 NN denotes proteins
T4131 540-542 , denotes ,
T4132 542-549 IN denotes because
T4133 550-552 IN denotes of
T4134 553-561 JJ denotes critical
T4135 562-571 NNS denotes mutations
T4136 571-572 . denotes .
T4137 572-680 sentence denotes This result suggests that there are a larger number of intron-less pseudogenes than previously anticipated.
T4138 573-577 DT denotes This
T4139 578-584 NN denotes result
T4140 585-593 VBZ denotes suggests
T4141 594-598 IN denotes that
T4143 599-604 EX denotes there
T4142 605-608 VBP denotes are
T4144 609-610 DT denotes a
T4146 611-617 JJR denotes larger
T4145 618-624 NN denotes number
T4147 625-627 IN denotes of
T4148 628-639 JJ denotes intron-less
T4149 640-651 NNS denotes pseudogenes
T4150 652-656 IN denotes than
T4152 657-667 RB denotes previously
T4151 668-679 VBN denotes anticipated
T4153 679-680 . denotes .
T4154 680-774 sentence denotes Existence of multiple retropseudogenes is a hallmark of pluripotent cell-specific genes [18].
T4155 681-690 NN denotes Existence
T4157 691-693 IN denotes of
T4158 694-702 JJ denotes multiple
T4159 703-719 NNS denotes retropseudogenes
T4156 720-722 VBZ denotes is
T4160 723-724 DT denotes a
T4161 725-733 NN denotes hallmark
T4162 734-736 IN denotes of
T4163 737-748 JJ denotes pluripotent
T4165 749-753 NN denotes cell
T4167 753-754 HYPH denotes -
T4166 754-762 JJ denotes specific
T4164 763-768 NNS denotes genes
T4168 769-770 -LRB- denotes [
T4169 770-772 CD denotes 18
T4170 772-773 -RRB- denotes ]
T4171 773-774 . denotes .
R743 T4031 T4032 aux To,determine
R744 T4032 T4033 advcl determine,performed
R745 T4034 T4035 det the,localizations
R746 T4035 T4032 dobj localizations,determine
R747 T4036 T4035 amod chromosomal,localizations
R748 T4037 T4035 prep of,localizations
R749 T4038 T4039 det the,gene
R750 T4039 T4037 pobj gene,of
R751 T4040 T4039 compound mouse,gene
R752 T4041 T4039 compound ESG1,gene
R753 T4042 T4039 cc and,gene
R754 T4043 T4039 conj pseudogenes,gene
R755 T4044 T4033 punct ", ",performed
R756 T4045 T4033 nsubj we,performed
R757 T4046 T4047 det a,analysis
R758 T4047 T4033 dobj analysis,performed
R759 T4048 T4047 compound Blast,analysis
R760 T4049 T4047 prep of,analysis
R761 T4050 T4051 det the,database
R762 T4051 T4049 pobj database,of
R763 T4052 T4051 nmod mouse,database
R764 T4053 T4051 amod genomic,database
R765 T4054 T4033 prep with,performed
R766 T4055 T4056 det the,sequence
R767 T4056 T4054 pobj sequence,with
R768 T4057 T4056 compound ESG1,sequence
R769 T4058 T4056 compound cDNA,sequence
R770 T4059 T4056 prep as,sequence
R771 T4060 T4061 det a,query
R772 T4061 T4059 pobj query,as
R773 T4062 T4033 punct .,performed
R774 T4064 T4065 nsubj We,identified
R775 T4066 T4067 amod several,pseudogenes
R776 T4067 T4065 dobj pseudogenes,identified
R777 T4068 T4067 amod putative,pseudogenes
R778 T4069 T4067 prep without,pseudogenes
R779 T4070 T4069 pobj introns,without
R780 T4071 T4065 prep on,identified
R781 T4072 T4073 nmod chromosomes,X
R782 T4073 T4071 pobj X,on
R783 T4074 T4073 nummod 1,X
R784 T4075 T4073 punct ", ",X
R785 T4076 T4073 nummod 5,X
R786 T4077 T4073 punct ", ",X
R787 T4078 T4073 nummod 11,X
R788 T4079 T4073 punct ", ",X
R789 T4080 T4073 nummod 12,X
R790 T4081 T4073 punct ", ",X
R791 T4082 T4073 nummod 14,X
R792 T4083 T4073 punct ", ",X
R793 T4084 T4073 nummod 16,X
R794 T4085 T4073 punct ", ",X
R795 T4086 T4073 nummod 17,X
R796 T4087 T4073 punct ", ",X
R797 T4088 T4073 cc and,X
R798 T4089 T4090 punct (,Figure
R799 T4090 T4065 parataxis Figure,identified
R800 T4091 T4090 nummod 1,Figure
R801 T4092 T4090 punct ),Figure
R802 T4093 T4065 punct .,identified
R803 T4095 T4096 prep In,identified
R804 T4097 T4095 pobj addition,In
R805 T4098 T4096 punct ", ",identified
R806 T4099 T4100 nummod two,pseudogenes
R807 T4100 T4096 nsubjpass pseudogenes,identified
R808 T4101 T4100 amod intron-less,pseudogenes
R809 T4102 T4096 auxpass were,identified
R810 T4103 T4096 prep in,identified
R811 T4104 T4105 compound DNA,fragments
R812 T4105 T4103 pobj fragments,in
R813 T4106 T4107 prep for,remained
R814 T4107 T4105 relcl remained,fragments
R815 T4108 T4106 pobj which,for
R816 T4109 T4110 det the,localization
R817 T4110 T4107 nsubj localization,remained
R818 T4111 T4110 amod chromosomal,localization
R819 T4112 T4107 acomp unmapped,remained
R820 T4113 T4096 punct .,identified
R821 T4115 T4116 mark While,have
R822 T4116 T4119 advcl have,produce
R823 T4117 T4118 det these,pseudodgenes
R824 T4118 T4116 nsubj pseudodgenes,have
R825 T4120 T4121 amod significant,homology
R826 T4121 T4116 dobj homology,have
R827 T4122 T4121 prep to,homology
R828 T4123 T4124 compound ESG1,cDNA
R829 T4124 T4122 pobj cDNA,to
R830 T4125 T4119 punct ", ",produce
R831 T4126 T4119 nsubj they,produce
R832 T4127 T4119 aux could,produce
R833 T4128 T4119 neg not,produce
R834 T4129 T4130 amod functional,proteins
R835 T4130 T4119 dobj proteins,produce
R836 T4131 T4119 punct ", ",produce
R837 T4132 T4119 prep because,produce
R838 T4133 T4132 pcomp of,because
R839 T4134 T4135 amod critical,mutations
R840 T4135 T4132 pobj mutations,because
R841 T4136 T4119 punct .,produce
R842 T4138 T4139 det This,result
R843 T4139 T4140 nsubj result,suggests
R844 T4141 T4142 mark that,are
R845 T4142 T4140 ccomp are,suggests
R846 T4143 T4142 expl there,are
R847 T4144 T4145 det a,number
R848 T4145 T4142 attr number,are
R849 T4146 T4145 amod larger,number
R850 T4147 T4145 prep of,number
R851 T4148 T4149 amod intron-less,pseudogenes
R852 T4149 T4147 pobj pseudogenes,of
R853 T4150 T4151 mark than,anticipated
R854 T4151 T4145 advcl anticipated,number
R855 T4152 T4151 advmod previously,anticipated
R856 T4153 T4140 punct .,suggests
R857 T4155 T4156 nsubj Existence,is
R858 T4157 T4155 prep of,Existence
R859 T4158 T4159 amod multiple,retropseudogenes
R860 T4159 T4157 pobj retropseudogenes,of
R861 T4160 T4161 det a,hallmark
R862 T4161 T4156 attr hallmark,is
R863 T4162 T4161 prep of,hallmark
R864 T4163 T4164 amod pluripotent,genes
R865 T4164 T4162 pobj genes,of
R866 T4165 T4166 npadvmod cell,specific
R867 T4166 T4164 amod specific,genes
R868 T4167 T4166 punct -,specific
R869 T4168 T4169 punct [,18
R870 T4169 T4161 parataxis 18,hallmark
R871 T4170 T4169 punct ],18
R872 T4171 T4156 punct .,is