PMC:329117 / 7140-8303 JSONTXT 5 Projects

Annnotations TAB TSV DIC JSON TextAE

Id Subject Object Predicate Lexical cue
T2024 0-9 JJ denotes Olfactory
T2025 10-18 NN denotes receptor
T2026 19-24 NNS denotes genes
T2027 25-29 VBP denotes have
T2028 30-32 DT denotes an
T2030 33-43 JJ denotes intronless
T2031 44-50 VBG denotes coding
T2029 51-57 NN denotes region
T2032 57-59 , denotes ,
T2033 59-70 VBG denotes simplifying
T2034 71-75 CC denotes both
T2035 76-89 JJ denotes computational
T2037 90-93 CC denotes and
T2038 94-106 JJ denotes experimental
T2039 107-116 JJ denotes olfactory
T2040 117-125 NN denotes receptor
T2036 126-140 NN denotes identification
T2041 140-141 . denotes .
T2042 141-220 sentence denotes For a small number of olfactory receptors, gene structure has been determined.
T2043 142-145 IN denotes For
T2045 146-147 DT denotes a
T2047 148-153 JJ denotes small
T2046 154-160 NN denotes number
T2048 161-163 IN denotes of
T2049 164-173 JJ denotes olfactory
T2050 174-183 NNS denotes receptors
T2051 183-185 , denotes ,
T2052 185-189 NN denotes gene
T2053 190-199 NN denotes structure
T2054 200-203 VBZ denotes has
T2055 204-208 VBN denotes been
T2044 209-219 VBN denotes determined
T2056 219-220 . denotes .
T2057 220-332 sentence denotes Additional 5' untranslated exons lie upstream of the coding region and can be alternatively spliced [19,23-26].
T2058 221-231 JJ denotes Additional
T2060 232-233 CD denotes 5
T2061 233-234 SYM denotes '
T2062 235-247 JJ denotes untranslated
T2059 248-253 NNS denotes exons
T2063 254-257 VBP denotes lie
T2064 258-266 RB denotes upstream
T2065 267-269 IN denotes of
T2066 270-273 DT denotes the
T2068 274-280 VBG denotes coding
T2067 281-287 NN denotes region
T2069 288-291 CC denotes and
T2070 292-295 MD denotes can
T2072 296-298 VB denotes be
T2073 299-312 RB denotes alternatively
T2071 313-320 VBN denotes spliced
T2074 321-322 -LRB- denotes [
T2076 322-324 CD denotes 19
T2077 324-325 , denotes ,
T2075 325-327 CD denotes 23
T2078 327-328 SYM denotes -
T2079 328-330 CD denotes 26
T2080 330-331 -RRB- denotes ]
T2081 331-332 . denotes .
T2082 332-384 sentence denotes The 3' untranslated region is typically intronless.
T2083 333-336 DT denotes The
T2085 337-338 CD denotes 3
T2086 338-339 SYM denotes '
T2087 340-352 JJ denotes untranslated
T2084 353-359 NN denotes region
T2088 360-362 VBZ denotes is
T2089 363-372 RB denotes typically
T2090 373-383 JJ denotes intronless
T2091 383-384 . denotes .
T2092 384-517 sentence denotes Exceptions to this stereotyped structure have been described for some human olfactory receptors, but are thought to be rare [25-27].
T2093 385-395 NNS denotes Exceptions
T2095 396-398 IN denotes to
T2096 399-403 DT denotes this
T2098 404-415 JJ denotes stereotyped
T2097 416-425 NN denotes structure
T2099 426-430 VBP denotes have
T2100 431-435 VBN denotes been
T2094 436-445 VBN denotes described
T2101 446-449 IN denotes for
T2102 450-454 DT denotes some
T2104 455-460 JJ denotes human
T2105 461-470 JJ denotes olfactory
T2103 471-480 NNS denotes receptors
T2106 480-482 , denotes ,
T2107 482-485 CC denotes but
T2108 486-489 VBP denotes are
T2109 490-497 VBN denotes thought
T2110 498-500 TO denotes to
T2111 501-503 VB denotes be
T2112 504-508 JJ denotes rare
T2113 509-510 -LRB- denotes [
T2114 510-512 CD denotes 25
T2115 512-513 SYM denotes -
T2116 513-515 CD denotes 27
T2117 515-516 -RRB- denotes ]
T2118 516-517 . denotes .
T2119 517-641 sentence denotes cDNA identification and RACE data have been used to determine gene structure for about 30 genes, see, for example, [19,23].
T2120 518-522 NN denotes cDNA
T2121 523-537 NN denotes identification
T2123 538-541 CC denotes and
T2124 542-546 NN denotes RACE
T2125 547-551 NNS denotes data
T2126 552-556 VBP denotes have
T2127 557-561 VBN denotes been
T2122 562-566 VBN denotes used
T2128 567-569 TO denotes to
T2129 570-579 VB denotes determine
T2130 580-584 NN denotes gene
T2131 585-594 NN denotes structure
T2132 595-598 IN denotes for
T2133 599-604 RB denotes about
T2134 605-607 CD denotes 30
T2135 608-613 NNS denotes genes
T2136 613-615 , denotes ,
T2137 615-618 VB denotes see
T2138 618-620 , denotes ,
T2139 620-623 IN denotes for
T2141 624-631 NN denotes example
T2142 631-633 , denotes ,
T2143 633-634 -LRB- denotes [
T2144 634-636 CD denotes 19
T2145 636-637 , denotes ,
T2140 637-639 CD denotes 23
T2146 639-640 -RRB- denotes ]
T2147 640-641 . denotes .
T2148 641-793 sentence denotes However, computational prediction of the location of 5' upstream exons and the extent of the 3' UTR from genomic sequence has been extremely difficult.
T2149 642-649 RB denotes However
T2151 649-651 , denotes ,
T2152 651-664 JJ denotes computational
T2153 665-675 NN denotes prediction
T2154 676-678 IN denotes of
T2155 679-682 DT denotes the
T2156 683-691 NN denotes location
T2157 692-694 IN denotes of
T2158 695-696 CD denotes 5
T2160 696-697 SYM denotes '
T2161 698-706 JJ denotes upstream
T2159 707-712 NNS denotes exons
T2162 713-716 CC denotes and
T2163 717-720 DT denotes the
T2164 721-727 NN denotes extent
T2165 728-730 IN denotes of
T2166 731-734 DT denotes the
T2168 735-736 CD denotes 3
T2169 736-737 SYM denotes '
T2167 738-741 NN denotes UTR
T2170 742-746 IN denotes from
T2171 747-754 JJ denotes genomic
T2172 755-763 NN denotes sequence
T2173 764-767 VBZ denotes has
T2150 768-772 VBN denotes been
T2174 773-782 RB denotes extremely
T2175 783-792 JJ denotes difficult
T2176 792-793 . denotes .
T2177 793-967 sentence denotes A combination of splice site predictions and similarity to other olfactory receptors has allowed some investigators to predict 5' exon locations for around 15 genes [25,28].
T2178 794-795 DT denotes A
T2179 796-807 NN denotes combination
T2181 808-810 IN denotes of
T2182 811-817 NN denotes splice
T2183 818-822 NN denotes site
T2184 823-834 NNS denotes predictions
T2185 835-838 CC denotes and
T2186 839-849 NN denotes similarity
T2187 850-852 IN denotes to
T2188 853-858 JJ denotes other
T2190 859-868 JJ denotes olfactory
T2189 869-878 NNS denotes receptors
T2191 879-882 VBZ denotes has
T2180 883-890 VBN denotes allowed
T2192 891-895 DT denotes some
T2193 896-909 NNS denotes investigators
T2195 910-912 TO denotes to
T2194 913-920 VB denotes predict
T2196 921-922 CD denotes 5
T2198 922-923 SYM denotes '
T2197 924-928 NN denotes exon
T2199 929-938 NNS denotes locations
T2200 939-942 IN denotes for
T2201 943-949 RB denotes around
T2202 950-952 CD denotes 15
T2203 953-958 NNS denotes genes
T2204 959-960 -LRB- denotes [
T2206 960-962 CD denotes 25
T2207 962-963 , denotes ,
T2205 963-965 CD denotes 28
T2208 965-966 -RRB- denotes ]
T2209 966-967 . denotes .
T2210 967-1055 sentence denotes Experimental validation shows that some, but not all, predictions are accurate [24,25].
T2211 968-980 JJ denotes Experimental
T2212 981-991 NN denotes validation
T2213 992-997 VBZ denotes shows
T2214 998-1002 IN denotes that
T2216 1003-1007 DT denotes some
T2217 1007-1009 , denotes ,
T2218 1009-1012 CC denotes but
T2219 1013-1016 RB denotes not
T2220 1017-1020 DT denotes all
T2221 1020-1022 , denotes ,
T2222 1022-1033 NNS denotes predictions
T2215 1034-1037 VBP denotes are
T2223 1038-1046 JJ denotes accurate
T2224 1047-1048 -LRB- denotes [
T2226 1048-1050 CD denotes 24
T2227 1050-1051 , denotes ,
T2225 1051-1053 CD denotes 25
T2228 1053-1054 -RRB- denotes ]
T2229 1054-1055 . denotes .
T2230 1055-1163 sentence denotes The total number of olfactory receptors for which gene structure is known is vastly increased by our study.
T2231 1056-1059 DT denotes The
T2233 1060-1065 JJ denotes total
T2232 1066-1072 NN denotes number
T2235 1073-1075 IN denotes of
T2236 1076-1085 JJ denotes olfactory
T2237 1086-1095 NNS denotes receptors
T2238 1096-1099 IN denotes for
T2240 1100-1105 WDT denotes which
T2241 1106-1110 NN denotes gene
T2242 1111-1120 NN denotes structure
T2243 1121-1123 VBZ denotes is
T2239 1124-1129 VBN denotes known
T2244 1130-1132 VBZ denotes is
T2245 1133-1139 RB denotes vastly
T2234 1140-1149 VBN denotes increased
T2246 1150-1152 IN denotes by
T2247 1153-1156 PRP$ denotes our
T2248 1157-1162 NN denotes study
T2249 1162-1163 . denotes .
R1048 T2024 T2025 amod Olfactory,receptor
R1055 T2025 T2026 compound receptor,genes
R1058 T2026 T2027 nsubj genes,have
R1062 T2028 T2029 det an,region
R1067 T2029 T2027 dobj region,have
R1071 T2030 T2029 amod intronless,region
R1074 T2031 T2029 amod coding,region
R1078 T2032 T2027 punct ", ",have
R1083 T2033 T2027 advcl simplifying,have
R1086 T2034 T2035 preconj both,computational
R1090 T2035 T2036 amod computational,identification
R1094 T2036 T2033 dobj identification,simplifying
R1097 T2037 T2035 cc and,computational
R1100 T2038 T2035 conj experimental,computational
R1106 T2039 T2040 amod olfactory,receptor
R1109 T2040 T2036 compound receptor,identification
R1113 T2041 T2027 punct .,have
R1118 T2043 T2044 prep For,determined
R1119 T2045 T2046 det a,number
R1120 T2046 T2043 pobj number,For
R1121 T2115 T2116 punct -,27
R1122 T2047 T2046 amod small,number
R1123 T2116 T2114 prep 27,25
R1124 T2117 T2114 punct ],25
R1125 T2118 T2094 punct .,described
R1126 T2048 T2046 prep of,number
R1127 T2120 T2121 compound cDNA,identification
R1128 T2049 T2050 amod olfactory,receptors
R1129 T2121 T2122 nsubjpass identification,used
R1130 T2123 T2121 cc and,identification
R1131 T2050 T2048 pobj receptors,of
R1132 T2124 T2125 compound RACE,data
R1133 T2125 T2121 conj data,identification
R1134 T2126 T2122 aux have,used
R1135 T2051 T2044 punct ", ",determined
R1136 T2127 T2122 auxpass been,used
R1137 T2128 T2129 aux to,determine
R1138 T2129 T2122 advcl determine,used
R1139 T2052 T2053 compound gene,structure
R1140 T2130 T2131 compound gene,structure
R1141 T2131 T2129 dobj structure,determine
R1142 T2132 T2129 prep for,determine
R1143 T2053 T2044 nsubjpass structure,determined
R1144 T2133 T2134 advmod about,30
R1145 T2134 T2135 nummod 30,genes
R1146 T2054 T2044 aux has,determined
R1147 T2135 T2132 pobj genes,for
R1148 T2136 T2137 punct ", ",see
R1149 T2137 T2122 parataxis see,used
R1150 T2055 T2044 auxpass been,determined
R1151 T2138 T2137 punct ", ",see
R1152 T2139 T2140 prep for,23
R1153 T2140 T2137 dobj 23,see
R1154 T2056 T2044 punct .,determined
R1155 T2141 T2139 pobj example,for
R1156 T2142 T2140 punct ", ",23
R1157 T2143 T2140 punct [,23
R1158 T2144 T2140 nummod 19,23
R1159 T2145 T2140 punct ",",23
R1160 T2058 T2059 amod Additional,exons
R1161 T2146 T2137 punct ],see
R1162 T2147 T2122 punct .,used
R1163 T2149 T2150 advmod However,been
R1164 T2059 T2063 nsubj exons,lie
R1165 T2151 T2150 punct ", ",been
R1166 T2152 T2153 amod computational,prediction
R1167 T2153 T2150 nsubj prediction,been
R1168 T2060 T2059 nummod 5,exons
R1169 T2154 T2153 prep of,prediction
R1170 T2155 T2156 det the,location
R1171 T2156 T2154 pobj location,of
R1172 T2061 T2060 punct ',5
R1173 T2157 T2156 prep of,location
R1174 T2158 T2159 nummod 5,exons
R1175 T2159 T2157 pobj exons,of
R1176 T2062 T2059 amod untranslated,exons
R1177 T2160 T2158 punct ',5
R1178 T2161 T2159 amod upstream,exons
R1179 T2162 T2153 cc and,prediction
R1180 T2064 T2063 advmod upstream,lie
R1181 T2163 T2164 det the,extent
R1182 T2164 T2153 conj extent,prediction
R1183 T2165 T2164 prep of,extent
R1184 T2166 T2167 det the,UTR
R1185 T2065 T2064 prep of,upstream
R1186 T2167 T2165 pobj UTR,of
R1187 T2168 T2167 nummod 3,UTR
R1188 T2066 T2067 det the,region
R1189 T2169 T2168 punct ',3
R1190 T2170 T2153 prep from,prediction
R1191 T2171 T2172 amod genomic,sequence
R1192 T2067 T2065 pobj region,of
R1193 T2172 T2170 pobj sequence,from
R1194 T2173 T2150 aux has,been
R1195 T2174 T2175 advmod extremely,difficult
R1196 T2068 T2067 amod coding,region
R1197 T2175 T2150 acomp difficult,been
R1198 T2176 T2150 punct .,been
R1199 T2069 T2063 cc and,lie
R1200 T2178 T2179 det A,combination
R1201 T2179 T2180 nsubj combination,allowed
R1202 T2070 T2071 aux can,spliced
R1203 T2181 T2179 prep of,combination
R1204 T2182 T2183 compound splice,site
R1205 T2071 T2063 conj spliced,lie
R1206 T2183 T2184 compound site,predictions
R1207 T2184 T2181 pobj predictions,of
R1208 T2185 T2184 cc and,predictions
R1209 T2186 T2184 conj similarity,predictions
R1210 T2187 T2186 prep to,similarity
R1211 T2188 T2189 amod other,receptors
R1212 T2072 T2071 auxpass be,spliced
R1213 T2189 T2187 pobj receptors,to
R1214 T2190 T2189 amod olfactory,receptors
R1215 T2073 T2071 advmod alternatively,spliced
R1216 T2074 T2075 punct [,23
R1217 T2191 T2180 aux has,allowed
R1218 T2075 T2071 parataxis 23,spliced
R1219 T2192 T2193 det some,investigators
R1220 T2076 T2075 dep 19,23
R1221 T2193 T2194 nsubj investigators,predict
R1222 T2194 T2180 ccomp predict,allowed
R1223 T2077 T2075 punct ",",23
R1224 T2195 T2194 aux to,predict
R1225 T2196 T2197 nummod 5,exon
R1226 T2197 T2199 compound exon,locations
R1227 T2078 T2079 punct -,26
R1228 T2198 T2196 punct ',5
R1229 T2199 T2194 dobj locations,predict
R1230 T2200 T2194 prep for,predict
R1231 T2079 T2075 prep 26,23
R1232 T2201 T2202 advmod around,15
R1233 T2202 T2203 nummod 15,genes
R1234 T2080 T2075 punct ],23
R1235 T2203 T2200 pobj genes,for
R1236 T2204 T2205 punct [,28
R1237 T2205 T2180 parataxis 28,allowed
R1238 T2206 T2205 nummod 25,28
R1239 T2081 T2063 punct .,lie
R1240 T2207 T2205 punct ",",28
R1241 T2208 T2205 punct ],28
R1242 T2209 T2180 punct .,allowed
R1243 T2083 T2084 det The,region
R1244 T2211 T2212 amod Experimental,validation
R1245 T2212 T2213 nsubj validation,shows
R1246 T2084 T2088 nsubj region,is
R1247 T2214 T2215 mark that,are
R1248 T2215 T2213 ccomp are,shows
R1249 T2085 T2084 nummod 3,region
R1250 T2216 T2215 nsubj some,are
R1251 T2217 T2216 punct ", ",some
R1252 T2218 T2216 cc but,some
R1253 T2086 T2085 punct ',3
R1254 T2219 T2218 neg not,but
R1255 T2220 T2216 conj all,some
R1256 T2087 T2084 amod untranslated,region
R1257 T2089 T2088 advmod typically,is
R1258 T2090 T2088 acomp intronless,is
R1259 T2221 T2220 punct ", ",all
R1260 T2222 T2220 conj predictions,all
R1261 T2223 T2215 acomp accurate,are
R1262 T2224 T2225 punct [,25
R1263 T2091 T2088 punct .,is
R1264 T2225 T2213 parataxis 25,shows
R1265 T2226 T2225 nummod 24,25
R1266 T2227 T2225 punct ",",25
R1267 T2093 T2094 nsubjpass Exceptions,described
R1268 T2228 T2225 punct ],25
R1269 T2229 T2213 punct .,shows
R1270 T2231 T2232 det The,number
R1271 T2095 T2093 prep to,Exceptions
R1272 T2232 T2234 nsubjpass number,increased
R1273 T2233 T2232 amod total,number
R1274 T2235 T2232 prep of,number
R1275 T2096 T2097 det this,structure
R1276 T2236 T2237 amod olfactory,receptors
R1277 T2237 T2235 pobj receptors,of
R1278 T2238 T2239 prep for,known
R1279 T2097 T2095 pobj structure,to
R1280 T2239 T2232 relcl known,number
R1281 T2240 T2238 pobj which,for
R1282 T2098 T2097 amod stereotyped,structure
R1283 T2241 T2242 compound gene,structure
R1284 T2242 T2239 nsubjpass structure,known
R1285 T2243 T2239 auxpass is,known
R1286 T2099 T2094 aux have,described
R1287 T2244 T2234 auxpass is,increased
R1288 T2245 T2234 advmod vastly,increased
R1289 T2246 T2234 agent by,increased
R1290 T2100 T2094 auxpass been,described
R1291 T2247 T2248 poss our,study
R1292 T2248 T2246 pobj study,by
R1293 T2249 T2234 punct .,increased
R1294 T2101 T2094 prep for,described
R1296 T2102 T2103 det some,receptors
R1299 T2103 T2101 pobj receptors,for
R1302 T2104 T2103 amod human,receptors
R1306 T2105 T2103 amod olfactory,receptors
R1310 T2106 T2094 punct ", ",described
R1317 T2107 T2094 cc but,described
R1320 T2108 T2109 auxpass are,thought
R1322 T2109 T2094 conj thought,described
R1325 T2110 T2111 aux to,be
R1329 T2111 T2109 xcomp be,thought
R1332 T2112 T2111 acomp rare,be
R1333 T2113 T2114 punct [,25
R1335 T2114 T2109 parataxis 25,thought