Id |
Subject |
Object |
Predicate |
Lexical cue |
T5515 |
0-3 |
DT |
denotes |
The |
T5516 |
16-25 |
NNS |
denotes |
sequences |
T5517 |
4-11 |
JJ |
denotes |
genomic |
T5518 |
12-15 |
NN |
denotes |
DNA |
T5519 |
157-166 |
VBN |
denotes |
extracted |
T5520 |
26-39 |
VBG |
denotes |
corresponding |
T5521 |
40-42 |
IN |
denotes |
to |
T5522 |
43-46 |
DT |
denotes |
the |
T5523 |
64-69 |
NNS |
denotes |
genes |
T5524 |
47-58 |
JJ |
denotes |
orthologous |
T5525 |
59-63 |
NN |
denotes |
TACC |
T5526 |
70-72 |
IN |
denotes |
of |
T5527 |
73-78 |
JJ |
denotes |
human |
T5528 |
78-80 |
, |
denotes |
, |
T5529 |
80-85 |
NN |
denotes |
mouse |
T5530 |
85-87 |
, |
denotes |
, |
T5531 |
87-90 |
NN |
denotes |
rat |
T5532 |
90-92 |
, |
denotes |
, |
T5533 |
92-102 |
NN |
denotes |
pufferfish |
T5534 |
102-104 |
, |
denotes |
, |
T5535 |
104-106 |
NNP |
denotes |
C. |
T5536 |
107-119 |
NNP |
denotes |
intestinalis |
T5537 |
119-121 |
, |
denotes |
, |
T5538 |
121-123 |
NNP |
denotes |
D. |
T5539 |
124-136 |
NNP |
denotes |
melanogaster |
T5540 |
137-140 |
CC |
denotes |
and |
T5541 |
141-143 |
NNP |
denotes |
C. |
T5542 |
144-151 |
NNP |
denotes |
elegans |
T5543 |
152-156 |
VBD |
denotes |
were |
T5544 |
167-170 |
CC |
denotes |
and |
T5545 |
171-179 |
VBN |
denotes |
analyzed |
T5546 |
180-182 |
IN |
denotes |
by |
T5547 |
183-191 |
NNP |
denotes |
Genescan |
T5548 |
192-195 |
CC |
denotes |
and |
T5549 |
196-201 |
NNP |
denotes |
BLAST |
T5550 |
202-204 |
TO |
denotes |
to |
T5551 |
205-214 |
VB |
denotes |
determine |
T5552 |
215-218 |
DT |
denotes |
the |
T5553 |
227-236 |
NN |
denotes |
structure |
T5554 |
219-226 |
JJ |
denotes |
genomic |
T5555 |
237-239 |
IN |
denotes |
of |
T5556 |
240-244 |
DT |
denotes |
each |
T5557 |
250-254 |
NN |
denotes |
gene |
T5558 |
245-249 |
NN |
denotes |
TACC |
T5559 |
254-255 |
. |
denotes |
. |
T5560 |
255-422 |
sentence |
denotes |
In some cases, for rat and pufferfish, exons were added or modified based on the best similarity of translated peptides to the corresponding mouse and human proteins. |
T5561 |
256-258 |
IN |
denotes |
In |
T5562 |
306-311 |
VBN |
denotes |
added |
T5563 |
259-263 |
DT |
denotes |
some |
T5564 |
264-269 |
NNS |
denotes |
cases |
T5565 |
269-271 |
, |
denotes |
, |
T5566 |
271-274 |
IN |
denotes |
for |
T5567 |
275-278 |
NN |
denotes |
rat |
T5568 |
279-282 |
CC |
denotes |
and |
T5569 |
283-293 |
NN |
denotes |
pufferfish |
T5570 |
293-295 |
, |
denotes |
, |
T5571 |
295-300 |
NNS |
denotes |
exons |
T5572 |
301-305 |
VBD |
denotes |
were |
T5573 |
312-314 |
CC |
denotes |
or |
T5574 |
315-323 |
VBN |
denotes |
modified |
T5575 |
324-329 |
VBN |
denotes |
based |
T5576 |
330-332 |
IN |
denotes |
on |
T5577 |
333-336 |
DT |
denotes |
the |
T5578 |
342-352 |
NN |
denotes |
similarity |
T5579 |
337-341 |
JJS |
denotes |
best |
T5580 |
353-355 |
IN |
denotes |
of |
T5581 |
356-366 |
VBN |
denotes |
translated |
T5582 |
367-375 |
NNS |
denotes |
peptides |
T5583 |
376-378 |
IN |
denotes |
to |
T5584 |
379-382 |
DT |
denotes |
the |
T5585 |
413-421 |
NN |
denotes |
proteins |
T5586 |
383-396 |
VBG |
denotes |
corresponding |
T5587 |
397-402 |
NN |
denotes |
mouse |
T5588 |
403-406 |
CC |
denotes |
and |
T5589 |
407-412 |
JJ |
denotes |
human |
T5590 |
421-422 |
. |
denotes |
. |
T5591 |
422-615 |
sentence |
denotes |
For regions with low sequence similarity in T. rubripes, genomic sequences from the fresh water pufferfish, Tetraodon nigroviridis were used as additional means to verify the predicted exons. |
T5592 |
423-426 |
IN |
denotes |
For |
T5593 |
560-564 |
VBN |
denotes |
used |
T5594 |
427-434 |
NNS |
denotes |
regions |
T5595 |
435-439 |
IN |
denotes |
with |
T5596 |
440-443 |
JJ |
denotes |
low |
T5597 |
453-463 |
NN |
denotes |
similarity |
T5598 |
444-452 |
NN |
denotes |
sequence |
T5599 |
464-466 |
IN |
denotes |
in |
T5600 |
467-469 |
NNP |
denotes |
T. |
T5601 |
470-478 |
NNP |
denotes |
rubripes |
T5602 |
478-480 |
, |
denotes |
, |
T5603 |
480-487 |
JJ |
denotes |
genomic |
T5604 |
488-497 |
NNS |
denotes |
sequences |
T5605 |
499-503 |
IN |
denotes |
from |
T5606 |
504-507 |
DT |
denotes |
the |
T5607 |
520-530 |
NN |
denotes |
pufferfish |
T5608 |
508-513 |
JJ |
denotes |
fresh |
T5609 |
514-519 |
NN |
denotes |
water |
T5610 |
530-532 |
, |
denotes |
, |
T5611 |
532-541 |
NNP |
denotes |
Tetraodon |
T5612 |
542-554 |
NNP |
denotes |
nigroviridis |
T5613 |
555-559 |
VBD |
denotes |
were |
T5614 |
565-567 |
IN |
denotes |
as |
T5615 |
568-578 |
JJ |
denotes |
additional |
T5616 |
579-584 |
NNS |
denotes |
means |
T5617 |
585-587 |
TO |
denotes |
to |
T5618 |
588-594 |
VB |
denotes |
verify |
T5619 |
595-598 |
DT |
denotes |
the |
T5620 |
609-614 |
NNS |
denotes |
exons |
T5621 |
599-608 |
VBN |
denotes |
predicted |
T5622 |
614-615 |
. |
denotes |
. |
R3411 |
T5515 |
T5516 |
det |
The,sequences |
R3412 |
T5516 |
T5519 |
nsubj |
sequences,extracted |
R3413 |
T5517 |
T5516 |
amod |
genomic,sequences |
R3414 |
T5518 |
T5516 |
compound |
DNA,sequences |
R3415 |
T5520 |
T5516 |
acl |
corresponding,sequences |
R3416 |
T5521 |
T5520 |
prep |
to,corresponding |
R3417 |
T5522 |
T5523 |
det |
the,genes |
R3418 |
T5523 |
T5521 |
pobj |
genes,to |
R3419 |
T5524 |
T5523 |
amod |
orthologous,genes |
R3420 |
T5525 |
T5523 |
compound |
TACC,genes |
R3421 |
T5526 |
T5523 |
prep |
of,genes |
R3422 |
T5527 |
T5526 |
pobj |
human,of |
R3423 |
T5528 |
T5527 |
punct |
", ",human |
R3424 |
T5529 |
T5527 |
conj |
mouse,human |
R3425 |
T5530 |
T5529 |
punct |
", ",mouse |
R3426 |
T5531 |
T5529 |
conj |
rat,mouse |
R3427 |
T5532 |
T5531 |
punct |
", ",rat |
R3428 |
T5533 |
T5531 |
conj |
pufferfish,rat |
R3429 |
T5534 |
T5533 |
punct |
", ",pufferfish |
R3430 |
T5535 |
T5536 |
compound |
C.,intestinalis |
R3431 |
T5536 |
T5533 |
conj |
intestinalis,pufferfish |
R3432 |
T5537 |
T5536 |
punct |
", ",intestinalis |
R3433 |
T5538 |
T5539 |
compound |
D.,melanogaster |
R3434 |
T5539 |
T5536 |
conj |
melanogaster,intestinalis |
R3435 |
T5540 |
T5539 |
cc |
and,melanogaster |
R3436 |
T5541 |
T5542 |
compound |
C.,elegans |
R3437 |
T5542 |
T5539 |
conj |
elegans,melanogaster |
R3438 |
T5543 |
T5519 |
aux |
were,extracted |
R3439 |
T5544 |
T5519 |
cc |
and,extracted |
R3440 |
T5545 |
T5519 |
conj |
analyzed,extracted |
R3441 |
T5546 |
T5545 |
prep |
by,analyzed |
R3442 |
T5547 |
T5546 |
pobj |
Genescan,by |
R3443 |
T5548 |
T5547 |
cc |
and,Genescan |
R3444 |
T5549 |
T5547 |
conj |
BLAST,Genescan |
R3445 |
T5550 |
T5551 |
aux |
to,determine |
R3446 |
T5551 |
T5519 |
advcl |
determine,extracted |
R3447 |
T5552 |
T5553 |
det |
the,structure |
R3448 |
T5553 |
T5551 |
dobj |
structure,determine |
R3449 |
T5554 |
T5553 |
amod |
genomic,structure |
R3450 |
T5555 |
T5553 |
prep |
of,structure |
R3451 |
T5556 |
T5557 |
det |
each,gene |
R3452 |
T5557 |
T5555 |
pobj |
gene,of |
R3453 |
T5558 |
T5557 |
compound |
TACC,gene |
R3454 |
T5559 |
T5519 |
punct |
.,extracted |
R3455 |
T5561 |
T5562 |
prep |
In,added |
R3456 |
T5563 |
T5564 |
det |
some,cases |
R3457 |
T5564 |
T5561 |
pobj |
cases,In |
R3458 |
T5565 |
T5562 |
punct |
", ",added |
R3459 |
T5566 |
T5562 |
prep |
for,added |
R3460 |
T5567 |
T5566 |
pobj |
rat,for |
R3461 |
T5568 |
T5567 |
cc |
and,rat |
R3462 |
T5569 |
T5567 |
conj |
pufferfish,rat |
R3463 |
T5570 |
T5562 |
punct |
", ",added |
R3464 |
T5571 |
T5562 |
nsubjpass |
exons,added |
R3465 |
T5572 |
T5562 |
auxpass |
were,added |
R3466 |
T5573 |
T5562 |
cc |
or,added |
R3467 |
T5574 |
T5562 |
conj |
modified,added |
R3468 |
T5575 |
T5574 |
prep |
based,modified |
R3469 |
T5576 |
T5575 |
prep |
on,based |
R3470 |
T5577 |
T5578 |
det |
the,similarity |
R3471 |
T5578 |
T5576 |
pobj |
similarity,on |
R3472 |
T5579 |
T5578 |
amod |
best,similarity |
R3473 |
T5580 |
T5578 |
prep |
of,similarity |
R3474 |
T5581 |
T5582 |
amod |
translated,peptides |
R3475 |
T5582 |
T5580 |
pobj |
peptides,of |
R3476 |
T5583 |
T5578 |
prep |
to,similarity |
R3477 |
T5584 |
T5585 |
det |
the,proteins |
R3478 |
T5585 |
T5583 |
pobj |
proteins,to |
R3479 |
T5586 |
T5585 |
amod |
corresponding,proteins |
R3480 |
T5587 |
T5585 |
nmod |
mouse,proteins |
R3481 |
T5588 |
T5587 |
cc |
and,mouse |
R3482 |
T5589 |
T5587 |
conj |
human,mouse |
R3483 |
T5590 |
T5562 |
punct |
.,added |
R3484 |
T5592 |
T5593 |
prep |
For,used |
R3485 |
T5594 |
T5592 |
pobj |
regions,For |
R3486 |
T5595 |
T5594 |
prep |
with,regions |
R3487 |
T5596 |
T5597 |
amod |
low,similarity |
R3488 |
T5597 |
T5595 |
pobj |
similarity,with |
R3489 |
T5598 |
T5597 |
compound |
sequence,similarity |
R3490 |
T5599 |
T5594 |
prep |
in,regions |
R3491 |
T5600 |
T5601 |
compound |
T.,rubripes |
R3492 |
T5601 |
T5599 |
pobj |
rubripes,in |
R3493 |
T5602 |
T5593 |
punct |
", ",used |
R3494 |
T5603 |
T5604 |
amod |
genomic,sequences |
R3495 |
T5604 |
T5593 |
nsubjpass |
sequences,used |
R3496 |
T5605 |
T5604 |
prep |
from,sequences |
R3497 |
T5606 |
T5607 |
det |
the,pufferfish |
R3498 |
T5607 |
T5605 |
pobj |
pufferfish,from |
R3499 |
T5608 |
T5609 |
amod |
fresh,water |
R3500 |
T5609 |
T5607 |
compound |
water,pufferfish |
R3501 |
T5610 |
T5607 |
punct |
", ",pufferfish |
R3502 |
T5611 |
T5612 |
compound |
Tetraodon,nigroviridis |
R3503 |
T5612 |
T5607 |
appos |
nigroviridis,pufferfish |
R3504 |
T5613 |
T5593 |
auxpass |
were,used |
R3505 |
T5614 |
T5593 |
prep |
as,used |
R3506 |
T5615 |
T5616 |
amod |
additional,means |
R3507 |
T5616 |
T5614 |
pobj |
means,as |
R3508 |
T5617 |
T5618 |
aux |
to,verify |
R3509 |
T5618 |
T5616 |
advcl |
verify,means |
R3510 |
T5619 |
T5620 |
det |
the,exons |
R3511 |
T5620 |
T5618 |
dobj |
exons,verify |
R3512 |
T5621 |
T5620 |
amod |
predicted,exons |
R3513 |
T5622 |
T5593 |
punct |
.,used |