> top > docs > PMC:441373 > spans > 21981-22596 > annotations

PMC:441373 / 21981-22596 JSONTXT

Annnotations TAB JSON ListView MergeView

craft-sa-dev

Id Subject Object Predicate Lexical cue
T5515 0-3 DT denotes The
T5516 16-25 NNS denotes sequences
T5517 4-11 JJ denotes genomic
T5518 12-15 NN denotes DNA
T5519 157-166 VBN denotes extracted
T5520 26-39 VBG denotes corresponding
T5521 40-42 IN denotes to
T5522 43-46 DT denotes the
T5523 64-69 NNS denotes genes
T5524 47-58 JJ denotes orthologous
T5525 59-63 NN denotes TACC
T5526 70-72 IN denotes of
T5527 73-78 JJ denotes human
T5528 78-80 , denotes ,
T5529 80-85 NN denotes mouse
T5530 85-87 , denotes ,
T5531 87-90 NN denotes rat
T5532 90-92 , denotes ,
T5533 92-102 NN denotes pufferfish
T5534 102-104 , denotes ,
T5535 104-106 NNP denotes C.
T5536 107-119 NNP denotes intestinalis
T5537 119-121 , denotes ,
T5538 121-123 NNP denotes D.
T5539 124-136 NNP denotes melanogaster
T5540 137-140 CC denotes and
T5541 141-143 NNP denotes C.
T5542 144-151 NNP denotes elegans
T5543 152-156 VBD denotes were
T5544 167-170 CC denotes and
T5545 171-179 VBN denotes analyzed
T5546 180-182 IN denotes by
T5547 183-191 NNP denotes Genescan
T5548 192-195 CC denotes and
T5549 196-201 NNP denotes BLAST
T5550 202-204 TO denotes to
T5551 205-214 VB denotes determine
T5552 215-218 DT denotes the
T5553 227-236 NN denotes structure
T5554 219-226 JJ denotes genomic
T5555 237-239 IN denotes of
T5556 240-244 DT denotes each
T5557 250-254 NN denotes gene
T5558 245-249 NN denotes TACC
T5559 254-255 . denotes .
T5560 255-422 sentence denotes In some cases, for rat and pufferfish, exons were added or modified based on the best similarity of translated peptides to the corresponding mouse and human proteins.
T5561 256-258 IN denotes In
T5562 306-311 VBN denotes added
T5563 259-263 DT denotes some
T5564 264-269 NNS denotes cases
T5565 269-271 , denotes ,
T5566 271-274 IN denotes for
T5567 275-278 NN denotes rat
T5568 279-282 CC denotes and
T5569 283-293 NN denotes pufferfish
T5570 293-295 , denotes ,
T5571 295-300 NNS denotes exons
T5572 301-305 VBD denotes were
T5573 312-314 CC denotes or
T5574 315-323 VBN denotes modified
T5575 324-329 VBN denotes based
T5576 330-332 IN denotes on
T5577 333-336 DT denotes the
T5578 342-352 NN denotes similarity
T5579 337-341 JJS denotes best
T5580 353-355 IN denotes of
T5581 356-366 VBN denotes translated
T5582 367-375 NNS denotes peptides
T5583 376-378 IN denotes to
T5584 379-382 DT denotes the
T5585 413-421 NN denotes proteins
T5586 383-396 VBG denotes corresponding
T5587 397-402 NN denotes mouse
T5588 403-406 CC denotes and
T5589 407-412 JJ denotes human
T5590 421-422 . denotes .
T5591 422-615 sentence denotes For regions with low sequence similarity in T. rubripes, genomic sequences from the fresh water pufferfish, Tetraodon nigroviridis were used as additional means to verify the predicted exons.
T5592 423-426 IN denotes For
T5593 560-564 VBN denotes used
T5594 427-434 NNS denotes regions
T5595 435-439 IN denotes with
T5596 440-443 JJ denotes low
T5597 453-463 NN denotes similarity
T5598 444-452 NN denotes sequence
T5599 464-466 IN denotes in
T5600 467-469 NNP denotes T.
T5601 470-478 NNP denotes rubripes
T5602 478-480 , denotes ,
T5603 480-487 JJ denotes genomic
T5604 488-497 NNS denotes sequences
T5605 499-503 IN denotes from
T5606 504-507 DT denotes the
T5607 520-530 NN denotes pufferfish
T5608 508-513 JJ denotes fresh
T5609 514-519 NN denotes water
T5610 530-532 , denotes ,
T5611 532-541 NNP denotes Tetraodon
T5612 542-554 NNP denotes nigroviridis
T5613 555-559 VBD denotes were
T5614 565-567 IN denotes as
T5615 568-578 JJ denotes additional
T5616 579-584 NNS denotes means
T5617 585-587 TO denotes to
T5618 588-594 VB denotes verify
T5619 595-598 DT denotes the
T5620 609-614 NNS denotes exons
T5621 599-608 VBN denotes predicted
T5622 614-615 . denotes .
R3411 T5515 T5516 det The,sequences
R3412 T5516 T5519 nsubj sequences,extracted
R3413 T5517 T5516 amod genomic,sequences
R3414 T5518 T5516 compound DNA,sequences
R3415 T5520 T5516 acl corresponding,sequences
R3416 T5521 T5520 prep to,corresponding
R3417 T5522 T5523 det the,genes
R3418 T5523 T5521 pobj genes,to
R3419 T5524 T5523 amod orthologous,genes
R3420 T5525 T5523 compound TACC,genes
R3421 T5526 T5523 prep of,genes
R3422 T5527 T5526 pobj human,of
R3423 T5528 T5527 punct ", ",human
R3424 T5529 T5527 conj mouse,human
R3425 T5530 T5529 punct ", ",mouse
R3426 T5531 T5529 conj rat,mouse
R3427 T5532 T5531 punct ", ",rat
R3428 T5533 T5531 conj pufferfish,rat
R3429 T5534 T5533 punct ", ",pufferfish
R3430 T5535 T5536 compound C.,intestinalis
R3431 T5536 T5533 conj intestinalis,pufferfish
R3432 T5537 T5536 punct ", ",intestinalis
R3433 T5538 T5539 compound D.,melanogaster
R3434 T5539 T5536 conj melanogaster,intestinalis
R3435 T5540 T5539 cc and,melanogaster
R3436 T5541 T5542 compound C.,elegans
R3437 T5542 T5539 conj elegans,melanogaster
R3438 T5543 T5519 aux were,extracted
R3439 T5544 T5519 cc and,extracted
R3440 T5545 T5519 conj analyzed,extracted
R3441 T5546 T5545 prep by,analyzed
R3442 T5547 T5546 pobj Genescan,by
R3443 T5548 T5547 cc and,Genescan
R3444 T5549 T5547 conj BLAST,Genescan
R3445 T5550 T5551 aux to,determine
R3446 T5551 T5519 advcl determine,extracted
R3447 T5552 T5553 det the,structure
R3448 T5553 T5551 dobj structure,determine
R3449 T5554 T5553 amod genomic,structure
R3450 T5555 T5553 prep of,structure
R3451 T5556 T5557 det each,gene
R3452 T5557 T5555 pobj gene,of
R3453 T5558 T5557 compound TACC,gene
R3454 T5559 T5519 punct .,extracted
R3455 T5561 T5562 prep In,added
R3456 T5563 T5564 det some,cases
R3457 T5564 T5561 pobj cases,In
R3458 T5565 T5562 punct ", ",added
R3459 T5566 T5562 prep for,added
R3460 T5567 T5566 pobj rat,for
R3461 T5568 T5567 cc and,rat
R3462 T5569 T5567 conj pufferfish,rat
R3463 T5570 T5562 punct ", ",added
R3464 T5571 T5562 nsubjpass exons,added
R3465 T5572 T5562 auxpass were,added
R3466 T5573 T5562 cc or,added
R3467 T5574 T5562 conj modified,added
R3468 T5575 T5574 prep based,modified
R3469 T5576 T5575 prep on,based
R3470 T5577 T5578 det the,similarity
R3471 T5578 T5576 pobj similarity,on
R3472 T5579 T5578 amod best,similarity
R3473 T5580 T5578 prep of,similarity
R3474 T5581 T5582 amod translated,peptides
R3475 T5582 T5580 pobj peptides,of
R3476 T5583 T5578 prep to,similarity
R3477 T5584 T5585 det the,proteins
R3478 T5585 T5583 pobj proteins,to
R3479 T5586 T5585 amod corresponding,proteins
R3480 T5587 T5585 nmod mouse,proteins
R3481 T5588 T5587 cc and,mouse
R3482 T5589 T5587 conj human,mouse
R3483 T5590 T5562 punct .,added
R3484 T5592 T5593 prep For,used
R3485 T5594 T5592 pobj regions,For
R3486 T5595 T5594 prep with,regions
R3487 T5596 T5597 amod low,similarity
R3488 T5597 T5595 pobj similarity,with
R3489 T5598 T5597 compound sequence,similarity
R3490 T5599 T5594 prep in,regions
R3491 T5600 T5601 compound T.,rubripes
R3492 T5601 T5599 pobj rubripes,in
R3493 T5602 T5593 punct ", ",used
R3494 T5603 T5604 amod genomic,sequences
R3495 T5604 T5593 nsubjpass sequences,used
R3496 T5605 T5604 prep from,sequences
R3497 T5606 T5607 det the,pufferfish
R3498 T5607 T5605 pobj pufferfish,from
R3499 T5608 T5609 amod fresh,water
R3500 T5609 T5607 compound water,pufferfish
R3501 T5610 T5607 punct ", ",pufferfish
R3502 T5611 T5612 compound Tetraodon,nigroviridis
R3503 T5612 T5607 appos nigroviridis,pufferfish
R3504 T5613 T5593 auxpass were,used
R3505 T5614 T5593 prep as,used
R3506 T5615 T5616 amod additional,means
R3507 T5616 T5614 pobj means,as
R3508 T5617 T5618 aux to,verify
R3509 T5618 T5616 advcl verify,means
R3510 T5619 T5620 det the,exons
R3511 T5620 T5618 dobj exons,verify
R3512 T5621 T5620 amod predicted,exons
R3513 T5622 T5593 punct .,used

craft-ca-core-ex-dev

Below, discontinuous spans are shown in the chain model. You can change it to the bag model.

Id Subject Object Predicate Lexical cue
T5362 4-15 SO_EXT:genomic_DNA denotes genomic DNA
T5363 12-15 CHEBI_SO_EXT:DNA denotes DNA
T5364 16-25 SO_EXT:biological_sequence denotes sequences
T5365 47-58 SO:0000858 denotes orthologous
T5366 64-69 SO_EXT:0000704 denotes genes
T5367 73-78 NCBITaxon:9606 denotes human
T5368 80-85 NCBITaxon:10088 denotes mouse
T5369 87-90 NCBITaxon:10114 denotes rat
T5370 92-102 NCBITaxon:31031 denotes pufferfish
T5371 104-119 NCBITaxon:7719 denotes C. intestinalis
T5372 121-136 NCBITaxon:7227 denotes D. melanogaster
T5373 141-151 NCBITaxon:6239 denotes C. elegans
T5374 219-226 SO_EXT:0001026 denotes genomic
T5375 250-254 SO_EXT:0000704 denotes gene
T5376 275-278 NCBITaxon:10114 denotes rat
T5377 283-293 NCBITaxon:31031 denotes pufferfish
T5378 295-300 SO_EXT:0000147 denotes exons
T5379 315-323 SO_EXT:sequence_alteration_process denotes modified
T5380 356-366 GO:0006412 denotes translated
T5381 367-375 CHEBI_SO_EXT:peptide_or_peptide_region denotes peptides
T5382 397-402 NCBITaxon:10088 denotes mouse
T5383 407-412 NCBITaxon:9606 denotes human
T5384 413-421 CHEBI_PR_EXT:protein denotes proteins
T5385 444-452 SO_EXT:biological_sequence denotes sequence
T5386 467-478 NCBITaxon:31033 denotes T. rubripes
T5387 480-487 SO_EXT:0001026 denotes genomic
T5388 488-497 SO_EXT:biological_sequence denotes sequences
T5389 514-519 CHEBI:15377 denotes water
T5390 520-530 NCBITaxon:31031 denotes pufferfish
T5391 532-554 NCBITaxon:99883 denotes Tetraodon nigroviridis
T5392 609-614 SO_EXT:0000147 denotes exons

craft-ca-core-dev

Below, discontinuous spans are shown in the chain model. You can change it to the bag model.

Id Subject Object Predicate Lexical cue
T5267 4-11 SO:0001026 denotes genomic
T5268 47-58 SO:0000858 denotes orthologous
T5269 64-69 SO:0000704 denotes genes
T5270 73-78 NCBITaxon:9606 denotes human
T5271 80-85 NCBITaxon:10088 denotes mouse
T5272 87-90 NCBITaxon:10114 denotes rat
T5273 92-102 NCBITaxon:31031 denotes pufferfish
T5274 104-119 NCBITaxon:7719 denotes C. intestinalis
T5275 121-136 NCBITaxon:7227 denotes D. melanogaster
T5276 141-151 NCBITaxon:6239 denotes C. elegans
T5277 219-226 SO:0001026 denotes genomic
T5278 250-254 SO:0000704 denotes gene
T5279 275-278 NCBITaxon:10114 denotes rat
T5280 283-293 NCBITaxon:31031 denotes pufferfish
T5281 295-300 SO:0000147 denotes exons
T5282 356-366 GO:0006412 denotes translated
T5283 397-402 NCBITaxon:10088 denotes mouse
T5284 407-412 NCBITaxon:9606 denotes human
T5285 467-478 NCBITaxon:31033 denotes T. rubripes
T5286 480-487 SO:0001026 denotes genomic
T5287 514-519 CHEBI:15377 denotes water
T5288 520-530 NCBITaxon:31031 denotes pufferfish
T5289 532-554 NCBITaxon:99883 denotes Tetraodon nigroviridis
T5290 609-614 SO:0000147 denotes exons