> top > docs > PMC:329117 > spans > 22181-26476 > annotations

PMC:329117 / 22181-26476 JSONTXT

Annnotations TAB JSON ListView MergeView

craft-sa-dev

Id Subject Object Predicate Lexical cue
T6417 0-1 DT denotes A
T6418 2-8 NN denotes subset
T6419 32-37 VBZ denotes shows
T6420 9-11 IN denotes of
T6421 12-21 JJ denotes olfactory
T6422 22-31 NNS denotes receptors
T6423 38-45 JJ denotes unusual
T6424 46-54 NN denotes splicing
T6425 54-250 sentence denotes We identified 62 cDNAs (5% of all olfactory receptor clones) from 38 intact olfactory receptors and one olfactory receptor pseudogene where a splice site within the protein-coding region is used.
T6426 55-57 PRP denotes We
T6427 58-68 VBD denotes identified
T6428 69-71 CD denotes 62
T6429 72-77 NNS denotes cDNAs
T6430 78-79 -LRB- denotes (
T6431 80-81 NN denotes %
T6432 79-80 CD denotes 5
T6433 82-84 IN denotes of
T6434 85-88 DT denotes all
T6435 108-114 NNS denotes clones
T6436 89-98 JJ denotes olfactory
T6437 99-107 NN denotes receptor
T6438 114-115 -RRB- denotes )
T6439 116-120 IN denotes from
T6440 121-123 CD denotes 38
T6441 141-150 NNS denotes receptors
T6442 124-130 JJ denotes intact
T6443 131-140 JJ denotes olfactory
T6444 151-154 CC denotes and
T6445 155-158 CD denotes one
T6446 178-188 NN denotes pseudogene
T6447 159-168 JJ denotes olfactory
T6448 169-177 NN denotes receptor
T6449 189-194 WRB denotes where
T6450 245-249 VBN denotes used
T6451 195-196 DT denotes a
T6452 204-208 NN denotes site
T6453 197-203 NN denotes splice
T6454 209-215 IN denotes within
T6455 216-219 DT denotes the
T6456 235-241 NN denotes region
T6457 220-227 NN denotes protein
T6458 228-234 VBG denotes coding
T6459 227-228 HYPH denotes -
T6460 242-244 VBZ denotes is
T6461 249-250 . denotes .
T6462 250-457 sentence denotes For two genes (top two cDNAs, Figure 6), the predicted protein appears to be an intact olfactory receptor with three or ten amino acids, including the initiating methionine, contributed by an upstream exon.
T6463 251-254 IN denotes For
T6464 314-321 VBZ denotes appears
T6465 255-258 CD denotes two
T6466 259-264 NNS denotes genes
T6467 265-266 -LRB- denotes (
T6468 281-287 NN denotes Figure
T6469 266-269 JJ denotes top
T6470 274-279 NNS denotes cDNAs
T6471 270-273 CD denotes two
T6472 279-281 , denotes ,
T6473 288-289 CD denotes 6
T6474 289-290 -RRB- denotes )
T6475 290-292 , denotes ,
T6476 292-295 DT denotes the
T6477 306-313 NN denotes protein
T6478 296-305 VBN denotes predicted
T6479 322-324 TO denotes to
T6480 325-327 VB denotes be
T6481 328-330 DT denotes an
T6482 348-356 NN denotes receptor
T6483 331-337 JJ denotes intact
T6484 338-347 JJ denotes olfactory
T6485 357-361 IN denotes with
T6486 362-367 CD denotes three
T6487 381-386 NNS denotes acids
T6488 368-370 CC denotes or
T6489 371-374 CD denotes ten
T6490 375-380 NN denotes amino
T6491 386-388 , denotes ,
T6492 388-397 VBG denotes including
T6493 398-401 DT denotes the
T6494 413-423 NN denotes methionine
T6495 402-412 VBG denotes initiating
T6496 423-425 , denotes ,
T6497 425-436 VBN denotes contributed
T6498 437-439 IN denotes by
T6499 440-442 DT denotes an
T6500 452-456 NN denotes exon
T6501 443-451 JJ denotes upstream
T6502 456-457 . denotes .
T6503 457-544 sentence denotes A similar gene structure was described previously for a human olfactory receptor [25].
T6504 458-459 DT denotes A
T6505 473-482 NN denotes structure
T6506 460-467 JJ denotes similar
T6507 468-472 NN denotes gene
T6508 487-496 VBN denotes described
T6509 483-486 VBD denotes was
T6510 497-507 RB denotes previously
T6511 508-511 IN denotes for
T6512 512-513 DT denotes a
T6513 530-538 NN denotes receptor
T6514 514-519 JJ denotes human
T6515 520-529 JJ denotes olfactory
T6516 539-540 -LRB- denotes [
T6517 540-542 CD denotes 25
T6518 542-543 -RRB- denotes ]
T6519 543-544 . denotes .
T6520 544-634 sentence denotes One of these two mouse genes has no start codon in its otherwise intact main coding exon.
T6521 545-548 CD denotes One
T6522 574-577 VBZ denotes has
T6523 549-551 IN denotes of
T6524 552-557 DT denotes these
T6525 568-573 NNS denotes genes
T6526 558-561 CD denotes two
T6527 562-567 NN denotes mouse
T6528 578-580 DT denotes no
T6529 587-592 NN denotes codon
T6530 581-586 NN denotes start
T6531 593-595 IN denotes in
T6532 596-599 PRP$ denotes its
T6533 629-633 NN denotes exon
T6534 600-609 RB denotes otherwise
T6535 610-616 JJ denotes intact
T6536 617-621 JJ denotes main
T6537 622-628 VBG denotes coding
T6538 633-634 . denotes .
T6539 634-714 sentence denotes The unusual splicing thus rescues what would otherwise be a dysfunctional gene.
T6540 635-638 DT denotes The
T6541 647-655 NN denotes splicing
T6542 639-646 JJ denotes unusual
T6543 661-668 VBZ denotes rescues
T6544 656-660 RB denotes thus
T6545 669-673 WP denotes what
T6546 690-692 VB denotes be
T6547 674-679 MD denotes would
T6548 680-689 RB denotes otherwise
T6549 693-694 DT denotes a
T6550 709-713 NN denotes gene
T6551 695-708 JJ denotes dysfunctional
T6552 713-714 . denotes .
T6553 714-971 sentence denotes In most cases (60 out of 62 cDNAs), the unusual transcript appears to be an aberrant splice form - the transcript would probably not encode a functional protein because the splice introduces a frameshift or removes conserved functional residues (Figure 6).
T6554 715-717 IN denotes In
T6555 848-854 VB denotes encode
T6556 718-722 JJS denotes most
T6557 723-728 NNS denotes cases
T6558 729-730 -LRB- denotes (
T6559 730-732 CD denotes 60
T6560 740-742 CD denotes 62
T6561 733-736 IN denotes out
T6562 737-739 IN denotes of
T6563 743-748 NNS denotes cDNAs
T6564 748-749 -RRB- denotes )
T6565 749-751 , denotes ,
T6566 751-754 DT denotes the
T6567 763-773 NN denotes transcript
T6568 755-762 JJ denotes unusual
T6569 774-781 VBZ denotes appears
T6570 782-784 TO denotes to
T6571 785-787 VB denotes be
T6572 788-790 DT denotes an
T6573 807-811 NN denotes form
T6574 791-799 JJ denotes aberrant
T6575 800-806 NN denotes splice
T6576 812-813 : denotes -
T6577 814-817 DT denotes the
T6578 818-828 NN denotes transcript
T6579 829-834 MD denotes would
T6580 835-843 RB denotes probably
T6581 844-847 RB denotes not
T6582 855-856 DT denotes a
T6583 868-875 NN denotes protein
T6584 857-867 JJ denotes functional
T6585 876-883 IN denotes because
T6586 895-905 VBZ denotes introduces
T6587 884-887 DT denotes the
T6588 888-894 NN denotes splice
T6589 906-907 DT denotes a
T6590 908-918 NN denotes frameshift
T6591 919-921 CC denotes or
T6592 922-929 VBZ denotes removes
T6593 930-939 VBN denotes conserved
T6594 951-959 NNS denotes residues
T6595 940-950 JJ denotes functional
T6596 960-961 -LRB- denotes (
T6597 961-967 NN denotes Figure
T6598 968-969 CD denotes 6
T6599 969-970 -RRB- denotes )
T6600 970-971 . denotes .
T6601 971-1102 sentence denotes For two clones (bottom two cDNAs, Figure 6), exon order in the cDNA clone is inconsistent with the corresponding genomic sequence.
T6602 972-975 IN denotes For
T6603 1046-1048 VBZ denotes is
T6604 976-979 CD denotes two
T6605 980-986 NNS denotes clones
T6606 987-988 -LRB- denotes (
T6607 1006-1012 NN denotes Figure
T6608 988-994 JJ denotes bottom
T6609 999-1004 NNS denotes cDNAs
T6610 995-998 CD denotes two
T6611 1004-1006 , denotes ,
T6612 1013-1014 CD denotes 6
T6613 1014-1015 -RRB- denotes )
T6614 1015-1017 , denotes ,
T6615 1017-1021 NN denotes exon
T6616 1022-1027 NN denotes order
T6617 1028-1030 IN denotes in
T6618 1031-1034 DT denotes the
T6619 1040-1045 NN denotes clone
T6620 1035-1039 NN denotes cDNA
T6621 1049-1061 JJ denotes inconsistent
T6622 1062-1066 IN denotes with
T6623 1067-1070 DT denotes the
T6624 1093-1101 NN denotes sequence
T6625 1071-1084 JJ denotes corresponding
T6626 1085-1092 JJ denotes genomic
T6627 1101-1102 . denotes .
T6628 1102-1263 sentence denotes It is difficult to imagine what kind of cloning artefact resulted in these severely scrambled cDNAs: we suggest that they derive from real but rare transcripts.
T6629 1103-1105 PRP denotes It
T6630 1106-1108 VBZ denotes is
T6631 1207-1214 VBP denotes suggest
T6632 1109-1118 JJ denotes difficult
T6633 1119-1121 TO denotes to
T6634 1122-1129 VB denotes imagine
T6635 1130-1134 WDT denotes what
T6636 1135-1139 NN denotes kind
T6637 1160-1168 VBD denotes resulted
T6638 1140-1142 IN denotes of
T6639 1143-1150 VBG denotes cloning
T6640 1151-1159 NN denotes artefact
T6641 1169-1171 IN denotes in
T6642 1172-1177 DT denotes these
T6643 1197-1202 NNS denotes cDNAs
T6644 1178-1186 RB denotes severely
T6645 1187-1196 VBN denotes scrambled
T6646 1202-1204 : denotes :
T6647 1204-1206 PRP denotes we
T6648 1215-1219 IN denotes that
T6649 1225-1231 VBP denotes derive
T6650 1220-1224 PRP denotes they
T6651 1232-1236 IN denotes from
T6652 1237-1241 JJ denotes real
T6653 1251-1262 NNS denotes transcripts
T6654 1242-1245 CC denotes but
T6655 1246-1250 JJ denotes rare
T6656 1262-1263 . denotes .
T6657 1263-1454 sentence denotes However, their low frequency in our cDNA collection suggests that splicing contrary to genomic organization does not contribute significantly to the olfactory receptor transcript repertoire.
T6658 1264-1271 RB denotes However
T6659 1316-1324 VBZ denotes suggests
T6660 1271-1273 , denotes ,
T6661 1273-1278 PRP$ denotes their
T6662 1283-1292 NN denotes frequency
T6663 1279-1282 JJ denotes low
T6664 1293-1295 IN denotes in
T6665 1296-1299 PRP$ denotes our
T6666 1305-1315 NN denotes collection
T6667 1300-1304 NN denotes cDNA
T6668 1325-1329 IN denotes that
T6669 1381-1391 VB denotes contribute
T6670 1330-1338 NN denotes splicing
T6671 1339-1347 JJ denotes contrary
T6672 1348-1350 IN denotes to
T6673 1351-1358 JJ denotes genomic
T6674 1359-1371 NN denotes organization
T6675 1372-1376 VBZ denotes does
T6676 1377-1380 RB denotes not
T6677 1392-1405 RB denotes significantly
T6678 1406-1408 IN denotes to
T6679 1409-1412 DT denotes the
T6680 1443-1453 NN denotes repertoire
T6681 1413-1422 JJ denotes olfactory
T6682 1423-1431 NN denotes receptor
T6683 1432-1442 NN denotes transcript
T6684 1453-1454 . denotes .
T6685 1454-1630 sentence denotes For 21 of the 26 genes for which unusually spliced cDNAs were found, we also observe an alternative ('normal') isoform that does not use splice sites within the coding region.
T6686 1455-1458 IN denotes For
T6687 1532-1539 VBP denotes observe
T6688 1459-1461 CD denotes 21
T6689 1469-1471 CD denotes 26
T6690 1462-1464 IN denotes of
T6691 1465-1468 DT denotes the
T6692 1472-1477 NNS denotes genes
T6693 1478-1481 IN denotes for
T6694 1517-1522 VBN denotes found
T6695 1482-1487 WDT denotes which
T6696 1488-1497 RB denotes unusually
T6697 1498-1505 VBN denotes spliced
T6698 1506-1511 NNS denotes cDNAs
T6699 1512-1516 VBD denotes were
T6700 1522-1524 , denotes ,
T6701 1524-1526 PRP denotes we
T6702 1527-1531 RB denotes also
T6703 1540-1542 DT denotes an
T6704 1566-1573 NN denotes isoform
T6705 1543-1554 JJ denotes alternative
T6706 1555-1556 -LRB- denotes (
T6707 1556-1557 `` denotes '
T6708 1557-1563 JJ denotes normal
T6709 1563-1564 '' denotes '
T6710 1564-1565 -RRB- denotes )
T6711 1574-1578 WDT denotes that
T6712 1588-1591 VB denotes use
T6713 1579-1583 VBZ denotes does
T6714 1584-1587 RB denotes not
T6715 1592-1598 NN denotes splice
T6716 1599-1604 NNS denotes sites
T6717 1605-1611 IN denotes within
T6718 1612-1615 DT denotes the
T6719 1623-1629 NN denotes region
T6720 1616-1622 VBG denotes coding
T6721 1629-1630 . denotes .
T6722 1630-1784 sentence denotes (For the remaining 13 of the 3' genes showing odd splicing, we have identified only one cDNA so have not determined whether normal isoforms are present.)
T6723 1631-1632 -LRB- denotes (
T6724 1699-1709 VBN denotes identified
T6725 1632-1635 IN denotes For
T6726 1636-1639 DT denotes the
T6727 1650-1652 CD denotes 13
T6728 1640-1649 VBG denotes remaining
T6729 1653-1655 IN denotes of
T6730 1656-1659 DT denotes the
T6731 1663-1668 NNS denotes genes
T6732 1660-1661 CD denotes 3
T6733 1661-1662 SYM denotes '
T6734 1669-1676 VBG denotes showing
T6735 1677-1680 JJ denotes odd
T6736 1681-1689 NN denotes splicing
T6737 1689-1691 , denotes ,
T6738 1691-1693 PRP denotes we
T6739 1694-1698 VBP denotes have
T6740 1710-1714 RB denotes only
T6741 1719-1723 NN denotes cDNA
T6742 1715-1718 CD denotes one
T6743 1724-1726 CC denotes so
T6744 1727-1731 VBP denotes have
T6745 1736-1746 VBN denotes determined
T6746 1732-1735 RB denotes not
T6747 1747-1754 IN denotes whether
T6748 1771-1774 VBP denotes are
T6749 1755-1761 JJ denotes normal
T6750 1762-1770 NNS denotes isoforms
T6751 1775-1782 JJ denotes present
T6752 1782-1783 . denotes .
T6753 1783-1784 -RRB- denotes )
T6754 1784-2210 sentence denotes We were intrigued both by previous reports of splicing of human olfactory receptors near the major histocompatibility complex (MHC) cluster, where several genes splice over long distances to a common upstream exon [26,27] and by the idea that olfactory receptor transcriptional control could be achieved by DNA recombination mechanisms, perhaps with the result that transcripts would contain some sequence from another locus.
T6755 1785-1787 PRP denotes We
T6756 1793-1802 VBN denotes intrigued
T6757 1788-1792 VBD denotes were
T6758 1803-1807 CC denotes both
T6759 1808-1810 IN denotes by
T6760 1811-1819 JJ denotes previous
T6761 1820-1827 NNS denotes reports
T6762 1828-1830 IN denotes of
T6763 1831-1839 NN denotes splicing
T6764 1840-1842 IN denotes of
T6765 1843-1848 JJ denotes human
T6766 1859-1868 NNS denotes receptors
T6767 1849-1858 JJ denotes olfactory
T6768 1869-1873 IN denotes near
T6769 1874-1877 DT denotes the
T6770 1903-1910 NN denotes complex
T6771 1878-1883 JJ denotes major
T6772 1884-1902 NN denotes histocompatibility
T6773 1917-1924 NN denotes cluster
T6774 1911-1912 -LRB- denotes (
T6775 1912-1915 NN denotes MHC
T6776 1915-1916 -RRB- denotes )
T6777 1924-1926 , denotes ,
T6778 1926-1931 WRB denotes where
T6779 1946-1952 VBP denotes splice
T6780 1932-1939 JJ denotes several
T6781 1940-1945 NNS denotes genes
T6782 1953-1957 IN denotes over
T6783 1958-1962 JJ denotes long
T6784 1963-1972 NNS denotes distances
T6785 1973-1975 IN denotes to
T6786 1976-1977 DT denotes a
T6787 1994-1998 NN denotes exon
T6788 1978-1984 JJ denotes common
T6789 1985-1993 JJ denotes upstream
T6790 1999-2000 -LRB- denotes [
T6791 2003-2005 CD denotes 27
T6792 2000-2002 CD denotes 26
T6793 2002-2003 , denotes ,
T6794 2005-2006 -RRB- denotes ]
T6795 2007-2010 CC denotes and
T6796 2011-2013 IN denotes by
T6797 2014-2017 DT denotes the
T6798 2018-2022 NN denotes idea
T6799 2023-2027 IN denotes that
T6800 2080-2088 VBN denotes achieved
T6801 2028-2037 JJ denotes olfactory
T6802 2038-2046 NN denotes receptor
T6803 2063-2070 NN denotes control
T6804 2047-2062 JJ denotes transcriptional
T6805 2071-2076 MD denotes could
T6806 2077-2079 VB denotes be
T6807 2089-2091 IN denotes by
T6808 2092-2095 NN denotes DNA
T6809 2110-2120 NNS denotes mechanisms
T6810 2096-2109 NN denotes recombination
T6811 2120-2122 , denotes ,
T6812 2122-2129 RB denotes perhaps
T6813 2130-2134 IN denotes with
T6814 2135-2138 DT denotes the
T6815 2139-2145 NN denotes result
T6816 2146-2150 IN denotes that
T6817 2169-2176 VB denotes contain
T6818 2151-2162 NNS denotes transcripts
T6819 2163-2168 MD denotes would
T6820 2177-2181 DT denotes some
T6821 2182-2190 NN denotes sequence
T6822 2191-2195 IN denotes from
T6823 2196-2203 DT denotes another
T6824 2204-2209 NN denotes locus
T6825 2209-2210 . denotes .
T6826 2210-2443 sentence denotes We therefore verified that the entire sequence of each olfactory receptor EST matches the corresponding gene's genomic 'territory' (defined for this purpose as from 1 kb after the preceding gene to 1 kb after the gene's stop codon).
T6827 2211-2213 PRP denotes We
T6828 2224-2232 VBD denotes verified
T6829 2214-2223 RB denotes therefore
T6830 2233-2237 IN denotes that
T6831 2289-2296 VBZ denotes matches
T6832 2238-2241 DT denotes the
T6833 2249-2257 NN denotes sequence
T6834 2242-2248 JJ denotes entire
T6835 2258-2260 IN denotes of
T6836 2261-2265 DT denotes each
T6837 2285-2288 NN denotes EST
T6838 2266-2275 JJ denotes olfactory
T6839 2276-2284 NN denotes receptor
T6840 2297-2300 DT denotes the
T6841 2315-2319 NN denotes gene
T6842 2301-2314 VBG denotes corresponding
T6843 2331-2340 NN denotes territory
T6844 2319-2321 POS denotes 's
T6845 2322-2329 JJ denotes genomic
T6846 2330-2331 `` denotes '
T6847 2340-2341 '' denotes '
T6848 2342-2343 -LRB- denotes (
T6849 2343-2350 VBN denotes defined
T6850 2351-2354 IN denotes for
T6851 2355-2359 DT denotes this
T6852 2360-2367 NN denotes purpose
T6853 2368-2370 IN denotes as
T6854 2371-2375 IN denotes from
T6855 2376-2377 CD denotes 1
T6856 2378-2380 NN denotes kb
T6857 2381-2386 IN denotes after
T6858 2387-2390 DT denotes the
T6859 2401-2405 NN denotes gene
T6860 2391-2400 VBG denotes preceding
T6861 2406-2408 IN denotes to
T6862 2409-2410 CD denotes 1
T6863 2411-2413 NN denotes kb
T6864 2414-2419 IN denotes after
T6865 2420-2423 DT denotes the
T6866 2424-2428 NN denotes gene
T6867 2436-2441 NN denotes codon
T6868 2428-2430 POS denotes 's
T6869 2431-2435 NN denotes stop
T6870 2441-2442 -RRB- denotes )
T6871 2442-2443 . denotes .
T6872 2443-2588 sentence denotes We found no cDNAs where introns encompassed other olfactory receptor genes, as reported for olfactory receptors in the human MHC region [26,27].
T6873 2444-2446 PRP denotes We
T6874 2447-2452 VBD denotes found
T6875 2453-2455 DT denotes no
T6876 2456-2461 NNS denotes cDNAs
T6877 2462-2467 WRB denotes where
T6878 2476-2487 VBD denotes encompassed
T6879 2468-2475 NNS denotes introns
T6880 2488-2493 JJ denotes other
T6881 2513-2518 NNS denotes genes
T6882 2494-2503 JJ denotes olfactory
T6883 2504-2512 NN denotes receptor
T6884 2518-2520 , denotes ,
T6885 2520-2522 IN denotes as
T6886 2523-2531 VBN denotes reported
T6887 2532-2535 IN denotes for
T6888 2536-2545 JJ denotes olfactory
T6889 2546-2555 NNS denotes receptors
T6890 2556-2558 IN denotes in
T6891 2559-2562 DT denotes the
T6892 2573-2579 NN denotes region
T6893 2563-2568 JJ denotes human
T6894 2569-2572 NN denotes MHC
T6895 2580-2581 -LRB- denotes [
T6896 2584-2586 CD denotes 27
T6897 2581-2583 CD denotes 26
T6898 2583-2584 , denotes ,
T6899 2586-2587 -RRB- denotes ]
T6900 2587-2588 . denotes .
T6901 2588-2719 sentence denotes Six cDNAs do extend further than a single gene's 'territory' and appear not to be artifacts of the sequencing or analysis process.
T6902 2589-2592 CD denotes Six
T6903 2593-2598 NNS denotes cDNAs
T6904 2602-2608 VB denotes extend
T6905 2599-2601 VBP denotes do
T6906 2609-2616 RBR denotes further
T6907 2617-2621 IN denotes than
T6908 2622-2623 DT denotes a
T6909 2631-2635 NN denotes gene
T6910 2624-2630 JJ denotes single
T6911 2639-2648 NN denotes territory
T6912 2635-2637 POS denotes 's
T6913 2638-2639 `` denotes '
T6914 2648-2649 '' denotes '
T6915 2650-2653 CC denotes and
T6916 2654-2660 VBP denotes appear
T6917 2661-2664 RB denotes not
T6918 2668-2670 VB denotes be
T6919 2665-2667 TO denotes to
T6920 2671-2680 NNS denotes artifacts
T6921 2681-2683 IN denotes of
T6922 2684-2687 DT denotes the
T6923 2711-2718 NN denotes process
T6924 2688-2698 NN denotes sequencing
T6925 2699-2701 CC denotes or
T6926 2702-2710 NN denotes analysis
T6927 2718-2719 . denotes .
T6928 2719-2867 sentence denotes In each of these cases, the clones use splice sites within the 3' UTR and thus extend further than the arbitrary 1 kb downstream of the stop codon.
T6929 2720-2722 IN denotes In
T6930 2755-2758 VBP denotes use
T6931 2723-2727 DT denotes each
T6932 2728-2730 IN denotes of
T6933 2731-2736 DT denotes these
T6934 2737-2742 NNS denotes cases
T6935 2742-2744 , denotes ,
T6936 2744-2747 DT denotes the
T6937 2748-2754 NNS denotes clones
T6938 2759-2765 NN denotes splice
T6939 2766-2771 NNS denotes sites
T6940 2772-2778 IN denotes within
T6941 2779-2782 DT denotes the
T6942 2786-2789 NN denotes UTR
T6943 2783-2784 CD denotes 3
T6944 2784-2785 SYM denotes '
T6945 2790-2793 CC denotes and
T6946 2794-2798 RB denotes thus
T6947 2799-2805 VBP denotes extend
T6948 2806-2813 RBR denotes further
T6949 2838-2848 JJ denotes downstream
T6950 2814-2818 IN denotes than
T6951 2819-2822 DT denotes the
T6952 2835-2837 NN denotes kb
T6953 2823-2832 JJ denotes arbitrary
T6954 2833-2834 CD denotes 1
T6955 2849-2851 IN denotes of
T6956 2852-2855 DT denotes the
T6957 2861-2866 NN denotes codon
T6958 2856-2860 NN denotes stop
T6959 2866-2867 . denotes .
T6960 2867-2997 sentence denotes Five of these six cDNAs also use splice-donor sites within the coding region and encode disrupted olfactory receptors (Figure 6).
T6961 2868-2872 CD denotes Five
T6962 2897-2900 VBP denotes use
T6963 2873-2875 IN denotes of
T6964 2876-2881 DT denotes these
T6965 2886-2891 NNS denotes cDNAs
T6966 2882-2885 CD denotes six
T6967 2892-2896 RB denotes also
T6968 2901-2907 NN denotes splice
T6969 2908-2913 NN denotes donor
T6970 2907-2908 HYPH denotes -
T6971 2914-2919 NNS denotes sites
T6972 2920-2926 IN denotes within
T6973 2927-2930 DT denotes the
T6974 2938-2944 NN denotes region
T6975 2931-2937 VBG denotes coding
T6976 2945-2948 CC denotes and
T6977 2949-2955 VBP denotes encode
T6978 2956-2965 VBN denotes disrupted
T6979 2976-2985 NNS denotes receptors
T6980 2966-2975 JJ denotes olfactory
T6981 2986-2987 -LRB- denotes (
T6982 2987-2993 NN denotes Figure
T6983 2994-2995 CD denotes 6
T6984 2995-2996 -RRB- denotes )
T6985 2996-2997 . denotes .
T6986 2997-3096 sentence denotes In the sixth cDNA, a 2.6-kb intron is spliced out of the 3' UTR, leaving the coding region intact.
T6987 2998-3000 IN denotes In
T6988 3036-3043 VBN denotes spliced
T6989 3001-3004 DT denotes the
T6990 3011-3015 NN denotes cDNA
T6991 3005-3010 JJ denotes sixth
T6992 3015-3017 , denotes ,
T6993 3017-3018 DT denotes a
T6994 3026-3032 NN denotes intron
T6995 3019-3022 CD denotes 2.6
T6996 3023-3025 NN denotes kb
T6997 3022-3023 HYPH denotes -
T6999 3044-3047 IN denotes out
T7000 3048-3050 IN denotes of
T7001 3051-3054 DT denotes the
T7002 3058-3061 NN denotes UTR
T7003 3055-3056 CD denotes 3
T7004 3056-3057 SYM denotes '
T7005 3061-3063 , denotes ,
T7006 3063-3070 VBG denotes leaving
T7007 3071-3074 DT denotes the
T7008 3082-3088 NN denotes region
T7009 3075-3081 VBG denotes coding
T7010 3089-3095 JJ denotes intact
T7011 3095-3096 . denotes .
T7012 3096-3331 sentence denotes If olfactory receptor transcriptional control is achieved by DNA recombination, the beginning of each transcript might derive from a donated promoter region, with the rest of the transcript coming from the native ORF-containing locus.
T7013 3097-3099 IN denotes If
T7014 3146-3154 VBN denotes achieved
T7015 3100-3109 JJ denotes olfactory
T7016 3110-3118 NN denotes receptor
T7017 3135-3142 NN denotes control
T7018 3119-3134 JJ denotes transcriptional
T7019 3143-3145 VBZ denotes is
T7020 3216-3222 VB denotes derive
T7021 3155-3157 IN denotes by
T7022 3158-3161 NN denotes DNA
T7023 3162-3175 NN denotes recombination
T7024 3175-3177 , denotes ,
T7025 3177-3180 DT denotes the
T7026 3181-3190 NN denotes beginning
T7027 3191-3193 IN denotes of
T7028 3194-3198 DT denotes each
T7029 3199-3209 NN denotes transcript
T7030 3210-3215 MD denotes might
T7031 3223-3227 IN denotes from
T7032 3228-3229 DT denotes a
T7033 3247-3253 NN denotes region
T7034 3230-3237 VBN denotes donated
T7035 3238-3246 NN denotes promoter
T7036 3253-3255 , denotes ,
T7037 3255-3259 IN denotes with
T7038 3260-3263 DT denotes the
T7039 3264-3268 NN denotes rest
T7040 3287-3293 VBG denotes coming
T7041 3269-3271 IN denotes of
T7042 3272-3275 DT denotes the
T7043 3276-3286 NN denotes transcript
T7044 3294-3298 IN denotes from
T7045 3299-3302 DT denotes the
T7046 3325-3330 NN denotes locus
T7047 3303-3309 JJ denotes native
T7048 3310-3313 NN denotes ORF
T7049 3314-3324 VBG denotes containing
T7050 3313-3314 HYPH denotes -
T7051 3330-3331 . denotes .
T7052 3331-3490 sentence denotes In order to examine the recombination hypothesis, we analyzed 115 cDNA clones for which sim4 failed to align 20 bp or more to the corresponding genomic locus.
T7053 3332-3334 IN denotes In
T7054 3385-3393 VBD denotes analyzed
T7055 3335-3340 NN denotes order
T7056 3341-3343 TO denotes to
T7057 3344-3351 VB denotes examine
T7058 3352-3355 DT denotes the
T7059 3370-3380 NN denotes hypothesis
T7060 3356-3369 NN denotes recombination
T7061 3380-3382 , denotes ,
T7062 3382-3384 PRP denotes we
T7063 3394-3397 CD denotes 115
T7064 3403-3409 NNS denotes clones
T7065 3398-3402 NN denotes cDNA
T7066 3410-3413 IN denotes for
T7067 3425-3431 VBD denotes failed
T7068 3414-3419 WDT denotes which
T7069 3420-3424 NN denotes sim4
T7070 3432-3434 TO denotes to
T7071 3435-3440 VB denotes align
T7072 3441-3443 CD denotes 20
T7073 3444-3446 NNS denotes bp
T7074 3447-3449 CC denotes or
T7075 3450-3454 JJR denotes more
T7076 3455-3457 IN denotes to
T7077 3458-3461 DT denotes the
T7078 3484-3489 NN denotes locus
T7079 3462-3475 VBG denotes corresponding
T7080 3476-3483 JJ denotes genomic
T7081 3489-3490 . denotes .
T7082 3490-3659 sentence denotes In most cases, the missing sequence was explained by gaps in the genomic sequence or by matches that fell below our percent identity-based cutoff for reporting matches.
T7083 3491-3493 IN denotes In
T7084 3531-3540 VBN denotes explained
T7085 3494-3498 JJS denotes most
T7086 3499-3504 NNS denotes cases
T7087 3504-3506 , denotes ,
T7088 3506-3509 DT denotes the
T7089 3518-3526 NN denotes sequence
T7090 3510-3517 JJ denotes missing
T7091 3527-3530 VBD denotes was
T7092 3541-3543 IN denotes by
T7093 3544-3548 NNS denotes gaps
T7094 3549-3551 IN denotes in
T7095 3552-3555 DT denotes the
T7096 3564-3572 NN denotes sequence
T7097 3556-3563 JJ denotes genomic
T7098 3573-3575 CC denotes or
T7099 3576-3578 IN denotes by
T7100 3579-3586 NNS denotes matches
T7101 3587-3591 WDT denotes that
T7102 3592-3596 VBD denotes fell
T7103 3597-3602 IN denotes below
T7104 3603-3606 PRP$ denotes our
T7105 3630-3636 NN denotes cutoff
T7106 3607-3614 NN denotes percent
T7107 3615-3623 NN denotes identity
T7108 3624-3629 VBN denotes based
T7109 3623-3624 HYPH denotes -
T7110 3637-3640 IN denotes for
T7111 3641-3650 VBG denotes reporting
T7112 3651-3658 NNS denotes matches
T7113 3658-3659 . denotes .
T7114 3659-3796 sentence denotes For three cDNAs (from three different olfactory receptors), we found that the missing piece of sequence matched elsewhere in the genome.
T7115 3660-3663 IN denotes For
T7116 3723-3728 VBD denotes found
T7117 3664-3669 CD denotes three
T7118 3670-3675 NNS denotes cDNAs
T7119 3676-3677 -LRB- denotes (
T7120 3677-3681 IN denotes from
T7121 3682-3687 CD denotes three
T7122 3708-3717 NNS denotes receptors
T7123 3688-3697 JJ denotes different
T7124 3698-3707 JJ denotes olfactory
T7125 3717-3718 -RRB- denotes )
T7126 3718-3720 , denotes ,
T7127 3720-3722 PRP denotes we
T7128 3729-3733 IN denotes that
T7129 3764-3771 VBN denotes matched
T7130 3734-3737 DT denotes the
T7131 3746-3751 NN denotes piece
T7132 3738-3745 JJ denotes missing
T7133 3752-3754 IN denotes of
T7134 3755-3763 NN denotes sequence
T7135 3772-3781 RB denotes elsewhere
T7136 3782-3784 IN denotes in
T7137 3785-3788 DT denotes the
T7138 3789-3795 NN denotes genome
T7139 3795-3796 . denotes .
T7140 3796-3876 sentence denotes Comparison with the public mouse genome assembly confirmed the distant matches.
T7141 3797-3807 NN denotes Comparison
T7142 3846-3855 VBD denotes confirmed
T7143 3808-3812 IN denotes with
T7144 3813-3816 DT denotes the
T7145 3837-3845 NN denotes assembly
T7146 3817-3823 JJ denotes public
T7147 3824-3829 NN denotes mouse
T7148 3830-3836 NN denotes genome
T7149 3856-3859 DT denotes the
T7150 3868-3875 NNS denotes matches
T7151 3860-3867 JJ denotes distant
T7152 3875-3876 . denotes .
T7153 3876-4081 sentence denotes With such a small number of cDNAs exhibiting a possible sign of DNA recombination (a sign that could also be interpreted as chimeric cDNA clones), we conclude that such rearrangement is unlikely to occur.
T7154 3877-3881 IN denotes With
T7155 4027-4035 VBP denotes conclude
T7156 3882-3886 PDT denotes such
T7157 3895-3901 NN denotes number
T7158 3887-3888 DT denotes a
T7159 3889-3894 JJ denotes small
T7160 3911-3921 VBG denotes exhibiting
T7161 3902-3904 IN denotes of
T7162 3905-3910 NNS denotes cDNAs
T7163 3922-3923 DT denotes a
T7164 3933-3937 NN denotes sign
T7165 3924-3932 JJ denotes possible
T7166 3938-3940 IN denotes of
T7167 3941-3944 NN denotes DNA
T7168 3945-3958 NN denotes recombination
T7169 3959-3960 -LRB- denotes (
T7170 3960-3961 DT denotes a
T7171 3962-3966 NN denotes sign
T7172 3967-3971 WDT denotes that
T7173 3986-3997 VBN denotes interpreted
T7174 3972-3977 MD denotes could
T7175 3978-3982 RB denotes also
T7176 3983-3985 VB denotes be
T7177 3998-4000 IN denotes as
T7178 4001-4009 JJ denotes chimeric
T7179 4015-4021 NNS denotes clones
T7180 4010-4014 NN denotes cDNA
T7181 4021-4022 -RRB- denotes )
T7182 4022-4024 , denotes ,
T7183 4024-4026 PRP denotes we
T7184 4036-4040 IN denotes that
T7185 4060-4062 VBZ denotes is
T7186 4041-4045 JJ denotes such
T7187 4046-4059 NN denotes rearrangement
T7188 4063-4071 JJ denotes unlikely
T7189 4072-4074 TO denotes to
T7190 4075-4080 VB denotes occur
T7191 4080-4081 . denotes .
T7192 4081-4295 sentence denotes However, the possibility remains that DNA recombination is responsible for olfactory receptor transcriptional regulation, with the donated region contributing only promoter sequences but no part of the transcript.
T7193 4082-4089 RB denotes However
T7194 4107-4114 VBZ denotes remains
T7195 4089-4091 , denotes ,
T7196 4091-4094 DT denotes the
T7197 4095-4106 NN denotes possibility
T7198 4115-4119 IN denotes that
T7199 4138-4140 VBZ denotes is
T7200 4120-4123 NN denotes DNA
T7201 4124-4137 NN denotes recombination
T7202 4141-4152 JJ denotes responsible
T7203 4153-4156 IN denotes for
T7204 4157-4166 JJ denotes olfactory
T7205 4167-4175 NN denotes receptor
T7206 4192-4202 NN denotes regulation
T7207 4176-4191 JJ denotes transcriptional
T7208 4202-4204 , denotes ,
T7209 4204-4208 IN denotes with
T7210 4209-4212 DT denotes the
T7211 4221-4227 NN denotes region
T7212 4213-4220 VBN denotes donated
T7213 4228-4240 VBG denotes contributing
T7214 4241-4245 RB denotes only
T7215 4255-4264 NNS denotes sequences
T7216 4246-4254 NN denotes promoter
T7217 4265-4268 CC denotes but
T7218 4269-4271 DT denotes no
T7219 4272-4276 NN denotes part
T7220 4277-4279 IN denotes of
T7221 4280-4283 DT denotes the
T7222 4284-4294 NN denotes transcript
T7223 4294-4295 . denotes .
T6998 3033-3035 VBZ denotes is
R3822 T6417 T6418 det A,subset
R3823 T6418 T6419 nsubj subset,shows
R3824 T6420 T6418 prep of,subset
R3825 T6421 T6422 amod olfactory,receptors
R3826 T6422 T6420 pobj receptors,of
R3827 T6423 T6424 amod unusual,splicing
R3828 T6424 T6419 dobj splicing,shows
R3829 T6426 T6427 nsubj We,identified
R3830 T6428 T6429 nummod 62,cDNAs
R3831 T6429 T6427 dobj cDNAs,identified
R3832 T6430 T6431 punct (,%
R3833 T6431 T6429 parataxis %,cDNAs
R3834 T6432 T6431 nummod 5,%
R3835 T6433 T6431 prep of,%
R3836 T6434 T6435 det all,clones
R3837 T6435 T6433 pobj clones,of
R3838 T6436 T6437 amod olfactory,receptor
R3839 T6437 T6435 compound receptor,clones
R3840 T6438 T6431 punct ),%
R3841 T6439 T6427 prep from,identified
R3842 T6440 T6441 nummod 38,receptors
R3843 T6441 T6439 pobj receptors,from
R3844 T6442 T6441 amod intact,receptors
R3845 T6443 T6441 amod olfactory,receptors
R3846 T6444 T6441 cc and,receptors
R3847 T6445 T6446 nummod one,pseudogene
R3848 T6446 T6441 conj pseudogene,receptors
R3849 T6447 T6448 amod olfactory,receptor
R3850 T6448 T6446 compound receptor,pseudogene
R3851 T6449 T6450 advmod where,used
R3852 T6450 T6446 relcl used,pseudogene
R3853 T6451 T6452 det a,site
R3854 T6452 T6450 nsubjpass site,used
R3855 T6453 T6452 compound splice,site
R3856 T6454 T6452 prep within,site
R3857 T6455 T6456 det the,region
R3858 T6456 T6454 pobj region,within
R3859 T6457 T6458 npadvmod protein,coding
R3860 T6458 T6456 amod coding,region
R3861 T6459 T6458 punct -,coding
R3862 T6460 T6450 auxpass is,used
R3863 T6461 T6427 punct .,identified
R3864 T6463 T6464 prep For,appears
R3865 T6465 T6466 nummod two,genes
R3866 T6466 T6463 pobj genes,For
R3867 T6467 T6468 punct (,Figure
R3868 T6468 T6466 parataxis Figure,genes
R3869 T6469 T6470 amod top,cDNAs
R3870 T6470 T6468 dep cDNAs,Figure
R3871 T6471 T6470 nummod two,cDNAs
R3872 T6472 T6468 punct ", ",Figure
R3873 T6473 T6468 nummod 6,Figure
R3874 T6474 T6468 punct ),Figure
R3875 T6475 T6464 punct ", ",appears
R3876 T6476 T6477 det the,protein
R3877 T6477 T6464 nsubj protein,appears
R3878 T6478 T6477 amod predicted,protein
R3879 T6479 T6480 aux to,be
R3880 T6480 T6464 xcomp be,appears
R3881 T6481 T6482 det an,receptor
R3882 T6482 T6480 attr receptor,be
R3883 T6483 T6482 amod intact,receptor
R3884 T6484 T6482 amod olfactory,receptor
R3885 T6485 T6482 prep with,receptor
R3886 T6486 T6487 nummod three,acids
R3887 T6487 T6485 pobj acids,with
R3888 T6488 T6486 cc or,three
R3889 T6489 T6486 conj ten,three
R3890 T6490 T6487 compound amino,acids
R3891 T6491 T6487 punct ", ",acids
R3892 T6492 T6487 prep including,acids
R3893 T6493 T6494 det the,methionine
R3894 T6494 T6492 pobj methionine,including
R3895 T6495 T6494 amod initiating,methionine
R3896 T6496 T6487 punct ", ",acids
R3897 T6497 T6487 acl contributed,acids
R3898 T6498 T6497 agent by,contributed
R3899 T6499 T6500 det an,exon
R3900 T6500 T6498 pobj exon,by
R3901 T6501 T6500 amod upstream,exon
R3902 T6502 T6464 punct .,appears
R3903 T6504 T6505 det A,structure
R3904 T6505 T6508 nsubjpass structure,described
R3905 T6506 T6505 amod similar,structure
R3906 T6507 T6505 compound gene,structure
R3907 T6509 T6508 auxpass was,described
R3908 T6510 T6508 advmod previously,described
R3909 T6511 T6508 prep for,described
R3910 T6512 T6513 det a,receptor
R3911 T6513 T6511 pobj receptor,for
R3912 T6514 T6513 amod human,receptor
R3913 T6515 T6513 amod olfactory,receptor
R3914 T6516 T6517 punct [,25
R3915 T6517 T6508 parataxis 25,described
R3916 T6518 T6517 punct ],25
R3917 T6519 T6508 punct .,described
R3918 T6521 T6522 nsubj One,has
R3919 T6523 T6521 prep of,One
R3920 T6524 T6525 det these,genes
R3921 T6525 T6523 pobj genes,of
R3922 T6526 T6525 nummod two,genes
R3923 T6527 T6525 compound mouse,genes
R3924 T6528 T6529 det no,codon
R3925 T6529 T6522 dobj codon,has
R3926 T6530 T6529 compound start,codon
R3927 T6531 T6529 prep in,codon
R3928 T6532 T6533 poss its,exon
R3929 T6533 T6531 pobj exon,in
R3930 T6534 T6535 advmod otherwise,intact
R3931 T6535 T6533 amod intact,exon
R3932 T6536 T6533 amod main,exon
R3933 T6537 T6533 amod coding,exon
R3934 T6538 T6522 punct .,has
R3935 T6540 T6541 det The,splicing
R3936 T6541 T6543 nsubj splicing,rescues
R3937 T6542 T6541 amod unusual,splicing
R3938 T6544 T6543 advmod thus,rescues
R3939 T6545 T6546 dep what,be
R3940 T6546 T6543 ccomp be,rescues
R3941 T6547 T6546 aux would,be
R3942 T6548 T6546 advmod otherwise,be
R3943 T6549 T6550 det a,gene
R3944 T6550 T6546 attr gene,be
R3945 T6551 T6550 amod dysfunctional,gene
R3946 T6552 T6543 punct .,rescues
R3947 T6554 T6555 prep In,encode
R3948 T6556 T6557 amod most,cases
R3949 T6557 T6554 pobj cases,In
R3950 T6558 T6557 punct (,cases
R3951 T6559 T6560 quantmod 60,62
R3952 T6560 T6563 nummod 62,cDNAs
R3953 T6561 T6560 quantmod out,62
R3954 T6562 T6560 quantmod of,62
R3955 T6563 T6557 appos cDNAs,cases
R3956 T6564 T6557 punct ),cases
R3957 T6565 T6555 punct ", ",encode
R3958 T6566 T6567 det the,transcript
R3959 T6567 T6569 nsubj transcript,appears
R3960 T6568 T6567 amod unusual,transcript
R3961 T6569 T6555 ccomp appears,encode
R3962 T6570 T6571 aux to,be
R3963 T6571 T6569 xcomp be,appears
R3964 T6572 T6573 det an,form
R3965 T6573 T6571 attr form,be
R3966 T6574 T6573 amod aberrant,form
R3967 T6575 T6573 compound splice,form
R3968 T6576 T6555 punct -,encode
R3969 T6577 T6578 det the,transcript
R3970 T6578 T6555 nsubj transcript,encode
R3971 T6579 T6555 aux would,encode
R3972 T6580 T6555 advmod probably,encode
R3973 T6581 T6555 neg not,encode
R3974 T6582 T6583 det a,protein
R3975 T6583 T6555 dobj protein,encode
R3976 T6584 T6583 amod functional,protein
R3977 T6585 T6586 mark because,introduces
R3978 T6586 T6555 advcl introduces,encode
R3979 T6587 T6588 det the,splice
R3980 T6588 T6586 nsubj splice,introduces
R3981 T6589 T6590 det a,frameshift
R3982 T6590 T6586 dobj frameshift,introduces
R3983 T6591 T6586 cc or,introduces
R3984 T6592 T6586 conj removes,introduces
R3985 T6593 T6594 amod conserved,residues
R3986 T6594 T6592 dobj residues,removes
R3987 T6595 T6594 amod functional,residues
R3988 T6596 T6597 punct (,Figure
R3989 T6597 T6555 parataxis Figure,encode
R3990 T6598 T6597 nummod 6,Figure
R3991 T6599 T6597 punct ),Figure
R3992 T6600 T6555 punct .,encode
R3993 T6602 T6603 prep For,is
R3994 T6604 T6605 nummod two,clones
R3995 T6605 T6602 pobj clones,For
R3996 T6606 T6607 punct (,Figure
R3997 T6607 T6605 parataxis Figure,clones
R3998 T6608 T6609 amod bottom,cDNAs
R3999 T6609 T6607 dep cDNAs,Figure
R4000 T6610 T6609 nummod two,cDNAs
R4001 T6611 T6607 punct ", ",Figure
R4002 T6612 T6607 nummod 6,Figure
R4003 T6613 T6607 punct ),Figure
R4004 T6614 T6603 punct ", ",is
R4005 T6615 T6616 compound exon,order
R4006 T6616 T6603 nsubj order,is
R4007 T6617 T6616 prep in,order
R4008 T6618 T6619 det the,clone
R4009 T6619 T6617 pobj clone,in
R4010 T6620 T6619 compound cDNA,clone
R4011 T6621 T6603 acomp inconsistent,is
R4012 T6622 T6621 prep with,inconsistent
R4013 T6623 T6624 det the,sequence
R4014 T6624 T6622 pobj sequence,with
R4015 T6625 T6624 amod corresponding,sequence
R4016 T6626 T6624 amod genomic,sequence
R4017 T6627 T6603 punct .,is
R4018 T6629 T6630 nsubj It,is
R4019 T6630 T6631 ccomp is,suggest
R4020 T6632 T6630 acomp difficult,is
R4021 T6633 T6634 aux to,imagine
R4022 T6634 T6630 xcomp imagine,is
R4023 T6635 T6636 det what,kind
R4024 T6636 T6637 dep kind,resulted
R4025 T6637 T6634 ccomp resulted,imagine
R4026 T6638 T6636 prep of,kind
R4027 T6639 T6640 amod cloning,artefact
R4028 T6640 T6638 pobj artefact,of
R4029 T6641 T6637 prep in,resulted
R4030 T6642 T6643 det these,cDNAs
R4031 T6643 T6641 pobj cDNAs,in
R4032 T6644 T6645 advmod severely,scrambled
R4033 T6645 T6643 amod scrambled,cDNAs
R4034 T6646 T6631 punct : ,suggest
R4035 T6647 T6631 nsubj we,suggest
R4036 T6648 T6649 mark that,derive
R4037 T6649 T6631 ccomp derive,suggest
R4038 T6650 T6649 nsubj they,derive
R4039 T6651 T6649 prep from,derive
R4040 T6652 T6653 amod real,transcripts
R4041 T6653 T6651 pobj transcripts,from
R4042 T6654 T6652 cc but,real
R4043 T6655 T6652 conj rare,real
R4044 T6656 T6631 punct .,suggest
R4045 T6658 T6659 advmod However,suggests
R4046 T6660 T6659 punct ", ",suggests
R4047 T6661 T6662 poss their,frequency
R4048 T6662 T6659 nsubj frequency,suggests
R4049 T6663 T6662 amod low,frequency
R4050 T6664 T6662 prep in,frequency
R4051 T6665 T6666 poss our,collection
R4052 T6666 T6664 pobj collection,in
R4053 T6667 T6666 compound cDNA,collection
R4054 T6668 T6669 mark that,contribute
R4055 T6669 T6659 ccomp contribute,suggests
R4056 T6670 T6669 nsubj splicing,contribute
R4057 T6671 T6670 amod contrary,splicing
R4058 T6672 T6671 prep to,contrary
R4059 T6673 T6674 amod genomic,organization
R4060 T6674 T6672 pobj organization,to
R4061 T6675 T6669 aux does,contribute
R4062 T6676 T6669 neg not,contribute
R4063 T6677 T6669 advmod significantly,contribute
R4064 T6678 T6669 prep to,contribute
R4065 T6679 T6680 det the,repertoire
R4066 T6680 T6678 pobj repertoire,to
R4067 T6681 T6682 amod olfactory,receptor
R4068 T6682 T6680 compound receptor,repertoire
R4069 T6683 T6680 compound transcript,repertoire
R4070 T6684 T6659 punct .,suggests
R4071 T6686 T6687 prep For,observe
R4072 T6688 T6689 quantmod 21,26
R4073 T6689 T6692 nummod 26,genes
R4074 T6690 T6689 quantmod of,26
R4075 T6691 T6689 quantmod the,26
R4076 T6692 T6686 pobj genes,For
R4077 T6693 T6694 prep for,found
R4078 T6694 T6692 relcl found,genes
R4079 T6695 T6693 pobj which,for
R4080 T6696 T6697 advmod unusually,spliced
R4081 T6697 T6698 amod spliced,cDNAs
R4082 T6698 T6694 nsubjpass cDNAs,found
R4083 T6699 T6694 auxpass were,found
R4084 T6700 T6687 punct ", ",observe
R4085 T6701 T6687 nsubj we,observe
R4086 T6702 T6687 advmod also,observe
R4087 T6703 T6704 det an,isoform
R4088 T6704 T6687 dobj isoform,observe
R4089 T6705 T6704 amod alternative,isoform
R4090 T6706 T6704 punct (,isoform
R4091 T6707 T6704 punct ',isoform
R4092 T6708 T6704 amod normal,isoform
R4093 T6709 T6704 punct ',isoform
R4094 T6710 T6704 punct ),isoform
R4095 T6711 T6712 dep that,use
R4096 T6712 T6704 relcl use,isoform
R4097 T6713 T6712 aux does,use
R4098 T6714 T6712 neg not,use
R4099 T6715 T6716 compound splice,sites
R4100 T6716 T6712 dobj sites,use
R4101 T6717 T6716 prep within,sites
R4102 T6718 T6719 det the,region
R4103 T6719 T6717 pobj region,within
R4104 T6720 T6719 amod coding,region
R4105 T6721 T6687 punct .,observe
R4106 T6723 T6724 punct (,identified
R4107 T6725 T6724 prep For,identified
R4108 T6726 T6727 det the,13
R4109 T6727 T6725 pobj 13,For
R4110 T6728 T6727 amod remaining,13
R4111 T6729 T6727 prep of,13
R4112 T6730 T6731 det the,genes
R4113 T6731 T6729 pobj genes,of
R4114 T6732 T6731 nummod 3,genes
R4115 T6733 T6732 punct ',3
R4116 T6734 T6731 acl showing,genes
R4117 T6735 T6736 amod odd,splicing
R4118 T6736 T6734 dobj splicing,showing
R4119 T6737 T6724 punct ", ",identified
R4120 T6738 T6724 nsubj we,identified
R4121 T6739 T6724 aux have,identified
R4122 T6740 T6741 advmod only,cDNA
R4123 T6741 T6724 dobj cDNA,identified
R4124 T6742 T6741 nummod one,cDNA
R4125 T6743 T6724 cc so,identified
R4126 T6744 T6745 aux have,determined
R4127 T6745 T6724 conj determined,identified
R4128 T6746 T6745 neg not,determined
R4129 T6747 T6748 mark whether,are
R4130 T6748 T6745 ccomp are,determined
R4131 T6749 T6750 amod normal,isoforms
R4132 T6750 T6748 nsubj isoforms,are
R4133 T6751 T6748 acomp present,are
R4134 T6752 T6724 punct .,identified
R4135 T6753 T6724 punct ),identified
R4136 T6755 T6756 nsubjpass We,intrigued
R4137 T6757 T6756 auxpass were,intrigued
R4138 T6758 T6759 preconj both,by
R4139 T6759 T6756 prep by,intrigued
R4140 T6760 T6761 amod previous,reports
R4141 T6761 T6759 pobj reports,by
R4142 T6762 T6761 prep of,reports
R4143 T6763 T6762 pobj splicing,of
R4144 T6764 T6763 prep of,splicing
R4145 T6765 T6766 amod human,receptors
R4146 T6766 T6764 pobj receptors,of
R4147 T6767 T6766 amod olfactory,receptors
R4148 T6768 T6763 prep near,splicing
R4149 T6769 T6770 det the,complex
R4150 T6770 T6773 nmod complex,cluster
R4151 T6771 T6770 amod major,complex
R4152 T6772 T6770 nmod histocompatibility,complex
R4153 T6773 T6768 pobj cluster,near
R4154 T6774 T6770 punct (,complex
R4155 T6775 T6770 appos MHC,complex
R4156 T6776 T6773 punct ),cluster
R4157 T6777 T6773 punct ", ",cluster
R4158 T6778 T6779 advmod where,splice
R4159 T6779 T6773 relcl splice,cluster
R4160 T6780 T6781 amod several,genes
R4161 T6781 T6779 nsubj genes,splice
R4162 T6782 T6779 prep over,splice
R4163 T6783 T6784 amod long,distances
R4164 T6784 T6782 pobj distances,over
R4165 T6785 T6779 prep to,splice
R4166 T6786 T6787 det a,exon
R4167 T6787 T6785 pobj exon,to
R4168 T6788 T6787 amod common,exon
R4169 T6789 T6787 amod upstream,exon
R4170 T6790 T6791 punct [,27
R4171 T6791 T6773 parataxis 27,cluster
R4172 T6792 T6791 nummod 26,27
R4173 T6793 T6791 punct ",",27
R4174 T6794 T6791 punct ],27
R4175 T6795 T6759 cc and,by
R4176 T6796 T6759 conj by,by
R4177 T6797 T6798 det the,idea
R4178 T6798 T6796 pobj idea,by
R4179 T6799 T6800 mark that,achieved
R4180 T6800 T6798 acl achieved,idea
R4181 T6801 T6802 amod olfactory,receptor
R4182 T6802 T6803 nmod receptor,control
R4183 T6803 T6800 nsubjpass control,achieved
R4184 T6804 T6803 amod transcriptional,control
R4185 T6805 T6800 aux could,achieved
R4186 T6806 T6800 auxpass be,achieved
R4187 T6807 T6800 prep by,achieved
R4188 T6808 T6809 compound DNA,mechanisms
R4189 T6809 T6807 pobj mechanisms,by
R4190 T6810 T6809 compound recombination,mechanisms
R4191 T6811 T6800 punct ", ",achieved
R4192 T6812 T6813 advmod perhaps,with
R4193 T6813 T6800 prep with,achieved
R4194 T6814 T6815 det the,result
R4195 T6815 T6813 pobj result,with
R4196 T6816 T6817 mark that,contain
R4197 T6817 T6815 acl contain,result
R4198 T6818 T6817 nsubj transcripts,contain
R4199 T6819 T6817 aux would,contain
R4200 T6820 T6821 det some,sequence
R4201 T6821 T6817 dobj sequence,contain
R4202 T6822 T6821 prep from,sequence
R4203 T6823 T6824 det another,locus
R4204 T6824 T6822 pobj locus,from
R4205 T6825 T6756 punct .,intrigued
R4206 T6827 T6828 nsubj We,verified
R4207 T6829 T6828 advmod therefore,verified
R4208 T6830 T6831 mark that,matches
R4209 T6831 T6828 ccomp matches,verified
R4210 T6832 T6833 det the,sequence
R4211 T6833 T6831 nsubj sequence,matches
R4212 T6834 T6833 amod entire,sequence
R4213 T6835 T6833 prep of,sequence
R4214 T6836 T6837 det each,EST
R4215 T6837 T6835 pobj EST,of
R4216 T6838 T6839 amod olfactory,receptor
R4217 T6839 T6837 compound receptor,EST
R4218 T6840 T6841 det the,gene
R4219 T6841 T6843 poss gene,territory
R4220 T6842 T6841 amod corresponding,gene
R4221 T6843 T6831 dobj territory,matches
R4222 T6844 T6841 case 's,gene
R4223 T6845 T6843 amod genomic,territory
R4224 T6846 T6843 punct ',territory
R4225 T6847 T6843 punct ',territory
R4226 T6848 T6843 punct (,territory
R4227 T6849 T6843 acl defined,territory
R4228 T6850 T6849 prep for,defined
R4229 T6851 T6852 det this,purpose
R4230 T6852 T6850 pobj purpose,for
R4231 T6853 T6849 prep as,defined
R4232 T6854 T6853 prep from,as
R4233 T6855 T6856 nummod 1,kb
R4234 T6856 T6857 npadvmod kb,after
R4235 T6857 T6854 prep after,from
R4236 T6858 T6859 det the,gene
R4237 T6859 T6857 pobj gene,after
R4238 T6860 T6859 amod preceding,gene
R4239 T6861 T6854 prep to,from
R4240 T6862 T6863 nummod 1,kb
R4241 T6863 T6864 npadvmod kb,after
R4242 T6864 T6861 prep after,to
R4243 T6865 T6866 det the,gene
R4244 T6866 T6867 poss gene,codon
R4245 T6867 T6864 pobj codon,after
R4246 T6868 T6866 case 's,gene
R4247 T6869 T6867 compound stop,codon
R4248 T6870 T6828 punct ),verified
R4249 T6871 T6828 punct .,verified
R4250 T6873 T6874 nsubj We,found
R4251 T6875 T6876 det no,cDNAs
R4252 T6876 T6874 dobj cDNAs,found
R4253 T6877 T6878 advmod where,encompassed
R4254 T6878 T6876 relcl encompassed,cDNAs
R4255 T6879 T6878 nsubj introns,encompassed
R4256 T6880 T6881 amod other,genes
R4257 T6881 T6878 dobj genes,encompassed
R4258 T6882 T6883 amod olfactory,receptor
R4259 T6883 T6881 compound receptor,genes
R4260 T6884 T6878 punct ", ",encompassed
R4261 T6885 T6886 mark as,reported
R4262 T6886 T6878 advcl reported,encompassed
R4263 T6887 T6886 prep for,reported
R4264 T6888 T6889 amod olfactory,receptors
R4265 T6889 T6887 pobj receptors,for
R4266 T6890 T6889 prep in,receptors
R4267 T6891 T6892 det the,region
R4268 T6892 T6890 pobj region,in
R4269 T6893 T6894 amod human,MHC
R4270 T6894 T6892 compound MHC,region
R4271 T6895 T6896 punct [,27
R4272 T6896 T6886 parataxis 27,reported
R4273 T6897 T6896 nummod 26,27
R4274 T6898 T6896 punct ",",27
R4275 T6899 T6896 punct ],27
R4276 T6900 T6874 punct .,found
R4277 T6902 T6903 nummod Six,cDNAs
R4278 T6903 T6904 nsubj cDNAs,extend
R4279 T6905 T6904 aux do,extend
R4280 T6906 T6904 advmod further,extend
R4281 T6907 T6906 prep than,further
R4282 T6908 T6909 det a,gene
R4283 T6909 T6911 poss gene,territory
R4284 T6910 T6909 amod single,gene
R4285 T6911 T6907 pobj territory,than
R4286 T6912 T6909 case 's,gene
R4287 T6913 T6911 punct ',territory
R4288 T6914 T6904 punct ',extend
R4289 T6915 T6904 cc and,extend
R4290 T6916 T6904 conj appear,extend
R4291 T6917 T6918 neg not,be
R4292 T6918 T6916 xcomp be,appear
R4293 T6919 T6918 aux to,be
R4294 T6920 T6918 attr artifacts,be
R4295 T6921 T6920 prep of,artifacts
R4296 T6922 T6923 det the,process
R4297 T6923 T6921 pobj process,of
R4298 T6924 T6923 nmod sequencing,process
R4299 T6925 T6924 cc or,sequencing
R4300 T6926 T6924 conj analysis,sequencing
R4301 T6927 T6904 punct .,extend
R4302 T6929 T6930 prep In,use
R4303 T6931 T6929 pobj each,In
R4304 T6932 T6931 prep of,each
R4305 T6933 T6934 det these,cases
R4306 T6934 T6932 pobj cases,of
R4307 T6935 T6930 punct ", ",use
R4308 T6936 T6937 det the,clones
R4309 T6937 T6930 nsubj clones,use
R4310 T6938 T6939 compound splice,sites
R4311 T6939 T6930 dobj sites,use
R4312 T6940 T6930 prep within,use
R4313 T6941 T6942 det the,UTR
R4314 T6942 T6940 pobj UTR,within
R4315 T6943 T6942 nummod 3,UTR
R4316 T6944 T6943 punct ',3
R4317 T6945 T6930 cc and,use
R4318 T6946 T6947 advmod thus,extend
R4319 T6947 T6930 conj extend,use
R4320 T6948 T6949 advmod further,downstream
R4321 T6949 T6947 advmod downstream,extend
R4322 T6950 T6948 prep than,further
R4323 T6951 T6952 det the,kb
R4324 T6952 T6950 pobj kb,than
R4325 T6953 T6952 amod arbitrary,kb
R4326 T6954 T6952 nummod 1,kb
R4327 T6955 T6949 prep of,downstream
R4328 T6956 T6957 det the,codon
R4329 T6957 T6955 pobj codon,of
R4330 T6958 T6957 compound stop,codon
R4331 T6959 T6930 punct .,use
R4332 T6961 T6962 nsubj Five,use
R4333 T6963 T6961 prep of,Five
R4334 T6964 T6965 det these,cDNAs
R4335 T6965 T6963 pobj cDNAs,of
R4336 T6966 T6965 nummod six,cDNAs
R4337 T6967 T6962 advmod also,use
R4338 T6968 T6969 compound splice,donor
R4339 T6969 T6971 compound donor,sites
R4340 T6970 T6969 punct -,donor
R4341 T6971 T6962 dobj sites,use
R4342 T6972 T6962 prep within,use
R4343 T6973 T6974 det the,region
R4344 T6974 T6972 pobj region,within
R4345 T6975 T6974 amod coding,region
R4346 T6976 T6962 cc and,use
R4347 T6977 T6962 conj encode,use
R4348 T6978 T6979 amod disrupted,receptors
R4349 T6979 T6977 dobj receptors,encode
R4350 T6980 T6979 amod olfactory,receptors
R4351 T6981 T6982 punct (,Figure
R4352 T6982 T6977 parataxis Figure,encode
R4353 T6983 T6982 nummod 6,Figure
R4354 T6984 T6982 punct ),Figure
R4355 T6985 T6962 punct .,use
R4356 T6987 T6988 prep In,spliced
R4357 T6989 T6990 det the,cDNA
R4358 T6990 T6987 pobj cDNA,In
R4359 T6991 T6990 amod sixth,cDNA
R4360 T6992 T6988 punct ", ",spliced
R4361 T6993 T6994 det a,intron
R4362 T6994 T6988 nsubjpass intron,spliced
R4363 T6995 T6996 nummod 2.6,kb
R4364 T6996 T6994 compound kb,intron
R4365 T6997 T6996 punct -,kb
R4366 T6998 T6988 auxpass is,spliced
R4367 T6999 T6988 prep out,spliced
R4368 T7000 T6999 prep of,out
R4369 T7001 T7002 det the,UTR
R4370 T7002 T7000 pobj UTR,of
R4371 T7003 T7002 nummod 3,UTR
R4372 T7004 T7003 punct ',3
R4373 T7005 T6988 punct ", ",spliced
R4374 T7006 T6988 advcl leaving,spliced
R4375 T7007 T7008 det the,region
R4376 T7008 T7006 dobj region,leaving
R4377 T7009 T7008 amod coding,region
R4378 T7010 T7006 oprd intact,leaving
R4379 T7011 T6988 punct .,spliced
R4380 T7013 T7014 mark If,achieved
R4381 T7014 T7020 advcl achieved,derive
R4382 T7015 T7016 amod olfactory,receptor
R4383 T7016 T7017 nmod receptor,control
R4384 T7017 T7014 nsubjpass control,achieved
R4385 T7018 T7017 amod transcriptional,control
R4386 T7019 T7014 auxpass is,achieved
R4387 T7021 T7014 prep by,achieved
R4388 T7022 T7023 compound DNA,recombination
R4389 T7023 T7021 pobj recombination,by
R4390 T7024 T7020 punct ", ",derive
R4391 T7025 T7026 det the,beginning
R4392 T7026 T7020 nsubj beginning,derive
R4393 T7027 T7026 prep of,beginning
R4394 T7028 T7029 det each,transcript
R4395 T7029 T7027 pobj transcript,of
R4396 T7030 T7020 aux might,derive
R4397 T7031 T7020 prep from,derive
R4398 T7032 T7033 det a,region
R4399 T7033 T7031 pobj region,from
R4400 T7034 T7033 amod donated,region
R4401 T7035 T7033 compound promoter,region
R4402 T7036 T7020 punct ", ",derive
R4403 T7037 T7020 prep with,derive
R4404 T7038 T7039 det the,rest
R4405 T7039 T7040 nsubj rest,coming
R4406 T7040 T7037 pcomp coming,with
R4407 T7041 T7039 prep of,rest
R4408 T7042 T7043 det the,transcript
R4409 T7043 T7041 pobj transcript,of
R4410 T7044 T7040 prep from,coming
R4411 T7045 T7046 det the,locus
R4412 T7046 T7044 pobj locus,from
R4413 T7047 T7046 amod native,locus
R4414 T7048 T7049 npadvmod ORF,containing
R4415 T7049 T7046 amod containing,locus
R4416 T7050 T7049 punct -,containing
R4417 T7051 T7020 punct .,derive
R4418 T7053 T7054 prep In,analyzed
R4419 T7055 T7053 pobj order,In
R4420 T7056 T7057 aux to,examine
R4421 T7057 T7055 acl examine,order
R4422 T7058 T7059 det the,hypothesis
R4423 T7059 T7057 dobj hypothesis,examine
R4424 T7060 T7059 compound recombination,hypothesis
R4425 T7061 T7054 punct ", ",analyzed
R4426 T7062 T7054 nsubj we,analyzed
R4427 T7063 T7064 nummod 115,clones
R4428 T7064 T7054 dobj clones,analyzed
R4429 T7065 T7064 compound cDNA,clones
R4430 T7066 T7067 prep for,failed
R4431 T7067 T7064 relcl failed,clones
R4432 T7068 T7066 pobj which,for
R4433 T7069 T7067 nsubj sim4,failed
R4434 T7070 T7071 aux to,align
R4435 T7071 T7067 xcomp align,failed
R4436 T7072 T7073 nummod 20,bp
R4437 T7073 T7071 dobj bp,align
R4438 T7074 T7075 cc or,more
R4439 T7075 T7073 nummod more,bp
R4440 T7076 T7071 prep to,align
R4441 T7077 T7078 det the,locus
R4442 T7078 T7076 pobj locus,to
R4443 T7079 T7078 amod corresponding,locus
R4444 T7080 T7078 amod genomic,locus
R4445 T7081 T7054 punct .,analyzed
R4446 T7083 T7084 prep In,explained
R4447 T7085 T7086 amod most,cases
R4448 T7086 T7083 pobj cases,In
R4449 T7087 T7084 punct ", ",explained
R4450 T7088 T7089 det the,sequence
R4451 T7089 T7084 nsubjpass sequence,explained
R4452 T7090 T7089 amod missing,sequence
R4453 T7091 T7084 auxpass was,explained
R4454 T7092 T7084 prep by,explained
R4455 T7093 T7092 pobj gaps,by
R4456 T7094 T7093 prep in,gaps
R4457 T7095 T7096 det the,sequence
R4458 T7096 T7094 pobj sequence,in
R4459 T7097 T7096 amod genomic,sequence
R4460 T7098 T7092 cc or,by
R4461 T7099 T7092 conj by,by
R4462 T7100 T7099 pobj matches,by
R4463 T7101 T7102 dep that,fell
R4464 T7102 T7100 relcl fell,matches
R4465 T7103 T7102 prep below,fell
R4466 T7104 T7105 poss our,cutoff
R4467 T7105 T7103 pobj cutoff,below
R4468 T7106 T7107 compound percent,identity
R4469 T7107 T7108 npadvmod identity,based
R4470 T7108 T7105 amod based,cutoff
R4471 T7109 T7108 punct -,based
R4472 T7110 T7105 prep for,cutoff
R4473 T7111 T7110 pcomp reporting,for
R4474 T7112 T7111 dobj matches,reporting
R4475 T7113 T7084 punct .,explained
R4476 T7115 T7116 prep For,found
R4477 T7117 T7118 nummod three,cDNAs
R4478 T7118 T7115 pobj cDNAs,For
R4479 T7119 T7118 punct (,cDNAs
R4480 T7120 T7118 prep from,cDNAs
R4481 T7121 T7122 nummod three,receptors
R4482 T7122 T7120 pobj receptors,from
R4483 T7123 T7122 amod different,receptors
R4484 T7124 T7122 amod olfactory,receptors
R4485 T7125 T7116 punct ),found
R4486 T7126 T7116 punct ", ",found
R4487 T7127 T7116 nsubj we,found
R4488 T7128 T7129 mark that,matched
R4489 T7129 T7116 ccomp matched,found
R4490 T7130 T7131 det the,piece
R4491 T7131 T7129 nsubj piece,matched
R4492 T7132 T7131 amod missing,piece
R4493 T7133 T7131 prep of,piece
R4494 T7134 T7133 pobj sequence,of
R4495 T7135 T7129 advmod elsewhere,matched
R4496 T7136 T7135 prep in,elsewhere
R4497 T7137 T7138 det the,genome
R4498 T7138 T7136 pobj genome,in
R4499 T7139 T7116 punct .,found
R4500 T7141 T7142 nsubj Comparison,confirmed
R4501 T7143 T7141 prep with,Comparison
R4502 T7144 T7145 det the,assembly
R4503 T7145 T7143 pobj assembly,with
R4504 T7146 T7145 amod public,assembly
R4505 T7147 T7148 compound mouse,genome
R4506 T7148 T7145 compound genome,assembly
R4507 T7149 T7150 det the,matches
R4508 T7150 T7142 dobj matches,confirmed
R4509 T7151 T7150 amod distant,matches
R4510 T7152 T7142 punct .,confirmed
R4511 T7154 T7155 prep With,conclude
R4512 T7156 T7157 predet such,number
R4513 T7157 T7160 nsubj number,exhibiting
R4514 T7158 T7157 det a,number
R4515 T7159 T7157 amod small,number
R4516 T7160 T7154 pcomp exhibiting,With
R4517 T7161 T7157 prep of,number
R4518 T7162 T7161 pobj cDNAs,of
R4519 T7163 T7164 det a,sign
R4520 T7164 T7160 dobj sign,exhibiting
R4521 T7165 T7164 amod possible,sign
R4522 T7166 T7164 prep of,sign
R4523 T7167 T7168 compound DNA,recombination
R4524 T7168 T7166 pobj recombination,of
R4525 T7169 T7164 punct (,sign
R4526 T7170 T7171 det a,sign
R4527 T7171 T7164 appos sign,sign
R4528 T7172 T7173 dep that,interpreted
R4529 T7173 T7171 relcl interpreted,sign
R4530 T7174 T7173 aux could,interpreted
R4531 T7175 T7173 advmod also,interpreted
R4532 T7176 T7173 auxpass be,interpreted
R4533 T7177 T7173 prep as,interpreted
R4534 T7178 T7179 amod chimeric,clones
R4535 T7179 T7177 pobj clones,as
R4536 T7180 T7179 compound cDNA,clones
R4537 T7181 T7155 punct ),conclude
R4538 T7182 T7155 punct ", ",conclude
R4539 T7183 T7155 nsubj we,conclude
R4540 T7184 T7185 mark that,is
R4541 T7185 T7155 ccomp is,conclude
R4542 T7186 T7187 amod such,rearrangement
R4543 T7187 T7185 nsubj rearrangement,is
R4544 T7188 T7185 acomp unlikely,is
R4545 T7189 T7190 aux to,occur
R4546 T7190 T7188 xcomp occur,unlikely
R4547 T7191 T7155 punct .,conclude
R4548 T7193 T7194 advmod However,remains
R4549 T7195 T7194 punct ", ",remains
R4550 T7196 T7197 det the,possibility
R4551 T7197 T7194 nsubj possibility,remains
R4552 T7198 T7199 mark that,is
R4553 T7199 T7194 ccomp is,remains
R4554 T7200 T7201 compound DNA,recombination
R4555 T7201 T7199 nsubj recombination,is
R4556 T7202 T7199 acomp responsible,is
R4557 T7203 T7202 prep for,responsible
R4558 T7204 T7205 amod olfactory,receptor
R4559 T7205 T7206 nmod receptor,regulation
R4560 T7206 T7203 pobj regulation,for
R4561 T7207 T7206 amod transcriptional,regulation
R4562 T7208 T7199 punct ", ",is
R4563 T7209 T7199 prep with,is
R4564 T7210 T7211 det the,region
R4565 T7211 T7213 nsubj region,contributing
R4566 T7212 T7211 amod donated,region
R4567 T7213 T7209 pcomp contributing,with
R4568 T7214 T7215 advmod only,sequences
R4569 T7215 T7213 dobj sequences,contributing
R4570 T7216 T7215 compound promoter,sequences
R4571 T7217 T7215 cc but,sequences
R4572 T7218 T7219 det no,part
R4573 T7219 T7215 conj part,sequences
R4574 T7220 T7219 prep of,part
R4575 T7221 T7222 det the,transcript
R4576 T7222 T7220 pobj transcript,of
R4577 T7223 T7194 punct .,remains

craft-ca-core-ex-dev

Below, discontinuous spans are shown in the chain model. You can change it to the bag model.

Id Subject Object Predicate Lexical cue
T6237 12-21 GO:0007608 denotes olfactory
T6238 12-31 GO_EXT:0004984 denotes olfactory receptors
T6239 46-54 GO:0008380 denotes splicing
T6240 72-77 SO_EXT:cDNA denotes cDNAs
T6241 89-98 GO:0007608 denotes olfactory
T6242 89-107 GO_EXT:0004984 denotes olfactory receptor
T6243 108-114 SO_EXT:sequence_cloned_entity denotes clones
T6244 131-140 GO:0007608 denotes olfactory
T6245 131-150 GO_EXT:0004984 denotes olfactory receptors
T6246 159-168 GO:0007608 denotes olfactory
T6247 159-177 GO_EXT:0004984 denotes olfactory receptor
T6248 178-188 SO_EXT:0000336 denotes pseudogene
T6249 197-203 GO:0008380 denotes splice
T6250 197-208 SO_EXT:0000162 denotes splice site
T6251 204-215 _FRAGMENT denotes site within
T6252 228-241 SO_EXT:0000851 denotes coding region
T6253 220-227 CHEBI_PR_EXT:protein denotes protein
T6254 220-234 SO:0000010 denotes protein-coding
T6255 259-264 SO_EXT:0000704 denotes genes
T6256 274-279 SO_EXT:cDNA denotes cDNAs
T6257 306-313 CHEBI_PR_EXT:protein denotes protein
T6258 338-347 GO:0007608 denotes olfactory
T6259 338-356 GO_EXT:0004984 denotes olfactory receptor
T6260 375-386 CHEBI_SO_EXT:amino_acid denotes amino acids
T6261 413-423 CHEBI_SO_EXT:methionine denotes methionine
T6262 443-451 SO_EXT:sequence_upstreamness denotes upstream
T6263 452-456 SO_EXT:0000147 denotes exon
T6264 468-472 SO_EXT:0000704 denotes gene
T6265 514-519 NCBITaxon:9606 denotes human
T6266 520-529 GO:0007608 denotes olfactory
T6267 520-538 GO_EXT:0004984 denotes olfactory receptor
T6268 562-567 NCBITaxon:10088 denotes mouse
T6269 568-573 SO_EXT:0000704 denotes genes
T6270 581-592 SO_EXT:0000318 denotes start codon
T6271 622-633 SO_EXT:0000195 denotes coding exon
T6272 647-655 GO:0008380 denotes splicing
T6273 661-668 SO_EXT:sequence_rescue_process denotes rescues
T6274 709-713 SO_EXT:0000704 denotes gene
T6275 743-748 SO_EXT:cDNA denotes cDNAs
T6276 763-773 SO_EXT:0000673 denotes transcript
T6277 800-806 GO:0008380 denotes splice
T6278 800-811 SO_EXT:alternative_splice_variant denotes splice form
T6279 818-828 SO_EXT:0000673 denotes transcript
T6280 848-854 SO_EXT:sequence_coding_function denotes encode
T6281 868-875 CHEBI_PR_EXT:protein denotes protein
T6282 888-894 GO:0008380 denotes splice
T6283 908-918 GO_SO_EXT:frameshift_entity_or_process denotes frameshift
T6284 930-939 SO_EXT:biological_conservation_process_or_quality denotes conserved
T6285 951-959 CHEBI_EXT:residue denotes residues
T6286 980-986 SO_EXT:sequence_cloned_entity denotes clones
T6287 999-1004 SO_EXT:cDNA denotes cDNAs
T6288 1017-1021 SO_EXT:0000147 denotes exon
T6289 1035-1045 SO_EXT:0000317 denotes cDNA clone
T6290 1085-1092 SO_EXT:0001026 denotes genomic
T6291 1093-1101 SO_EXT:biological_sequence denotes sequence
T6292 1143-1150 SO_EXT:sequence_cloning_process denotes cloning
T6293 1197-1202 SO_EXT:cDNA denotes cDNAs
T6294 1251-1262 SO_EXT:0000673 denotes transcripts
T6295 1300-1304 SO_EXT:cDNA denotes cDNA
T6296 1330-1338 GO:0008380 denotes splicing
T6297 1351-1358 SO_EXT:0001026 denotes genomic
T6298 1413-1422 GO:0007608 denotes olfactory
T6299 1413-1431 GO_EXT:0004984 denotes olfactory receptor
T6300 1432-1442 SO_EXT:0000673 denotes transcript
T6301 1472-1477 SO_EXT:0000704 denotes genes
T6302 1498-1505 GO:0008380 denotes spliced
T6303 1506-1511 SO_EXT:cDNA denotes cDNAs
T6304 1566-1573 SO_EXT:0001060 denotes isoform
T6305 1592-1598 GO:0008380 denotes splice
T6306 1592-1604 SO_EXT:0000162 denotes splice sites
T6307 1599-1611 _FRAGMENT denotes sites within
T6308 1616-1629 SO_EXT:0000851 denotes coding region
T6309 1663-1668 SO_EXT:0000704 denotes genes
T6310 1681-1689 GO:0008380 denotes splicing
T6311 1719-1723 SO_EXT:cDNA denotes cDNA
T6312 1762-1770 SO_EXT:0001060 denotes isoforms
T6313 1831-1839 GO:0008380 denotes splicing
T6314 1843-1848 NCBITaxon:9606 denotes human
T6315 1849-1858 GO:0007608 denotes olfactory
T6316 1849-1868 GO_EXT:0004984 denotes olfactory receptors
T6317 1878-1910 GO:0042611 denotes major histocompatibility complex
T6318 1912-1915 GO:0042611 denotes MHC
T6319 1940-1945 SO_EXT:0000704 denotes genes
T6320 1946-1952 GO:0008380 denotes splice
T6321 1985-1993 SO_EXT:sequence_upstreamness denotes upstream
T6322 1994-1998 SO_EXT:0000147 denotes exon
T6323 2028-2037 GO:0007608 denotes olfactory
T6324 2028-2046 GO_EXT:0004984 denotes olfactory receptor
T6325 2047-2062 GO_EXT:transcription denotes transcriptional
T6326 2063-2070 GO:0065007 denotes control
T6327 2092-2095 CHEBI_SO_EXT:DNA denotes DNA
T6328 2096-2109 GO_SO_EXT:sequence_rearrangement_process denotes recombination
T6329 2151-2162 SO_EXT:0000673 denotes transcripts
T6330 2182-2190 SO_EXT:biological_sequence denotes sequence
T6331 2249-2257 SO_EXT:biological_sequence denotes sequence
T6332 2266-2275 GO:0007608 denotes olfactory
T6333 2266-2284 GO_EXT:0004984 denotes olfactory receptor
T6334 2285-2288 SO_EXT:0000345 denotes EST
T6335 2315-2319 SO_EXT:0000704 denotes gene
T6336 2322-2329 SO_EXT:0001026 denotes genomic
T6337 2379-2380 CHEBI_SO_EXT:base denotes b
T6338 2401-2405 SO_EXT:0000704 denotes gene
T6339 2412-2413 CHEBI_SO_EXT:base denotes b
T6340 2424-2428 SO_EXT:0000704 denotes gene
T6341 2431-2441 SO_EXT:0000319 denotes stop codon
T6342 2456-2461 SO_EXT:cDNA denotes cDNAs
T6343 2468-2475 SO_EXT:0000188 denotes introns
T6344 2494-2503 GO:0007608 denotes olfactory
T6345 2494-2512 GO_EXT:0004984 denotes olfactory receptor
T6346 2513-2518 SO_EXT:0000704 denotes genes
T6347 2536-2545 GO:0007608 denotes olfactory
T6348 2536-2555 GO_EXT:0004984 denotes olfactory receptors
T6349 2563-2568 NCBITaxon:9606 denotes human
T6350 2569-2572 GO:0042611 denotes MHC
T6351 2593-2598 SO_EXT:cDNA denotes cDNAs
T6352 2631-2635 SO_EXT:0000704 denotes gene
T6353 2748-2754 SO_EXT:sequence_cloned_entity denotes clones
T6354 2759-2765 GO:0008380 denotes splice
T6355 2759-2771 SO_EXT:0000162 denotes splice sites
T6356 2766-2778 _FRAGMENT denotes sites within
T6357 2786-2789 SO_EXT:0000837 denotes UTR
T6358 2783-2789 SO_EXT:0000205 denotes 3' UTR
T6359 2836-2837 CHEBI_SO_EXT:base denotes b
T6360 2838-2848 SO_EXT:sequence_downstreamness denotes downstream
T6361 2856-2866 SO_EXT:0000319 denotes stop codon
T6362 2886-2891 SO_EXT:cDNA denotes cDNAs
T6363 2901-2907 GO:0008380 denotes splice
T6364 2901-2919 SO_EXT:splice_donor denotes splice-donor sites
T6365 2914-2926 _FRAGMENT denotes sites within
T6366 2931-2944 SO_EXT:0000851 denotes coding region
T6367 2949-2955 SO_EXT:sequence_coding_function denotes encode
T6368 2966-2975 GO:0007608 denotes olfactory
T6369 2966-2985 GO_EXT:0004984 denotes olfactory receptors
T6370 3011-3015 SO_EXT:cDNA denotes cDNA
T6371 3024-3025 CHEBI_SO_EXT:base denotes b
T6372 3026-3032 SO_EXT:0000188 denotes intron
T6373 3036-3043 GO:0008380 denotes spliced
T6374 3055-3061 SO_EXT:0000205 denotes 3' UTR
T6375 3075-3088 SO_EXT:coding_sequence denotes coding region
T6376 3100-3109 GO:0007608 denotes olfactory
T6377 3100-3118 GO_EXT:0004984 denotes olfactory receptor
T6378 3119-3134 GO_EXT:transcription denotes transcriptional
T6379 3135-3142 GO:0065007 denotes control
T6380 3158-3161 CHEBI_SO_EXT:DNA denotes DNA
T6381 3162-3175 GO_SO_EXT:sequence_rearrangement_process denotes recombination
T6382 3199-3209 SO_EXT:0000673 denotes transcript
T6383 3238-3246 SO_EXT:0000167 denotes promoter
T6384 3276-3286 SO_EXT:0000673 denotes transcript
T6385 3310-3313 SO_EXT:0000236 denotes ORF
T6386 3356-3369 GO_SO_EXT:sequence_rearrangement_process denotes recombination
T6387 3398-3409 SO_EXT:0000317 denotes cDNA clones
T6388 3444-3446 SO_EXT:0000028 denotes bp
T6389 3476-3483 SO_EXT:0001026 denotes genomic
T6390 3518-3526 SO_EXT:biological_sequence denotes sequence
T6391 3544-3548 SO_EXT:0000730 denotes gaps
T6392 3556-3563 SO_EXT:0001026 denotes genomic
T6393 3564-3572 SO_EXT:biological_sequence denotes sequence
T6394 3670-3675 SO_EXT:cDNA denotes cDNAs
T6395 3698-3707 GO:0007608 denotes olfactory
T6396 3698-3717 GO_EXT:0004984 denotes olfactory receptors
T6397 3755-3763 SO_EXT:biological_sequence denotes sequence
T6398 3789-3795 SO_EXT:0001026 denotes genome
T6399 3824-3829 NCBITaxon:10088 denotes mouse
T6400 3830-3836 SO_EXT:0001026 denotes genome
T6401 3837-3845 SO_EXT:sequence_assembly_entity_or_process denotes assembly
T6402 3905-3910 SO_EXT:cDNA denotes cDNAs
T6403 3941-3944 CHEBI_SO_EXT:DNA denotes DNA
T6404 3945-3958 GO_SO_EXT:sequence_rearrangement_process denotes recombination
T6405 4001-4021 SO_EXT:0000810 denotes chimeric cDNA clones
T6406 4046-4059 GO_SO_EXT:sequence_rearrangement_entity_or_process denotes rearrangement
T6407 4120-4123 CHEBI_SO_EXT:DNA denotes DNA
T6408 4124-4137 GO_SO_EXT:sequence_rearrangement_process denotes recombination
T6409 4157-4166 GO:0007608 denotes olfactory
T6410 4157-4175 GO_EXT:0004984 denotes olfactory receptor
T6411 4176-4191 GO_EXT:transcription denotes transcriptional
T6412 4192-4202 GO:0065007 denotes regulation
T6413 4246-4254 SO_EXT:0000167 denotes promoter
T6414 4255-4264 SO_EXT:biological_sequence denotes sequences
T6415 4272-4279 _FRAGMENT denotes part of
T6416 4284-4294 SO_EXT:0000833 denotes transcript
R3817 T6252 T6251 _lexicallyChainedTo coding region,site within
R3818 T6308 T6307 _lexicallyChainedTo coding region,sites within
R3819 T6357 T6356 _lexicallyChainedTo UTR,sites within
R3820 T6366 T6365 _lexicallyChainedTo coding region,sites within
R3821 T6416 T6415 _lexicallyChainedTo transcript,part of

craft-ca-core-dev

Below, discontinuous spans are shown in the chain model. You can change it to the bag model.

Id Subject Object Predicate Lexical cue
T6133 12-21 GO:0007608 denotes olfactory
T6134 46-54 GO:0008380 denotes splicing
T6135 89-98 GO:0007608 denotes olfactory
T6136 131-140 GO:0007608 denotes olfactory
T6137 159-168 GO:0007608 denotes olfactory
T6138 178-188 SO:0000336 denotes pseudogene
T6139 197-203 GO:0008380 denotes splice
T6140 197-208 SO:0000162 denotes splice site
T6141 204-215 _FRAGMENT denotes site within
T6142 228-241 SO:0000851 denotes coding region
T6143 220-234 SO:0000010 denotes protein-coding
T6144 259-264 SO:0000704 denotes genes
T6145 338-347 GO:0007608 denotes olfactory
T6146 452-456 SO:0000147 denotes exon
T6147 468-472 SO:0000704 denotes gene
T6148 514-519 NCBITaxon:9606 denotes human
T6149 520-529 GO:0007608 denotes olfactory
T6150 562-567 NCBITaxon:10088 denotes mouse
T6151 568-573 SO:0000704 denotes genes
T6152 581-592 SO:0000318 denotes start codon
T6153 622-633 SO:0000195 denotes coding exon
T6154 647-655 GO:0008380 denotes splicing
T6155 709-713 SO:0000704 denotes gene
T6156 763-773 SO:0000673 denotes transcript
T6157 800-806 GO:0008380 denotes splice
T6158 818-828 SO:0000673 denotes transcript
T6159 888-894 GO:0008380 denotes splice
T6160 1017-1021 SO:0000147 denotes exon
T6161 1035-1045 SO:0000317 denotes cDNA clone
T6162 1085-1092 SO:0001026 denotes genomic
T6163 1251-1262 SO:0000673 denotes transcripts
T6164 1330-1338 GO:0008380 denotes splicing
T6165 1351-1358 SO:0001026 denotes genomic
T6166 1413-1422 GO:0007608 denotes olfactory
T6167 1432-1442 SO:0000673 denotes transcript
T6168 1472-1477 SO:0000704 denotes genes
T6169 1498-1505 GO:0008380 denotes spliced
T6170 1566-1573 SO:0001060 denotes isoform
T6171 1592-1598 GO:0008380 denotes splice
T6172 1592-1604 SO:0000162 denotes splice sites
T6173 1599-1611 _FRAGMENT denotes sites within
T6174 1616-1629 SO:0000851 denotes coding region
T6175 1663-1668 SO:0000704 denotes genes
T6176 1681-1689 GO:0008380 denotes splicing
T6177 1762-1770 SO:0001060 denotes isoforms
T6178 1831-1839 GO:0008380 denotes splicing
T6179 1843-1848 NCBITaxon:9606 denotes human
T6180 1849-1858 GO:0007608 denotes olfactory
T6181 1878-1910 GO:0042611 denotes major histocompatibility complex
T6182 1912-1915 GO:0042611 denotes MHC
T6183 1940-1945 SO:0000704 denotes genes
T6184 1946-1952 GO:0008380 denotes splice
T6185 1994-1998 SO:0000147 denotes exon
T6186 2028-2037 GO:0007608 denotes olfactory
T6187 2063-2070 GO:0065007 denotes control
T6188 2151-2162 SO:0000673 denotes transcripts
T6189 2266-2275 GO:0007608 denotes olfactory
T6190 2285-2288 SO:0000345 denotes EST
T6191 2315-2319 SO:0000704 denotes gene
T6192 2322-2329 SO:0001026 denotes genomic
T6193 2401-2405 SO:0000704 denotes gene
T6194 2424-2428 SO:0000704 denotes gene
T6195 2431-2441 SO:0000319 denotes stop codon
T6196 2468-2475 SO:0000188 denotes introns
T6197 2494-2503 GO:0007608 denotes olfactory
T6198 2513-2518 SO:0000704 denotes genes
T6199 2536-2545 GO:0007608 denotes olfactory
T6200 2563-2568 NCBITaxon:9606 denotes human
T6201 2569-2572 GO:0042611 denotes MHC
T6202 2631-2635 SO:0000704 denotes gene
T6203 2759-2765 GO:0008380 denotes splice
T6204 2759-2771 SO:0000162 denotes splice sites
T6205 2766-2778 _FRAGMENT denotes sites within
T6206 2786-2789 SO:0000837 denotes UTR
T6207 2783-2789 SO:0000205 denotes 3' UTR
T6208 2856-2866 SO:0000319 denotes stop codon
T6209 2901-2907 GO:0008380 denotes splice
T6210 2914-2926 _FRAGMENT denotes sites within
T6211 2931-2944 SO:0000851 denotes coding region
T6212 2966-2975 GO:0007608 denotes olfactory
T6213 3026-3032 SO:0000188 denotes intron
T6214 3036-3043 GO:0008380 denotes spliced
T6215 3055-3061 SO:0000205 denotes 3' UTR
T6216 3100-3109 GO:0007608 denotes olfactory
T6217 3135-3142 GO:0065007 denotes control
T6218 3199-3209 SO:0000673 denotes transcript
T6219 3238-3246 SO:0000167 denotes promoter
T6220 3276-3286 SO:0000673 denotes transcript
T6221 3310-3313 SO:0000236 denotes ORF
T6222 3398-3409 SO:0000317 denotes cDNA clones
T6223 3444-3446 SO:0000028 denotes bp
T6224 3476-3483 SO:0001026 denotes genomic
T6225 3544-3548 SO:0000730 denotes gaps
T6226 3556-3563 SO:0001026 denotes genomic
T6227 3698-3707 GO:0007608 denotes olfactory
T6228 3789-3795 SO:0001026 denotes genome
T6229 3824-3829 NCBITaxon:10088 denotes mouse
T6230 3830-3836 SO:0001026 denotes genome
T6231 4001-4021 SO:0000810 denotes chimeric cDNA clones
T6232 4157-4166 GO:0007608 denotes olfactory
T6233 4192-4202 GO:0065007 denotes regulation
T6234 4246-4254 SO:0000167 denotes promoter
T6235 4272-4279 _FRAGMENT denotes part of
T6236 4284-4294 SO:0000833 denotes transcript
R3812 T6142 T6141 _lexicallyChainedTo coding region,site within
R3813 T6174 T6173 _lexicallyChainedTo coding region,sites within
R3814 T6206 T6205 _lexicallyChainedTo UTR,sites within
R3815 T6211 T6210 _lexicallyChainedTo coding region,sites within
R3816 T6236 T6235 _lexicallyChainedTo transcript,part of

2_test

Id Subject Object Predicate Lexical cue
14611657-11689484-44684069 540-542 11689484 denotes 25
14611657-11282967-44684070 2000-2002 11282967 denotes 26
14611657-12637542-44684071 2003-2005 12637542 denotes 27
14611657-11282967-44684072 2581-2583 11282967 denotes 26
14611657-12637542-44684073 2584-2586 12637542 denotes 27