Id |
Subject |
Object |
Predicate |
Lexical cue |
T5507 |
0-11 |
JJ |
denotes |
Comparative |
T5508 |
20-29 |
NN |
denotes |
structure |
T5509 |
12-19 |
JJ |
denotes |
genomic |
T5510 |
30-32 |
IN |
denotes |
of |
T5511 |
33-36 |
DT |
denotes |
the |
T5512 |
42-48 |
NN |
denotes |
family |
T5513 |
37-41 |
NN |
denotes |
TACC |
T5514 |
48-304 |
sentence |
denotes |
The genomic DNA sequences corresponding to the orthologous TACC genes of human, mouse, rat, pufferfish, C. intestinalis, D. melanogaster and C. elegans were extracted and analyzed by Genescan and BLAST to determine the genomic structure of each TACC gene. |
T5515 |
49-52 |
DT |
denotes |
The |
T5516 |
65-74 |
NNS |
denotes |
sequences |
T5517 |
53-60 |
JJ |
denotes |
genomic |
T5518 |
61-64 |
NN |
denotes |
DNA |
T5519 |
206-215 |
VBN |
denotes |
extracted |
T5520 |
75-88 |
VBG |
denotes |
corresponding |
T5521 |
89-91 |
IN |
denotes |
to |
T5522 |
92-95 |
DT |
denotes |
the |
T5523 |
113-118 |
NNS |
denotes |
genes |
T5524 |
96-107 |
JJ |
denotes |
orthologous |
T5525 |
108-112 |
NN |
denotes |
TACC |
T5526 |
119-121 |
IN |
denotes |
of |
T5527 |
122-127 |
JJ |
denotes |
human |
T5528 |
127-129 |
, |
denotes |
, |
T5529 |
129-134 |
NN |
denotes |
mouse |
T5530 |
134-136 |
, |
denotes |
, |
T5531 |
136-139 |
NN |
denotes |
rat |
T5532 |
139-141 |
, |
denotes |
, |
T5533 |
141-151 |
NN |
denotes |
pufferfish |
T5534 |
151-153 |
, |
denotes |
, |
T5535 |
153-155 |
NNP |
denotes |
C. |
T5536 |
156-168 |
NNP |
denotes |
intestinalis |
T5537 |
168-170 |
, |
denotes |
, |
T5538 |
170-172 |
NNP |
denotes |
D. |
T5539 |
173-185 |
NNP |
denotes |
melanogaster |
T5540 |
186-189 |
CC |
denotes |
and |
T5541 |
190-192 |
NNP |
denotes |
C. |
T5542 |
193-200 |
NNP |
denotes |
elegans |
T5543 |
201-205 |
VBD |
denotes |
were |
T5544 |
216-219 |
CC |
denotes |
and |
T5545 |
220-228 |
VBN |
denotes |
analyzed |
T5546 |
229-231 |
IN |
denotes |
by |
T5547 |
232-240 |
NNP |
denotes |
Genescan |
T5548 |
241-244 |
CC |
denotes |
and |
T5549 |
245-250 |
NNP |
denotes |
BLAST |
T5550 |
251-253 |
TO |
denotes |
to |
T5551 |
254-263 |
VB |
denotes |
determine |
T5552 |
264-267 |
DT |
denotes |
the |
T5553 |
276-285 |
NN |
denotes |
structure |
T5554 |
268-275 |
JJ |
denotes |
genomic |
T5555 |
286-288 |
IN |
denotes |
of |
T5556 |
289-293 |
DT |
denotes |
each |
T5557 |
299-303 |
NN |
denotes |
gene |
T5558 |
294-298 |
NN |
denotes |
TACC |
T5559 |
303-304 |
. |
denotes |
. |
T5560 |
304-471 |
sentence |
denotes |
In some cases, for rat and pufferfish, exons were added or modified based on the best similarity of translated peptides to the corresponding mouse and human proteins. |
T5561 |
305-307 |
IN |
denotes |
In |
T5562 |
355-360 |
VBN |
denotes |
added |
T5563 |
308-312 |
DT |
denotes |
some |
T5564 |
313-318 |
NNS |
denotes |
cases |
T5565 |
318-320 |
, |
denotes |
, |
T5566 |
320-323 |
IN |
denotes |
for |
T5567 |
324-327 |
NN |
denotes |
rat |
T5568 |
328-331 |
CC |
denotes |
and |
T5569 |
332-342 |
NN |
denotes |
pufferfish |
T5570 |
342-344 |
, |
denotes |
, |
T5571 |
344-349 |
NNS |
denotes |
exons |
T5572 |
350-354 |
VBD |
denotes |
were |
T5573 |
361-363 |
CC |
denotes |
or |
T5574 |
364-372 |
VBN |
denotes |
modified |
T5575 |
373-378 |
VBN |
denotes |
based |
T5576 |
379-381 |
IN |
denotes |
on |
T5577 |
382-385 |
DT |
denotes |
the |
T5578 |
391-401 |
NN |
denotes |
similarity |
T5579 |
386-390 |
JJS |
denotes |
best |
T5580 |
402-404 |
IN |
denotes |
of |
T5581 |
405-415 |
VBN |
denotes |
translated |
T5582 |
416-424 |
NNS |
denotes |
peptides |
T5583 |
425-427 |
IN |
denotes |
to |
T5584 |
428-431 |
DT |
denotes |
the |
T5585 |
462-470 |
NN |
denotes |
proteins |
T5586 |
432-445 |
VBG |
denotes |
corresponding |
T5587 |
446-451 |
NN |
denotes |
mouse |
T5588 |
452-455 |
CC |
denotes |
and |
T5589 |
456-461 |
JJ |
denotes |
human |
T5590 |
470-471 |
. |
denotes |
. |
T5591 |
471-664 |
sentence |
denotes |
For regions with low sequence similarity in T. rubripes, genomic sequences from the fresh water pufferfish, Tetraodon nigroviridis were used as additional means to verify the predicted exons. |
T5592 |
472-475 |
IN |
denotes |
For |
T5593 |
609-613 |
VBN |
denotes |
used |
T5594 |
476-483 |
NNS |
denotes |
regions |
T5595 |
484-488 |
IN |
denotes |
with |
T5596 |
489-492 |
JJ |
denotes |
low |
T5597 |
502-512 |
NN |
denotes |
similarity |
T5598 |
493-501 |
NN |
denotes |
sequence |
T5599 |
513-515 |
IN |
denotes |
in |
T5600 |
516-518 |
NNP |
denotes |
T. |
T5601 |
519-527 |
NNP |
denotes |
rubripes |
T5602 |
527-529 |
, |
denotes |
, |
T5603 |
529-536 |
JJ |
denotes |
genomic |
T5604 |
537-546 |
NNS |
denotes |
sequences |
T5605 |
548-552 |
IN |
denotes |
from |
T5606 |
553-556 |
DT |
denotes |
the |
T5607 |
569-579 |
NN |
denotes |
pufferfish |
T5608 |
557-562 |
JJ |
denotes |
fresh |
T5609 |
563-568 |
NN |
denotes |
water |
T5610 |
579-581 |
, |
denotes |
, |
T5611 |
581-590 |
NNP |
denotes |
Tetraodon |
T5612 |
591-603 |
NNP |
denotes |
nigroviridis |
T5613 |
604-608 |
VBD |
denotes |
were |
T5614 |
614-616 |
IN |
denotes |
as |
T5615 |
617-627 |
JJ |
denotes |
additional |
T5616 |
628-633 |
NNS |
denotes |
means |
T5617 |
634-636 |
TO |
denotes |
to |
T5618 |
637-643 |
VB |
denotes |
verify |
T5619 |
644-647 |
DT |
denotes |
the |
T5620 |
658-663 |
NNS |
denotes |
exons |
T5621 |
648-657 |
VBN |
denotes |
predicted |
T5622 |
663-664 |
. |
denotes |
. |
T5623 |
664-740 |
sentence |
denotes |
The general structure of the TACC genes and proteins is depicted in Fig. 4. |
T5624 |
665-668 |
DT |
denotes |
The |
T5625 |
677-686 |
NN |
denotes |
structure |
T5626 |
669-676 |
JJ |
denotes |
general |
T5627 |
721-729 |
VBN |
denotes |
depicted |
T5628 |
687-689 |
IN |
denotes |
of |
T5629 |
690-693 |
DT |
denotes |
the |
T5630 |
699-704 |
NNS |
denotes |
genes |
T5631 |
694-698 |
NN |
denotes |
TACC |
T5632 |
705-708 |
CC |
denotes |
and |
T5633 |
709-717 |
NN |
denotes |
proteins |
T5634 |
718-720 |
VBZ |
denotes |
is |
T5635 |
730-732 |
IN |
denotes |
in |
T5636 |
733-737 |
NN |
denotes |
Fig. |
T5637 |
738-739 |
CD |
denotes |
4 |
T5638 |
739-740 |
. |
denotes |
. |
T5639 |
740-855 |
sentence |
denotes |
The main conserved feature of the TACC family, the TACC domain, is located at the carboxy terminus of the protein. |
T5640 |
741-744 |
DT |
denotes |
The |
T5641 |
760-767 |
NN |
denotes |
feature |
T5642 |
745-749 |
JJ |
denotes |
main |
T5643 |
750-759 |
VBN |
denotes |
conserved |
T5644 |
808-815 |
VBN |
denotes |
located |
T5645 |
768-770 |
IN |
denotes |
of |
T5646 |
771-774 |
DT |
denotes |
the |
T5647 |
780-786 |
NN |
denotes |
family |
T5648 |
775-779 |
NN |
denotes |
TACC |
T5649 |
786-788 |
, |
denotes |
, |
T5650 |
788-791 |
DT |
denotes |
the |
T5651 |
797-803 |
NN |
denotes |
domain |
T5652 |
792-796 |
NN |
denotes |
TACC |
T5653 |
803-805 |
, |
denotes |
, |
T5654 |
805-807 |
VBZ |
denotes |
is |
T5655 |
816-818 |
IN |
denotes |
at |
T5656 |
819-822 |
DT |
denotes |
the |
T5657 |
831-839 |
NN |
denotes |
terminus |
T5658 |
823-830 |
NN |
denotes |
carboxy |
T5659 |
840-842 |
IN |
denotes |
of |
T5660 |
843-846 |
DT |
denotes |
the |
T5661 |
847-854 |
NN |
denotes |
protein |
T5662 |
854-855 |
. |
denotes |
. |
T5663 |
855-1005 |
sentence |
denotes |
In the case of the C. elegans TAC protein, this structure comprises the majority of the protein and is encoded by two of the three exons of the gene. |
T5664 |
856-858 |
IN |
denotes |
In |
T5665 |
914-923 |
VBZ |
denotes |
comprises |
T5666 |
859-862 |
DT |
denotes |
the |
T5667 |
863-867 |
NN |
denotes |
case |
T5668 |
868-870 |
IN |
denotes |
of |
T5669 |
871-874 |
DT |
denotes |
the |
T5670 |
890-897 |
NN |
denotes |
protein |
T5671 |
875-877 |
NNP |
denotes |
C. |
T5672 |
878-885 |
NNP |
denotes |
elegans |
T5673 |
886-889 |
NN |
denotes |
TAC |
T5674 |
897-899 |
, |
denotes |
, |
T5675 |
899-903 |
DT |
denotes |
this |
T5676 |
904-913 |
NN |
denotes |
structure |
T5677 |
924-927 |
DT |
denotes |
the |
T5678 |
928-936 |
NN |
denotes |
majority |
T5679 |
937-939 |
IN |
denotes |
of |
T5680 |
940-943 |
DT |
denotes |
the |
T5681 |
944-951 |
NN |
denotes |
protein |
T5682 |
952-955 |
CC |
denotes |
and |
T5683 |
956-958 |
VBZ |
denotes |
is |
T5684 |
959-966 |
VBN |
denotes |
encoded |
T5685 |
967-969 |
IN |
denotes |
by |
T5686 |
970-973 |
CD |
denotes |
two |
T5687 |
974-976 |
IN |
denotes |
of |
T5688 |
977-980 |
DT |
denotes |
the |
T5689 |
987-992 |
NNS |
denotes |
exons |
T5690 |
981-986 |
CD |
denotes |
three |
T5691 |
993-995 |
IN |
denotes |
of |
T5692 |
996-999 |
DT |
denotes |
the |
T5693 |
1000-1004 |
NN |
denotes |
gene |
T5694 |
1004-1005 |
. |
denotes |
. |
T5695 |
1005-1215 |
sentence |
denotes |
In the higher organisms, D. melanogaster, and the deuterostomes C. intestinalis to human, this feature is also encoded by the final exons of the gene (five in D. melanogaster, seven in the deuterostome genes). |
T5696 |
1006-1008 |
IN |
denotes |
In |
T5697 |
1117-1124 |
VBN |
denotes |
encoded |
T5698 |
1009-1012 |
DT |
denotes |
the |
T5699 |
1020-1029 |
NNS |
denotes |
organisms |
T5700 |
1013-1019 |
JJR |
denotes |
higher |
T5701 |
1029-1031 |
, |
denotes |
, |
T5702 |
1031-1033 |
NNP |
denotes |
D. |
T5703 |
1034-1046 |
NNP |
denotes |
melanogaster |
T5704 |
1046-1048 |
, |
denotes |
, |
T5705 |
1048-1051 |
CC |
denotes |
and |
T5706 |
1052-1055 |
DT |
denotes |
the |
T5707 |
1073-1085 |
NNP |
denotes |
intestinalis |
T5708 |
1056-1069 |
NNS |
denotes |
deuterostomes |
T5709 |
1070-1072 |
NNP |
denotes |
C. |
T5710 |
1086-1088 |
IN |
denotes |
to |
T5711 |
1089-1094 |
JJ |
denotes |
human |
T5712 |
1094-1096 |
, |
denotes |
, |
T5713 |
1096-1100 |
DT |
denotes |
this |
T5714 |
1101-1108 |
NN |
denotes |
feature |
T5715 |
1109-1111 |
VBZ |
denotes |
is |
T5716 |
1112-1116 |
RB |
denotes |
also |
T5717 |
1125-1127 |
IN |
denotes |
by |
T5718 |
1128-1131 |
DT |
denotes |
the |
T5719 |
1138-1143 |
NNS |
denotes |
exons |
T5720 |
1132-1137 |
JJ |
denotes |
final |
T5721 |
1144-1146 |
IN |
denotes |
of |
T5722 |
1147-1150 |
DT |
denotes |
the |
T5723 |
1151-1155 |
NN |
denotes |
gene |
T5724 |
1156-1157 |
-LRB- |
denotes |
( |
T5725 |
1157-1161 |
CD |
denotes |
five |
T5726 |
1162-1164 |
IN |
denotes |
in |
T5727 |
1165-1167 |
NNP |
denotes |
D. |
T5728 |
1168-1180 |
NNP |
denotes |
melanogaster |
T5729 |
1180-1182 |
, |
denotes |
, |
T5730 |
1182-1187 |
CD |
denotes |
seven |
T5731 |
1188-1190 |
IN |
denotes |
in |
T5732 |
1191-1194 |
DT |
denotes |
the |
T5733 |
1208-1213 |
NNS |
denotes |
genes |
T5734 |
1195-1207 |
NN |
denotes |
deuterostome |
T5735 |
1213-1214 |
-RRB- |
denotes |
) |
T5736 |
1214-1215 |
. |
denotes |
. |
T5737 |
1215-1305 |
sentence |
denotes |
Outside of the TACC domain, however, TACC family members show relatively little homology. |
T5738 |
1216-1223 |
IN |
denotes |
Outside |
T5739 |
1273-1277 |
VBP |
denotes |
show |
T5740 |
1224-1226 |
IN |
denotes |
of |
T5741 |
1227-1230 |
DT |
denotes |
the |
T5742 |
1236-1242 |
NN |
denotes |
domain |
T5743 |
1231-1235 |
NN |
denotes |
TACC |
T5744 |
1242-1244 |
, |
denotes |
, |
T5745 |
1244-1251 |
RB |
denotes |
however |
T5746 |
1251-1253 |
, |
denotes |
, |
T5747 |
1253-1257 |
NN |
denotes |
TACC |
T5748 |
1265-1272 |
NNS |
denotes |
members |
T5749 |
1258-1264 |
NN |
denotes |
family |
T5750 |
1278-1288 |
RB |
denotes |
relatively |
T5751 |
1289-1295 |
JJ |
denotes |
little |
T5752 |
1296-1304 |
NN |
denotes |
homology |
T5753 |
1304-1305 |
. |
denotes |
. |
T5754 |
1305-1523 |
sentence |
denotes |
It is interesting that each TACC gene contains one large exon, which shows considerable variability between TACC orthologues, and constitutes the main difference between the TACC3 genes in the vertebrates (see below). |
T5755 |
1306-1308 |
PRP |
denotes |
It |
T5756 |
1309-1311 |
VBZ |
denotes |
is |
T5757 |
1312-1323 |
JJ |
denotes |
interesting |
T5758 |
1324-1328 |
IN |
denotes |
that |
T5759 |
1344-1352 |
VBZ |
denotes |
contains |
T5760 |
1329-1333 |
DT |
denotes |
each |
T5761 |
1339-1343 |
NN |
denotes |
gene |
T5762 |
1334-1338 |
NN |
denotes |
TACC |
T5763 |
1353-1356 |
CD |
denotes |
one |
T5764 |
1363-1367 |
NN |
denotes |
exon |
T5765 |
1357-1362 |
JJ |
denotes |
large |
T5766 |
1367-1369 |
, |
denotes |
, |
T5767 |
1369-1374 |
WDT |
denotes |
which |
T5768 |
1375-1380 |
VBZ |
denotes |
shows |
T5769 |
1381-1393 |
JJ |
denotes |
considerable |
T5770 |
1394-1405 |
NN |
denotes |
variability |
T5771 |
1406-1413 |
IN |
denotes |
between |
T5772 |
1414-1418 |
NN |
denotes |
TACC |
T5773 |
1419-1430 |
NNS |
denotes |
orthologues |
T5774 |
1430-1432 |
, |
denotes |
, |
T5775 |
1432-1435 |
CC |
denotes |
and |
T5776 |
1436-1447 |
VBZ |
denotes |
constitutes |
T5777 |
1448-1451 |
DT |
denotes |
the |
T5778 |
1457-1467 |
NN |
denotes |
difference |
T5779 |
1452-1456 |
JJ |
denotes |
main |
T5780 |
1468-1475 |
IN |
denotes |
between |
T5781 |
1476-1479 |
DT |
denotes |
the |
T5782 |
1486-1491 |
NNS |
denotes |
genes |
T5783 |
1480-1485 |
NN |
denotes |
TACC3 |
T5784 |
1492-1494 |
IN |
denotes |
in |
T5785 |
1495-1498 |
DT |
denotes |
the |
T5786 |
1499-1510 |
NNS |
denotes |
vertebrates |
T5787 |
1511-1512 |
-LRB- |
denotes |
( |
T5788 |
1512-1515 |
VB |
denotes |
see |
T5789 |
1516-1521 |
RB |
denotes |
below |
T5790 |
1521-1522 |
-RRB- |
denotes |
) |
T5791 |
1522-1523 |
. |
denotes |
. |
T5792 |
1523-1761 |
sentence |
denotes |
In deuterostomes, this exon contains the SDP repeat (or in the case of the murine TACC3's, a rodent-specific 24 amino acid repeat), which is responsible for the binding of the SWI/SNF chromatin remodeling complex component GAS41 [15,16]. |
T5793 |
1524-1526 |
IN |
denotes |
In |
T5794 |
1552-1560 |
VBZ |
denotes |
contains |
T5795 |
1527-1540 |
NNS |
denotes |
deuterostomes |
T5796 |
1540-1542 |
, |
denotes |
, |
T5797 |
1542-1546 |
DT |
denotes |
this |
T5798 |
1547-1551 |
NN |
denotes |
exon |
T5799 |
1561-1564 |
DT |
denotes |
the |
T5800 |
1569-1575 |
NN |
denotes |
repeat |
T5801 |
1565-1568 |
NN |
denotes |
SDP |
T5802 |
1576-1577 |
-LRB- |
denotes |
( |
T5803 |
1577-1579 |
CC |
denotes |
or |
T5804 |
1580-1582 |
IN |
denotes |
in |
T5805 |
1647-1653 |
NN |
denotes |
repeat |
T5806 |
1583-1586 |
DT |
denotes |
the |
T5807 |
1587-1591 |
NN |
denotes |
case |
T5808 |
1592-1594 |
IN |
denotes |
of |
T5809 |
1595-1598 |
DT |
denotes |
the |
T5810 |
1606-1611 |
NN |
denotes |
TACC3 |
T5811 |
1599-1605 |
JJ |
denotes |
murine |
T5812 |
1611-1613 |
POS |
denotes |
's |
T5813 |
1613-1615 |
, |
denotes |
, |
T5814 |
1615-1616 |
DT |
denotes |
a |
T5815 |
1617-1623 |
NN |
denotes |
rodent |
T5816 |
1624-1632 |
JJ |
denotes |
specific |
T5817 |
1623-1624 |
HYPH |
denotes |
- |
T5818 |
1633-1635 |
CD |
denotes |
24 |
T5819 |
1642-1646 |
NN |
denotes |
acid |
T5820 |
1636-1641 |
NN |
denotes |
amino |
T5821 |
1653-1654 |
-RRB- |
denotes |
) |
T5822 |
1654-1656 |
, |
denotes |
, |
T5823 |
1656-1661 |
WDT |
denotes |
which |
T5824 |
1662-1664 |
VBZ |
denotes |
is |
T5825 |
1665-1676 |
JJ |
denotes |
responsible |
T5826 |
1677-1680 |
IN |
denotes |
for |
T5827 |
1681-1684 |
DT |
denotes |
the |
T5828 |
1685-1692 |
NN |
denotes |
binding |
T5829 |
1693-1695 |
IN |
denotes |
of |
T5830 |
1696-1699 |
DT |
denotes |
the |
T5831 |
1737-1746 |
NN |
denotes |
component |
T5832 |
1700-1703 |
NN |
denotes |
SWI |
T5833 |
1704-1707 |
NN |
denotes |
SNF |
T5834 |
1703-1704 |
HYPH |
denotes |
/ |
T5835 |
1708-1717 |
NN |
denotes |
chromatin |
T5836 |
1718-1728 |
NN |
denotes |
remodeling |
T5837 |
1729-1736 |
JJ |
denotes |
complex |
T5838 |
1747-1752 |
NN |
denotes |
GAS41 |
T5839 |
1753-1754 |
-LRB- |
denotes |
[ |
T5840 |
1757-1759 |
CD |
denotes |
16 |
T5841 |
1754-1756 |
CD |
denotes |
15 |
T5842 |
1756-1757 |
, |
denotes |
, |
T5843 |
1759-1760 |
-RRB- |
denotes |
] |
T5844 |
1760-1761 |
. |
denotes |
. |
T5845 |
1761-1979 |
sentence |
denotes |
Of the vertebrate TACC proteins, the TACC3 orthologues show the greatest variability in size and sequence, ranging in size from 599 amino acids for the rat TACC3 protein, to 942 amino acids in the Danio rerio protein. |
T5846 |
1762-1764 |
IN |
denotes |
Of |
T5847 |
1817-1821 |
VBP |
denotes |
show |
T5848 |
1765-1768 |
DT |
denotes |
the |
T5849 |
1785-1793 |
NN |
denotes |
proteins |
T5850 |
1769-1779 |
NN |
denotes |
vertebrate |
T5851 |
1780-1784 |
NN |
denotes |
TACC |
T5852 |
1793-1795 |
, |
denotes |
, |
T5853 |
1795-1798 |
DT |
denotes |
the |
T5854 |
1805-1816 |
NNS |
denotes |
orthologues |
T5855 |
1799-1804 |
NN |
denotes |
TACC3 |
T5856 |
1822-1825 |
DT |
denotes |
the |
T5857 |
1835-1846 |
NN |
denotes |
variability |
T5858 |
1826-1834 |
JJS |
denotes |
greatest |
T5859 |
1847-1849 |
IN |
denotes |
in |
T5860 |
1850-1854 |
NN |
denotes |
size |
T5861 |
1855-1858 |
CC |
denotes |
and |
T5862 |
1859-1867 |
NN |
denotes |
sequence |
T5863 |
1867-1869 |
, |
denotes |
, |
T5864 |
1869-1876 |
VBG |
denotes |
ranging |
T5865 |
1877-1879 |
IN |
denotes |
in |
T5866 |
1880-1884 |
NN |
denotes |
size |
T5867 |
1885-1889 |
IN |
denotes |
from |
T5868 |
1890-1893 |
CD |
denotes |
599 |
T5869 |
1900-1905 |
NNS |
denotes |
acids |
T5870 |
1894-1899 |
NN |
denotes |
amino |
T5871 |
1906-1909 |
IN |
denotes |
for |
T5872 |
1910-1913 |
DT |
denotes |
the |
T5873 |
1924-1931 |
NN |
denotes |
protein |
T5874 |
1914-1917 |
NN |
denotes |
rat |
T5875 |
1918-1923 |
NN |
denotes |
TACC3 |
T5876 |
1931-1933 |
, |
denotes |
, |
T5877 |
1933-1935 |
IN |
denotes |
to |
T5878 |
1936-1939 |
CD |
denotes |
942 |
T5879 |
1946-1951 |
NNS |
denotes |
acids |
T5880 |
1940-1945 |
NN |
denotes |
amino |
T5881 |
1952-1954 |
IN |
denotes |
in |
T5882 |
1955-1958 |
DT |
denotes |
the |
T5883 |
1971-1978 |
NN |
denotes |
protein |
T5884 |
1959-1964 |
NNP |
denotes |
Danio |
T5885 |
1965-1970 |
NNP |
denotes |
rerio |
T5886 |
1978-1979 |
. |
denotes |
. |
T5887 |
1979-2079 |
sentence |
denotes |
The reasons for these differences are apparent from the genomic structure of the TACC3 orthologues. |
T5888 |
1980-1983 |
DT |
denotes |
The |
T5889 |
1984-1991 |
NNS |
denotes |
reasons |
T5890 |
2014-2017 |
VBP |
denotes |
are |
T5891 |
1992-1995 |
IN |
denotes |
for |
T5892 |
1996-2001 |
DT |
denotes |
these |
T5893 |
2002-2013 |
NNS |
denotes |
differences |
T5894 |
2018-2026 |
JJ |
denotes |
apparent |
T5895 |
2027-2031 |
IN |
denotes |
from |
T5896 |
2032-2035 |
DT |
denotes |
the |
T5897 |
2044-2053 |
NN |
denotes |
structure |
T5898 |
2036-2043 |
JJ |
denotes |
genomic |
T5899 |
2054-2056 |
IN |
denotes |
of |
T5900 |
2057-2060 |
DT |
denotes |
the |
T5901 |
2067-2078 |
NNS |
denotes |
orthologues |
T5902 |
2061-2066 |
NN |
denotes |
TACC3 |
T5903 |
2078-2079 |
. |
denotes |
. |
T5904 |
2079-2338 |
sentence |
denotes |
TACC3 can be divided into three sections: a conserved N-terminal region (CNTR) of 108 amino acids, encoded by exons 2 and 3 in each vertebrate TACC3 gene, the conserved TACC domain distributed over the final seven exons, and a highly variable central region. |
T5905 |
2080-2085 |
NN |
denotes |
TACC3 |
T5906 |
2093-2100 |
VBN |
denotes |
divided |
T5907 |
2086-2089 |
MD |
denotes |
can |
T5908 |
2090-2092 |
VB |
denotes |
be |
T5909 |
2101-2105 |
IN |
denotes |
into |
T5910 |
2106-2111 |
CD |
denotes |
three |
T5911 |
2112-2120 |
NNS |
denotes |
sections |
T5912 |
2120-2122 |
: |
denotes |
: |
T5913 |
2122-2123 |
DT |
denotes |
a |
T5914 |
2145-2151 |
NN |
denotes |
region |
T5915 |
2124-2133 |
VBN |
denotes |
conserved |
T5916 |
2134-2135 |
NN |
denotes |
N |
T5917 |
2136-2144 |
JJ |
denotes |
terminal |
T5918 |
2135-2136 |
HYPH |
denotes |
- |
T5919 |
2152-2153 |
-LRB- |
denotes |
( |
T5920 |
2153-2157 |
NN |
denotes |
CNTR |
T5921 |
2157-2158 |
-RRB- |
denotes |
) |
T5922 |
2159-2161 |
IN |
denotes |
of |
T5923 |
2162-2165 |
CD |
denotes |
108 |
T5924 |
2172-2177 |
NNS |
denotes |
acids |
T5925 |
2166-2171 |
NN |
denotes |
amino |
T5926 |
2177-2179 |
, |
denotes |
, |
T5927 |
2179-2186 |
VBN |
denotes |
encoded |
T5928 |
2187-2189 |
IN |
denotes |
by |
T5929 |
2190-2195 |
NNS |
denotes |
exons |
T5930 |
2196-2197 |
CD |
denotes |
2 |
T5931 |
2198-2201 |
CC |
denotes |
and |
T5932 |
2202-2203 |
CD |
denotes |
3 |
T5933 |
2204-2206 |
IN |
denotes |
in |
T5934 |
2207-2211 |
DT |
denotes |
each |
T5935 |
2229-2233 |
NN |
denotes |
gene |
T5936 |
2212-2222 |
NN |
denotes |
vertebrate |
T5937 |
2223-2228 |
NN |
denotes |
TACC3 |
T5938 |
2233-2235 |
, |
denotes |
, |
T5939 |
2235-2238 |
DT |
denotes |
the |
T5940 |
2254-2260 |
NN |
denotes |
domain |
T5941 |
2239-2248 |
VBN |
denotes |
conserved |
T5942 |
2249-2253 |
NN |
denotes |
TACC |
T5943 |
2261-2272 |
VBN |
denotes |
distributed |
T5944 |
2273-2277 |
IN |
denotes |
over |
T5945 |
2278-2281 |
DT |
denotes |
the |
T5946 |
2294-2299 |
NNS |
denotes |
exons |
T5947 |
2282-2287 |
JJ |
denotes |
final |
T5948 |
2288-2293 |
CD |
denotes |
seven |
T5949 |
2299-2301 |
, |
denotes |
, |
T5950 |
2301-2304 |
CC |
denotes |
and |
T5951 |
2305-2306 |
DT |
denotes |
a |
T5952 |
2331-2337 |
NN |
denotes |
region |
T5953 |
2307-2313 |
RB |
denotes |
highly |
T5954 |
2314-2322 |
JJ |
denotes |
variable |
T5955 |
2323-2330 |
JJ |
denotes |
central |
T5956 |
2337-2338 |
. |
denotes |
. |
T5957 |
2338-2553 |
sentence |
denotes |
The lack of conservation in both size and sequence of the central portion of the TACC3 proteins of human and mouse has been previously noted, and accounts for the major difference between these two orthologues [2]. |
T5958 |
2339-2342 |
DT |
denotes |
The |
T5959 |
2343-2347 |
NN |
denotes |
lack |
T5960 |
2474-2479 |
VBN |
denotes |
noted |
T5961 |
2348-2350 |
IN |
denotes |
of |
T5962 |
2351-2363 |
NN |
denotes |
conservation |
T5963 |
2364-2366 |
IN |
denotes |
in |
T5964 |
2367-2371 |
CC |
denotes |
both |
T5965 |
2372-2376 |
NN |
denotes |
size |
T5966 |
2377-2380 |
CC |
denotes |
and |
T5967 |
2381-2389 |
NN |
denotes |
sequence |
T5968 |
2390-2392 |
IN |
denotes |
of |
T5969 |
2393-2396 |
DT |
denotes |
the |
T5970 |
2405-2412 |
NN |
denotes |
portion |
T5971 |
2397-2404 |
JJ |
denotes |
central |
T5972 |
2413-2415 |
IN |
denotes |
of |
T5973 |
2416-2419 |
DT |
denotes |
the |
T5974 |
2426-2434 |
NN |
denotes |
proteins |
T5975 |
2420-2425 |
NN |
denotes |
TACC3 |
T5976 |
2435-2437 |
IN |
denotes |
of |
T5977 |
2438-2443 |
JJ |
denotes |
human |
T5978 |
2448-2453 |
NN |
denotes |
mouse |
T5979 |
2444-2447 |
CC |
denotes |
and |
T5980 |
2454-2457 |
VBZ |
denotes |
has |
T5981 |
2458-2462 |
VBN |
denotes |
been |
T5982 |
2463-2473 |
RB |
denotes |
previously |
T5983 |
2479-2481 |
, |
denotes |
, |
T5984 |
2481-2484 |
CC |
denotes |
and |
T5985 |
2485-2493 |
VBZ |
denotes |
accounts |
T5986 |
2494-2497 |
IN |
denotes |
for |
T5987 |
2498-2501 |
DT |
denotes |
the |
T5988 |
2508-2518 |
NN |
denotes |
difference |
T5989 |
2502-2507 |
JJ |
denotes |
major |
T5990 |
2519-2526 |
IN |
denotes |
between |
T5991 |
2527-2532 |
DT |
denotes |
these |
T5992 |
2537-2548 |
NNS |
denotes |
orthologues |
T5993 |
2533-2536 |
CD |
denotes |
two |
T5994 |
2549-2550 |
-LRB- |
denotes |
[ |
T5995 |
2550-2551 |
CD |
denotes |
2 |
T5996 |
2551-2552 |
-RRB- |
denotes |
] |
T5997 |
2552-2553 |
. |
denotes |
. |
T5998 |
2553-2700 |
sentence |
denotes |
The majority of this central portion, which contains the SDP repeat motifs, is encoded by one exon in human and the pufferfish (emb|CAAB01001184). |
T5999 |
2554-2557 |
DT |
denotes |
The |
T6000 |
2558-2566 |
NN |
denotes |
majority |
T6001 |
2633-2640 |
VBN |
denotes |
encoded |
T6002 |
2567-2569 |
IN |
denotes |
of |
T6003 |
2570-2574 |
DT |
denotes |
this |
T6004 |
2583-2590 |
NN |
denotes |
portion |
T6005 |
2575-2582 |
JJ |
denotes |
central |
T6006 |
2590-2592 |
, |
denotes |
, |
T6007 |
2592-2597 |
WDT |
denotes |
which |
T6008 |
2598-2606 |
VBZ |
denotes |
contains |
T6009 |
2607-2610 |
DT |
denotes |
the |
T6010 |
2622-2628 |
NNS |
denotes |
motifs |
T6011 |
2611-2614 |
NN |
denotes |
SDP |
T6012 |
2615-2621 |
NN |
denotes |
repeat |
T6013 |
2628-2630 |
, |
denotes |
, |
T6014 |
2630-2632 |
VBZ |
denotes |
is |
T6015 |
2641-2643 |
IN |
denotes |
by |
T6016 |
2644-2647 |
CD |
denotes |
one |
T6017 |
2648-2652 |
NN |
denotes |
exon |
T6018 |
2653-2655 |
IN |
denotes |
in |
T6019 |
2656-2661 |
JJ |
denotes |
human |
T6020 |
2662-2665 |
CC |
denotes |
and |
T6021 |
2666-2669 |
DT |
denotes |
the |
T6022 |
2670-2680 |
NN |
denotes |
pufferfish |
T6023 |
2681-2682 |
-LRB- |
denotes |
( |
T6024 |
2682-2698 |
NN |
denotes |
emb|CAAB01001184 |
T6025 |
2698-2699 |
-RRB- |
denotes |
) |
T6026 |
2699-2700 |
. |
denotes |
. |
T6027 |
2700-2862 |
sentence |
denotes |
In rodents, however, this region is almost entirely composed of seven 24 amino acid repeats, which are located in a single exon of the mouse and rat TACC3 genes. |
T6028 |
2701-2703 |
IN |
denotes |
In |
T6029 |
2753-2761 |
VBN |
denotes |
composed |
T6030 |
2704-2711 |
NNS |
denotes |
rodents |
T6031 |
2711-2713 |
, |
denotes |
, |
T6032 |
2713-2720 |
RB |
denotes |
however |
T6033 |
2720-2722 |
, |
denotes |
, |
T6034 |
2722-2726 |
DT |
denotes |
this |
T6035 |
2727-2733 |
NN |
denotes |
region |
T6036 |
2734-2736 |
VBZ |
denotes |
is |
T6037 |
2737-2743 |
RB |
denotes |
almost |
T6038 |
2744-2752 |
RB |
denotes |
entirely |
T6039 |
2762-2764 |
IN |
denotes |
of |
T6040 |
2765-2770 |
CD |
denotes |
seven |
T6041 |
2785-2792 |
NNS |
denotes |
repeats |
T6042 |
2771-2773 |
CD |
denotes |
24 |
T6043 |
2780-2784 |
NN |
denotes |
acid |
T6044 |
2774-2779 |
NN |
denotes |
amino |
T6045 |
2792-2794 |
, |
denotes |
, |
T6046 |
2794-2799 |
WDT |
denotes |
which |
T6047 |
2804-2811 |
VBN |
denotes |
located |
T6048 |
2800-2803 |
VBP |
denotes |
are |
T6049 |
2812-2814 |
IN |
denotes |
in |
T6050 |
2815-2816 |
DT |
denotes |
a |
T6051 |
2824-2828 |
NN |
denotes |
exon |
T6052 |
2817-2823 |
JJ |
denotes |
single |
T6053 |
2829-2831 |
IN |
denotes |
of |
T6054 |
2832-2835 |
DT |
denotes |
the |
T6055 |
2856-2861 |
NNS |
denotes |
genes |
T6056 |
2836-2841 |
NN |
denotes |
mouse |
T6057 |
2842-2845 |
CC |
denotes |
and |
T6058 |
2846-2849 |
NN |
denotes |
rat |
T6059 |
2850-2855 |
NN |
denotes |
TACC3 |
T6060 |
2861-2862 |
. |
denotes |
. |
T6061 |
2862-2995 |
sentence |
denotes |
It has been previously reported that there are four mouse TACC3 splice variants that differ in the number of these repeats [2,7,17]. |
T6062 |
2863-2865 |
PRP |
denotes |
It |
T6063 |
2886-2894 |
VBN |
denotes |
reported |
T6064 |
2866-2869 |
VBZ |
denotes |
has |
T6065 |
2870-2874 |
VBN |
denotes |
been |
T6066 |
2875-2885 |
RB |
denotes |
previously |
T6067 |
2895-2899 |
IN |
denotes |
that |
T6068 |
2906-2909 |
VBP |
denotes |
are |
T6069 |
2900-2905 |
EX |
denotes |
there |
T6070 |
2910-2914 |
CD |
denotes |
four |
T6071 |
2934-2942 |
NNS |
denotes |
variants |
T6072 |
2915-2920 |
NN |
denotes |
mouse |
T6073 |
2921-2926 |
NN |
denotes |
TACC3 |
T6074 |
2927-2933 |
NN |
denotes |
splice |
T6075 |
2943-2947 |
WDT |
denotes |
that |
T6076 |
2948-2954 |
VBP |
denotes |
differ |
T6077 |
2955-2957 |
IN |
denotes |
in |
T6078 |
2958-2961 |
DT |
denotes |
the |
T6079 |
2962-2968 |
NN |
denotes |
number |
T6080 |
2969-2971 |
IN |
denotes |
of |
T6081 |
2972-2977 |
DT |
denotes |
these |
T6082 |
2978-2985 |
NNS |
denotes |
repeats |
T6083 |
2986-2987 |
-LRB- |
denotes |
[ |
T6084 |
2991-2993 |
CD |
denotes |
17 |
T6085 |
2987-2988 |
CD |
denotes |
2 |
T6086 |
2988-2989 |
, |
denotes |
, |
T6087 |
2989-2990 |
CD |
denotes |
7 |
T6088 |
2990-2991 |
, |
denotes |
, |
T6089 |
2993-2994 |
-RRB- |
denotes |
] |
T6090 |
2994-2995 |
. |
denotes |
. |
T6091 |
2995-3216 |
sentence |
denotes |
As these repeats are present in a single exon, it appears likely that these different sequences may be the result of the DNA polymerases used in the cDNA synthesis and/or PCR reaction stuttering through the repeat motif. |
T6092 |
2996-2998 |
IN |
denotes |
As |
T6093 |
3013-3016 |
VBP |
denotes |
are |
T6094 |
2999-3004 |
DT |
denotes |
these |
T6095 |
3005-3012 |
NNS |
denotes |
repeats |
T6096 |
3046-3053 |
VBZ |
denotes |
appears |
T6097 |
3017-3024 |
JJ |
denotes |
present |
T6098 |
3025-3027 |
IN |
denotes |
in |
T6099 |
3028-3029 |
DT |
denotes |
a |
T6100 |
3037-3041 |
NN |
denotes |
exon |
T6101 |
3030-3036 |
JJ |
denotes |
single |
T6102 |
3041-3043 |
, |
denotes |
, |
T6103 |
3043-3045 |
PRP |
denotes |
it |
T6104 |
3054-3060 |
JJ |
denotes |
likely |
T6105 |
3061-3065 |
IN |
denotes |
that |
T6106 |
3096-3098 |
VB |
denotes |
be |
T6107 |
3066-3071 |
DT |
denotes |
these |
T6108 |
3082-3091 |
NNS |
denotes |
sequences |
T6109 |
3072-3081 |
JJ |
denotes |
different |
T6110 |
3092-3095 |
MD |
denotes |
may |
T6111 |
3099-3102 |
DT |
denotes |
the |
T6112 |
3103-3109 |
NN |
denotes |
result |
T6113 |
3110-3112 |
IN |
denotes |
of |
T6114 |
3113-3116 |
DT |
denotes |
the |
T6115 |
3121-3132 |
NNS |
denotes |
polymerases |
T6116 |
3117-3120 |
NN |
denotes |
DNA |
T6117 |
3133-3137 |
VBN |
denotes |
used |
T6118 |
3138-3140 |
IN |
denotes |
in |
T6119 |
3141-3144 |
DT |
denotes |
the |
T6120 |
3150-3159 |
NN |
denotes |
synthesis |
T6121 |
3145-3149 |
NN |
denotes |
cDNA |
T6122 |
3160-3163 |
CC |
denotes |
and |
T6123 |
3163-3164 |
HYPH |
denotes |
/ |
T6124 |
3164-3166 |
CC |
denotes |
or |
T6125 |
3167-3170 |
NN |
denotes |
PCR |
T6126 |
3171-3179 |
NN |
denotes |
reaction |
T6127 |
3180-3190 |
VBG |
denotes |
stuttering |
T6128 |
3191-3198 |
IN |
denotes |
through |
T6129 |
3199-3202 |
DT |
denotes |
the |
T6130 |
3210-3215 |
NN |
denotes |
motif |
T6131 |
3203-3209 |
NN |
denotes |
repeat |
T6132 |
3215-3216 |
. |
denotes |
. |
T6133 |
3216-3327 |
sentence |
denotes |
The correct sequence, reported by Sadek et al [7], is the one used throughout the entirety of this manuscript. |
T6134 |
3217-3220 |
DT |
denotes |
The |
T6135 |
3229-3237 |
NN |
denotes |
sequence |
T6136 |
3221-3228 |
JJ |
denotes |
correct |
T6137 |
3268-3270 |
VBZ |
denotes |
is |
T6138 |
3237-3239 |
, |
denotes |
, |
T6139 |
3239-3247 |
VBN |
denotes |
reported |
T6140 |
3248-3250 |
IN |
denotes |
by |
T6141 |
3251-3256 |
NNP |
denotes |
Sadek |
T6142 |
3257-3259 |
FW |
denotes |
et |
T6143 |
3260-3262 |
FW |
denotes |
al |
T6144 |
3263-3264 |
-LRB- |
denotes |
[ |
T6145 |
3264-3265 |
CD |
denotes |
7 |
T6146 |
3265-3266 |
-RRB- |
denotes |
] |
T6147 |
3266-3268 |
, |
denotes |
, |
T6148 |
3271-3274 |
DT |
denotes |
the |
T6149 |
3275-3278 |
CD |
denotes |
one |
T6150 |
3279-3283 |
VBN |
denotes |
used |
T6151 |
3284-3294 |
IN |
denotes |
throughout |
T6152 |
3295-3298 |
DT |
denotes |
the |
T6153 |
3299-3307 |
NN |
denotes |
entirety |
T6154 |
3308-3310 |
IN |
denotes |
of |
T6155 |
3311-3315 |
DT |
denotes |
this |
T6156 |
3316-3326 |
NN |
denotes |
manuscript |
T6157 |
3326-3327 |
. |
denotes |
. |
T6158 |
3327-3549 |
sentence |
denotes |
These repeats are not evident in the rabbit protein, or any other TACC protein, and may indicate that the rodent TACC3 has evolved distinct functions, as has already been noted for the amphibian Xenopus TACC3, maskin [8]. |
T6159 |
3328-3333 |
DT |
denotes |
These |
T6160 |
3334-3341 |
NNS |
denotes |
repeats |
T6161 |
3342-3345 |
VBP |
denotes |
are |
T6162 |
3346-3349 |
RB |
denotes |
not |
T6163 |
3350-3357 |
JJ |
denotes |
evident |
T6164 |
3358-3360 |
IN |
denotes |
in |
T6165 |
3361-3364 |
DT |
denotes |
the |
T6166 |
3372-3379 |
NN |
denotes |
protein |
T6167 |
3365-3371 |
NN |
denotes |
rabbit |
T6168 |
3379-3381 |
, |
denotes |
, |
T6169 |
3381-3383 |
CC |
denotes |
or |
T6170 |
3384-3387 |
DT |
denotes |
any |
T6171 |
3399-3406 |
NN |
denotes |
protein |
T6172 |
3388-3393 |
JJ |
denotes |
other |
T6173 |
3394-3398 |
NN |
denotes |
TACC |
T6174 |
3406-3408 |
, |
denotes |
, |
T6175 |
3408-3411 |
CC |
denotes |
and |
T6176 |
3412-3415 |
MD |
denotes |
may |
T6177 |
3416-3424 |
VB |
denotes |
indicate |
T6178 |
3425-3429 |
IN |
denotes |
that |
T6179 |
3451-3458 |
VBN |
denotes |
evolved |
T6180 |
3430-3433 |
DT |
denotes |
the |
T6181 |
3441-3446 |
NN |
denotes |
TACC3 |
T6182 |
3434-3440 |
NN |
denotes |
rodent |
T6183 |
3447-3450 |
VBZ |
denotes |
has |
T6184 |
3459-3467 |
JJ |
denotes |
distinct |
T6185 |
3468-3477 |
NNS |
denotes |
functions |
T6186 |
3477-3479 |
, |
denotes |
, |
T6187 |
3479-3481 |
IN |
denotes |
as |
T6188 |
3499-3504 |
VBN |
denotes |
noted |
T6189 |
3482-3485 |
VBZ |
denotes |
has |
T6190 |
3486-3493 |
RB |
denotes |
already |
T6191 |
3494-3498 |
VBN |
denotes |
been |
T6192 |
3505-3508 |
IN |
denotes |
for |
T6193 |
3509-3512 |
DT |
denotes |
the |
T6194 |
3531-3536 |
NN |
denotes |
TACC3 |
T6195 |
3513-3522 |
JJ |
denotes |
amphibian |
T6196 |
3523-3530 |
NNP |
denotes |
Xenopus |
T6197 |
3536-3538 |
, |
denotes |
, |
T6198 |
3538-3544 |
NN |
denotes |
maskin |
T6199 |
3545-3546 |
-LRB- |
denotes |
[ |
T6200 |
3546-3547 |
CD |
denotes |
8 |
T6201 |
3547-3548 |
-RRB- |
denotes |
] |
T6202 |
3548-3549 |
. |
denotes |
. |
R3405 |
T5507 |
T5508 |
amod |
Comparative,structure |
R3406 |
T5509 |
T5508 |
amod |
genomic,structure |
R3407 |
T5510 |
T5508 |
prep |
of,structure |
R3408 |
T5511 |
T5512 |
det |
the,family |
R3409 |
T5512 |
T5510 |
pobj |
family,of |
R3410 |
T5513 |
T5512 |
compound |
TACC,family |
R3411 |
T5515 |
T5516 |
det |
The,sequences |
R3412 |
T5516 |
T5519 |
nsubj |
sequences,extracted |
R3413 |
T5517 |
T5516 |
amod |
genomic,sequences |
R3414 |
T5518 |
T5516 |
compound |
DNA,sequences |
R3415 |
T5520 |
T5516 |
acl |
corresponding,sequences |
R3416 |
T5521 |
T5520 |
prep |
to,corresponding |
R3417 |
T5522 |
T5523 |
det |
the,genes |
R3418 |
T5523 |
T5521 |
pobj |
genes,to |
R3419 |
T5524 |
T5523 |
amod |
orthologous,genes |
R3420 |
T5525 |
T5523 |
compound |
TACC,genes |
R3421 |
T5526 |
T5523 |
prep |
of,genes |
R3422 |
T5527 |
T5526 |
pobj |
human,of |
R3423 |
T5528 |
T5527 |
punct |
", ",human |
R3424 |
T5529 |
T5527 |
conj |
mouse,human |
R3425 |
T5530 |
T5529 |
punct |
", ",mouse |
R3426 |
T5531 |
T5529 |
conj |
rat,mouse |
R3427 |
T5532 |
T5531 |
punct |
", ",rat |
R3428 |
T5533 |
T5531 |
conj |
pufferfish,rat |
R3429 |
T5534 |
T5533 |
punct |
", ",pufferfish |
R3430 |
T5535 |
T5536 |
compound |
C.,intestinalis |
R3431 |
T5536 |
T5533 |
conj |
intestinalis,pufferfish |
R3432 |
T5537 |
T5536 |
punct |
", ",intestinalis |
R3433 |
T5538 |
T5539 |
compound |
D.,melanogaster |
R3434 |
T5539 |
T5536 |
conj |
melanogaster,intestinalis |
R3435 |
T5540 |
T5539 |
cc |
and,melanogaster |
R3436 |
T5541 |
T5542 |
compound |
C.,elegans |
R3437 |
T5542 |
T5539 |
conj |
elegans,melanogaster |
R3438 |
T5543 |
T5519 |
aux |
were,extracted |
R3439 |
T5544 |
T5519 |
cc |
and,extracted |
R3440 |
T5545 |
T5519 |
conj |
analyzed,extracted |
R3441 |
T5546 |
T5545 |
prep |
by,analyzed |
R3442 |
T5547 |
T5546 |
pobj |
Genescan,by |
R3443 |
T5548 |
T5547 |
cc |
and,Genescan |
R3444 |
T5549 |
T5547 |
conj |
BLAST,Genescan |
R3445 |
T5550 |
T5551 |
aux |
to,determine |
R3446 |
T5551 |
T5519 |
advcl |
determine,extracted |
R3447 |
T5552 |
T5553 |
det |
the,structure |
R3448 |
T5553 |
T5551 |
dobj |
structure,determine |
R3449 |
T5554 |
T5553 |
amod |
genomic,structure |
R3450 |
T5555 |
T5553 |
prep |
of,structure |
R3451 |
T5556 |
T5557 |
det |
each,gene |
R3452 |
T5557 |
T5555 |
pobj |
gene,of |
R3453 |
T5558 |
T5557 |
compound |
TACC,gene |
R3454 |
T5559 |
T5519 |
punct |
.,extracted |
R3455 |
T5561 |
T5562 |
prep |
In,added |
R3456 |
T5563 |
T5564 |
det |
some,cases |
R3457 |
T5564 |
T5561 |
pobj |
cases,In |
R3458 |
T5565 |
T5562 |
punct |
", ",added |
R3459 |
T5566 |
T5562 |
prep |
for,added |
R3460 |
T5567 |
T5566 |
pobj |
rat,for |
R3461 |
T5568 |
T5567 |
cc |
and,rat |
R3462 |
T5569 |
T5567 |
conj |
pufferfish,rat |
R3463 |
T5570 |
T5562 |
punct |
", ",added |
R3464 |
T5571 |
T5562 |
nsubjpass |
exons,added |
R3465 |
T5572 |
T5562 |
auxpass |
were,added |
R3466 |
T5573 |
T5562 |
cc |
or,added |
R3467 |
T5574 |
T5562 |
conj |
modified,added |
R3468 |
T5575 |
T5574 |
prep |
based,modified |
R3469 |
T5576 |
T5575 |
prep |
on,based |
R3470 |
T5577 |
T5578 |
det |
the,similarity |
R3471 |
T5578 |
T5576 |
pobj |
similarity,on |
R3472 |
T5579 |
T5578 |
amod |
best,similarity |
R3473 |
T5580 |
T5578 |
prep |
of,similarity |
R3474 |
T5581 |
T5582 |
amod |
translated,peptides |
R3475 |
T5582 |
T5580 |
pobj |
peptides,of |
R3476 |
T5583 |
T5578 |
prep |
to,similarity |
R3477 |
T5584 |
T5585 |
det |
the,proteins |
R3478 |
T5585 |
T5583 |
pobj |
proteins,to |
R3479 |
T5586 |
T5585 |
amod |
corresponding,proteins |
R3480 |
T5587 |
T5585 |
nmod |
mouse,proteins |
R3481 |
T5588 |
T5587 |
cc |
and,mouse |
R3482 |
T5589 |
T5587 |
conj |
human,mouse |
R3483 |
T5590 |
T5562 |
punct |
.,added |
R3484 |
T5592 |
T5593 |
prep |
For,used |
R3485 |
T5594 |
T5592 |
pobj |
regions,For |
R3486 |
T5595 |
T5594 |
prep |
with,regions |
R3487 |
T5596 |
T5597 |
amod |
low,similarity |
R3488 |
T5597 |
T5595 |
pobj |
similarity,with |
R3489 |
T5598 |
T5597 |
compound |
sequence,similarity |
R3490 |
T5599 |
T5594 |
prep |
in,regions |
R3491 |
T5600 |
T5601 |
compound |
T.,rubripes |
R3492 |
T5601 |
T5599 |
pobj |
rubripes,in |
R3493 |
T5602 |
T5593 |
punct |
", ",used |
R3494 |
T5603 |
T5604 |
amod |
genomic,sequences |
R3495 |
T5604 |
T5593 |
nsubjpass |
sequences,used |
R3496 |
T5605 |
T5604 |
prep |
from,sequences |
R3497 |
T5606 |
T5607 |
det |
the,pufferfish |
R3498 |
T5607 |
T5605 |
pobj |
pufferfish,from |
R3499 |
T5608 |
T5609 |
amod |
fresh,water |
R3500 |
T5609 |
T5607 |
compound |
water,pufferfish |
R3501 |
T5610 |
T5607 |
punct |
", ",pufferfish |
R3502 |
T5611 |
T5612 |
compound |
Tetraodon,nigroviridis |
R3503 |
T5612 |
T5607 |
appos |
nigroviridis,pufferfish |
R3504 |
T5613 |
T5593 |
auxpass |
were,used |
R3505 |
T5614 |
T5593 |
prep |
as,used |
R3506 |
T5615 |
T5616 |
amod |
additional,means |
R3507 |
T5616 |
T5614 |
pobj |
means,as |
R3508 |
T5617 |
T5618 |
aux |
to,verify |
R3509 |
T5618 |
T5616 |
advcl |
verify,means |
R3510 |
T5619 |
T5620 |
det |
the,exons |
R3511 |
T5620 |
T5618 |
dobj |
exons,verify |
R3512 |
T5621 |
T5620 |
amod |
predicted,exons |
R3513 |
T5622 |
T5593 |
punct |
.,used |
R3514 |
T5624 |
T5625 |
det |
The,structure |
R3515 |
T5625 |
T5627 |
nsubjpass |
structure,depicted |
R3516 |
T5626 |
T5625 |
amod |
general,structure |
R3517 |
T5628 |
T5625 |
prep |
of,structure |
R3518 |
T5629 |
T5630 |
det |
the,genes |
R3519 |
T5630 |
T5628 |
pobj |
genes,of |
R3520 |
T5631 |
T5630 |
compound |
TACC,genes |
R3521 |
T5632 |
T5630 |
cc |
and,genes |
R3522 |
T5633 |
T5630 |
conj |
proteins,genes |
R3523 |
T5634 |
T5627 |
auxpass |
is,depicted |
R3524 |
T5635 |
T5627 |
prep |
in,depicted |
R3525 |
T5636 |
T5635 |
pobj |
Fig.,in |
R3526 |
T5637 |
T5636 |
nummod |
4,Fig. |
R3527 |
T5638 |
T5627 |
punct |
.,depicted |
R3528 |
T5640 |
T5641 |
det |
The,feature |
R3529 |
T5641 |
T5644 |
nsubjpass |
feature,located |
R3530 |
T5642 |
T5641 |
amod |
main,feature |
R3531 |
T5643 |
T5641 |
amod |
conserved,feature |
R3532 |
T5645 |
T5641 |
prep |
of,feature |
R3533 |
T5646 |
T5647 |
det |
the,family |
R3534 |
T5647 |
T5645 |
pobj |
family,of |
R3535 |
T5648 |
T5647 |
compound |
TACC,family |
R3536 |
T5649 |
T5641 |
punct |
", ",feature |
R3537 |
T5650 |
T5651 |
det |
the,domain |
R3538 |
T5651 |
T5641 |
appos |
domain,feature |
R3539 |
T5652 |
T5651 |
compound |
TACC,domain |
R3540 |
T5653 |
T5644 |
punct |
", ",located |
R3541 |
T5654 |
T5644 |
auxpass |
is,located |
R3542 |
T5655 |
T5644 |
prep |
at,located |
R3543 |
T5656 |
T5657 |
det |
the,terminus |
R3544 |
T5657 |
T5655 |
pobj |
terminus,at |
R3545 |
T5658 |
T5657 |
compound |
carboxy,terminus |
R3546 |
T5659 |
T5657 |
prep |
of,terminus |
R3547 |
T5660 |
T5661 |
det |
the,protein |
R3548 |
T5661 |
T5659 |
pobj |
protein,of |
R3549 |
T5662 |
T5644 |
punct |
.,located |
R3550 |
T5664 |
T5665 |
prep |
In,comprises |
R3551 |
T5666 |
T5667 |
det |
the,case |
R3552 |
T5667 |
T5664 |
pobj |
case,In |
R3553 |
T5668 |
T5667 |
prep |
of,case |
R3554 |
T5669 |
T5670 |
det |
the,protein |
R3555 |
T5670 |
T5668 |
pobj |
protein,of |
R3556 |
T5671 |
T5672 |
compound |
C.,elegans |
R3557 |
T5672 |
T5670 |
compound |
elegans,protein |
R3558 |
T5673 |
T5670 |
compound |
TAC,protein |
R3559 |
T5674 |
T5665 |
punct |
", ",comprises |
R3560 |
T5675 |
T5676 |
det |
this,structure |
R3561 |
T5676 |
T5665 |
nsubj |
structure,comprises |
R3562 |
T5677 |
T5678 |
det |
the,majority |
R3563 |
T5678 |
T5665 |
dobj |
majority,comprises |
R3564 |
T5679 |
T5678 |
prep |
of,majority |
R3565 |
T5680 |
T5681 |
det |
the,protein |
R3566 |
T5681 |
T5679 |
pobj |
protein,of |
R3567 |
T5682 |
T5665 |
cc |
and,comprises |
R3568 |
T5683 |
T5684 |
auxpass |
is,encoded |
R3569 |
T5684 |
T5665 |
conj |
encoded,comprises |
R3570 |
T5685 |
T5684 |
agent |
by,encoded |
R3571 |
T5686 |
T5685 |
pobj |
two,by |
R3572 |
T5687 |
T5686 |
prep |
of,two |
R3573 |
T5688 |
T5689 |
det |
the,exons |
R3574 |
T5689 |
T5687 |
pobj |
exons,of |
R3575 |
T5690 |
T5689 |
nummod |
three,exons |
R3576 |
T5691 |
T5689 |
prep |
of,exons |
R3577 |
T5692 |
T5693 |
det |
the,gene |
R3578 |
T5693 |
T5691 |
pobj |
gene,of |
R3579 |
T5694 |
T5665 |
punct |
.,comprises |
R3580 |
T5696 |
T5697 |
prep |
In,encoded |
R3581 |
T5698 |
T5699 |
det |
the,organisms |
R3582 |
T5699 |
T5696 |
pobj |
organisms,In |
R3583 |
T5700 |
T5699 |
amod |
higher,organisms |
R3584 |
T5701 |
T5699 |
punct |
", ",organisms |
R3585 |
T5702 |
T5703 |
compound |
D.,melanogaster |
R3586 |
T5703 |
T5699 |
appos |
melanogaster,organisms |
R3587 |
T5704 |
T5703 |
punct |
", ",melanogaster |
R3588 |
T5705 |
T5703 |
cc |
and,melanogaster |
R3589 |
T5706 |
T5707 |
det |
the,intestinalis |
R3590 |
T5707 |
T5703 |
conj |
intestinalis,melanogaster |
R3591 |
T5708 |
T5709 |
compound |
deuterostomes,C. |
R3592 |
T5709 |
T5707 |
compound |
C.,intestinalis |
R3593 |
T5710 |
T5707 |
prep |
to,intestinalis |
R3594 |
T5711 |
T5710 |
pobj |
human,to |
R3595 |
T5712 |
T5697 |
punct |
", ",encoded |
R3596 |
T5713 |
T5714 |
det |
this,feature |
R3597 |
T5714 |
T5697 |
nsubjpass |
feature,encoded |
R3598 |
T5715 |
T5697 |
auxpass |
is,encoded |
R3599 |
T5716 |
T5697 |
advmod |
also,encoded |
R3600 |
T5717 |
T5697 |
agent |
by,encoded |
R3601 |
T5718 |
T5719 |
det |
the,exons |
R3602 |
T5719 |
T5717 |
pobj |
exons,by |
R3603 |
T5720 |
T5719 |
amod |
final,exons |
R3604 |
T5721 |
T5719 |
prep |
of,exons |
R3605 |
T5722 |
T5723 |
det |
the,gene |
R3606 |
T5723 |
T5721 |
pobj |
gene,of |
R3607 |
T5724 |
T5725 |
punct |
(,five |
R3608 |
T5725 |
T5697 |
parataxis |
five,encoded |
R3609 |
T5726 |
T5725 |
prep |
in,five |
R3610 |
T5727 |
T5728 |
compound |
D.,melanogaster |
R3611 |
T5728 |
T5726 |
pobj |
melanogaster,in |
R3612 |
T5729 |
T5725 |
punct |
", ",five |
R3613 |
T5730 |
T5725 |
appos |
seven,five |
R3614 |
T5731 |
T5730 |
prep |
in,seven |
R3615 |
T5732 |
T5733 |
det |
the,genes |
R3616 |
T5733 |
T5731 |
pobj |
genes,in |
R3617 |
T5734 |
T5733 |
compound |
deuterostome,genes |
R3618 |
T5735 |
T5725 |
punct |
),five |
R3619 |
T5736 |
T5697 |
punct |
.,encoded |
R3620 |
T5738 |
T5739 |
prep |
Outside,show |
R3621 |
T5740 |
T5738 |
prep |
of,Outside |
R3622 |
T5741 |
T5742 |
det |
the,domain |
R3623 |
T5742 |
T5740 |
pobj |
domain,of |
R3624 |
T5743 |
T5742 |
compound |
TACC,domain |
R3625 |
T5744 |
T5739 |
punct |
", ",show |
R3626 |
T5745 |
T5739 |
advmod |
however,show |
R3627 |
T5746 |
T5739 |
punct |
", ",show |
R3628 |
T5747 |
T5748 |
compound |
TACC,members |
R3629 |
T5748 |
T5739 |
nsubj |
members,show |
R3630 |
T5749 |
T5748 |
compound |
family,members |
R3631 |
T5750 |
T5751 |
advmod |
relatively,little |
R3632 |
T5751 |
T5752 |
amod |
little,homology |
R3633 |
T5752 |
T5739 |
dobj |
homology,show |
R3634 |
T5753 |
T5739 |
punct |
.,show |
R3635 |
T5755 |
T5756 |
nsubj |
It,is |
R3636 |
T5757 |
T5756 |
acomp |
interesting,is |
R3637 |
T5758 |
T5759 |
mark |
that,contains |
R3638 |
T5759 |
T5756 |
ccomp |
contains,is |
R3639 |
T5760 |
T5761 |
det |
each,gene |
R3640 |
T5761 |
T5759 |
nsubj |
gene,contains |
R3641 |
T5762 |
T5761 |
compound |
TACC,gene |
R3642 |
T5763 |
T5764 |
nummod |
one,exon |
R3643 |
T5764 |
T5759 |
dobj |
exon,contains |
R3644 |
T5765 |
T5764 |
amod |
large,exon |
R3645 |
T5766 |
T5764 |
punct |
", ",exon |
R3646 |
T5767 |
T5768 |
dep |
which,shows |
R3647 |
T5768 |
T5764 |
relcl |
shows,exon |
R3648 |
T5769 |
T5770 |
amod |
considerable,variability |
R3649 |
T5770 |
T5768 |
dobj |
variability,shows |
R3650 |
T5771 |
T5768 |
prep |
between,shows |
R3651 |
T5772 |
T5773 |
compound |
TACC,orthologues |
R3652 |
T5773 |
T5771 |
pobj |
orthologues,between |
R3653 |
T5774 |
T5768 |
punct |
", ",shows |
R3654 |
T5775 |
T5768 |
cc |
and,shows |
R3655 |
T5776 |
T5768 |
conj |
constitutes,shows |
R3656 |
T5777 |
T5778 |
det |
the,difference |
R3657 |
T5778 |
T5776 |
dobj |
difference,constitutes |
R3658 |
T5779 |
T5778 |
amod |
main,difference |
R3659 |
T5780 |
T5778 |
prep |
between,difference |
R3660 |
T5781 |
T5782 |
det |
the,genes |
R3661 |
T5782 |
T5780 |
pobj |
genes,between |
R3662 |
T5783 |
T5782 |
compound |
TACC3,genes |
R3663 |
T5784 |
T5776 |
prep |
in,constitutes |
R3664 |
T5785 |
T5786 |
det |
the,vertebrates |
R3665 |
T5786 |
T5784 |
pobj |
vertebrates,in |
R3666 |
T5787 |
T5788 |
punct |
(,see |
R3667 |
T5788 |
T5759 |
parataxis |
see,contains |
R3668 |
T5789 |
T5788 |
advmod |
below,see |
R3669 |
T5790 |
T5788 |
punct |
),see |
R3670 |
T5791 |
T5756 |
punct |
.,is |
R3671 |
T5793 |
T5794 |
prep |
In,contains |
R3672 |
T5795 |
T5793 |
pobj |
deuterostomes,In |
R3673 |
T5796 |
T5794 |
punct |
", ",contains |
R3674 |
T5797 |
T5798 |
det |
this,exon |
R3675 |
T5798 |
T5794 |
nsubj |
exon,contains |
R3676 |
T5799 |
T5800 |
det |
the,repeat |
R3677 |
T5800 |
T5794 |
dobj |
repeat,contains |
R3678 |
T5801 |
T5800 |
compound |
SDP,repeat |
R3679 |
T5802 |
T5800 |
punct |
(,repeat |
R3680 |
T5803 |
T5800 |
cc |
or,repeat |
R3681 |
T5804 |
T5805 |
prep |
in,repeat |
R3682 |
T5805 |
T5800 |
conj |
repeat,repeat |
R3683 |
T5806 |
T5807 |
det |
the,case |
R3684 |
T5807 |
T5804 |
pobj |
case,in |
R3685 |
T5808 |
T5807 |
prep |
of,case |
R3686 |
T5809 |
T5810 |
det |
the,TACC3 |
R3687 |
T5810 |
T5808 |
pobj |
TACC3,of |
R3688 |
T5811 |
T5810 |
amod |
murine,TACC3 |
R3689 |
T5812 |
T5810 |
case |
's,TACC3 |
R3690 |
T5813 |
T5805 |
punct |
", ",repeat |
R3691 |
T5814 |
T5805 |
det |
a,repeat |
R3692 |
T5815 |
T5816 |
npadvmod |
rodent,specific |
R3693 |
T5816 |
T5805 |
amod |
specific,repeat |
R3694 |
T5817 |
T5816 |
punct |
-,specific |
R3695 |
T5818 |
T5819 |
nummod |
24,acid |
R3696 |
T5819 |
T5805 |
compound |
acid,repeat |
R3697 |
T5820 |
T5819 |
compound |
amino,acid |
R3698 |
T5821 |
T5800 |
punct |
),repeat |
R3699 |
T5822 |
T5800 |
punct |
", ",repeat |
R3700 |
T5823 |
T5824 |
dep |
which,is |
R3701 |
T5824 |
T5800 |
relcl |
is,repeat |
R3702 |
T5825 |
T5824 |
acomp |
responsible,is |
R3703 |
T5826 |
T5825 |
prep |
for,responsible |
R3704 |
T5827 |
T5828 |
det |
the,binding |
R3705 |
T5828 |
T5826 |
pobj |
binding,for |
R3706 |
T5829 |
T5828 |
prep |
of,binding |
R3707 |
T5830 |
T5831 |
det |
the,component |
R3708 |
T5831 |
T5829 |
pobj |
component,of |
R3709 |
T5832 |
T5833 |
nmod |
SWI,SNF |
R3710 |
T5833 |
T5831 |
nmod |
SNF,component |
R3711 |
T5834 |
T5833 |
punct |
/,SNF |
R3712 |
T5835 |
T5836 |
compound |
chromatin,remodeling |
R3713 |
T5836 |
T5833 |
appos |
remodeling,SNF |
R3714 |
T5837 |
T5833 |
amod |
complex,SNF |
R3715 |
T5838 |
T5831 |
appos |
GAS41,component |
R3716 |
T5839 |
T5840 |
punct |
[,16 |
R3717 |
T5840 |
T5824 |
parataxis |
16,is |
R3718 |
T5841 |
T5840 |
nummod |
15,16 |
R3719 |
T5842 |
T5840 |
punct |
",",16 |
R3720 |
T5843 |
T5840 |
punct |
],16 |
R3721 |
T5844 |
T5794 |
punct |
.,contains |
R3722 |
T5846 |
T5847 |
prep |
Of,show |
R3723 |
T5848 |
T5849 |
det |
the,proteins |
R3724 |
T5849 |
T5846 |
pobj |
proteins,Of |
R3725 |
T5850 |
T5849 |
compound |
vertebrate,proteins |
R3726 |
T5851 |
T5849 |
compound |
TACC,proteins |
R3727 |
T5852 |
T5847 |
punct |
", ",show |
R3728 |
T5853 |
T5854 |
det |
the,orthologues |
R3729 |
T5854 |
T5847 |
nsubj |
orthologues,show |
R3730 |
T5855 |
T5854 |
compound |
TACC3,orthologues |
R3731 |
T5856 |
T5857 |
det |
the,variability |
R3732 |
T5857 |
T5847 |
dobj |
variability,show |
R3733 |
T5858 |
T5857 |
amod |
greatest,variability |
R3734 |
T5859 |
T5857 |
prep |
in,variability |
R3735 |
T5860 |
T5859 |
pobj |
size,in |
R3736 |
T5861 |
T5860 |
cc |
and,size |
R3737 |
T5862 |
T5860 |
conj |
sequence,size |
R3738 |
T5863 |
T5847 |
punct |
", ",show |
R3739 |
T5864 |
T5847 |
advcl |
ranging,show |
R3740 |
T5865 |
T5864 |
prep |
in,ranging |
R3741 |
T5866 |
T5865 |
pobj |
size,in |
R3742 |
T5867 |
T5864 |
prep |
from,ranging |
R3743 |
T5868 |
T5869 |
nummod |
599,acids |
R3744 |
T5869 |
T5867 |
pobj |
acids,from |
R3745 |
T5870 |
T5869 |
compound |
amino,acids |
R3746 |
T5871 |
T5869 |
prep |
for,acids |
R3747 |
T5872 |
T5873 |
det |
the,protein |
R3748 |
T5873 |
T5871 |
pobj |
protein,for |
R3749 |
T5874 |
T5873 |
compound |
rat,protein |
R3750 |
T5875 |
T5873 |
compound |
TACC3,protein |
R3751 |
T5876 |
T5867 |
punct |
", ",from |
R3752 |
T5877 |
T5867 |
prep |
to,from |
R3753 |
T5878 |
T5879 |
nummod |
942,acids |
R3754 |
T5879 |
T5877 |
pobj |
acids,to |
R3755 |
T5880 |
T5879 |
compound |
amino,acids |
R3756 |
T5881 |
T5879 |
prep |
in,acids |
R3757 |
T5882 |
T5883 |
det |
the,protein |
R3758 |
T5883 |
T5881 |
pobj |
protein,in |
R3759 |
T5884 |
T5883 |
compound |
Danio,protein |
R3760 |
T5885 |
T5883 |
compound |
rerio,protein |
R3761 |
T5886 |
T5847 |
punct |
.,show |
R3762 |
T5888 |
T5889 |
det |
The,reasons |
R3763 |
T5889 |
T5890 |
nsubj |
reasons,are |
R3764 |
T5891 |
T5889 |
prep |
for,reasons |
R3765 |
T5892 |
T5893 |
det |
these,differences |
R3766 |
T5893 |
T5891 |
pobj |
differences,for |
R3767 |
T5894 |
T5890 |
acomp |
apparent,are |
R3768 |
T5895 |
T5890 |
prep |
from,are |
R3769 |
T5896 |
T5897 |
det |
the,structure |
R3770 |
T5897 |
T5895 |
pobj |
structure,from |
R3771 |
T5898 |
T5897 |
amod |
genomic,structure |
R3772 |
T5899 |
T5897 |
prep |
of,structure |
R3773 |
T5900 |
T5901 |
det |
the,orthologues |
R3774 |
T5901 |
T5899 |
pobj |
orthologues,of |
R3775 |
T5902 |
T5901 |
compound |
TACC3,orthologues |
R3776 |
T5903 |
T5890 |
punct |
.,are |
R3777 |
T5905 |
T5906 |
nsubjpass |
TACC3,divided |
R3778 |
T5907 |
T5906 |
aux |
can,divided |
R3779 |
T5908 |
T5906 |
auxpass |
be,divided |
R3780 |
T5909 |
T5906 |
prep |
into,divided |
R3781 |
T5910 |
T5911 |
nummod |
three,sections |
R3782 |
T5911 |
T5909 |
pobj |
sections,into |
R3783 |
T5912 |
T5911 |
punct |
: ,sections |
R3784 |
T5913 |
T5914 |
det |
a,region |
R3785 |
T5914 |
T5911 |
appos |
region,sections |
R3786 |
T5915 |
T5914 |
amod |
conserved,region |
R3787 |
T5916 |
T5917 |
npadvmod |
N,terminal |
R3788 |
T5917 |
T5914 |
amod |
terminal,region |
R3789 |
T5918 |
T5917 |
punct |
-,terminal |
R3790 |
T5919 |
T5914 |
punct |
(,region |
R3791 |
T5920 |
T5914 |
appos |
CNTR,region |
R3792 |
T5921 |
T5914 |
punct |
),region |
R3793 |
T5922 |
T5914 |
prep |
of,region |
R3794 |
T5923 |
T5924 |
nummod |
108,acids |
R3795 |
T5924 |
T5922 |
pobj |
acids,of |
R3796 |
T5925 |
T5924 |
compound |
amino,acids |
R3797 |
T5926 |
T5924 |
punct |
", ",acids |
R3798 |
T5927 |
T5924 |
acl |
encoded,acids |
R3799 |
T5928 |
T5927 |
agent |
by,encoded |
R3800 |
T5929 |
T5930 |
nmod |
exons,2 |
R3801 |
T5930 |
T5928 |
pobj |
2,by |
R3802 |
T5931 |
T5930 |
cc |
and,2 |
R3803 |
T5932 |
T5930 |
conj |
3,2 |
R3804 |
T5933 |
T5927 |
prep |
in,encoded |
R3805 |
T5934 |
T5935 |
det |
each,gene |
R3806 |
T5935 |
T5933 |
pobj |
gene,in |
R3807 |
T5936 |
T5935 |
compound |
vertebrate,gene |
R3808 |
T5937 |
T5935 |
compound |
TACC3,gene |
R3809 |
T5938 |
T5914 |
punct |
", ",region |
R3810 |
T5939 |
T5940 |
det |
the,domain |
R3811 |
T5940 |
T5914 |
conj |
domain,region |
R3812 |
T5941 |
T5940 |
amod |
conserved,domain |
R3813 |
T5942 |
T5940 |
compound |
TACC,domain |
R3814 |
T5943 |
T5940 |
acl |
distributed,domain |
R3815 |
T5944 |
T5943 |
prep |
over,distributed |
R3816 |
T5945 |
T5946 |
det |
the,exons |
R3817 |
T5946 |
T5944 |
pobj |
exons,over |
R3818 |
T5947 |
T5946 |
amod |
final,exons |
R3819 |
T5948 |
T5946 |
nummod |
seven,exons |
R3820 |
T5949 |
T5940 |
punct |
", ",domain |
R3821 |
T5950 |
T5940 |
cc |
and,domain |
R3822 |
T5951 |
T5952 |
det |
a,region |
R3823 |
T5952 |
T5940 |
conj |
region,domain |
R3824 |
T5953 |
T5954 |
advmod |
highly,variable |
R3825 |
T5954 |
T5952 |
amod |
variable,region |
R3826 |
T5955 |
T5952 |
amod |
central,region |
R3827 |
T5956 |
T5906 |
punct |
.,divided |
R3828 |
T5958 |
T5959 |
det |
The,lack |
R3829 |
T5959 |
T5960 |
nsubjpass |
lack,noted |
R3830 |
T5961 |
T5959 |
prep |
of,lack |
R3831 |
T5962 |
T5961 |
pobj |
conservation,of |
R3832 |
T5963 |
T5962 |
prep |
in,conservation |
R3833 |
T5964 |
T5965 |
preconj |
both,size |
R3834 |
T5965 |
T5963 |
pobj |
size,in |
R3835 |
T5966 |
T5965 |
cc |
and,size |
R3836 |
T5967 |
T5965 |
conj |
sequence,size |
R3837 |
T5968 |
T5965 |
prep |
of,size |
R3838 |
T5969 |
T5970 |
det |
the,portion |
R3839 |
T5970 |
T5968 |
pobj |
portion,of |
R3840 |
T5971 |
T5970 |
amod |
central,portion |
R3841 |
T5972 |
T5970 |
prep |
of,portion |
R3842 |
T5973 |
T5974 |
det |
the,proteins |
R3843 |
T5974 |
T5972 |
pobj |
proteins,of |
R3844 |
T5975 |
T5974 |
compound |
TACC3,proteins |
R3845 |
T5976 |
T5974 |
prep |
of,proteins |
R3846 |
T5977 |
T5978 |
amod |
human,mouse |
R3847 |
T5978 |
T5976 |
pobj |
mouse,of |
R3848 |
T5979 |
T5978 |
cc |
and,mouse |
R3849 |
T5980 |
T5960 |
aux |
has,noted |
R3850 |
T5981 |
T5960 |
auxpass |
been,noted |
R3851 |
T5982 |
T5960 |
advmod |
previously,noted |
R3852 |
T5983 |
T5960 |
punct |
", ",noted |
R3853 |
T5984 |
T5960 |
cc |
and,noted |
R3854 |
T5985 |
T5960 |
conj |
accounts,noted |
R3855 |
T5986 |
T5985 |
prep |
for,accounts |
R3856 |
T5987 |
T5988 |
det |
the,difference |
R3857 |
T5988 |
T5986 |
pobj |
difference,for |
R3858 |
T5989 |
T5988 |
amod |
major,difference |
R3859 |
T5990 |
T5988 |
prep |
between,difference |
R3860 |
T5991 |
T5992 |
det |
these,orthologues |
R3861 |
T5992 |
T5990 |
pobj |
orthologues,between |
R3862 |
T5993 |
T5992 |
nummod |
two,orthologues |
R3863 |
T5994 |
T5995 |
punct |
[,2 |
R3864 |
T5995 |
T5985 |
parataxis |
2,accounts |
R3865 |
T5996 |
T5995 |
punct |
],2 |
R3866 |
T5997 |
T5960 |
punct |
.,noted |
R3867 |
T5999 |
T6000 |
det |
The,majority |
R3868 |
T6000 |
T6001 |
nsubjpass |
majority,encoded |
R3869 |
T6002 |
T6000 |
prep |
of,majority |
R3870 |
T6003 |
T6004 |
det |
this,portion |
R3871 |
T6004 |
T6002 |
pobj |
portion,of |
R3872 |
T6005 |
T6004 |
amod |
central,portion |
R3873 |
T6006 |
T6004 |
punct |
", ",portion |
R3874 |
T6007 |
T6008 |
dep |
which,contains |
R3875 |
T6008 |
T6004 |
relcl |
contains,portion |
R3876 |
T6009 |
T6010 |
det |
the,motifs |
R3877 |
T6010 |
T6008 |
dobj |
motifs,contains |
R3878 |
T6011 |
T6010 |
compound |
SDP,motifs |
R3879 |
T6012 |
T6010 |
compound |
repeat,motifs |
R3880 |
T6013 |
T6001 |
punct |
", ",encoded |
R3881 |
T6014 |
T6001 |
auxpass |
is,encoded |
R3882 |
T6015 |
T6001 |
agent |
by,encoded |
R3883 |
T6016 |
T6017 |
nummod |
one,exon |
R3884 |
T6017 |
T6015 |
pobj |
exon,by |
R3885 |
T6018 |
T6001 |
prep |
in,encoded |
R3886 |
T6019 |
T6018 |
pobj |
human,in |
R3887 |
T6020 |
T6019 |
cc |
and,human |
R3888 |
T6021 |
T6022 |
det |
the,pufferfish |
R3889 |
T6022 |
T6019 |
conj |
pufferfish,human |
R3890 |
T6023 |
T6024 |
punct |
(,emb|CAAB01001184 |
R3891 |
T6024 |
T6001 |
parataxis |
emb|CAAB01001184,encoded |
R3892 |
T6025 |
T6024 |
punct |
),emb|CAAB01001184 |
R3893 |
T6026 |
T6001 |
punct |
.,encoded |
R3894 |
T6028 |
T6029 |
prep |
In,composed |
R3895 |
T6030 |
T6028 |
pobj |
rodents,In |
R3896 |
T6031 |
T6029 |
punct |
", ",composed |
R3897 |
T6032 |
T6029 |
advmod |
however,composed |
R3898 |
T6033 |
T6029 |
punct |
", ",composed |
R3899 |
T6034 |
T6035 |
det |
this,region |
R3900 |
T6035 |
T6029 |
nsubjpass |
region,composed |
R3901 |
T6036 |
T6029 |
auxpass |
is,composed |
R3902 |
T6037 |
T6038 |
advmod |
almost,entirely |
R3903 |
T6038 |
T6029 |
advmod |
entirely,composed |
R3904 |
T6039 |
T6029 |
prep |
of,composed |
R3905 |
T6040 |
T6041 |
nummod |
seven,repeats |
R3906 |
T6041 |
T6039 |
pobj |
repeats,of |
R3907 |
T6042 |
T6043 |
nummod |
24,acid |
R3908 |
T6043 |
T6041 |
compound |
acid,repeats |
R3909 |
T6044 |
T6043 |
compound |
amino,acid |
R3910 |
T6045 |
T6041 |
punct |
", ",repeats |
R3911 |
T6046 |
T6047 |
dep |
which,located |
R3912 |
T6047 |
T6041 |
relcl |
located,repeats |
R3913 |
T6048 |
T6047 |
auxpass |
are,located |
R3914 |
T6049 |
T6047 |
prep |
in,located |
R3915 |
T6050 |
T6051 |
det |
a,exon |
R3916 |
T6051 |
T6049 |
pobj |
exon,in |
R3917 |
T6052 |
T6051 |
amod |
single,exon |
R3918 |
T6053 |
T6051 |
prep |
of,exon |
R3919 |
T6054 |
T6055 |
det |
the,genes |
R3920 |
T6055 |
T6053 |
pobj |
genes,of |
R3921 |
T6056 |
T6055 |
nmod |
mouse,genes |
R3922 |
T6057 |
T6056 |
cc |
and,mouse |
R3923 |
T6058 |
T6056 |
conj |
rat,mouse |
R3924 |
T6059 |
T6055 |
compound |
TACC3,genes |
R3925 |
T6060 |
T6029 |
punct |
.,composed |
R3926 |
T6062 |
T6063 |
nsubjpass |
It,reported |
R3927 |
T6064 |
T6063 |
aux |
has,reported |
R3928 |
T6065 |
T6063 |
auxpass |
been,reported |
R3929 |
T6066 |
T6063 |
advmod |
previously,reported |
R3930 |
T6067 |
T6068 |
mark |
that,are |
R3931 |
T6068 |
T6063 |
ccomp |
are,reported |
R3932 |
T6069 |
T6068 |
expl |
there,are |
R3933 |
T6070 |
T6071 |
nummod |
four,variants |
R3934 |
T6071 |
T6068 |
attr |
variants,are |
R3935 |
T6072 |
T6073 |
compound |
mouse,TACC3 |
R3936 |
T6073 |
T6071 |
compound |
TACC3,variants |
R3937 |
T6074 |
T6071 |
compound |
splice,variants |
R3938 |
T6075 |
T6076 |
dep |
that,differ |
R3939 |
T6076 |
T6071 |
relcl |
differ,variants |
R3940 |
T6077 |
T6076 |
prep |
in,differ |
R3941 |
T6078 |
T6079 |
det |
the,number |
R3942 |
T6079 |
T6077 |
pobj |
number,in |
R3943 |
T6080 |
T6079 |
prep |
of,number |
R3944 |
T6081 |
T6082 |
det |
these,repeats |
R3945 |
T6082 |
T6080 |
pobj |
repeats,of |
R3946 |
T6083 |
T6084 |
punct |
[,17 |
R3947 |
T6084 |
T6063 |
parataxis |
17,reported |
R3948 |
T6085 |
T6084 |
nummod |
2,17 |
R3949 |
T6086 |
T6084 |
punct |
",",17 |
R3950 |
T6087 |
T6084 |
nummod |
7,17 |
R3951 |
T6088 |
T6084 |
punct |
",",17 |
R3952 |
T6089 |
T6084 |
punct |
],17 |
R3953 |
T6090 |
T6063 |
punct |
.,reported |
R3954 |
T6092 |
T6093 |
mark |
As,are |
R3955 |
T6093 |
T6096 |
advcl |
are,appears |
R3956 |
T6094 |
T6095 |
det |
these,repeats |
R3957 |
T6095 |
T6093 |
nsubj |
repeats,are |
R3958 |
T6097 |
T6093 |
acomp |
present,are |
R3959 |
T6098 |
T6093 |
prep |
in,are |
R3960 |
T6099 |
T6100 |
det |
a,exon |
R3961 |
T6100 |
T6098 |
pobj |
exon,in |
R3962 |
T6101 |
T6100 |
amod |
single,exon |
R3963 |
T6102 |
T6096 |
punct |
", ",appears |
R3964 |
T6103 |
T6096 |
nsubj |
it,appears |
R3965 |
T6104 |
T6096 |
oprd |
likely,appears |
R3966 |
T6105 |
T6106 |
mark |
that,be |
R3967 |
T6106 |
T6096 |
ccomp |
be,appears |
R3968 |
T6107 |
T6108 |
det |
these,sequences |
R3969 |
T6108 |
T6106 |
nsubj |
sequences,be |
R3970 |
T6109 |
T6108 |
amod |
different,sequences |
R3971 |
T6110 |
T6106 |
aux |
may,be |
R3972 |
T6111 |
T6112 |
det |
the,result |
R3973 |
T6112 |
T6106 |
attr |
result,be |
R3974 |
T6113 |
T6112 |
prep |
of,result |
R3975 |
T6114 |
T6115 |
det |
the,polymerases |
R3976 |
T6115 |
T6113 |
pobj |
polymerases,of |
R3977 |
T6116 |
T6115 |
compound |
DNA,polymerases |
R3978 |
T6117 |
T6115 |
acl |
used,polymerases |
R3979 |
T6118 |
T6117 |
prep |
in,used |
R3980 |
T6119 |
T6120 |
det |
the,synthesis |
R3981 |
T6120 |
T6118 |
pobj |
synthesis,in |
R3982 |
T6121 |
T6120 |
compound |
cDNA,synthesis |
R3983 |
T6122 |
T6115 |
cc |
and,polymerases |
R3984 |
T6123 |
T6122 |
punct |
/,and |
R3985 |
T6124 |
T6122 |
cc |
or,and |
R3986 |
T6125 |
T6126 |
compound |
PCR,reaction |
R3987 |
T6126 |
T6127 |
nsubj |
reaction,stuttering |
R3988 |
T6127 |
T6115 |
conj |
stuttering,polymerases |
R3989 |
T6128 |
T6127 |
prep |
through,stuttering |
R3990 |
T6129 |
T6130 |
det |
the,motif |
R3991 |
T6130 |
T6128 |
pobj |
motif,through |
R3992 |
T6131 |
T6130 |
compound |
repeat,motif |
R3993 |
T6132 |
T6096 |
punct |
.,appears |
R3994 |
T6134 |
T6135 |
det |
The,sequence |
R3995 |
T6135 |
T6137 |
nsubj |
sequence,is |
R3996 |
T6136 |
T6135 |
amod |
correct,sequence |
R3997 |
T6138 |
T6135 |
punct |
", ",sequence |
R3998 |
T6139 |
T6135 |
acl |
reported,sequence |
R3999 |
T6140 |
T6139 |
agent |
by,reported |
R4000 |
T6141 |
T6140 |
pobj |
Sadek,by |
R4001 |
T6142 |
T6143 |
advmod |
et,al |
R4002 |
T6143 |
T6141 |
advmod |
al,Sadek |
R4003 |
T6144 |
T6145 |
punct |
[,7 |
R4004 |
T6145 |
T6139 |
parataxis |
7,reported |
R4005 |
T6146 |
T6145 |
punct |
],7 |
R4006 |
T6147 |
T6137 |
punct |
", ",is |
R4007 |
T6148 |
T6149 |
det |
the,one |
R4008 |
T6149 |
T6137 |
attr |
one,is |
R4009 |
T6150 |
T6149 |
acl |
used,one |
R4010 |
T6151 |
T6150 |
prep |
throughout,used |
R4011 |
T6152 |
T6153 |
det |
the,entirety |
R4012 |
T6153 |
T6151 |
pobj |
entirety,throughout |
R4013 |
T6154 |
T6153 |
prep |
of,entirety |
R4014 |
T6155 |
T6156 |
det |
this,manuscript |
R4015 |
T6156 |
T6154 |
pobj |
manuscript,of |
R4016 |
T6157 |
T6137 |
punct |
.,is |
R4017 |
T6159 |
T6160 |
det |
These,repeats |
R4018 |
T6160 |
T6161 |
nsubj |
repeats,are |
R4019 |
T6162 |
T6161 |
neg |
not,are |
R4020 |
T6163 |
T6161 |
acomp |
evident,are |
R4021 |
T6164 |
T6161 |
prep |
in,are |
R4022 |
T6165 |
T6166 |
det |
the,protein |
R4023 |
T6166 |
T6164 |
pobj |
protein,in |
R4024 |
T6167 |
T6166 |
compound |
rabbit,protein |
R4025 |
T6168 |
T6166 |
punct |
", ",protein |
R4026 |
T6169 |
T6166 |
cc |
or,protein |
R4027 |
T6170 |
T6171 |
det |
any,protein |
R4028 |
T6171 |
T6166 |
conj |
protein,protein |
R4029 |
T6172 |
T6171 |
amod |
other,protein |
R4030 |
T6173 |
T6171 |
compound |
TACC,protein |
R4031 |
T6174 |
T6161 |
punct |
", ",are |
R4032 |
T6175 |
T6161 |
cc |
and,are |
R4033 |
T6176 |
T6177 |
aux |
may,indicate |
R4034 |
T6177 |
T6161 |
conj |
indicate,are |
R4035 |
T6178 |
T6179 |
mark |
that,evolved |
R4036 |
T6179 |
T6177 |
ccomp |
evolved,indicate |
R4037 |
T6180 |
T6181 |
det |
the,TACC3 |
R4038 |
T6181 |
T6179 |
nsubj |
TACC3,evolved |
R4039 |
T6182 |
T6181 |
compound |
rodent,TACC3 |
R4040 |
T6183 |
T6179 |
aux |
has,evolved |
R4041 |
T6184 |
T6185 |
amod |
distinct,functions |
R4042 |
T6185 |
T6179 |
dobj |
functions,evolved |
R4043 |
T6186 |
T6177 |
punct |
", ",indicate |
R4044 |
T6187 |
T6188 |
mark |
as,noted |
R4045 |
T6188 |
T6177 |
advcl |
noted,indicate |
R4046 |
T6189 |
T6188 |
aux |
has,noted |
R4047 |
T6190 |
T6188 |
advmod |
already,noted |
R4048 |
T6191 |
T6188 |
auxpass |
been,noted |
R4049 |
T6192 |
T6188 |
prep |
for,noted |
R4050 |
T6193 |
T6194 |
det |
the,TACC3 |
R4051 |
T6194 |
T6192 |
pobj |
TACC3,for |
R4052 |
T6195 |
T6194 |
amod |
amphibian,TACC3 |
R4053 |
T6196 |
T6194 |
compound |
Xenopus,TACC3 |
R4054 |
T6197 |
T6194 |
punct |
", ",TACC3 |
R4055 |
T6198 |
T6194 |
appos |
maskin,TACC3 |
R4056 |
T6199 |
T6200 |
punct |
[,8 |
R4057 |
T6200 |
T6188 |
parataxis |
8,noted |
R4058 |
T6201 |
T6200 |
punct |
],8 |
R4059 |
T6202 |
T6161 |
punct |
.,are |