Id |
Subject |
Object |
Predicate |
Lexical cue |
T6310 |
4624-4625 |
. |
denotes |
. |
T6309 |
4619-4624 |
NN |
denotes |
model |
T6308 |
4615-4618 |
DT |
denotes |
the |
T6307 |
4607-4614 |
IN |
denotes |
through |
T6306 |
4592-4600 |
JJ |
denotes |
possible |
T6305 |
4601-4606 |
NNS |
denotes |
paths |
T6304 |
4588-4591 |
DT |
denotes |
all |
T6303 |
4583-4587 |
IN |
denotes |
over |
T6302 |
4576-4582 |
VBD |
denotes |
summed |
T6301 |
4568-4575 |
RB |
denotes |
outside |
T6300 |
4565-4567 |
CC |
denotes |
or |
T6299 |
4563-4565 |
, |
denotes |
, |
T6298 |
4557-4563 |
RB |
denotes |
inside |
T6297 |
4555-4557 |
, |
denotes |
, |
T6296 |
4550-4555 |
NN |
denotes |
helix |
T6295 |
4547-4549 |
IN |
denotes |
in |
T6294 |
4534-4541 |
NN |
denotes |
residue |
T6293 |
4532-4533 |
DT |
denotes |
a |
T6292 |
4542-4546 |
VBZ |
denotes |
sits |
T6291 |
4527-4531 |
IN |
denotes |
that |
T6290 |
4509-4514 |
JJ |
denotes |
total |
T6289 |
4515-4526 |
NN |
denotes |
probability |
T6288 |
4505-4508 |
DT |
denotes |
the |
T6287 |
4493-4504 |
VBG |
denotes |
calculating |
T6286 |
4490-4492 |
IN |
denotes |
by |
T6285 |
4478-4480 |
VBZ |
denotes |
is |
T6284 |
4481-4489 |
VBN |
denotes |
obtained |
T6283 |
4473-4477 |
NN |
denotes |
plot |
T6282 |
4469-4472 |
DT |
denotes |
The |
T6281 |
4468-4625 |
sentence |
denotes |
The plot is obtained by calculating the total probability that a residue sits in helix, inside, or outside summed over all possible paths through the model. |
T6280 |
4467-4468 |
. |
denotes |
. |
T6279 |
4459-4461 |
VBZ |
denotes |
is |
T6278 |
4442-4443 |
HYPH |
denotes |
- |
T6277 |
4443-4447 |
JJS |
denotes |
best |
T6276 |
4441-4442 |
NN |
denotes |
N |
T6275 |
4448-4458 |
NN |
denotes |
prediction |
T6274 |
4437-4440 |
DT |
denotes |
the |
T6273 |
4435-4436 |
-RRB- |
denotes |
) |
T6272 |
4432-4435 |
CD |
denotes |
1.2 |
T6271 |
4428-4431 |
CC |
denotes |
and |
T6270 |
4426-4427 |
CD |
denotes |
1 |
T6269 |
4418-4425 |
IN |
denotes |
between |
T6268 |
4417-4418 |
-LRB- |
denotes |
( |
T6267 |
4412-4416 |
NN |
denotes |
plot |
T6266 |
4408-4411 |
DT |
denotes |
the |
T6265 |
4405-4407 |
IN |
denotes |
of |
T6264 |
4401-4404 |
NN |
denotes |
top |
T6263 |
4397-4400 |
DT |
denotes |
the |
T6262 |
4462-4467 |
VBN |
denotes |
shown |
T6261 |
4394-4396 |
IN |
denotes |
At |
T6260 |
4393-4468 |
sentence |
denotes |
At the top of the plot (between 1 and 1.2) the N-best prediction is shown. |
T6259 |
4392-4393 |
. |
denotes |
. |
T6258 |
4387-4392 |
NN |
denotes |
helix |
T6257 |
4383-4384 |
HYPH |
denotes |
/ |
T6256 |
4376-4383 |
JJ |
denotes |
outside |
T6255 |
4375-4376 |
HYPH |
denotes |
/ |
T6254 |
4384-4386 |
NN |
denotes |
TM |
T6253 |
4368-4374 |
JJ |
denotes |
inside |
T6252 |
4365-4367 |
IN |
denotes |
of |
T6251 |
4341-4350 |
JJ |
denotes |
posterior |
T6250 |
4351-4364 |
NNS |
denotes |
probabilities |
T6249 |
4337-4340 |
DT |
denotes |
the |
T6106 |
2995-2996 |
-LRB- |
denotes |
( |
T6105 |
3006-3008 |
VBZ |
denotes |
is |
T6104 |
2990-2994 |
NN |
denotes |
CaeE |
T6103 |
2989-3063 |
sentence |
denotes |
CaeE (AAK77203) is a hypothetical protein from the Caenohabditis elegans. |
T6102 |
2988-2989 |
. |
denotes |
. |
T6101 |
2987-2988 |
-RRB- |
denotes |
) |
T6100 |
2979-2987 |
NN |
denotes |
EAA01004 |
T6099 |
2978-2979 |
-LRB- |
denotes |
( |
T6098 |
2965-2972 |
NNP |
denotes |
gambiae |
T6097 |
2955-2964 |
NNP |
denotes |
anopheles |
T6096 |
2973-2977 |
NN |
denotes |
str. |
T6095 |
2951-2954 |
DT |
denotes |
the |
T6094 |
2946-2950 |
IN |
denotes |
from |
T6093 |
2938-2945 |
NN |
denotes |
protein |
T6092 |
2936-2937 |
DT |
denotes |
a |
T6091 |
2925-2935 |
VBZ |
denotes |
represents |
T6090 |
2920-2924 |
NN |
denotes |
AnoG |
T6089 |
2919-2989 |
sentence |
denotes |
AnoG represents a protein from the anopheles gambiae str. (EAA01004). |
T6088 |
2918-2919 |
. |
denotes |
. |
T6087 |
2917-2918 |
-RRB- |
denotes |
) |
T6086 |
2903-2909 |
NNP |
denotes |
Genome |
T6085 |
2892-2902 |
NNP |
denotes |
Drosophila |
T6084 |
2910-2917 |
NNP |
denotes |
Project |
T6083 |
2883-2891 |
NNP |
denotes |
Berkeley |
T6082 |
2882-2883 |
-LRB- |
denotes |
( |
T6081 |
2877-2881 |
NN |
denotes |
BDGP |
T6080 |
2874-2876 |
IN |
denotes |
in |
T6079 |
2866-2873 |
NN |
denotes |
CG40084 |
T6078 |
2858-2862 |
NN |
denotes |
gene |
T6077 |
2853-2857 |
DT |
denotes |
this |
T6076 |
2849-2852 |
IN |
denotes |
for |
T6075 |
2863-2865 |
VBZ |
denotes |
is |
T6074 |
2832-2841 |
NN |
denotes |
accession |
T6073 |
2842-2848 |
NN |
denotes |
number |
T6072 |
2828-2831 |
DT |
denotes |
The |
T6071 |
2827-2919 |
sentence |
denotes |
The accession number for this gene is CG40084 in BDGP (Berkeley Drosophila Genome Project). |
T6070 |
2826-2827 |
. |
denotes |
. |
T6069 |
2814-2826 |
NNP |
denotes |
melanogaster |
T6068 |
2811-2813 |
NNP |
denotes |
D. |
T6067 |
2806-2810 |
IN |
denotes |
from |
T6066 |
2793-2797 |
NN |
denotes |
gene |
T6065 |
2798-2805 |
NN |
denotes |
product |
T6064 |
2791-2792 |
DT |
denotes |
a |
T6063 |
2788-2790 |
VBZ |
denotes |
is |
T6062 |
2783-2787 |
NN |
denotes |
DroM |
T6061 |
2782-2827 |
sentence |
denotes |
DroM is a gene product from D. melanogaster. |
T6060 |
2781-2782 |
. |
denotes |
. |
T6059 |
2775-2781 |
NNP |
denotes |
crassa |
T6058 |
2764-2774 |
NNP |
denotes |
Neurospora |
T6057 |
2759-2763 |
IN |
denotes |
from |
T6056 |
2738-2750 |
JJ |
denotes |
hypothetical |
T6055 |
2751-2758 |
NN |
denotes |
protein |
T6054 |
2736-2737 |
DT |
denotes |
a |
T6053 |
2731-2732 |
-RRB- |
denotes |
) |
T6052 |
2723-2731 |
NN |
denotes |
EAA31204 |
T6051 |
2722-2723 |
-LRB- |
denotes |
( |
T6050 |
2733-2735 |
VBZ |
denotes |
is |
T6049 |
2717-2721 |
NN |
denotes |
NeuC |
T6048 |
2716-2782 |
sentence |
denotes |
NeuC (EAA31204) is a hypothetical protein from Neurospora crassa. |
T6047 |
2715-2716 |
. |
denotes |
. |
T6046 |
2714-2715 |
-RRB- |
denotes |
) |
T6045 |
2706-2714 |
NN |
denotes |
AAF33142 |
T6044 |
2705-2706 |
-LRB- |
denotes |
( |
T6043 |
2696-2704 |
NNP |
denotes |
glabrata |
T6042 |
2688-2695 |
NNP |
denotes |
Candida |
T6041 |
2683-2687 |
IN |
denotes |
from |
T6040 |
2675-2682 |
NN |
denotes |
protein |
T6039 |
2673-2674 |
DT |
denotes |
a |
T6038 |
2670-2672 |
VBZ |
denotes |
is |
T6037 |
2665-2669 |
NN |
denotes |
CanG |
T6036 |
2664-2716 |
sentence |
denotes |
CanG is a protein from Candida glabrata (AAF33142). |
T6035 |
2663-2664 |
. |
denotes |
. |
T6034 |
2662-2663 |
-RRB- |
denotes |
) |
T6033 |
2653-2662 |
NN |
denotes |
NP_014581 |
T6032 |
2652-2653 |
-LRB- |
denotes |
( |
T6031 |
2641-2651 |
NNP |
denotes |
cerevisiae |
T6030 |
2627-2640 |
NNP |
denotes |
Saccharomyces |
T6029 |
2622-2626 |
IN |
denotes |
from |
T6028 |
2614-2621 |
NN |
denotes |
protein |
T6027 |
2612-2613 |
DT |
denotes |
a |
T6026 |
2609-2611 |
VBZ |
denotes |
is |
T6025 |
2603-2608 |
NN |
denotes |
Amip3 |
T6024 |
2602-2664 |
sentence |
denotes |
Amip3 is a protein from Saccharomyces cerevisiae (NP_014581). |
T6023 |
2601-2602 |
. |
denotes |
. |
T6022 |
2594-2601 |
NNS |
denotes |
species |
T6021 |
2586-2593 |
JJ |
denotes |
various |
T6020 |
2583-2585 |
IN |
denotes |
in |
T6019 |
2576-2582 |
NN |
denotes |
domain |
T6018 |
2572-2575 |
NN |
denotes |
ACD |
T6017 |
2569-2571 |
IN |
denotes |
of |
T6016 |
2556-2568 |
NN |
denotes |
conservation |
T6015 |
2552-2555 |
DT |
denotes |
the |
T6014 |
2544-2551 |
VBG |
denotes |
showing |
T6013 |
2525-2533 |
NN |
denotes |
sequence |
T6012 |
2534-2543 |
NN |
denotes |
alignment |
T6011 |
2520-2524 |
NN |
denotes |
acid |
T6010 |
2514-2519 |
NN |
denotes |
Amino |
T5926 |
2502-2503 |
. |
denotes |
. |
T5925 |
2493-2502 |
NN |
denotes |
alignment |
T5924 |
2489-2492 |
DT |
denotes |
the |
T5923 |
2485-2488 |
IN |
denotes |
for |
T5922 |
2480-2484 |
NNS |
denotes |
gaps |
T5921 |
2470-2479 |
VBP |
denotes |
represent |
T5920 |
2464-2469 |
NNS |
denotes |
lines |
T5919 |
2460-2463 |
NN |
denotes |
Dot |
T5918 |
2459-2503 |
sentence |
denotes |
Dot lines represent gaps for the alignment. |
T5917 |
2458-2459 |
. |
denotes |
. |
T5916 |
2454-2458 |
JJ |
denotes |
grey |
T5915 |
2442-2446 |
VBD |
denotes |
were |
T5914 |
2433-2441 |
NN |
denotes |
proteins |
T5913 |
2429-2432 |
DT |
denotes |
the |
T5912 |
2426-2428 |
IN |
denotes |
of |
T5911 |
2421-2425 |
JJS |
denotes |
most |
T5910 |
2418-2420 |
IN |
denotes |
in |
T5909 |
2407-2417 |
NNS |
denotes |
homologies |
T5908 |
2400-2406 |
JJ |
denotes |
strong |
T5907 |
2395-2399 |
RB |
denotes |
very |
T5906 |
2390-2394 |
IN |
denotes |
with |
T5905 |
2384-2389 |
NNS |
denotes |
acids |
T5904 |
2378-2383 |
NN |
denotes |
amino |
T5903 |
2375-2377 |
CC |
denotes |
or |
T5902 |
2447-2453 |
VBN |
denotes |
shaded |
T5901 |
2363-2368 |
NN |
denotes |
amino |
T5900 |
2369-2374 |
NNS |
denotes |
acids |
T5899 |
2353-2362 |
JJ |
denotes |
Identical |
T5898 |
2352-2459 |
sentence |
denotes |
Identical amino acids or amino acids with very strong homologies in most of the proteins were shaded grey. |
T5897 |
2351-2352 |
. |
denotes |
. |
T5896 |
2346-2351 |
JJ |
denotes |
black |
T5895 |
2334-2338 |
VBD |
denotes |
were |
T5894 |
2325-2333 |
NN |
denotes |
proteins |
T5893 |
2321-2324 |
DT |
denotes |
all |
T5892 |
2315-2320 |
IN |
denotes |
among |
T5891 |
2304-2314 |
NNS |
denotes |
homologies |
T5873 |
2226-2227 |
-RRB- |
denotes |
) |
T5872 |
2221-2226 |
NN |
denotes |
Acdp3 |
T5871 |
2220-2221 |
-LRB- |
denotes |
( |
T5870 |
2211-2219 |
NN |
denotes |
AF216964 |
T5869 |
2209-2211 |
, |
denotes |
, |
T5868 |
2208-2209 |
-RRB- |
denotes |
) |
T5867 |
2203-2208 |
NN |
denotes |
Acdp2 |
T5866 |
2202-2203 |
-LRB- |
denotes |
( |
T5865 |
2193-2201 |
NN |
denotes |
AF216961 |
T5864 |
2191-2193 |
, |
denotes |
, |
T5863 |
2190-2191 |
-RRB- |
denotes |
) |
T5862 |
2185-2190 |
NN |
denotes |
Acdp1 |
T5861 |
2184-2185 |
-LRB- |
denotes |
( |
T5860 |
2168-2174 |
NN |
denotes |
number |
T5859 |
2175-2183 |
NN |
denotes |
AF202994 |
T5858 |
2158-2167 |
NN |
denotes |
accession |
T5857 |
2152-2157 |
IN |
denotes |
under |
T5856 |
2144-2151 |
NNP |
denotes |
GenBank |
T5855 |
2141-2143 |
IN |
denotes |
in |
T5854 |
2126-2130 |
VBN |
denotes |
been |
T5853 |
2121-2125 |
VBP |
denotes |
have |
T5852 |
2110-2114 |
NN |
denotes |
Acdp |
T5851 |
2115-2120 |
NNS |
denotes |
genes |
T5850 |
2106-2109 |
DT |
denotes |
the |
T5849 |
2102-2105 |
IN |
denotes |
for |
T5848 |
2131-2140 |
VBN |
denotes |
deposited |
T5847 |
2088-2096 |
NN |
denotes |
sequence |
T5846 |
2097-2101 |
NNS |
denotes |
data |
T5845 |
2084-2087 |
DT |
denotes |
The |
T5844 |
2083-2249 |
sentence |
denotes |
The sequence data for the Acdp genes have been deposited in GenBank under accession number AF202994 (Acdp1), AF216961 (Acdp2), AF216964 (Acdp3) and AF216963 (Acdp4). |
T5843 |
2082-2083 |
. |
denotes |
. |
T5842 |
2072-2075 |
NN |
denotes |
ACD |
T5841 |
2076-2082 |
NN |
denotes |
domain |
T5840 |
2068-2071 |
DT |
denotes |
the |
T5839 |
2061-2067 |
IN |
denotes |
within |
T5838 |
2050-2054 |
NN |
denotes |
Acdp |
T5837 |
2046-2049 |
CC |
denotes |
and |
T5836 |
2041-2045 |
NN |
denotes |
ACDP |
T5835 |
2055-2060 |
NNS |
denotes |
genes |
T5834 |
2037-2040 |
DT |
denotes |
the |
T5833 |
2034-2036 |
IN |
denotes |
of |
T5832 |
2030-2033 |
DT |
denotes |
all |
T5831 |
2026-2029 |
IN |
denotes |
for |
T5830 |
2007-2015 |
NN |
denotes |
homology |
T5829 |
2016-2025 |
NN |
denotes |
alignment |
T5828 |
1998-2006 |
NN |
denotes |
sequence |
T5827 |
1993-1997 |
NN |
denotes |
acid |
T5826 |
1987-1992 |
NN |
denotes |
Amino |
T6248 |
4331-4336 |
VBZ |
denotes |
shows |
T6247 |
4326-4330 |
NN |
denotes |
plot |
T6246 |
4322-4325 |
DT |
denotes |
The |
T6245 |
4321-4393 |
sentence |
denotes |
The plot shows the posterior probabilities of inside /outside/TM helix. |
T6244 |
4320-4321 |
. |
denotes |
. |
T6243 |
4306-4311 |
NN |
denotes |
TMHMM |
T6242 |
4312-4319 |
NN |
denotes |
program |
T6241 |
4302-4305 |
DT |
denotes |
the |
T6240 |
4299-4301 |
IN |
denotes |
by |
T6239 |
4284-4288 |
VBD |
denotes |
were |
T6238 |
4289-4298 |
VBN |
denotes |
predicted |
T6237 |
4276-4283 |
NNS |
denotes |
domains |
T6236 |
4262-4275 |
NN |
denotes |
Transmembrane |
T6235 |
4261-4321 |
sentence |
denotes |
Transmembrane domains were predicted by the TMHMM program . |
T6234 |
4260-4261 |
. |
denotes |
. |
T6233 |
4253-4260 |
NN |
denotes |
protein |
T6232 |
4247-4252 |
NN |
denotes |
Acdp4 |
T6231 |
4240-4246 |
IN |
denotes |
within |
T6230 |
4218-4231 |
NN |
denotes |
transmembrane |
T6229 |
4232-4239 |
NNS |
denotes |
domains |
T6228 |
4213-4217 |
CD |
denotes |
Four |
T6201 |
3530-3531 |
. |
denotes |
. |
T6200 |
3529-3530 |
CD |
denotes |
3 |
T6199 |
3522-3528 |
NN |
denotes |
figure |
T6198 |
3519-3521 |
IN |
denotes |
in |
T6197 |
3509-3518 |
VBN |
denotes |
presented |
T6196 |
3506-3508 |
IN |
denotes |
as |
T6195 |
3501-3505 |
JJ |
denotes |
same |
T6194 |
3497-3500 |
DT |
denotes |
the |
T6193 |
3485-3492 |
NN |
denotes |
protein |
T6192 |
3480-3484 |
DT |
denotes |
each |
T6191 |
3476-3479 |
IN |
denotes |
for |
T6190 |
3493-3496 |
VBP |
denotes |
are |
T6189 |
3462-3475 |
NNS |
denotes |
Abbreviations |
T6188 |
3461-3531 |
sentence |
denotes |
Abbreviations for each protein are the same as presented in figure 3. |
T6187 |
3460-3461 |
. |
denotes |
. |
T6186 |
3442-3450 |
JJ |
denotes |
selected |
T6185 |
3451-3460 |
NNS |
denotes |
sequences |
T6184 |
3438-3441 |
DT |
denotes |
the |
T6183 |
3434-3437 |
IN |
denotes |
for |
T6182 |
3423-3427 |
JJS |
denotes |
best |
T6181 |
3428-3433 |
NN |
denotes |
match |
T6180 |
3419-3422 |
DT |
denotes |
the |
T6179 |
3416-3418 |
IN |
denotes |
of |
T6178 |
3404-3415 |
NN |
denotes |
calculation |
T6177 |
3400-3403 |
DT |
denotes |
the |
T6176 |
3397-3399 |
IN |
denotes |
to |
T6175 |
3387-3396 |
VBG |
denotes |
according |
T6174 |
3371-3374 |
VBD |
denotes |
was |
T6173 |
3375-3386 |
VBN |
denotes |
constructed |
T6172 |
3353-3365 |
JJ |
denotes |
phylogenetic |
T6171 |
3366-3370 |
NN |
denotes |
tree |
T6170 |
3349-3352 |
DT |
denotes |
The |
T6169 |
3348-3461 |
sentence |
denotes |
The phylogenetic tree was constructed according to the calculation of the best match for the selected sequences. |
T6168 |
3347-3348 |
. |
denotes |
. |
T6167 |
3346-3347 |
CD |
denotes |
3 |
T6166 |
3342-3345 |
CC |
denotes |
and |
T6165 |
3340-3341 |
CD |
denotes |
2 |
T6164 |
3333-3339 |
NN |
denotes |
figure |
T6163 |
3328-3332 |
IN |
denotes |
from |
T6162 |
3317-3320 |
NN |
denotes |
ACD |
T6161 |
3321-3327 |
NN |
denotes |
domain |
T6160 |
3313-3316 |
DT |
denotes |
the |
T6159 |
3302-3312 |
VBG |
denotes |
containing |
T6158 |
3293-3301 |
NN |
denotes |
proteins |
T6157 |
3287-3292 |
IN |
denotes |
among |
T6156 |
3273-3286 |
NNS |
denotes |
relationships |
T6155 |
3265-3272 |
VBG |
denotes |
showing |
T6154 |
3260-3264 |
NN |
denotes |
tree |
T6153 |
3247-3259 |
JJ |
denotes |
Phylogenetic |
T6145 |
3235-3236 |
. |
denotes |
. |
T6144 |
3234-3235 |
-RRB- |
denotes |
) |
T6143 |
3223-3234 |
NN |
denotes |
ZP_00038107 |
T6142 |
3222-3223 |
-LRB- |
denotes |
( |
T6141 |
3205-3215 |
NNP |
denotes |
fastidiosa |
T6140 |
3197-3204 |
NNP |
denotes |
Xylella |
T6139 |
3216-3221 |
NNP |
denotes |
Dixon |
T6138 |
3193-3196 |
DT |
denotes |
the |
T6137 |
3188-3192 |
IN |
denotes |
from |
T6136 |
3167-3179 |
JJ |
denotes |
hypothetical |
T6135 |
3180-3187 |
NN |
denotes |
protein |
T6134 |
3165-3166 |
DT |
denotes |
a |
T6133 |
3162-3164 |
VBZ |
denotes |
is |
T6132 |
3157-3161 |
NN |
denotes |
XyFD |
T6131 |
3156-3236 |
sentence |
denotes |
XyFD is a hypothetical protein from the Xylella fastidiosa Dixon (ZP_00038107). |
T6130 |
3155-3156 |
. |
denotes |
. |
T6129 |
3134-3144 |
NNP |
denotes |
Shewanella |
T6128 |
3145-3155 |
NNP |
denotes |
oneidensis |
T6127 |
3130-3133 |
DT |
denotes |
the |
T6126 |
3125-3129 |
IN |
denotes |
from |
T6125 |
3110-3116 |
NN |
denotes |
efflux |
T6124 |
3117-3124 |
NN |
denotes |
protein |
T6123 |
3103-3109 |
NN |
denotes |
cobalt |
T6122 |
3099-3102 |
CC |
denotes |
and |
T6121 |
3089-3098 |
NN |
denotes |
magnesium |
T6120 |
3080-3088 |
NNS |
denotes |
bacteria |
T6119 |
3069-3079 |
VBZ |
denotes |
represents |
T6118 |
3064-3068 |
NN |
denotes |
CorC |
T6117 |
3063-3156 |
sentence |
denotes |
CorC represents bacteria magnesium and cobalt efflux protein from the Shewanella oneidensis. |
T6116 |
3062-3063 |
. |
denotes |
. |
T6115 |
3041-3054 |
NNP |
denotes |
Caenohabditis |
T6114 |
3055-3062 |
NNP |
denotes |
elegans |
T6113 |
3037-3040 |
DT |
denotes |
the |
T6112 |
3032-3036 |
IN |
denotes |
from |
T6111 |
3011-3023 |
JJ |
denotes |
hypothetical |
T6110 |
3024-3031 |
NN |
denotes |
protein |
T6109 |
3009-3010 |
DT |
denotes |
a |
T6108 |
3004-3005 |
-RRB- |
denotes |
) |
T6107 |
2996-3004 |
NN |
denotes |
AAK77203 |
T5890 |
2297-2303 |
JJ |
denotes |
strong |
T5889 |
2292-2296 |
RB |
denotes |
very |
T5888 |
2287-2291 |
IN |
denotes |
with |
T5887 |
2281-2286 |
NNS |
denotes |
acids |
T5886 |
2275-2280 |
NN |
denotes |
amino |
T5885 |
2272-2274 |
CC |
denotes |
or |
T5884 |
2339-2345 |
VBN |
denotes |
shaded |
T5883 |
2260-2265 |
NN |
denotes |
amino |
T5882 |
2266-2271 |
NNS |
denotes |
acids |
T5881 |
2250-2259 |
JJ |
denotes |
Identical |
T5880 |
2249-2352 |
sentence |
denotes |
Identical amino acids or amino acids with very strong homologies among all proteins were shaded black. |
T5879 |
2248-2249 |
. |
denotes |
. |
T5878 |
2247-2248 |
-RRB- |
denotes |
) |
T5877 |
2242-2247 |
NN |
denotes |
Acdp4 |
T5876 |
2241-2242 |
-LRB- |
denotes |
( |
T5875 |
2232-2240 |
NN |
denotes |
AF216963 |
T5874 |
2228-2231 |
CC |
denotes |
and |
T2645 |
5200-5201 |
. |
denotes |
. |
T2644 |
5199-5200 |
-RRB- |
denotes |
) |
T2643 |
5193-5195 |
: |
denotes |
: |
T2642 |
5191-5193 |
CD |
denotes |
10 |
T2641 |
5190-5191 |
SYM |
denotes |
– |
T2640 |
5189-5190 |
CD |
denotes |
7 |
T2639 |
5195-5199 |
NN |
denotes |
GGRR |
T2638 |
5188-5189 |
-LRB- |
denotes |
( |
T2637 |
5173-5182 |
NN |
denotes |
amidation |
T2636 |
5183-5187 |
NN |
denotes |
site |
T2635 |
5170-5172 |
DT |
denotes |
an |
T2634 |
5166-5169 |
CC |
denotes |
and |
T2633 |
5164-5165 |
-RRB- |
denotes |
) |
T2632 |
5140-5142 |
: |
denotes |
: |
T2631 |
5137-5140 |
CD |
denotes |
206 |
T2630 |
5136-5137 |
SYM |
denotes |
– |
T2629 |
5133-5136 |
CD |
denotes |
185 |
T2628 |
5142-5164 |
NN |
denotes |
LVMVLLVLSGIFSGLNLGLMAL |
T2627 |
5132-5133 |
-LRB- |
denotes |
( |
T2626 |
5117-5123 |
NN |
denotes |
zipper |
T2625 |
5109-5116 |
NN |
denotes |
leucine |
T2624 |
5124-5131 |
NN |
denotes |
pattern |
T2623 |
5107-5108 |
DT |
denotes |
a |
T2622 |
5098-5106 |
VBZ |
denotes |
contains |
T2621 |
5092-5097 |
NN |
denotes |
Acdp4 |
T2620 |
5091-5201 |
sentence |
denotes |
Acdp4 contains a leucine zipper pattern (185–206: LVMVLLVLSGIFSGLNLGLMAL) and an amidation site (7–10: GGRR). |
T2619 |
5090-5091 |
. |
denotes |
. |
T2618 |
5089-5090 |
-RRB- |
denotes |
) |
T2617 |
5086-5089 |
CD |
denotes |
299 |
T2616 |
5085-5086 |
SYM |
denotes |
– |
T2615 |
5082-5085 |
CD |
denotes |
201 |
T2614 |
5081-5082 |
-LRB- |
denotes |
( |
T2613 |
5068-5069 |
HYPH |
denotes |
- |
T2612 |
5069-5073 |
JJ |
denotes |
rich |
T2611 |
5061-5068 |
NN |
denotes |
leucine |
T2610 |
5055-5060 |
JJ |
denotes |
large |
T2609 |
5074-5080 |
NN |
denotes |
region |
T2608 |
5053-5054 |
DT |
denotes |
a |
T2607 |
5049-5052 |
CC |
denotes |
and |
T2606 |
5047-5048 |
-RRB- |
denotes |
) |
T2605 |
5044-5047 |
CD |
denotes |
261 |
T2604 |
5043-5044 |
SYM |
denotes |
– |
T2603 |
5042-5043 |
CD |
denotes |
2 |
T2602 |
5041-5042 |
-LRB- |
denotes |
( |
T2601 |
5028-5029 |
HYPH |
denotes |
- |
T2600 |
5029-5033 |
JJ |
denotes |
rich |
T2599 |
5021-5028 |
NN |
denotes |
alanine |
T2598 |
5015-5020 |
JJ |
denotes |
large |
T2597 |
5034-5040 |
NN |
denotes |
region |
T2596 |
5013-5014 |
DT |
denotes |
a |
T2595 |
5003-5012 |
VBZ |
denotes |
possesses |
T2594 |
4997-5002 |
NN |
denotes |
Acdp3 |
T2593 |
4996-5091 |
sentence |
denotes |
Acdp3 possesses a large alanine-rich region (2–261) and a large leucine-rich region (201–299). |
T2592 |
4995-4996 |
. |
denotes |
. |
T2591 |
4994-4995 |
-RRB- |
denotes |
) |
T2590 |
4970-4972 |
: |
denotes |
: |
T2589 |
4967-4970 |
CD |
denotes |
222 |
T2588 |
4966-4967 |
SYM |
denotes |
– |
T2587 |
4963-4966 |
CD |
denotes |
201 |
T2586 |
4972-4994 |
NN |
denotes |
GAGGSGSASGTVGGKGGAGVAG |
T2585 |
4962-4963 |
-LRB- |
denotes |
( |
T2584 |
4949-4950 |
HYPH |
denotes |
- |
T2583 |
4950-4954 |
JJ |
denotes |
rich |
T2582 |
4942-4949 |
NN |
denotes |
glycine |
T2581 |
4955-4961 |
NN |
denotes |
region |
T2580 |
4940-4941 |
DT |
denotes |
a |
T2579 |
4936-4939 |
VBZ |
denotes |
has |
T2578 |
4930-4935 |
NN |
denotes |
Acdp2 |
T2577 |
4929-4996 |
sentence |
denotes |
Acdp2 has a glycine-rich region (201–222: GAGGSGSASGTVGGKGGAGVAG). |
T2576 |
4928-4929 |
. |
denotes |
. |
T2575 |
4927-4928 |
-RRB- |
denotes |
) |
T2574 |
4921-4923 |
: |
denotes |
: |
T2573 |
4918-4921 |
CD |
denotes |
929 |
T2572 |
4917-4918 |
SYM |
denotes |
– |
T2571 |
4914-4917 |
CD |
denotes |
926 |
T2570 |
4912-4913 |
: |
denotes |
; |
T2569 |
4906-4908 |
: |
denotes |
: |
T2568 |
4903-4906 |
CD |
denotes |
920 |
T2567 |
4902-4903 |
SYM |
denotes |
– |
T2566 |
4908-4912 |
NN |
denotes |
MGKK |
T2565 |
4899-4902 |
CD |
denotes |
917 |
T2564 |
4923-4927 |
NN |
denotes |
SGRK |
T2563 |
4898-4899 |
-LRB- |
denotes |
( |
T2562 |
4882-4891 |
NN |
denotes |
amidation |
T2561 |
4892-4897 |
NNS |
denotes |
sites |
T2560 |
4878-4881 |
CD |
denotes |
two |
T2559 |
4874-4877 |
CC |
denotes |
and |
T2558 |
4872-4874 |
, |
denotes |
, |
T2557 |
4871-4872 |
-RRB- |
denotes |
) |
T2556 |
4817-4833 |
NN |
denotes |
PGPPVPAAPVPAPSLA |
T2555 |
4815-4817 |
: |
denotes |
: |
T2554 |
4812-4815 |
CD |
denotes |
130 |
T2553 |
4811-4812 |
SYM |
denotes |
– |
T2552 |
4809-4811 |
CD |
denotes |
78 |
T2551 |
4834-4871 |
NN |
denotes |
PGENGTGDWAPRLVFIEEPPGAGGAAPSAVPTRPPGP |
T2550 |
4808-4809 |
-LRB- |
denotes |
( |
T2549 |
4795-4796 |
HYPH |
denotes |
- |
T2548 |
4796-4800 |
JJ |
denotes |
rich |
T2547 |
4788-4795 |
NN |
denotes |
Proline |
T2546 |
4801-4807 |
NN |
denotes |
region |
T2545 |
4786-4787 |
DT |
denotes |
a |
T2544 |
4784-4786 |
, |
denotes |
, |
T2543 |
4783-4784 |
-RRB- |
denotes |
) |
T2542 |
4728-4764 |
NN |
denotes |
LLRVRPRLYGPGGDLLPPAWLRALGALLLLALSALF |
T2541 |
4726-4728 |
: |
denotes |
: |
T2540 |
4723-4726 |
CD |
denotes |
257 |
T2539 |
4722-4723 |
SYM |
denotes |
– |
T2538 |
4719-4722 |
CD |
denotes |
204 |
T2537 |
4765-4783 |
NN |
denotes |
SGLRLSLLSLDPVELRVL |
T2536 |
4718-4719 |
-LRB- |
denotes |
( |
T2535 |
4705-4706 |
HYPH |
denotes |
- |
T2534 |
4706-4710 |
JJ |
denotes |
rich |
T2533 |
4698-4705 |
NN |
denotes |
Leucine |
T2532 |
4711-4717 |
NN |
denotes |
region |
T2531 |
4696-4697 |
DT |
denotes |
a |
T2530 |
4694-4696 |
, |
denotes |
, |
T2529 |
4693-4694 |
-RRB- |
denotes |
) |
T2528 |
4682-4684 |
: |
denotes |
: |
T2527 |
4680-4682 |
CD |
denotes |
10 |
T2526 |
4679-4680 |
SYM |
denotes |
– |
T2525 |
4678-4679 |
CD |
denotes |
2 |
T2524 |
4684-4693 |
NN |
denotes |
AAAAAAAAA |
T2523 |
4677-4678 |
-LRB- |
denotes |
( |
T2522 |
4664-4665 |
HYPH |
denotes |
- |
T2521 |
4665-4669 |
JJ |
denotes |
rich |
T2520 |
4657-4664 |
NN |
denotes |
Alanine |
T2519 |
4670-4676 |
NN |
denotes |
region |
T2518 |
4654-4656 |
DT |
denotes |
an |
T2517 |
4639-4644 |
NN |
denotes |
Acdp1 |
T2516 |
4637-4639 |
, |
denotes |
, |
T2422 |
3782-3784 |
IN |
denotes |
in |
T2421 |
3769-3775 |
RB |
denotes |
mostly |
T2420 |
3765-3768 |
VBP |
denotes |
are |
T2419 |
3776-3781 |
VBN |
denotes |
found |
T2418 |
3760-3764 |
WDT |
denotes |
that |
T2417 |
3738-3751 |
JJ |
denotes |
intracellular |
T2416 |
3752-3759 |
NNS |
denotes |
modules |
T2415 |
3732-3737 |
JJ |
denotes |
small |
T2414 |
3728-3731 |
VBP |
denotes |
are |
T2413 |
3720-3727 |
NNS |
denotes |
domains |
T2412 |
3716-3719 |
NN |
denotes |
CBS |
T2411 |
3715-3819 |
sentence |
denotes |
CBS domains are small intracellular modules that are mostly found in 2 or four copies within a protein. |
T2410 |
3714-3715 |
. |
denotes |
. |
T2409 |
3700-3705 |
NN |
denotes |
Amip3 |
T2408 |
3694-3699 |
NN |
denotes |
yeast |
T2407 |
3690-3693 |
CC |
denotes |
and |
T2406 |
3706-3714 |
NN |
denotes |
proteins |
T2405 |
3685-3689 |
NN |
denotes |
CorC |
T2404 |
3676-3684 |
NNS |
denotes |
bacteria |
T2403 |
3673-3675 |
IN |
denotes |
in |
T2402 |
3663-3666 |
VBP |
denotes |
are |
T2401 |
3667-3672 |
VBN |
denotes |
found |
T2400 |
3658-3662 |
WDT |
denotes |
that |
T2399 |
3645-3650 |
NN |
denotes |
DUF21 |
T2398 |
3651-3657 |
NN |
denotes |
domain |
T2397 |
3643-3644 |
DT |
denotes |
a |
T2396 |
3639-3642 |
CC |
denotes |
and |
T2395 |
3627-3630 |
NN |
denotes |
CBS |
T2394 |
3631-3638 |
NNS |
denotes |
domains |
T2393 |
3623-3626 |
CD |
denotes |
two |
T2392 |
3621-3623 |
, |
denotes |
, |
T2497 |
4123-4124 |
-LRB- |
denotes |
( |
T2496 |
4176-4181 |
VBN |
denotes |
found |
T2495 |
4107-4108 |
HYPH |
denotes |
- |
T2494 |
4108-4115 |
VBG |
denotes |
binding |
T2493 |
4103-4107 |
NN |
denotes |
cNMP |
T2492 |
4116-4122 |
NN |
denotes |
domain |
T2491 |
4101-4102 |
DT |
denotes |
A |
T2490 |
4100-4202 |
sentence |
denotes |
A cNMP-binding domain (cyclic nucleotide-monophosphate-binding domain) was found in all Acdp members. |
T2489 |
4099-4100 |
. |
denotes |
. |
T2488 |
4087-4090 |
NN |
denotes |
CBS |
T2487 |
4073-4086 |
JJ |
denotes |
intracellular |
T2486 |
4091-4098 |
NNS |
denotes |
domains |
T2485 |
4069-4072 |
CD |
denotes |
two |
T2484 |
4066-4068 |
IN |
denotes |
to |
T2483 |
4057-4065 |
JJ |
denotes |
adjacent |
T2482 |
4048-4056 |
NN |
denotes |
proteins |
T2481 |
4044-4047 |
DT |
denotes |
the |
T2480 |
4041-4043 |
IN |
denotes |
of |
T2479 |
4031-4032 |
HYPH |
denotes |
- |
T2478 |
4030-4031 |
NN |
denotes |
N |
T2477 |
4032-4040 |
NN |
denotes |
terminus |
T2476 |
4026-4029 |
DT |
denotes |
the |
T2475 |
4023-4025 |
IN |
denotes |
in |
T2474 |
4012-4014 |
VB |
denotes |
be |
T2473 |
4015-4022 |
VBN |
denotes |
located |
T2472 |
4009-4011 |
TO |
denotes |
to |
T2471 |
4003-4008 |
VBN |
denotes |
found |
T2470 |
3999-4002 |
CC |
denotes |
and |
T2469 |
3978-3991 |
JJ |
denotes |
transmembrane |
T2468 |
3992-3998 |
NN |
denotes |
region |
T2467 |
3976-3977 |
DT |
denotes |
a |
T2466 |
3973-3975 |
VBZ |
denotes |
is |
T2465 |
3966-3972 |
NN |
denotes |
domain |
T2464 |
3961-3965 |
DT |
denotes |
This |
T2463 |
3960-4100 |
sentence |
denotes |
This domain is a transmembrane region and found to be located in the N-terminus of the proteins adjacent to two intracellular CBS domains . |
T2462 |
3959-3960 |
. |
denotes |
. |
T2461 |
3951-3959 |
NN |
denotes |
function |
T2460 |
3943-3950 |
JJ |
denotes |
unknown |
T2459 |
3938-3942 |
IN |
denotes |
with |
T2458 |
3923-3930 |
VBN |
denotes |
defined |
T2457 |
3917-3922 |
RB |
denotes |
newly |
T2456 |
3931-3937 |
NN |
denotes |
domain |
T2455 |
3915-3916 |
DT |
denotes |
a |
T2454 |
3910-3911 |
-RRB- |
denotes |
) |
T2453 |
3897-3899 |
: |
denotes |
: |
T2452 |
3895-3897 |
NN |
denotes |
CD |
T2451 |
3899-3910 |
NN |
denotes |
pfam01959.9 |
T2450 |
3894-3895 |
-LRB- |
denotes |
( |
T2449 |
3912-3914 |
VBZ |
denotes |
is |
T2448 |
3888-3893 |
NN |
denotes |
DUF21 |
T2447 |
3887-3960 |
sentence |
denotes |
DUF21 (CD: pfam01959.9) is a newly defined domain with unknown function. |
T2446 |
3886-3887 |
. |
denotes |
. |
T2445 |
3885-3886 |
-RRB- |
denotes |
] |
T2444 |
3884-3885 |
CD |
denotes |
8 |
T2443 |
3883-3884 |
-LRB- |
denotes |
[ |
T2442 |
3867-3875 |
JJ |
denotes |
globular |
T2441 |
3860-3866 |
JJ |
denotes |
stable |
T2440 |
3876-3882 |
NN |
denotes |
domain |
T2439 |
3858-3859 |
DT |
denotes |
a |
T2438 |
3853-3857 |
VB |
denotes |
form |
T2437 |
3850-3852 |
TO |
denotes |
to |
T2436 |
3833-3840 |
NNS |
denotes |
domains |
T2435 |
3829-3832 |
NN |
denotes |
CBS |
T2434 |
3826-3828 |
IN |
denotes |
of |
T2433 |
3841-3849 |
VBP |
denotes |
dimerise |
T2432 |
3820-3825 |
NNS |
denotes |
Pairs |
T2431 |
3819-3887 |
sentence |
denotes |
Pairs of CBS domains dimerise to form a stable globular domain [8]. |
T2430 |
3818-3819 |
. |
denotes |
. |
T2429 |
3811-3818 |
NN |
denotes |
protein |
T2428 |
3809-3810 |
DT |
denotes |
a |
T2427 |
3802-3808 |
IN |
denotes |
within |
T2426 |
3790-3794 |
CD |
denotes |
four |
T2425 |
3787-3789 |
CC |
denotes |
or |
T2424 |
3795-3801 |
NNS |
denotes |
copies |
T2423 |
3785-3786 |
CD |
denotes |
2 |
T2019 |
0-8 |
NN |
denotes |
Sequence |
T2020 |
9-17 |
NN |
denotes |
homology |
T2021 |
18-21 |
CC |
denotes |
and |
T2022 |
22-31 |
JJ |
denotes |
molecular |
T2023 |
32-47 |
NNS |
denotes |
characteristics |
T2024 |
47-169 |
sentence |
denotes |
The mouse Acdp genes showed very strong homologies of both nucleotide and AA sequences to the human ACDP genes (Table 1). |
T2025 |
48-51 |
DT |
denotes |
The |
T2026 |
63-68 |
NNS |
denotes |
genes |
T2027 |
52-57 |
NN |
denotes |
mouse |
T2028 |
58-62 |
NN |
denotes |
Acdp |
T2029 |
69-75 |
VBD |
denotes |
showed |
T2030 |
76-80 |
RB |
denotes |
very |
T2031 |
81-87 |
JJ |
denotes |
strong |
T2032 |
88-98 |
NNS |
denotes |
homologies |
T2033 |
99-101 |
IN |
denotes |
of |
T2034 |
102-106 |
CC |
denotes |
both |
T2035 |
107-117 |
NN |
denotes |
nucleotide |
T2036 |
125-134 |
NNS |
denotes |
sequences |
T2037 |
118-121 |
CC |
denotes |
and |
T2038 |
122-124 |
NN |
denotes |
AA |
T2039 |
135-137 |
IN |
denotes |
to |
T2040 |
138-141 |
DT |
denotes |
the |
T2041 |
153-158 |
NNS |
denotes |
genes |
T2042 |
142-147 |
JJ |
denotes |
human |
T2043 |
148-152 |
NN |
denotes |
ACDP |
T2044 |
159-160 |
-LRB- |
denotes |
( |
T2045 |
160-165 |
NN |
denotes |
Table |
T2046 |
166-167 |
CD |
denotes |
1 |
T2047 |
167-168 |
-RRB- |
denotes |
) |
T2048 |
168-169 |
. |
denotes |
. |
T2049 |
169-330 |
sentence |
denotes |
The highest homologies were observed between the human ACDP2 and the mouse Acdp2 gene (91% of nucleotide identity, 97% of AA identity and 99.4% of AA homology). |
T2050 |
170-173 |
DT |
denotes |
The |
T2051 |
182-192 |
NNS |
denotes |
homologies |
T2052 |
174-181 |
JJS |
denotes |
highest |
T2053 |
198-206 |
VBN |
denotes |
observed |
T2054 |
193-197 |
VBD |
denotes |
were |
T2055 |
207-214 |
IN |
denotes |
between |
T2056 |
215-218 |
DT |
denotes |
the |
T2057 |
225-230 |
NN |
denotes |
ACDP2 |
T2058 |
219-224 |
JJ |
denotes |
human |
T2059 |
231-234 |
CC |
denotes |
and |
T2060 |
235-238 |
DT |
denotes |
the |
T2061 |
245-250 |
NN |
denotes |
Acdp2 |
T2062 |
239-244 |
NN |
denotes |
mouse |
T2063 |
251-255 |
NN |
denotes |
gene |
T2064 |
256-257 |
-LRB- |
denotes |
( |
T2065 |
259-260 |
NN |
denotes |
% |
T2066 |
257-259 |
CD |
denotes |
91 |
T2067 |
261-263 |
IN |
denotes |
of |
T2068 |
264-274 |
NN |
denotes |
nucleotide |
T2069 |
275-283 |
NN |
denotes |
identity |
T2070 |
283-285 |
, |
denotes |
, |
T2071 |
285-287 |
CD |
denotes |
97 |
T2072 |
287-288 |
NN |
denotes |
% |
T2073 |
289-291 |
IN |
denotes |
of |
T2074 |
292-294 |
NN |
denotes |
AA |
T2075 |
295-303 |
NN |
denotes |
identity |
T2076 |
304-307 |
CC |
denotes |
and |
T2077 |
308-312 |
CD |
denotes |
99.4 |
T2078 |
312-313 |
NN |
denotes |
% |
T2079 |
314-316 |
IN |
denotes |
of |
T2080 |
317-319 |
NN |
denotes |
AA |
T2081 |
320-328 |
NN |
denotes |
homology |
T2082 |
328-329 |
-RRB- |
denotes |
) |
T2083 |
329-330 |
. |
denotes |
. |
T2084 |
330-551 |
sentence |
denotes |
In addition, the 5' UTR nucleotide sequences (20 bp of nucleotides before start codon) also showed high homologies to the human homologs, for example, the Acdp2 5' UTR sequence showed 95% identities to its human homolog. |
T2085 |
331-333 |
IN |
denotes |
In |
T2086 |
423-429 |
VBD |
denotes |
showed |
T2087 |
334-342 |
NN |
denotes |
addition |
T2088 |
342-344 |
, |
denotes |
, |
T2089 |
344-347 |
DT |
denotes |
the |
T2090 |
366-375 |
NNS |
denotes |
sequences |
T2091 |
348-349 |
CD |
denotes |
5 |
T2092 |
351-354 |
NN |
denotes |
UTR |
T2093 |
349-350 |
SYM |
denotes |
' |
T2094 |
355-365 |
NN |
denotes |
nucleotide |
T2095 |
376-377 |
-LRB- |
denotes |
( |
T2096 |
380-382 |
NNS |
denotes |
bp |
T2097 |
377-379 |
CD |
denotes |
20 |
T2098 |
383-385 |
IN |
denotes |
of |
T2099 |
386-397 |
NNS |
denotes |
nucleotides |
T2100 |
398-404 |
IN |
denotes |
before |
T2101 |
405-410 |
NN |
denotes |
start |
T2102 |
411-416 |
NN |
denotes |
codon |
T2103 |
416-417 |
-RRB- |
denotes |
) |
T2104 |
418-422 |
RB |
denotes |
also |
T2105 |
508-514 |
VBD |
denotes |
showed |
T2106 |
430-434 |
JJ |
denotes |
high |
T2107 |
435-445 |
NNS |
denotes |
homologies |
T2108 |
446-448 |
IN |
denotes |
to |
T2109 |
449-452 |
DT |
denotes |
the |
T2110 |
459-467 |
NNS |
denotes |
homologs |
T2111 |
453-458 |
JJ |
denotes |
human |
T2112 |
467-469 |
, |
denotes |
, |
T2113 |
469-472 |
IN |
denotes |
for |
T2114 |
473-480 |
NN |
denotes |
example |
T2115 |
480-482 |
, |
denotes |
, |
T2116 |
482-485 |
DT |
denotes |
the |
T2117 |
499-507 |
NN |
denotes |
sequence |
T2118 |
486-491 |
NN |
denotes |
Acdp2 |
T2119 |
492-493 |
CD |
denotes |
5 |
T2120 |
495-498 |
NN |
denotes |
UTR |
T2121 |
493-494 |
SYM |
denotes |
' |
T2122 |
515-517 |
CD |
denotes |
95 |
T2123 |
517-518 |
NN |
denotes |
% |
T2124 |
519-529 |
NNS |
denotes |
identities |
T2125 |
530-532 |
IN |
denotes |
to |
T2126 |
533-536 |
PRP$ |
denotes |
its |
T2127 |
543-550 |
NN |
denotes |
homolog |
T2128 |
537-542 |
JJ |
denotes |
human |
T2129 |
550-551 |
. |
denotes |
. |
T2130 |
551-733 |
sentence |
denotes |
However, the homologies in the 3' UTR sequences (20 bp of nucleotides after stop codon) were much lower (40–55%) for all Acdp genes except Acdp4 (90% identity to its human homolog). |
T2131 |
552-559 |
RB |
denotes |
However |
T2132 |
640-644 |
VBD |
denotes |
were |
T2133 |
559-561 |
, |
denotes |
, |
T2134 |
561-564 |
DT |
denotes |
the |
T2135 |
565-575 |
NNS |
denotes |
homologies |
T2136 |
576-578 |
IN |
denotes |
in |
T2137 |
579-582 |
DT |
denotes |
the |
T2138 |
590-599 |
NNS |
denotes |
sequences |
T2139 |
583-584 |
CD |
denotes |
3 |
T2140 |
586-589 |
NN |
denotes |
UTR |
T2141 |
584-585 |
SYM |
denotes |
' |
T2142 |
600-601 |
-LRB- |
denotes |
( |
T2143 |
604-606 |
NNS |
denotes |
bp |
T2144 |
601-603 |
CD |
denotes |
20 |
T2145 |
607-609 |
IN |
denotes |
of |
T2146 |
610-621 |
NNS |
denotes |
nucleotides |
T2147 |
622-627 |
IN |
denotes |
after |
T2148 |
628-632 |
NN |
denotes |
stop |
T2149 |
633-638 |
NN |
denotes |
codon |
T2150 |
638-639 |
-RRB- |
denotes |
) |
T2151 |
645-649 |
RB |
denotes |
much |
T2152 |
650-655 |
JJR |
denotes |
lower |
T2153 |
656-657 |
-LRB- |
denotes |
( |
T2154 |
662-663 |
NN |
denotes |
% |
T2155 |
657-659 |
CD |
denotes |
40 |
T2156 |
660-662 |
CD |
denotes |
55 |
T2157 |
659-660 |
SYM |
denotes |
– |
T2158 |
663-664 |
-RRB- |
denotes |
) |
T2159 |
665-668 |
IN |
denotes |
for |
T2160 |
669-672 |
DT |
denotes |
all |
T2161 |
678-683 |
NNS |
denotes |
genes |
T2162 |
673-677 |
NN |
denotes |
Acdp |
T2163 |
684-690 |
IN |
denotes |
except |
T2164 |
691-696 |
NN |
denotes |
Acdp4 |
T2165 |
697-698 |
-LRB- |
denotes |
( |
T2166 |
702-710 |
NN |
denotes |
identity |
T2167 |
698-700 |
CD |
denotes |
90 |
T2168 |
700-701 |
NN |
denotes |
% |
T2169 |
711-713 |
IN |
denotes |
to |
T2170 |
714-717 |
PRP$ |
denotes |
its |
T2171 |
724-731 |
NN |
denotes |
homolog |
T2172 |
718-723 |
JJ |
denotes |
human |
T2173 |
731-732 |
-RRB- |
denotes |
) |
T2174 |
732-733 |
. |
denotes |
. |
T2175 |
733-867 |
sentence |
denotes |
The ancient conserved domain (ACD) has 55.3% of AA identity and 83.3% of homology between all mouse and human ACDP proteins (Fig. 2). |
T2176 |
734-737 |
DT |
denotes |
The |
T2177 |
756-762 |
NN |
denotes |
domain |
T2178 |
738-745 |
JJ |
denotes |
ancient |
T2179 |
746-755 |
VBN |
denotes |
conserved |
T2180 |
769-772 |
VBZ |
denotes |
has |
T2181 |
763-764 |
-LRB- |
denotes |
( |
T2182 |
764-767 |
NN |
denotes |
ACD |
T2183 |
767-768 |
-RRB- |
denotes |
) |
T2184 |
773-777 |
CD |
denotes |
55.3 |
T2185 |
777-778 |
NN |
denotes |
% |
T2186 |
779-781 |
IN |
denotes |
of |
T2187 |
782-784 |
NN |
denotes |
AA |
T2188 |
785-793 |
NN |
denotes |
identity |
T2189 |
794-797 |
CC |
denotes |
and |
T2190 |
798-802 |
CD |
denotes |
83.3 |
T2191 |
802-803 |
NN |
denotes |
% |
T2192 |
804-806 |
IN |
denotes |
of |
T2193 |
807-815 |
NN |
denotes |
homology |
T2194 |
816-823 |
IN |
denotes |
between |
T2195 |
824-827 |
DT |
denotes |
all |
T2196 |
849-857 |
NN |
denotes |
proteins |
T2197 |
828-833 |
NN |
denotes |
mouse |
T2198 |
834-837 |
CC |
denotes |
and |
T2199 |
838-843 |
JJ |
denotes |
human |
T2200 |
844-848 |
NN |
denotes |
ACDP |
T2201 |
858-859 |
-LRB- |
denotes |
( |
T2202 |
859-863 |
NN |
denotes |
Fig. |
T2203 |
864-865 |
CD |
denotes |
2 |
T2204 |
865-866 |
-RRB- |
denotes |
) |
T2205 |
866-867 |
. |
denotes |
. |
T2206 |
867-1015 |
sentence |
denotes |
The ACD domain is evolutionarily conserved in divergent species ranging from bacteria, yeast, C. elegans, D. melanogaster, mouse to human (Fig. 3). |
T2207 |
868-871 |
DT |
denotes |
The |
T2208 |
876-882 |
NN |
denotes |
domain |
T2209 |
872-875 |
NN |
denotes |
ACD |
T2210 |
901-910 |
VBN |
denotes |
conserved |
T2211 |
883-885 |
VBZ |
denotes |
is |
T2212 |
886-900 |
RB |
denotes |
evolutionarily |
T2213 |
911-913 |
IN |
denotes |
in |
T2214 |
914-923 |
JJ |
denotes |
divergent |
T2215 |
924-931 |
NNS |
denotes |
species |
T2216 |
932-939 |
VBG |
denotes |
ranging |
T2217 |
940-944 |
IN |
denotes |
from |
T2218 |
945-953 |
NNS |
denotes |
bacteria |
T2219 |
953-955 |
, |
denotes |
, |
T2220 |
955-960 |
NN |
denotes |
yeast |
T2221 |
960-962 |
, |
denotes |
, |
T2222 |
962-964 |
NNP |
denotes |
C. |
T2223 |
965-972 |
NNP |
denotes |
elegans |
T2224 |
972-974 |
, |
denotes |
, |
T2225 |
974-976 |
NNP |
denotes |
D. |
T2226 |
977-989 |
NNP |
denotes |
melanogaster |
T2227 |
989-991 |
, |
denotes |
, |
T2228 |
991-996 |
NN |
denotes |
mouse |
T2229 |
997-999 |
IN |
denotes |
to |
T2230 |
1000-1005 |
JJ |
denotes |
human |
T2231 |
1006-1007 |
-LRB- |
denotes |
( |
T2232 |
1007-1011 |
NN |
denotes |
Fig. |
T2233 |
1012-1013 |
CD |
denotes |
3 |
T2234 |
1013-1014 |
-RRB- |
denotes |
) |
T2235 |
1014-1015 |
. |
denotes |
. |
T2236 |
1015-1210 |
sentence |
denotes |
Particularly, as shown in Fig. 3, Acdp proteins showed very strong AA homology to bacteria CorC protein (35% AA identity with 55% homology), which is involved in magnesium and cobalt efflux [7]. |
T2237 |
1016-1028 |
RB |
denotes |
Particularly |
T2238 |
1064-1070 |
VBD |
denotes |
showed |
T2239 |
1028-1030 |
, |
denotes |
, |
T2240 |
1030-1032 |
IN |
denotes |
as |
T2241 |
1033-1038 |
VBN |
denotes |
shown |
T2242 |
1039-1041 |
IN |
denotes |
in |
T2243 |
1042-1046 |
NN |
denotes |
Fig. |
T2244 |
1047-1048 |
CD |
denotes |
3 |
T2245 |
1048-1050 |
, |
denotes |
, |
T2246 |
1050-1054 |
NN |
denotes |
Acdp |
T2247 |
1055-1063 |
NN |
denotes |
proteins |
T2248 |
1071-1075 |
RB |
denotes |
very |
T2249 |
1076-1082 |
JJ |
denotes |
strong |
T2250 |
1086-1094 |
NN |
denotes |
homology |
T2251 |
1083-1085 |
NN |
denotes |
AA |
T2252 |
1095-1097 |
IN |
denotes |
to |
T2253 |
1098-1106 |
NNS |
denotes |
bacteria |
T2254 |
1112-1119 |
NN |
denotes |
protein |
T2255 |
1107-1111 |
NN |
denotes |
CorC |
T2256 |
1120-1121 |
-LRB- |
denotes |
( |
T2257 |
1128-1136 |
NN |
denotes |
identity |
T2258 |
1121-1123 |
CD |
denotes |
35 |
T2259 |
1123-1124 |
NN |
denotes |
% |
T2260 |
1125-1127 |
NN |
denotes |
AA |
T2261 |
1137-1141 |
IN |
denotes |
with |
T2262 |
1142-1144 |
CD |
denotes |
55 |
T2263 |
1144-1145 |
NN |
denotes |
% |
T2264 |
1146-1154 |
NN |
denotes |
homology |
T2265 |
1154-1155 |
-RRB- |
denotes |
) |
T2266 |
1155-1157 |
, |
denotes |
, |
T2267 |
1157-1162 |
WDT |
denotes |
which |
T2268 |
1166-1174 |
VBN |
denotes |
involved |
T2269 |
1163-1165 |
VBZ |
denotes |
is |
T2270 |
1175-1177 |
IN |
denotes |
in |
T2271 |
1178-1187 |
NN |
denotes |
magnesium |
T2272 |
1199-1205 |
NN |
denotes |
efflux |
T2273 |
1188-1191 |
CC |
denotes |
and |
T2274 |
1192-1198 |
NN |
denotes |
cobalt |
T2275 |
1206-1207 |
-LRB- |
denotes |
[ |
T2276 |
1207-1208 |
CD |
denotes |
7 |
T2277 |
1208-1209 |
-RRB- |
denotes |
] |
T2278 |
1209-1210 |
. |
denotes |
. |
T2279 |
1210-1336 |
sentence |
denotes |
High AA homology was also observed between the Acdp proteins and the yeast Amip3 protein (35% AA identity with 56% homology). |
T2280 |
1211-1215 |
JJ |
denotes |
High |
T2281 |
1219-1227 |
NN |
denotes |
homology |
T2282 |
1216-1218 |
NN |
denotes |
AA |
T2283 |
1237-1245 |
VBN |
denotes |
observed |
T2284 |
1228-1231 |
VBD |
denotes |
was |
T2285 |
1232-1236 |
RB |
denotes |
also |
T2303 |
1324-1325 |
NN |
denotes |
% |
T2304 |
1334-1335 |
-RRB- |
denotes |
) |
T2305 |
1335-1336 |
. |
denotes |
. |
T2306 |
1336-1405 |
sentence |
denotes |
The Amip3 is likely to be a homologous to the bacteria CorC protein. |
T2307 |
1337-1340 |
DT |
denotes |
The |
T2308 |
1341-1346 |
NN |
denotes |
Amip3 |
T2309 |
1347-1349 |
VBZ |
denotes |
is |
T2310 |
1350-1356 |
JJ |
denotes |
likely |
T2311 |
1357-1359 |
TO |
denotes |
to |
T2312 |
1360-1362 |
VB |
denotes |
be |
T2313 |
1363-1364 |
DT |
denotes |
a |
T2314 |
1365-1375 |
JJ |
denotes |
homologous |
T2315 |
1376-1378 |
IN |
denotes |
to |
T2316 |
1379-1382 |
DT |
denotes |
the |
T2317 |
1397-1404 |
NN |
denotes |
protein |
T2318 |
1383-1391 |
NNS |
denotes |
bacteria |
T2319 |
1392-1396 |
NN |
denotes |
CorC |
T2320 |
1404-1405 |
. |
denotes |
. |
T2321 |
1405-1557 |
sentence |
denotes |
The Amip3 mutants confer resistance to copper toxicity (Personal communication with Dr. V.C. Culotte, John Hopkins Bloomberg, School of Public Health). |
T2322 |
1406-1409 |
DT |
denotes |
The |
T2323 |
1416-1423 |
NNS |
denotes |
mutants |
T2324 |
1410-1415 |
NN |
denotes |
Amip3 |
T2325 |
1424-1430 |
VBP |
denotes |
confer |
T2326 |
1431-1441 |
NN |
denotes |
resistance |
T2327 |
1442-1444 |
IN |
denotes |
to |
T2328 |
1445-1451 |
NN |
denotes |
copper |
T2329 |
1452-1460 |
NN |
denotes |
toxicity |
T2330 |
1461-1462 |
-LRB- |
denotes |
( |
T2331 |
1490-1493 |
NNP |
denotes |
Dr. |
T2332 |
1462-1470 |
JJ |
denotes |
Personal |
T2333 |
1471-1484 |
NN |
denotes |
communication |
T2334 |
1485-1489 |
IN |
denotes |
with |
T2335 |
1494-1498 |
NNP |
denotes |
V.C. |
T2336 |
1499-1506 |
NNP |
denotes |
Culotte |
T2337 |
1506-1508 |
, |
denotes |
, |
T2338 |
1508-1512 |
NNP |
denotes |
John |
T2339 |
1513-1520 |
NNP |
denotes |
Hopkins |
T2340 |
1521-1530 |
NNP |
denotes |
Bloomberg |
T2341 |
1530-1532 |
, |
denotes |
, |
T2342 |
1532-1538 |
NNP |
denotes |
School |
T2343 |
1539-1541 |
IN |
denotes |
of |
T2344 |
1542-1548 |
NNP |
denotes |
Public |
T2345 |
1549-1555 |
NNP |
denotes |
Health |
T2346 |
1555-1556 |
-RRB- |
denotes |
) |
T2347 |
1556-1557 |
. |
denotes |
. |
T2348 |
1557-1707 |
sentence |
denotes |
The evolutionary relationships among those proteins are illustrated by a phylogenetic tree constructed based on the AA homology of proteins (Fig. 4). |
T2349 |
1558-1561 |
DT |
denotes |
The |
T2350 |
1575-1588 |
NNS |
denotes |
relationships |
T2351 |
1562-1574 |
JJ |
denotes |
evolutionary |
T2352 |
1614-1625 |
VBN |
denotes |
illustrated |
T2353 |
1589-1594 |
IN |
denotes |
among |
T2354 |
1595-1600 |
DT |
denotes |
those |
T2355 |
1601-1609 |
NN |
denotes |
proteins |
T2356 |
1610-1613 |
VBP |
denotes |
are |
T2357 |
1626-1628 |
IN |
denotes |
by |
T2358 |
1629-1630 |
DT |
denotes |
a |
T2359 |
1644-1648 |
NN |
denotes |
tree |
T2360 |
1631-1643 |
JJ |
denotes |
phylogenetic |
T2361 |
1649-1660 |
VBN |
denotes |
constructed |
T2362 |
1661-1666 |
VBN |
denotes |
based |
T2363 |
1667-1669 |
IN |
denotes |
on |
T2364 |
1670-1673 |
DT |
denotes |
the |
T2365 |
1677-1685 |
NN |
denotes |
homology |
T2366 |
1674-1676 |
NN |
denotes |
AA |
T2367 |
1686-1688 |
IN |
denotes |
of |
T2368 |
1689-1697 |
NN |
denotes |
proteins |
T2369 |
1698-1699 |
-LRB- |
denotes |
( |
T2370 |
1699-1703 |
NN |
denotes |
Fig. |
T2371 |
1704-1705 |
CD |
denotes |
4 |
T2372 |
1705-1706 |
-RRB- |
denotes |
) |
T2373 |
1706-1707 |
. |
denotes |
. |
T2374 |
1707-1708 |
sentence |
denotes |
|
T6623 |
1717-1727 |
NN |
denotes |
Nucleotide |
T6624 |
1743-1753 |
NNS |
denotes |
homologies |
T6625 |
1728-1731 |
CC |
denotes |
and |
T6626 |
1732-1737 |
NN |
denotes |
amino |
T6627 |
1738-1742 |
NN |
denotes |
acid |
T6628 |
1754-1755 |
-LRB- |
denotes |
( |
T6629 |
1755-1756 |
NN |
denotes |
% |
T6630 |
1756-1757 |
-RRB- |
denotes |
) |
T6631 |
1758-1765 |
IN |
denotes |
between |
T6632 |
1766-1771 |
JJ |
denotes |
human |
T6633 |
1772-1776 |
NN |
denotes |
ACDP |
T6634 |
1792-1799 |
NNS |
denotes |
members |
T6635 |
1777-1780 |
CC |
denotes |
and |
T6636 |
1781-1786 |
NN |
denotes |
mouse |
T6637 |
1787-1791 |
NN |
denotes |
Acdp |
T6638 |
1799-1800 |
. |
denotes |
. |
T2286 |
1246-1253 |
IN |
denotes |
between |
T2287 |
1254-1257 |
DT |
denotes |
the |
T2288 |
1263-1271 |
NN |
denotes |
proteins |
T2289 |
1258-1262 |
NN |
denotes |
Acdp |
T2290 |
1272-1275 |
CC |
denotes |
and |
T2291 |
1276-1279 |
DT |
denotes |
the |
T2292 |
1292-1299 |
NN |
denotes |
protein |
T2293 |
1280-1285 |
NN |
denotes |
yeast |
T2294 |
1286-1291 |
NN |
denotes |
Amip3 |
T2295 |
1300-1301 |
-LRB- |
denotes |
( |
T2296 |
1308-1316 |
NN |
denotes |
identity |
T2297 |
1301-1303 |
CD |
denotes |
35 |
T2298 |
1303-1304 |
NN |
denotes |
% |
T2299 |
1305-1307 |
NN |
denotes |
AA |
T2300 |
1317-1321 |
IN |
denotes |
with |
T2301 |
1322-1324 |
CD |
denotes |
56 |
T2302 |
1326-1334 |
NN |
denotes |
homology |
T2391 |
3620-3621 |
-RRB- |
denotes |
) |
T2390 |
3619-3620 |
CD |
denotes |
5 |
T2389 |
3614-3618 |
NN |
denotes |
Fig. |
T2388 |
3613-3614 |
-LRB- |
denotes |
( |
T2387 |
3591-3604 |
NN |
denotes |
transmembrane |
T2386 |
3582-3590 |
JJ |
denotes |
distinct |
T2385 |
3605-3612 |
NNS |
denotes |
domains |
T2384 |
3577-3581 |
CD |
denotes |
four |
T2383 |
3556-3560 |
NN |
denotes |
Acdp |
T2382 |
3550-3555 |
NN |
denotes |
mouse |
T2381 |
3561-3568 |
NNS |
denotes |
members |
T2380 |
3546-3549 |
DT |
denotes |
all |
T2379 |
3569-3576 |
VBP |
denotes |
contain |
T2378 |
3541-3545 |
IN |
denotes |
that |
T2377 |
3535-3540 |
VBD |
denotes |
found |
T2376 |
3532-3534 |
PRP |
denotes |
We |
T2375 |
3532-3715 |
sentence |
denotes |
We found that all mouse Acdp members contain four distinct transmembrane domains (Fig. 5), two CBS domains and a DUF21 domain that are found in bacteria CorC and yeast Amip3 proteins. |
T2515 |
4629-4637 |
NN |
denotes |
addition |
T2514 |
4645-4653 |
VBZ |
denotes |
contains |
T2513 |
4626-4628 |
IN |
denotes |
In |
T2512 |
4202-4929 |
sentence |
denotes |
Figure 5 Four transmembrane domains within Acdp4 protein. Transmembrane domains were predicted by the TMHMM program . The plot shows the posterior probabilities of inside /outside/TM helix. At the top of the plot (between 1 and 1.2) the N-best prediction is shown. The plot is obtained by calculating the total probability that a residue sits in helix, inside, or outside summed over all possible paths through the model. In addition, Acdp1 contains an Alanine-rich region (2–10: AAAAAAAAA), a Leucine-rich region (204–257: LLRVRPRLYGPGGDLLPPAWLRALGALLLLALSALF SGLRLSLLSLDPVELRVL), a Proline-rich region (78–130: PGPPVPAAPVPAPSLA PGENGTGDWAPRLVFIEEPPGAGGAAPSAVPTRPPGP), and two amidation sites (917–920: MGKK; 926–929: SGRK). |
T2511 |
4201-4202 |
. |
denotes |
. |
T2510 |
4189-4193 |
NN |
denotes |
Acdp |
T2509 |
4194-4201 |
NNS |
denotes |
members |
T2508 |
4185-4188 |
DT |
denotes |
all |
T2507 |
4182-4184 |
IN |
denotes |
in |
T2506 |
4172-4175 |
VBD |
denotes |
was |
T2505 |
4170-4171 |
-RRB- |
denotes |
) |
T2504 |
4156-4163 |
VBG |
denotes |
binding |
T2503 |
4155-4156 |
HYPH |
denotes |
- |
T2502 |
4141-4142 |
HYPH |
denotes |
- |
T2501 |
4131-4141 |
NN |
denotes |
nucleotide |
T2500 |
4142-4155 |
NN |
denotes |
monophosphate |
T2499 |
4124-4130 |
JJ |
denotes |
cyclic |
T2498 |
4164-4170 |
NN |
denotes |
domain |
R987 |
T2019 |
T2020 |
compound |
Sequence,homology |
R988 |
T2021 |
T2020 |
cc |
and,homology |
R989 |
T2022 |
T2023 |
amod |
molecular,characteristics |
R990 |
T2023 |
T2020 |
conj |
characteristics,homology |
R991 |
T2025 |
T2026 |
det |
The,genes |
R992 |
T2026 |
T2029 |
nsubj |
genes,showed |
R993 |
T2027 |
T2026 |
compound |
mouse,genes |
R994 |
T2028 |
T2026 |
compound |
Acdp,genes |
R995 |
T2030 |
T2031 |
advmod |
very,strong |
R996 |
T2031 |
T2032 |
amod |
strong,homologies |
R997 |
T2032 |
T2029 |
dobj |
homologies,showed |
R998 |
T2033 |
T2032 |
prep |
of,homologies |
R999 |
T2034 |
T2035 |
preconj |
both,nucleotide |
R1000 |
T2035 |
T2036 |
nmod |
nucleotide,sequences |
R1001 |
T2036 |
T2033 |
pobj |
sequences,of |
R1002 |
T2037 |
T2035 |
cc |
and,nucleotide |
R1003 |
T2038 |
T2035 |
conj |
AA,nucleotide |
R1004 |
T2039 |
T2032 |
prep |
to,homologies |
R1005 |
T2040 |
T2041 |
det |
the,genes |
R1006 |
T2041 |
T2039 |
pobj |
genes,to |
R1007 |
T2042 |
T2041 |
amod |
human,genes |
R1008 |
T2043 |
T2041 |
compound |
ACDP,genes |
R1009 |
T2044 |
T2045 |
punct |
(,Table |
R1010 |
T2045 |
T2029 |
parataxis |
Table,showed |
R1011 |
T2046 |
T2045 |
nummod |
1,Table |
R1012 |
T2047 |
T2045 |
punct |
),Table |
R1013 |
T2048 |
T2029 |
punct |
.,showed |
R1014 |
T2050 |
T2051 |
det |
The,homologies |
R1015 |
T2051 |
T2053 |
nsubjpass |
homologies,observed |
R1016 |
T2052 |
T2051 |
amod |
highest,homologies |
R1017 |
T2054 |
T2053 |
auxpass |
were,observed |
R1018 |
T2055 |
T2053 |
prep |
between,observed |
R1019 |
T2056 |
T2057 |
det |
the,ACDP2 |
R1020 |
T2057 |
T2055 |
pobj |
ACDP2,between |
R1021 |
T2058 |
T2057 |
amod |
human,ACDP2 |
R1022 |
T2059 |
T2057 |
cc |
and,ACDP2 |
R1023 |
T2060 |
T2061 |
det |
the,Acdp2 |
R1024 |
T2061 |
T2063 |
compound |
Acdp2,gene |
R1025 |
T2062 |
T2061 |
compound |
mouse,Acdp2 |
R1026 |
T2063 |
T2057 |
conj |
gene,ACDP2 |
R1027 |
T2064 |
T2065 |
punct |
(,% |
R1028 |
T2065 |
T2053 |
parataxis |
%,observed |
R1029 |
T2066 |
T2065 |
nummod |
91,% |
R1030 |
T2067 |
T2065 |
prep |
of,% |
R1031 |
T2068 |
T2069 |
compound |
nucleotide,identity |
R1032 |
T2069 |
T2067 |
pobj |
identity,of |
R1033 |
T2070 |
T2065 |
punct |
", ",% |
R1034 |
T2071 |
T2072 |
nummod |
97,% |
R1035 |
T2072 |
T2065 |
conj |
%,% |
R1036 |
T2073 |
T2072 |
prep |
of,% |
R1037 |
T2074 |
T2075 |
compound |
AA,identity |
R1038 |
T2075 |
T2073 |
pobj |
identity,of |
R1039 |
T2076 |
T2072 |
cc |
and,% |
R1040 |
T2077 |
T2078 |
nummod |
99.4,% |
R1041 |
T2078 |
T2072 |
conj |
%,% |
R1042 |
T2079 |
T2078 |
prep |
of,% |
R1043 |
T2080 |
T2081 |
compound |
AA,homology |
R1044 |
T2081 |
T2079 |
pobj |
homology,of |
R1045 |
T2082 |
T2065 |
punct |
),% |
R1046 |
T2083 |
T2053 |
punct |
.,observed |
R1047 |
T2085 |
T2086 |
prep |
In,showed |
R1048 |
T2086 |
T2105 |
ccomp |
showed,showed |
R1049 |
T2087 |
T2085 |
pobj |
addition,In |
R1050 |
T2088 |
T2086 |
punct |
", ",showed |
R1051 |
T2089 |
T2090 |
det |
the,sequences |
R1052 |
T2090 |
T2086 |
nsubj |
sequences,showed |
R1053 |
T2091 |
T2092 |
nummod |
5,UTR |
R1054 |
T2092 |
T2090 |
compound |
UTR,sequences |
R1055 |
T2093 |
T2091 |
punct |
',5 |
R1056 |
T2094 |
T2090 |
compound |
nucleotide,sequences |
R1057 |
T2095 |
T2096 |
punct |
(,bp |
R1058 |
T2096 |
T2090 |
parataxis |
bp,sequences |
R1059 |
T2097 |
T2096 |
nummod |
20,bp |
R1060 |
T2098 |
T2096 |
prep |
of,bp |
R1061 |
T2099 |
T2098 |
pobj |
nucleotides,of |
R1062 |
T2100 |
T2096 |
prep |
before,bp |
R1063 |
T2101 |
T2102 |
compound |
start,codon |
R1064 |
T2102 |
T2100 |
pobj |
codon,before |
R1065 |
T2103 |
T2096 |
punct |
),bp |
R1066 |
T2104 |
T2086 |
advmod |
also,showed |
R1067 |
T2106 |
T2107 |
amod |
high,homologies |
R1068 |
T2107 |
T2086 |
dobj |
homologies,showed |
R1069 |
T2108 |
T2107 |
prep |
to,homologies |
R1070 |
T2109 |
T2110 |
det |
the,homologs |
R1071 |
T2110 |
T2108 |
pobj |
homologs,to |
R1072 |
T2111 |
T2110 |
amod |
human,homologs |
R1073 |
T2112 |
T2105 |
punct |
", ",showed |
R1074 |
T2113 |
T2105 |
prep |
for,showed |
R1075 |
T2114 |
T2113 |
pobj |
example,for |
R1076 |
T2115 |
T2105 |
punct |
", ",showed |
R1077 |
T2116 |
T2117 |
det |
the,sequence |
R1078 |
T2117 |
T2105 |
nsubj |
sequence,showed |
R1079 |
T2118 |
T2117 |
nmod |
Acdp2,sequence |
R1080 |
T2119 |
T2120 |
nummod |
5,UTR |
R1081 |
T2120 |
T2117 |
compound |
UTR,sequence |
R1082 |
T2121 |
T2119 |
punct |
',5 |
R1083 |
T2122 |
T2123 |
nummod |
95,% |
R1084 |
T2123 |
T2124 |
compound |
%,identities |
R1085 |
T2124 |
T2105 |
dobj |
identities,showed |
R1086 |
T2125 |
T2105 |
prep |
to,showed |
R1087 |
T2126 |
T2127 |
poss |
its,homolog |
R1088 |
T2127 |
T2125 |
pobj |
homolog,to |
R1089 |
T2128 |
T2127 |
amod |
human,homolog |
R1090 |
T2129 |
T2105 |
punct |
.,showed |
R1091 |
T2131 |
T2132 |
advmod |
However,were |
R1092 |
T2133 |
T2132 |
punct |
", ",were |
R1093 |
T2134 |
T2135 |
det |
the,homologies |
R1094 |
T2135 |
T2132 |
nsubj |
homologies,were |
R1095 |
T2136 |
T2135 |
prep |
in,homologies |
R1096 |
T2137 |
T2138 |
det |
the,sequences |
R1097 |
T2138 |
T2136 |
pobj |
sequences,in |
R1098 |
T2139 |
T2140 |
nummod |
3,UTR |
R1099 |
T2140 |
T2138 |
compound |
UTR,sequences |
R1100 |
T2141 |
T2139 |
punct |
',3 |
R1101 |
T2142 |
T2143 |
punct |
(,bp |
R1102 |
T2143 |
T2135 |
parataxis |
bp,homologies |
R1103 |
T2144 |
T2143 |
nummod |
20,bp |
R1104 |
T2145 |
T2143 |
prep |
of,bp |
R1105 |
T2146 |
T2145 |
pobj |
nucleotides,of |
R1106 |
T2147 |
T2143 |
prep |
after,bp |
R1107 |
T2148 |
T2149 |
compound |
stop,codon |
R1108 |
T2149 |
T2147 |
pobj |
codon,after |
R1109 |
T2150 |
T2143 |
punct |
),bp |
R1110 |
T2151 |
T2152 |
advmod |
much,lower |
R1111 |
T2152 |
T2132 |
acomp |
lower,were |
R1112 |
T2153 |
T2154 |
punct |
(,% |
R1113 |
T2154 |
T2132 |
parataxis |
%,were |
R1114 |
T2155 |
T2156 |
quantmod |
40,55 |
R1115 |
T2156 |
T2154 |
nummod |
55,% |
R1116 |
T2157 |
T2156 |
punct |
–,55 |
R1117 |
T2158 |
T2154 |
punct |
),% |
R1118 |
T2159 |
T2132 |
prep |
for,were |
R1119 |
T2160 |
T2161 |
det |
all,genes |
R1120 |
T2161 |
T2159 |
pobj |
genes,for |
R1121 |
T2162 |
T2161 |
compound |
Acdp,genes |
R1122 |
T2163 |
T2161 |
prep |
except,genes |
R1123 |
T2164 |
T2163 |
pobj |
Acdp4,except |
R1124 |
T2165 |
T2166 |
punct |
(,identity |
R1125 |
T2166 |
T2164 |
parataxis |
identity,Acdp4 |
R1126 |
T2167 |
T2166 |
nummod |
90,identity |
R1127 |
T2168 |
T2167 |
quantmod |
%,90 |
R1128 |
T2169 |
T2166 |
prep |
to,identity |
R1129 |
T2170 |
T2171 |
poss |
its,homolog |
R1130 |
T2171 |
T2169 |
pobj |
homolog,to |
R1131 |
T2172 |
T2171 |
amod |
human,homolog |
R1132 |
T2173 |
T2166 |
punct |
),identity |
R1133 |
T2174 |
T2132 |
punct |
.,were |
R1134 |
T2176 |
T2177 |
det |
The,domain |
R1135 |
T2177 |
T2180 |
nsubj |
domain,has |
R1136 |
T2178 |
T2177 |
amod |
ancient,domain |
R1137 |
T2179 |
T2177 |
amod |
conserved,domain |
R1138 |
T2181 |
T2177 |
punct |
(,domain |
R1139 |
T2182 |
T2177 |
appos |
ACD,domain |
R1140 |
T2183 |
T2180 |
punct |
),has |
R1141 |
T2184 |
T2185 |
nummod |
55.3,% |
R1142 |
T2185 |
T2180 |
dobj |
%,has |
R1143 |
T2186 |
T2185 |
prep |
of,% |
R1144 |
T2187 |
T2188 |
compound |
AA,identity |
R1145 |
T2188 |
T2186 |
pobj |
identity,of |
R1146 |
T2189 |
T2185 |
cc |
and,% |
R1147 |
T2190 |
T2191 |
nummod |
83.3,% |
R1148 |
T2191 |
T2185 |
conj |
%,% |
R1149 |
T2192 |
T2191 |
prep |
of,% |
R1150 |
T2193 |
T2192 |
pobj |
homology,of |
R1151 |
T2194 |
T2180 |
prep |
between,has |
R1152 |
T2195 |
T2196 |
det |
all,proteins |
R1153 |
T2196 |
T2194 |
pobj |
proteins,between |
R1154 |
T2197 |
T2196 |
nmod |
mouse,proteins |
R1155 |
T2198 |
T2197 |
cc |
and,mouse |
R1156 |
T2199 |
T2197 |
conj |
human,mouse |
R1157 |
T2200 |
T2196 |
compound |
ACDP,proteins |
R1158 |
T2201 |
T2202 |
punct |
(,Fig. |
R1159 |
T2202 |
T2180 |
parataxis |
Fig.,has |
R1160 |
T2203 |
T2202 |
nummod |
2,Fig. |
R1161 |
T2204 |
T2202 |
punct |
),Fig. |
R1162 |
T2205 |
T2180 |
punct |
.,has |
R1163 |
T2207 |
T2208 |
det |
The,domain |
R1164 |
T2208 |
T2210 |
nsubjpass |
domain,conserved |
R1165 |
T2209 |
T2208 |
compound |
ACD,domain |
R1166 |
T2211 |
T2210 |
auxpass |
is,conserved |
R1167 |
T2212 |
T2210 |
advmod |
evolutionarily,conserved |
R1168 |
T2213 |
T2210 |
prep |
in,conserved |
R1169 |
T2214 |
T2215 |
amod |
divergent,species |
R1170 |
T2215 |
T2213 |
pobj |
species,in |
R1171 |
T2216 |
T2215 |
acl |
ranging,species |
R1172 |
T2217 |
T2216 |
prep |
from,ranging |
R1173 |
T2218 |
T2217 |
pobj |
bacteria,from |
R1174 |
T2219 |
T2218 |
punct |
", ",bacteria |
R1175 |
T2220 |
T2218 |
appos |
yeast,bacteria |
R1176 |
T2221 |
T2218 |
punct |
", ",bacteria |
R1177 |
T2222 |
T2223 |
compound |
C.,elegans |
R1178 |
T2223 |
T2218 |
appos |
elegans,bacteria |
R1179 |
T2224 |
T2218 |
punct |
", ",bacteria |
R1180 |
T2225 |
T2226 |
compound |
D.,melanogaster |
R1181 |
T2226 |
T2218 |
appos |
melanogaster,bacteria |
R1182 |
T2227 |
T2218 |
punct |
", ",bacteria |
R1183 |
T2228 |
T2218 |
appos |
mouse,bacteria |
R1184 |
T2229 |
T2217 |
prep |
to,from |
R1185 |
T2230 |
T2229 |
pobj |
human,to |
R1186 |
T2231 |
T2232 |
punct |
(,Fig. |
R1187 |
T2232 |
T2210 |
parataxis |
Fig.,conserved |
R1188 |
T2233 |
T2232 |
nummod |
3,Fig. |
R1189 |
T2234 |
T2232 |
punct |
),Fig. |
R1190 |
T2235 |
T2210 |
punct |
.,conserved |
R1191 |
T2237 |
T2238 |
advmod |
Particularly,showed |
R1192 |
T2239 |
T2238 |
punct |
", ",showed |
R1193 |
T2240 |
T2241 |
mark |
as,shown |
R1194 |
T2241 |
T2238 |
advcl |
shown,showed |
R1195 |
T2242 |
T2241 |
prep |
in,shown |
R1196 |
T2243 |
T2242 |
pobj |
Fig.,in |
R1197 |
T2244 |
T2243 |
nummod |
3,Fig. |
R1198 |
T2245 |
T2238 |
punct |
", ",showed |
R1199 |
T2246 |
T2247 |
compound |
Acdp,proteins |
R1200 |
T2247 |
T2238 |
nsubj |
proteins,showed |
R1201 |
T2248 |
T2249 |
advmod |
very,strong |
R1202 |
T2249 |
T2250 |
amod |
strong,homology |
R1203 |
T2250 |
T2238 |
dobj |
homology,showed |
R1204 |
T2251 |
T2250 |
compound |
AA,homology |
R1205 |
T2252 |
T2250 |
prep |
to,homology |
R1206 |
T2253 |
T2254 |
compound |
bacteria,protein |
R1207 |
T2254 |
T2252 |
pobj |
protein,to |
R1208 |
T2255 |
T2254 |
compound |
CorC,protein |
R1209 |
T2256 |
T2257 |
punct |
(,identity |
R1210 |
T2257 |
T2254 |
parataxis |
identity,protein |
R1211 |
T2258 |
T2259 |
nummod |
35,% |
R1212 |
T2259 |
T2257 |
compound |
%,identity |
R1213 |
T2260 |
T2257 |
compound |
AA,identity |
R1214 |
T2261 |
T2257 |
prep |
with,identity |
R1215 |
T2262 |
T2263 |
nummod |
55,% |
R1216 |
T2263 |
T2264 |
compound |
%,homology |
R1217 |
T2264 |
T2261 |
pobj |
homology,with |
R1218 |
T2265 |
T2257 |
punct |
),identity |
R1219 |
T2266 |
T2254 |
punct |
", ",protein |
R1220 |
T2267 |
T2268 |
dep |
which,involved |
R1221 |
T2268 |
T2254 |
relcl |
involved,protein |
R1222 |
T2269 |
T2268 |
auxpass |
is,involved |
R1223 |
T2270 |
T2268 |
prep |
in,involved |
R1224 |
T2271 |
T2272 |
nmod |
magnesium,efflux |
R1225 |
T2272 |
T2270 |
pobj |
efflux,in |
R1226 |
T2273 |
T2271 |
cc |
and,magnesium |
R1227 |
T2274 |
T2271 |
conj |
cobalt,magnesium |
R1228 |
T2275 |
T2276 |
punct |
[,7 |
R1229 |
T2276 |
T2238 |
parataxis |
7,showed |
R1230 |
T2277 |
T2276 |
punct |
],7 |
R1231 |
T2278 |
T2238 |
punct |
.,showed |
R1232 |
T2280 |
T2281 |
amod |
High,homology |
R1233 |
T2281 |
T2283 |
nsubjpass |
homology,observed |
R1234 |
T2282 |
T2281 |
compound |
AA,homology |
R1235 |
T2284 |
T2283 |
auxpass |
was,observed |
R1236 |
T2285 |
T2283 |
advmod |
also,observed |
R1237 |
T2286 |
T2283 |
prep |
between,observed |
R1238 |
T2287 |
T2288 |
det |
the,proteins |
R1239 |
T2288 |
T2286 |
pobj |
proteins,between |
R1240 |
T2289 |
T2288 |
compound |
Acdp,proteins |
R1241 |
T2290 |
T2288 |
cc |
and,proteins |
R1242 |
T2291 |
T2292 |
det |
the,protein |
R1243 |
T2292 |
T2288 |
conj |
protein,proteins |
R1244 |
T2293 |
T2292 |
compound |
yeast,protein |
R1245 |
T2294 |
T2292 |
compound |
Amip3,protein |
R1246 |
T2295 |
T2296 |
punct |
(,identity |
R1247 |
T2296 |
T2283 |
parataxis |
identity,observed |
R1248 |
T2297 |
T2296 |
nummod |
35,identity |
R1249 |
T2298 |
T2297 |
quantmod |
%,35 |
R1250 |
T2299 |
T2296 |
compound |
AA,identity |
R1251 |
T2300 |
T2296 |
prep |
with,identity |
R1252 |
T2301 |
T2302 |
nummod |
56,homology |
R1253 |
T2302 |
T2300 |
pobj |
homology,with |
R1254 |
T2303 |
T2301 |
quantmod |
%,56 |
R1255 |
T2304 |
T2296 |
punct |
),identity |
R1256 |
T2305 |
T2283 |
punct |
.,observed |
R1257 |
T2307 |
T2308 |
det |
The,Amip3 |
R1258 |
T2308 |
T2309 |
nsubj |
Amip3,is |
R1259 |
T2310 |
T2309 |
acomp |
likely,is |
R1260 |
T2311 |
T2312 |
aux |
to,be |
R1261 |
T2312 |
T2310 |
xcomp |
be,likely |
R1262 |
T2313 |
T2314 |
det |
a,homologous |
R1263 |
T2314 |
T2312 |
attr |
homologous,be |
R1264 |
T2315 |
T2314 |
prep |
to,homologous |
R1265 |
T2316 |
T2317 |
det |
the,protein |
R1266 |
T2317 |
T2315 |
pobj |
protein,to |
R1267 |
T2318 |
T2317 |
compound |
bacteria,protein |
R1268 |
T2319 |
T2317 |
compound |
CorC,protein |
R1269 |
T2320 |
T2309 |
punct |
.,is |
R1270 |
T2322 |
T2323 |
det |
The,mutants |
R1271 |
T2323 |
T2325 |
nsubj |
mutants,confer |
R1272 |
T2324 |
T2323 |
compound |
Amip3,mutants |
R1273 |
T2326 |
T2325 |
dobj |
resistance,confer |
R1274 |
T2327 |
T2326 |
prep |
to,resistance |
R1275 |
T2328 |
T2329 |
compound |
copper,toxicity |
R1276 |
T2329 |
T2327 |
pobj |
toxicity,to |
R1277 |
T2330 |
T2331 |
punct |
(,Dr. |
R1278 |
T2331 |
T2325 |
meta |
Dr.,confer |
R1279 |
T2332 |
T2331 |
amod |
Personal,Dr. |
R1280 |
T2333 |
T2331 |
nmod |
communication,Dr. |
R1281 |
T2334 |
T2331 |
nmod |
with,Dr. |
R1282 |
T2335 |
T2331 |
nmod |
V.C.,Dr. |
R1283 |
T2336 |
T2331 |
nmod |
Culotte,Dr. |
R1284 |
T2337 |
T2331 |
punct |
", ",Dr. |
R1336 |
T2394 |
T2385 |
conj |
domains,domains |
R1338 |
T2396 |
T2394 |
cc |
and,domains |
R1339 |
T2397 |
T2398 |
det |
a,domain |
R1340 |
T2398 |
T2394 |
conj |
domain,domains |
R1341 |
T2399 |
T2398 |
compound |
DUF21,domain |
R1342 |
T2400 |
T2401 |
dep |
that,found |
R1343 |
T2401 |
T2385 |
relcl |
found,domains |
R1344 |
T2402 |
T2401 |
auxpass |
are,found |
R1345 |
T2403 |
T2401 |
prep |
in,found |
R1346 |
T2404 |
T2405 |
nmod |
bacteria,CorC |
R1347 |
T2405 |
T2406 |
nmod |
CorC,proteins |
R1348 |
T2406 |
T2403 |
pobj |
proteins,in |
R1349 |
T2407 |
T2405 |
cc |
and,CorC |
R1350 |
T2408 |
T2409 |
compound |
yeast,Amip3 |
R1351 |
T2409 |
T2405 |
conj |
Amip3,CorC |
R1352 |
T2410 |
T2377 |
punct |
.,found |
R1353 |
T2412 |
T2413 |
compound |
CBS,domains |
R1354 |
T2413 |
T2414 |
nsubj |
domains,are |
R1355 |
T2415 |
T2416 |
amod |
small,modules |
R1356 |
T2416 |
T2414 |
attr |
modules,are |
R1357 |
T2417 |
T2416 |
amod |
intracellular,modules |
R1358 |
T2418 |
T2419 |
dep |
that,found |
R1359 |
T2419 |
T2416 |
relcl |
found,modules |
R1360 |
T2420 |
T2419 |
auxpass |
are,found |
R1361 |
T2421 |
T2419 |
advmod |
mostly,found |
R1362 |
T2422 |
T2419 |
prep |
in,found |
R1363 |
T2423 |
T2424 |
nummod |
2,copies |
R1364 |
T2424 |
T2422 |
pobj |
copies,in |
R1365 |
T2425 |
T2423 |
cc |
or,2 |
R1366 |
T2426 |
T2423 |
conj |
four,2 |
R1367 |
T2427 |
T2419 |
prep |
within,found |
R1368 |
T2428 |
T2429 |
det |
a,protein |
R1369 |
T2429 |
T2427 |
pobj |
protein,within |
R1370 |
T2430 |
T2414 |
punct |
.,are |
R1371 |
T2432 |
T2433 |
nsubj |
Pairs,dimerise |
R1372 |
T2434 |
T2432 |
prep |
of,Pairs |
R1373 |
T2435 |
T2436 |
compound |
CBS,domains |
R1374 |
T2436 |
T2434 |
pobj |
domains,of |
R1375 |
T2437 |
T2438 |
aux |
to,form |
R1376 |
T2438 |
T2433 |
advcl |
form,dimerise |
R1377 |
T2439 |
T2440 |
det |
a,domain |
R1378 |
T2440 |
T2438 |
dobj |
domain,form |
R1379 |
T2441 |
T2440 |
amod |
stable,domain |
R1380 |
T2442 |
T2440 |
amod |
globular,domain |
R1381 |
T2443 |
T2444 |
punct |
[,8 |
R1382 |
T2444 |
T2433 |
parataxis |
8,dimerise |
R1383 |
T2445 |
T2444 |
punct |
],8 |
R1384 |
T2446 |
T2433 |
punct |
.,dimerise |
R1385 |
T2448 |
T2449 |
nsubj |
DUF21,is |
R1386 |
T2450 |
T2451 |
punct |
(,pfam01959.9 |
R1387 |
T2451 |
T2448 |
parataxis |
pfam01959.9,DUF21 |
R1388 |
T2452 |
T2451 |
dep |
CD,pfam01959.9 |
R1389 |
T2453 |
T2451 |
punct |
: ,pfam01959.9 |
R1390 |
T2454 |
T2451 |
punct |
),pfam01959.9 |
R1391 |
T2455 |
T2456 |
det |
a,domain |
R1392 |
T2456 |
T2449 |
attr |
domain,is |
R1393 |
T2457 |
T2458 |
advmod |
newly,defined |
R1394 |
T2458 |
T2456 |
amod |
defined,domain |
R1395 |
T2459 |
T2456 |
prep |
with,domain |
R1396 |
T2460 |
T2461 |
amod |
unknown,function |
R1397 |
T2461 |
T2459 |
pobj |
function,with |
R1398 |
T2462 |
T2449 |
punct |
.,is |
R1399 |
T2464 |
T2465 |
det |
This,domain |
R1400 |
T2465 |
T2466 |
nsubj |
domain,is |
R1401 |
T2467 |
T2468 |
det |
a,region |
R1402 |
T2468 |
T2466 |
attr |
region,is |
R1403 |
T2469 |
T2468 |
amod |
transmembrane,region |
R1404 |
T2470 |
T2468 |
cc |
and,region |
R1405 |
T2471 |
T2468 |
conj |
found,region |
R1406 |
T2472 |
T2473 |
aux |
to,located |
R1407 |
T2473 |
T2471 |
xcomp |
located,found |
R1408 |
T2474 |
T2473 |
auxpass |
be,located |
R1409 |
T2475 |
T2473 |
prep |
in,located |
R1410 |
T2476 |
T2477 |
det |
the,terminus |
R1411 |
T2477 |
T2475 |
pobj |
terminus,in |
R1412 |
T2478 |
T2477 |
compound |
N,terminus |
R1413 |
T2479 |
T2477 |
punct |
-,terminus |
R1414 |
T2480 |
T2477 |
prep |
of,terminus |
R1415 |
T2481 |
T2482 |
det |
the,proteins |
R1416 |
T2482 |
T2480 |
pobj |
proteins,of |
R1417 |
T2483 |
T2482 |
amod |
adjacent,proteins |
R1464 |
T2534 |
T2532 |
amod |
rich,region |
R1465 |
T2535 |
T2534 |
punct |
-,rich |
R1466 |
T2536 |
T2537 |
punct |
(,SGLRLSLLSLDPVELRVL |
R1467 |
T2537 |
T2532 |
parataxis |
SGLRLSLLSLDPVELRVL,region |
R1468 |
T2538 |
T2537 |
dep |
204,SGLRLSLLSLDPVELRVL |
R1469 |
T2539 |
T2540 |
punct |
–,257 |
R1470 |
T2540 |
T2538 |
prep |
257,204 |
R1471 |
T2541 |
T2537 |
punct |
: ,SGLRLSLLSLDPVELRVL |
R1472 |
T2542 |
T2537 |
compound |
LLRVRPRLYGPGGDLLPPAWLRALGALLLLALSALF,SGLRLSLLSLDPVELRVL |
R1473 |
T2543 |
T2537 |
punct |
),SGLRLSLLSLDPVELRVL |
R1474 |
T2544 |
T2532 |
punct |
", ",region |
R1475 |
T2545 |
T2546 |
det |
a,region |
R1476 |
T2546 |
T2532 |
conj |
region,region |
R1477 |
T2547 |
T2548 |
npadvmod |
Proline,rich |
R1478 |
T2548 |
T2546 |
amod |
rich,region |
R1479 |
T2549 |
T2548 |
punct |
-,rich |
R1480 |
T2550 |
T2551 |
punct |
(,PGENGTGDWAPRLVFIEEPPGAGGAAPSAVPTRPPGP |
R1481 |
T2551 |
T2546 |
parataxis |
PGENGTGDWAPRLVFIEEPPGAGGAAPSAVPTRPPGP,region |
R1482 |
T2552 |
T2551 |
dep |
78,PGENGTGDWAPRLVFIEEPPGAGGAAPSAVPTRPPGP |
R1483 |
T2553 |
T2554 |
punct |
–,130 |
R1484 |
T2554 |
T2552 |
prep |
130,78 |
R1485 |
T2555 |
T2551 |
punct |
: ,PGENGTGDWAPRLVFIEEPPGAGGAAPSAVPTRPPGP |
R1486 |
T2556 |
T2551 |
compound |
PGPPVPAAPVPAPSLA,PGENGTGDWAPRLVFIEEPPGAGGAAPSAVPTRPPGP |
R1487 |
T2557 |
T2551 |
punct |
),PGENGTGDWAPRLVFIEEPPGAGGAAPSAVPTRPPGP |
R1488 |
T2558 |
T2546 |
punct |
", ",region |
R1489 |
T2559 |
T2546 |
cc |
and,region |
R1490 |
T2560 |
T2561 |
nummod |
two,sites |
R1491 |
T2561 |
T2546 |
conj |
sites,region |
R1492 |
T2562 |
T2561 |
compound |
amidation,sites |
R1493 |
T2563 |
T2564 |
punct |
(,SGRK |
R1494 |
T2564 |
T2561 |
parataxis |
SGRK,sites |
R1495 |
T2565 |
T2566 |
dep |
917,MGKK |
R1496 |
T2566 |
T2564 |
dep |
MGKK,SGRK |
R1497 |
T2567 |
T2568 |
punct |
–,920 |
R1498 |
T2568 |
T2565 |
prep |
920,917 |
R1499 |
T2569 |
T2566 |
punct |
: ,MGKK |
R1500 |
T2570 |
T2564 |
punct |
;,SGRK |
R1501 |
T2571 |
T2564 |
dep |
926,SGRK |
R1502 |
T2572 |
T2573 |
punct |
–,929 |
R1503 |
T2573 |
T2571 |
prep |
929,926 |
R1504 |
T2574 |
T2564 |
punct |
: ,SGRK |
R1505 |
T2575 |
T2564 |
punct |
),SGRK |
R1506 |
T2576 |
T2514 |
punct |
.,contains |
R1507 |
T2578 |
T2579 |
nsubj |
Acdp2,has |
R1508 |
T2580 |
T2581 |
det |
a,region |
R1509 |
T2581 |
T2579 |
dobj |
region,has |
R1510 |
T2582 |
T2583 |
npadvmod |
glycine,rich |
R1511 |
T2583 |
T2581 |
amod |
rich,region |
R1512 |
T2584 |
T2583 |
punct |
-,rich |
R1513 |
T2585 |
T2586 |
punct |
(,GAGGSGSASGTVGGKGGAGVAG |
R1514 |
T2586 |
T2581 |
parataxis |
GAGGSGSASGTVGGKGGAGVAG,region |
R1515 |
T2587 |
T2586 |
dep |
201,GAGGSGSASGTVGGKGGAGVAG |
R1516 |
T2588 |
T2589 |
punct |
–,222 |
R1517 |
T2589 |
T2587 |
prep |
222,201 |
R1518 |
T2590 |
T2586 |
punct |
: ,GAGGSGSASGTVGGKGGAGVAG |
R1519 |
T2591 |
T2586 |
punct |
),GAGGSGSASGTVGGKGGAGVAG |
R1520 |
T2592 |
T2579 |
punct |
.,has |
R1521 |
T2594 |
T2595 |
nsubj |
Acdp3,possesses |
R1522 |
T2596 |
T2597 |
det |
a,region |
R1523 |
T2597 |
T2595 |
dobj |
region,possesses |
R1524 |
T2598 |
T2597 |
amod |
large,region |
R1525 |
T2599 |
T2600 |
npadvmod |
alanine,rich |
R1526 |
T2600 |
T2597 |
amod |
rich,region |
R1527 |
T2601 |
T2600 |
punct |
-,rich |
R1528 |
T2602 |
T2603 |
punct |
(,2 |
R1529 |
T2603 |
T2597 |
parataxis |
2,region |
R1530 |
T2604 |
T2605 |
punct |
–,261 |
R1531 |
T2605 |
T2603 |
prep |
261,2 |
R1532 |
T2606 |
T2603 |
punct |
),2 |
R1533 |
T2607 |
T2597 |
cc |
and,region |
R1534 |
T2608 |
T2609 |
det |
a,region |
R1535 |
T2609 |
T2597 |
conj |
region,region |
R1536 |
T2610 |
T2609 |
amod |
large,region |
R1537 |
T2611 |
T2612 |
npadvmod |
leucine,rich |
R1538 |
T2612 |
T2609 |
amod |
rich,region |
R1539 |
T2613 |
T2612 |
punct |
-,rich |
R1540 |
T2614 |
T2615 |
punct |
(,201 |
R1541 |
T2615 |
T2609 |
parataxis |
201,region |
R1542 |
T2616 |
T2617 |
punct |
–,299 |
R1543 |
T2617 |
T2615 |
prep |
299,201 |
R1544 |
T2618 |
T2615 |
punct |
),201 |
R1545 |
T2619 |
T2595 |
punct |
.,possesses |
R1546 |
T2621 |
T2622 |
nsubj |
Acdp4,contains |
R1547 |
T2623 |
T2624 |
det |
a,pattern |
R1548 |
T2624 |
T2622 |
dobj |
pattern,contains |
R1549 |
T2625 |
T2624 |
compound |
leucine,pattern |
R1550 |
T2626 |
T2624 |
compound |
zipper,pattern |
R1551 |
T2627 |
T2628 |
punct |
(,LVMVLLVLSGIFSGLNLGLMAL |
R1552 |
T2628 |
T2624 |
parataxis |
LVMVLLVLSGIFSGLNLGLMAL,pattern |
R1553 |
T2629 |
T2628 |
dep |
185,LVMVLLVLSGIFSGLNLGLMAL |
R1554 |
T2630 |
T2631 |
punct |
–,206 |
R1555 |
T2631 |
T2629 |
prep |
206,185 |
R1556 |
T2632 |
T2628 |
punct |
: ,LVMVLLVLSGIFSGLNLGLMAL |
R1557 |
T2633 |
T2628 |
punct |
),LVMVLLVLSGIFSGLNLGLMAL |
R1558 |
T2634 |
T2624 |
cc |
and,pattern |
R1559 |
T2635 |
T2636 |
det |
an,site |
R1560 |
T2636 |
T2624 |
conj |
site,pattern |
R1561 |
T2637 |
T2636 |
compound |
amidation,site |
R1562 |
T2638 |
T2639 |
punct |
(,GGRR |
R1563 |
T2639 |
T2636 |
parataxis |
GGRR,site |
R1564 |
T2640 |
T2639 |
dep |
7,GGRR |
R1565 |
T2641 |
T2642 |
punct |
–,10 |
R1566 |
T2642 |
T2640 |
prep |
10,7 |
R1567 |
T2643 |
T2639 |
punct |
: ,GGRR |
R1568 |
T2644 |
T2639 |
punct |
),GGRR |
R1569 |
T2645 |
T2622 |
punct |
.,contains |
R3572 |
T5826 |
T5827 |
compound |
Amino,acid |
R3573 |
T5827 |
T5828 |
compound |
acid,sequence |
R3574 |
T5828 |
T5829 |
compound |
sequence,alignment |
R3575 |
T5830 |
T5829 |
compound |
homology,alignment |
R3576 |
T5831 |
T5829 |
prep |
for,alignment |
R3577 |
T5832 |
T5831 |
pobj |
all,for |
R3578 |
T5833 |
T5832 |
prep |
of,all |
R3579 |
T5834 |
T5835 |
det |
the,genes |
R3580 |
T5835 |
T5833 |
pobj |
genes,of |
R3581 |
T5836 |
T5835 |
nmod |
ACDP,genes |
R3582 |
T5837 |
T5836 |
cc |
and,ACDP |
R3583 |
T5838 |
T5836 |
conj |
Acdp,ACDP |
R3584 |
T5839 |
T5835 |
prep |
within,genes |
R3585 |
T5840 |
T5841 |
det |
the,domain |
R3586 |
T5841 |
T5839 |
pobj |
domain,within |
R3587 |
T5842 |
T5841 |
compound |
ACD,domain |
R3588 |
T5843 |
T5829 |
punct |
.,alignment |
R3589 |
T5845 |
T5846 |
det |
The,data |
R3590 |
T5846 |
T5848 |
nsubjpass |
data,deposited |
R3591 |
T5847 |
T5846 |
compound |
sequence,data |
R3592 |
T5849 |
T5846 |
prep |
for,data |
R3593 |
T5850 |
T5851 |
det |
the,genes |
R3594 |
T5851 |
T5849 |
pobj |
genes,for |
R3595 |
T5852 |
T5851 |
compound |
Acdp,genes |
R3596 |
T5853 |
T5848 |
aux |
have,deposited |
R3597 |
T5854 |
T5848 |
auxpass |
been,deposited |
R3598 |
T5855 |
T5848 |
prep |
in,deposited |
R3599 |
T5856 |
T5855 |
pobj |
GenBank,in |
R3600 |
T5857 |
T5848 |
prep |
under,deposited |
R3601 |
T5858 |
T5859 |
compound |
accession,AF202994 |
R3602 |
T5859 |
T5857 |
pobj |
AF202994,under |
R3603 |
T5860 |
T5859 |
compound |
number,AF202994 |
R3604 |
T5861 |
T5859 |
punct |
(,AF202994 |
R3605 |
T5862 |
T5859 |
appos |
Acdp1,AF202994 |
R3606 |
T5863 |
T5859 |
punct |
),AF202994 |
R3607 |
T5864 |
T5859 |
punct |
", ",AF202994 |
R3608 |
T5865 |
T5859 |
conj |
AF216961,AF202994 |
R3609 |
T5866 |
T5865 |
punct |
(,AF216961 |
R3610 |
T5867 |
T5865 |
appos |
Acdp2,AF216961 |
R3611 |
T5868 |
T5865 |
punct |
),AF216961 |
R3612 |
T5869 |
T5865 |
punct |
", ",AF216961 |
R3613 |
T5870 |
T5865 |
conj |
AF216964,AF216961 |
R3614 |
T5871 |
T5870 |
punct |
(,AF216964 |
R3615 |
T5872 |
T5870 |
appos |
Acdp3,AF216964 |
R3616 |
T5873 |
T5870 |
punct |
),AF216964 |
R3617 |
T5874 |
T5870 |
cc |
and,AF216964 |
R3618 |
T5875 |
T5870 |
conj |
AF216963,AF216964 |
R3619 |
T5876 |
T5875 |
punct |
(,AF216963 |
R3620 |
T5877 |
T5875 |
appos |
Acdp4,AF216963 |
R3621 |
T5878 |
T5859 |
punct |
),AF202994 |
R3622 |
T5879 |
T5848 |
punct |
.,deposited |
R3623 |
T5881 |
T5882 |
amod |
Identical,acids |
R3624 |
T5882 |
T5884 |
nsubjpass |
acids,shaded |
R3625 |
T5883 |
T5882 |
compound |
amino,acids |
R3626 |
T5885 |
T5882 |
cc |
or,acids |
R3627 |
T5886 |
T5887 |
compound |
amino,acids |
R3628 |
T5887 |
T5882 |
conj |
acids,acids |
R3629 |
T5888 |
T5887 |
prep |
with,acids |
R3630 |
T5889 |
T5890 |
advmod |
very,strong |
R3631 |
T5890 |
T5891 |
amod |
strong,homologies |
R3632 |
T5891 |
T5888 |
pobj |
homologies,with |
R3633 |
T5892 |
T5891 |
prep |
among,homologies |
R3634 |
T5893 |
T5894 |
det |
all,proteins |
R3635 |
T5894 |
T5892 |
pobj |
proteins,among |
R3636 |
T5895 |
T5884 |
auxpass |
were,shaded |
R3637 |
T5896 |
T5884 |
oprd |
black,shaded |
R3638 |
T5897 |
T5884 |
punct |
.,shaded |
R3639 |
T5899 |
T5900 |
amod |
Identical,acids |
R3640 |
T5900 |
T5902 |
nsubjpass |
acids,shaded |
R3641 |
T5901 |
T5900 |
compound |
amino,acids |
R3642 |
T5903 |
T5900 |
cc |
or,acids |
R3643 |
T5904 |
T5905 |
compound |
amino,acids |
R3644 |
T5905 |
T5900 |
conj |
acids,acids |
R3645 |
T5906 |
T5905 |
prep |
with,acids |
R3646 |
T5907 |
T5908 |
advmod |
very,strong |
R3647 |
T5908 |
T5909 |
amod |
strong,homologies |
R3648 |
T5909 |
T5906 |
pobj |
homologies,with |
R3649 |
T5910 |
T5909 |
prep |
in,homologies |
R3650 |
T5911 |
T5910 |
pobj |
most,in |
R3651 |
T5912 |
T5911 |
prep |
of,most |
R3652 |
T5913 |
T5914 |
det |
the,proteins |
R3653 |
T5914 |
T5912 |
pobj |
proteins,of |
R3654 |
T5915 |
T5902 |
auxpass |
were,shaded |
R3655 |
T5916 |
T5902 |
oprd |
grey,shaded |
R3656 |
T5917 |
T5902 |
punct |
.,shaded |
R3657 |
T5919 |
T5920 |
compound |
Dot,lines |
R3658 |
T5920 |
T5921 |
nsubj |
lines,represent |
R3659 |
T5922 |
T5921 |
dobj |
gaps,represent |
R3660 |
T5923 |
T5922 |
prep |
for,gaps |
R3661 |
T5924 |
T5925 |
det |
the,alignment |
R3662 |
T5925 |
T5923 |
pobj |
alignment,for |
R3663 |
T5926 |
T5921 |
punct |
.,represent |
R3664 |
T6010 |
T6011 |
compound |
Amino,acid |
R3665 |
T6011 |
T6012 |
compound |
acid,alignment |
R3666 |
T6013 |
T6012 |
compound |
sequence,alignment |
R3667 |
T6014 |
T6012 |
acl |
showing,alignment |
R3668 |
T6015 |
T6016 |
det |
the,conservation |
R3669 |
T6016 |
T6014 |
dobj |
conservation,showing |
R3670 |
T6017 |
T6016 |
prep |
of,conservation |
R3671 |
T6018 |
T6019 |
compound |
ACD,domain |
R3672 |
T6019 |
T6017 |
pobj |
domain,of |
R3673 |
T6020 |
T6014 |
prep |
in,showing |
R3674 |
T6021 |
T6022 |
amod |
various,species |
R3675 |
T6022 |
T6020 |
pobj |
species,in |
R3676 |
T6023 |
T6012 |
punct |
.,alignment |
R3677 |
T6025 |
T6026 |
nsubj |
Amip3,is |
R3678 |
T6027 |
T6028 |
det |
a,protein |
R3679 |
T6028 |
T6026 |
attr |
protein,is |
R3680 |
T6029 |
T6028 |
prep |
from,protein |
R3681 |
T6030 |
T6031 |
compound |
Saccharomyces,cerevisiae |
R3682 |
T6031 |
T6029 |
pobj |
cerevisiae,from |
R3683 |
T6032 |
T6028 |
punct |
(,protein |
R3684 |
T6033 |
T6028 |
appos |
NP_014581,protein |
R3685 |
T6034 |
T6026 |
punct |
),is |
R3686 |
T6035 |
T6026 |
punct |
.,is |
R3687 |
T6037 |
T6038 |
nsubj |
CanG,is |
R3688 |
T6039 |
T6040 |
det |
a,protein |
R3689 |
T6040 |
T6038 |
attr |
protein,is |
R3690 |
T6041 |
T6040 |
prep |
from,protein |
R3691 |
T6042 |
T6043 |
compound |
Candida,glabrata |
R3692 |
T6043 |
T6041 |
pobj |
glabrata,from |
R3693 |
T6044 |
T6040 |
punct |
(,protein |
R3694 |
T6045 |
T6040 |
appos |
AAF33142,protein |
R3695 |
T6046 |
T6038 |
punct |
),is |
R3696 |
T6047 |
T6038 |
punct |
.,is |
R3697 |
T6049 |
T6050 |
nsubj |
NeuC,is |
R3698 |
T6051 |
T6049 |
punct |
(,NeuC |
R3699 |
T6052 |
T6049 |
appos |
EAA31204,NeuC |
R3700 |
T6053 |
T6050 |
punct |
),is |
R3701 |
T6054 |
T6055 |
det |
a,protein |
R3702 |
T6055 |
T6050 |
attr |
protein,is |
R3703 |
T6056 |
T6055 |
amod |
hypothetical,protein |
R3704 |
T6057 |
T6055 |
prep |
from,protein |
R3705 |
T6058 |
T6059 |
compound |
Neurospora,crassa |
R3706 |
T6059 |
T6057 |
pobj |
crassa,from |
R3707 |
T6060 |
T6050 |
punct |
.,is |
R3708 |
T6062 |
T6063 |
nsubj |
DroM,is |
R3709 |
T6064 |
T6065 |
det |
a,product |
R3710 |
T6065 |
T6063 |
attr |
product,is |
R3711 |
T6066 |
T6065 |
compound |
gene,product |
R3712 |
T6067 |
T6065 |
prep |
from,product |
R3713 |
T6068 |
T6069 |
compound |
D.,melanogaster |
R3714 |
T6069 |
T6067 |
pobj |
melanogaster,from |
R3715 |
T6070 |
T6063 |
punct |
.,is |
R3716 |
T6072 |
T6073 |
det |
The,number |
R3717 |
T6073 |
T6075 |
nsubj |
number,is |
R3718 |
T6074 |
T6073 |
compound |
accession,number |
R3719 |
T6076 |
T6073 |
prep |
for,number |
R3720 |
T6077 |
T6078 |
det |
this,gene |
R3721 |
T6078 |
T6076 |
pobj |
gene,for |
R3722 |
T6079 |
T6075 |
attr |
CG40084,is |
R3723 |
T6080 |
T6075 |
prep |
in,is |
R3724 |
T6081 |
T6080 |
pobj |
BDGP,in |
R3725 |
T6082 |
T6081 |
punct |
(,BDGP |
R3726 |
T6083 |
T6084 |
compound |
Berkeley,Project |
R3727 |
T6084 |
T6081 |
appos |
Project,BDGP |
R3728 |
T6085 |
T6084 |
compound |
Drosophila,Project |
R3729 |
T6086 |
T6084 |
compound |
Genome,Project |
R3730 |
T6087 |
T6075 |
punct |
),is |
R3731 |
T6088 |
T6075 |
punct |
.,is |
R3732 |
T6090 |
T6091 |
nsubj |
AnoG,represents |
R3733 |
T6092 |
T6093 |
det |
a,protein |
R3734 |
T6093 |
T6091 |
dobj |
protein,represents |
R3735 |
T6094 |
T6093 |
prep |
from,protein |
R3736 |
T6095 |
T6096 |
det |
the,str. |
R3737 |
T6096 |
T6094 |
pobj |
str.,from |
R3738 |
T6097 |
T6096 |
compound |
anopheles,str. |
R3739 |
T6098 |
T6096 |
compound |
gambiae,str. |
R3740 |
T6099 |
T6100 |
punct |
(,EAA01004 |
R3741 |
T6100 |
T6094 |
parataxis |
EAA01004,from |
R3742 |
T6101 |
T6100 |
punct |
),EAA01004 |
R3743 |
T6102 |
T6091 |
punct |
.,represents |
R3744 |
T6104 |
T6105 |
nsubj |
CaeE,is |
R3745 |
T6106 |
T6104 |
punct |
(,CaeE |
R3746 |
T6107 |
T6104 |
appos |
AAK77203,CaeE |
R3747 |
T6108 |
T6105 |
punct |
),is |
R3748 |
T6109 |
T6110 |
det |
a,protein |
R3749 |
T6110 |
T6105 |
attr |
protein,is |
R3750 |
T6111 |
T6110 |
amod |
hypothetical,protein |
R3751 |
T6112 |
T6110 |
prep |
from,protein |
R3752 |
T6113 |
T6114 |
det |
the,elegans |
R3753 |
T6114 |
T6112 |
pobj |
elegans,from |
R3754 |
T6115 |
T6114 |
compound |
Caenohabditis,elegans |
R3755 |
T6116 |
T6105 |
punct |
.,is |
R3756 |
T6118 |
T6119 |
nsubj |
CorC,represents |
R3757 |
T6120 |
T6121 |
compound |
bacteria,magnesium |
R3758 |
T6121 |
T6119 |
dobj |
magnesium,represents |
R3759 |
T6122 |
T6121 |
cc |
and,magnesium |
R3760 |
T6123 |
T6124 |
compound |
cobalt,protein |
R3761 |
T6124 |
T6121 |
conj |
protein,magnesium |
R3762 |
T6125 |
T6124 |
compound |
efflux,protein |
R3763 |
T6126 |
T6121 |
prep |
from,magnesium |
R3764 |
T6127 |
T6128 |
det |
the,oneidensis |
R3765 |
T6128 |
T6126 |
pobj |
oneidensis,from |
R3766 |
T6129 |
T6128 |
compound |
Shewanella,oneidensis |
R3767 |
T6130 |
T6119 |
punct |
.,represents |
R3768 |
T6132 |
T6133 |
nsubj |
XyFD,is |
R3769 |
T6134 |
T6135 |
det |
a,protein |
R3770 |
T6135 |
T6133 |
attr |
protein,is |
R3771 |
T6136 |
T6135 |
amod |
hypothetical,protein |
R3772 |
T6137 |
T6135 |
prep |
from,protein |
R3773 |
T6138 |
T6139 |
det |
the,Dixon |
R3774 |
T6139 |
T6137 |
pobj |
Dixon,from |
R3775 |
T6140 |
T6139 |
compound |
Xylella,Dixon |
R3776 |
T6141 |
T6139 |
compound |
fastidiosa,Dixon |
R3777 |
T6142 |
T6135 |
punct |
(,protein |
R3778 |
T6143 |
T6135 |
appos |
ZP_00038107,protein |
R3779 |
T6144 |
T6133 |
punct |
),is |
R3780 |
T6145 |
T6133 |
punct |
.,is |
R3781 |
T6153 |
T6154 |
amod |
Phylogenetic,tree |
R3782 |
T6155 |
T6154 |
acl |
showing,tree |
R3783 |
T6156 |
T6155 |
dobj |
relationships,showing |
R3784 |
T6157 |
T6156 |
prep |
among,relationships |
R3785 |
T6158 |
T6157 |
pobj |
proteins,among |
R3786 |
T6159 |
T6158 |
acl |
containing,proteins |
R3787 |
T6160 |
T6161 |
det |
the,domain |
R3788 |
T6161 |
T6159 |
dobj |
domain,containing |
R3789 |
T6162 |
T6161 |
compound |
ACD,domain |
R3790 |
T6163 |
T6161 |
prep |
from,domain |
R3791 |
T6164 |
T6165 |
nmod |
figure,2 |
R3792 |
T6165 |
T6163 |
pobj |
2,from |
R3793 |
T6166 |
T6165 |
cc |
and,2 |
R3794 |
T6167 |
T6165 |
conj |
3,2 |
R3795 |
T6168 |
T6154 |
punct |
.,tree |
R3796 |
T6170 |
T6171 |
det |
The,tree |
R3797 |
T6171 |
T6173 |
nsubjpass |
tree,constructed |
R3798 |
T6172 |
T6171 |
amod |
phylogenetic,tree |
R3799 |
T6174 |
T6173 |
auxpass |
was,constructed |
R3800 |
T6175 |
T6173 |
prep |
according,constructed |
R3801 |
T6176 |
T6175 |
prep |
to,according |
R3802 |
T6177 |
T6178 |
det |
the,calculation |
R3803 |
T6178 |
T6176 |
pobj |
calculation,to |
R3804 |
T6179 |
T6178 |
prep |
of,calculation |
R3805 |
T6180 |
T6181 |
det |
the,match |
R3806 |
T6181 |
T6179 |
pobj |
match,of |
R3807 |
T6182 |
T6181 |
amod |
best,match |
R3808 |
T6183 |
T6181 |
prep |
for,match |
R3809 |
T6184 |
T6185 |
det |
the,sequences |
R3810 |
T6185 |
T6183 |
pobj |
sequences,for |
R3811 |
T6186 |
T6185 |
amod |
selected,sequences |
R3812 |
T6187 |
T6173 |
punct |
.,constructed |
R3813 |
T6189 |
T6190 |
nsubj |
Abbreviations,are |
R3814 |
T6191 |
T6189 |
prep |
for,Abbreviations |
R3815 |
T6192 |
T6193 |
det |
each,protein |
R3816 |
T6193 |
T6191 |
pobj |
protein,for |
R3817 |
T6194 |
T6195 |
det |
the,same |
R3818 |
T6195 |
T6190 |
attr |
same,are |
R3819 |
T6196 |
T6197 |
mark |
as,presented |
R3820 |
T6197 |
T6195 |
advcl |
presented,same |
R3821 |
T6198 |
T6197 |
prep |
in,presented |
R3822 |
T6199 |
T6198 |
pobj |
figure,in |
R3823 |
T6200 |
T6199 |
nummod |
3,figure |
R3824 |
T6201 |
T6190 |
punct |
.,are |
R3825 |
T6228 |
T6229 |
nummod |
Four,domains |
R3826 |
T6230 |
T6229 |
compound |
transmembrane,domains |
R3827 |
T6231 |
T6229 |
prep |
within,domains |
R3828 |
T6232 |
T6233 |
compound |
Acdp4,protein |
R3829 |
T6233 |
T6231 |
pobj |
protein,within |
R3830 |
T6234 |
T6229 |
punct |
.,domains |
R3831 |
T6236 |
T6237 |
compound |
Transmembrane,domains |
R3832 |
T6237 |
T6238 |
nsubjpass |
domains,predicted |
R3833 |
T6239 |
T6238 |
auxpass |
were,predicted |
R3834 |
T6240 |
T6238 |
prep |
by,predicted |
R3835 |
T6241 |
T6242 |
det |
the,program |
R3836 |
T6242 |
T6240 |
pobj |
program,by |
R3837 |
T6243 |
T6242 |
compound |
TMHMM,program |
R3838 |
T6244 |
T6238 |
punct |
.,predicted |
R3839 |
T6246 |
T6247 |
det |
The,plot |
R3840 |
T6247 |
T6248 |
nsubj |
plot,shows |
R3841 |
T6249 |
T6250 |
det |
the,probabilities |
R3842 |
T6250 |
T6248 |
dobj |
probabilities,shows |
R3843 |
T6251 |
T6250 |
amod |
posterior,probabilities |
R3844 |
T6252 |
T6250 |
prep |
of,probabilities |
R3845 |
T6253 |
T6254 |
amod |
inside,TM |
R3846 |
T6254 |
T6258 |
compound |
TM,helix |
R3847 |
T6255 |
T6254 |
punct |
/,TM |
R3848 |
T6256 |
T6254 |
amod |
outside,TM |
R3849 |
T6257 |
T6254 |
punct |
/,TM |
R3850 |
T6258 |
T6252 |
pobj |
helix,of |
R3851 |
T6259 |
T6248 |
punct |
.,shows |
R3852 |
T6261 |
T6262 |
prep |
At,shown |
R3853 |
T6263 |
T6264 |
det |
the,top |
R3854 |
T6264 |
T6261 |
pobj |
top,At |
R3855 |
T6265 |
T6264 |
prep |
of,top |
R3856 |
T6266 |
T6267 |
det |
the,plot |
R3857 |
T6267 |
T6265 |
pobj |
plot,of |
R3858 |
T6268 |
T6261 |
punct |
(,At |
R3859 |
T6269 |
T6261 |
prep |
between,At |
R3860 |
T6270 |
T6269 |
pobj |
1,between |
R3861 |
T6271 |
T6270 |
cc |
and,1 |
R3862 |
T6272 |
T6270 |
conj |
1.2,1 |
R3863 |
T6273 |
T6262 |
punct |
),shown |
R3864 |
T6274 |
T6275 |
det |
the,prediction |
R3865 |
T6275 |
T6262 |
nsubjpass |
prediction,shown |
R3866 |
T6276 |
T6277 |
npadvmod |
N,best |
R3867 |
T6277 |
T6275 |
amod |
best,prediction |
R3868 |
T6278 |
T6277 |
punct |
-,best |
R3869 |
T6279 |
T6262 |
auxpass |
is,shown |
R3870 |
T6280 |
T6262 |
punct |
.,shown |
R3871 |
T6282 |
T6283 |
det |
The,plot |
R3872 |
T6283 |
T6284 |
nsubjpass |
plot,obtained |
R3873 |
T6285 |
T6284 |
auxpass |
is,obtained |
R3874 |
T6286 |
T6284 |
prep |
by,obtained |
R3875 |
T6287 |
T6286 |
pcomp |
calculating,by |
R3876 |
T6288 |
T6289 |
det |
the,probability |
R3877 |
T6289 |
T6287 |
dobj |
probability,calculating |
R3878 |
T6290 |
T6289 |
amod |
total,probability |
R3879 |
T6291 |
T6292 |
mark |
that,sits |
R3880 |
T6292 |
T6289 |
acl |
sits,probability |
R3881 |
T6293 |
T6294 |
det |
a,residue |
R3882 |
T6294 |
T6292 |
nsubj |
residue,sits |
R3883 |
T6295 |
T6292 |
prep |
in,sits |
R3884 |
T6296 |
T6295 |
pobj |
helix,in |
R3885 |
T6297 |
T6292 |
punct |
", ",sits |
R3886 |
T6298 |
T6292 |
advmod |
inside,sits |
R3887 |
T6299 |
T6298 |
punct |
", ",inside |
R3888 |
T6300 |
T6298 |
cc |
or,inside |
R3889 |
T6301 |
T6298 |
conj |
outside,inside |
R3890 |
T6302 |
T6289 |
acl |
summed,probability |
R3891 |
T6303 |
T6302 |
prep |
over,summed |
R3892 |
T6304 |
T6305 |
det |
all,paths |
R3893 |
T6305 |
T6303 |
pobj |
paths,over |
R3894 |
T6306 |
T6305 |
amod |
possible,paths |
R3895 |
T6307 |
T6305 |
prep |
through,paths |
R3896 |
T6308 |
T6309 |
det |
the,model |
R3897 |
T6309 |
T6307 |
pobj |
model,through |
R3898 |
T6310 |
T6284 |
punct |
.,obtained |
R4094 |
T6626 |
T6627 |
compound |
amino,acid |
R4095 |
T6627 |
T6623 |
conj |
acid,Nucleotide |
R4096 |
T6628 |
T6629 |
punct |
(,% |
R4097 |
T6629 |
T6624 |
parataxis |
%,homologies |
R4098 |
T6630 |
T6629 |
punct |
),% |
R4099 |
T6631 |
T6624 |
prep |
between,homologies |
R4100 |
T6632 |
T6633 |
amod |
human,ACDP |
R4101 |
T6633 |
T6634 |
nmod |
ACDP,members |
R4102 |
T6634 |
T6631 |
pobj |
members,between |
R4103 |
T6635 |
T6633 |
cc |
and,ACDP |
R4104 |
T6636 |
T6637 |
compound |
mouse,Acdp |
R4105 |
T6637 |
T6633 |
conj |
Acdp,ACDP |
R4106 |
T6638 |
T6624 |
punct |
.,homologies |
R1312 |
T2367 |
T2365 |
prep |
of,homology |
R1313 |
T2368 |
T2367 |
pobj |
proteins,of |
R1314 |
T2369 |
T2370 |
punct |
(,Fig. |
R4092 |
T6623 |
T6624 |
nmod |
Nucleotide,homologies |
R4093 |
T6625 |
T6623 |
cc |
and,Nucleotide |
R1285 |
T2338 |
T2331 |
nmod |
John,Dr. |
R1286 |
T2339 |
T2331 |
nmod |
Hopkins,Dr. |
R1287 |
T2340 |
T2331 |
nmod |
Bloomberg,Dr. |
R1288 |
T2341 |
T2331 |
punct |
", ",Dr. |
R1289 |
T2342 |
T2331 |
nmod |
School,Dr. |
R1290 |
T2343 |
T2331 |
nmod |
of,Dr. |
R1291 |
T2344 |
T2331 |
nmod |
Public,Dr. |
R1292 |
T2345 |
T2331 |
nmod |
Health,Dr. |
R1293 |
T2346 |
T2331 |
punct |
),Dr. |
R1294 |
T2347 |
T2325 |
punct |
.,confer |
R1295 |
T2349 |
T2350 |
det |
The,relationships |
R1296 |
T2350 |
T2352 |
nsubjpass |
relationships,illustrated |
R1297 |
T2351 |
T2350 |
amod |
evolutionary,relationships |
R1298 |
T2353 |
T2350 |
prep |
among,relationships |
R1299 |
T2354 |
T2355 |
det |
those,proteins |
R1300 |
T2355 |
T2353 |
pobj |
proteins,among |
R1301 |
T2356 |
T2352 |
auxpass |
are,illustrated |
R1302 |
T2357 |
T2352 |
prep |
by,illustrated |
R1303 |
T2358 |
T2359 |
det |
a,tree |
R1304 |
T2359 |
T2357 |
pobj |
tree,by |
R1305 |
T2360 |
T2359 |
amod |
phylogenetic,tree |
R1306 |
T2361 |
T2359 |
acl |
constructed,tree |
R1307 |
T2362 |
T2361 |
prep |
based,constructed |
R1308 |
T2363 |
T2362 |
prep |
on,based |
R1309 |
T2364 |
T2365 |
det |
the,homology |
R1310 |
T2365 |
T2363 |
pobj |
homology,on |
R1311 |
T2366 |
T2365 |
compound |
AA,homology |
R1315 |
T2370 |
T2352 |
parataxis |
Fig.,illustrated |
R1316 |
T2371 |
T2370 |
nummod |
4,Fig. |
R1317 |
T2372 |
T2370 |
punct |
),Fig. |
R1318 |
T2373 |
T2352 |
punct |
.,illustrated |
R1319 |
T2376 |
T2377 |
nsubj |
We,found |
R1320 |
T2378 |
T2379 |
mark |
that,contain |
R1321 |
T2379 |
T2377 |
ccomp |
contain,found |
R1322 |
T2380 |
T2381 |
det |
all,members |
R1323 |
T2381 |
T2379 |
nsubj |
members,contain |
R1324 |
T2382 |
T2381 |
compound |
mouse,members |
R1325 |
T2383 |
T2381 |
compound |
Acdp,members |
R1326 |
T2384 |
T2385 |
nummod |
four,domains |
R1327 |
T2385 |
T2379 |
dobj |
domains,contain |
R1328 |
T2386 |
T2385 |
amod |
distinct,domains |
R1329 |
T2387 |
T2385 |
compound |
transmembrane,domains |
R1330 |
T2388 |
T2389 |
punct |
(,Fig. |
R1331 |
T2389 |
T2385 |
parataxis |
Fig.,domains |
R1332 |
T2390 |
T2389 |
nummod |
5,Fig. |
R1333 |
T2391 |
T2389 |
punct |
),Fig. |
R1334 |
T2392 |
T2385 |
punct |
", ",domains |
R1335 |
T2393 |
T2394 |
nummod |
two,domains |
R1337 |
T2395 |
T2394 |
compound |
CBS,domains |
R1418 |
T2484 |
T2483 |
prep |
to,adjacent |
R1419 |
T2485 |
T2486 |
nummod |
two,domains |
R1420 |
T2486 |
T2484 |
pobj |
domains,to |
R1421 |
T2487 |
T2486 |
amod |
intracellular,domains |
R1422 |
T2488 |
T2486 |
compound |
CBS,domains |
R1423 |
T2489 |
T2466 |
punct |
.,is |
R1424 |
T2491 |
T2492 |
det |
A,domain |
R1425 |
T2492 |
T2496 |
nsubjpass |
domain,found |
R1426 |
T2493 |
T2494 |
npadvmod |
cNMP,binding |
R1427 |
T2494 |
T2492 |
amod |
binding,domain |
R1428 |
T2495 |
T2494 |
punct |
-,binding |
R1429 |
T2497 |
T2498 |
punct |
(,domain |
R1430 |
T2498 |
T2492 |
parataxis |
domain,domain |
R1431 |
T2499 |
T2500 |
amod |
cyclic,monophosphate |
R1432 |
T2500 |
T2498 |
nmod |
monophosphate,domain |
R1433 |
T2501 |
T2500 |
nmod |
nucleotide,monophosphate |
R1434 |
T2502 |
T2500 |
punct |
-,monophosphate |
R1435 |
T2503 |
T2500 |
punct |
-,monophosphate |
R1436 |
T2504 |
T2500 |
amod |
binding,monophosphate |
R1437 |
T2505 |
T2498 |
punct |
),domain |
R1438 |
T2506 |
T2496 |
auxpass |
was,found |
R1439 |
T2507 |
T2496 |
prep |
in,found |
R1440 |
T2508 |
T2509 |
det |
all,members |
R1441 |
T2509 |
T2507 |
pobj |
members,in |
R1442 |
T2510 |
T2509 |
compound |
Acdp,members |
R1443 |
T2511 |
T2496 |
punct |
.,found |
R1444 |
T2513 |
T2514 |
prep |
In,contains |
R1445 |
T2515 |
T2513 |
pobj |
addition,In |
R1446 |
T2516 |
T2514 |
punct |
", ",contains |
R1447 |
T2517 |
T2514 |
nsubj |
Acdp1,contains |
R1448 |
T2518 |
T2519 |
det |
an,region |
R1449 |
T2519 |
T2514 |
dobj |
region,contains |
R1450 |
T2520 |
T2521 |
npadvmod |
Alanine,rich |
R1451 |
T2521 |
T2519 |
amod |
rich,region |
R1452 |
T2522 |
T2521 |
punct |
-,rich |
R1453 |
T2523 |
T2524 |
punct |
(,AAAAAAAAA |
R1454 |
T2524 |
T2519 |
parataxis |
AAAAAAAAA,region |
R1455 |
T2525 |
T2524 |
dep |
2,AAAAAAAAA |
R1456 |
T2526 |
T2527 |
punct |
–,10 |
R1457 |
T2527 |
T2525 |
prep |
10,2 |
R1458 |
T2528 |
T2524 |
punct |
: ,AAAAAAAAA |
R1459 |
T2529 |
T2524 |
punct |
),AAAAAAAAA |
R1460 |
T2530 |
T2519 |
punct |
", ",region |
R1461 |
T2531 |
T2532 |
det |
a,region |
R1462 |
T2532 |
T2519 |
conj |
region,region |
R1463 |
T2533 |
T2534 |
npadvmod |
Leucine,rich |