PubMed@Masakazu SAGA:23622113
Annnotations
{"target":"https://pubannotation.org/docs/sourcedb/PubMed@Masakazu%20SAGA/sourceid/23622113","sourcedb":"PubMed@Masakazu SAGA","sourceid":"23622113","text":"Unscrambling butterfly oogenesis\r\n\r\nBackground\r\n\r\nButterflies are popular model organisms to study physiological mechanisms underlying variability in oogenesis and egg provisioning in response to environmental conditions. Nothing is known, however, about; the developmental mechanisms governing butterfly oogenesis, how polarity in the oocyte is established, or which particular maternal effect genes regulate early embryogenesis. To gain insights into these developmental mechanisms and to identify the conserved and divergent aspects of butterfly oogenesis, we analysed a de novo ovarian transcriptome of the Speckled Wood butterfly Pararge aegeria (L.), and compared the results with known model organisms such as Drosophila melanogaster and Bombyx mori.\r\n\r\nResults\r\n\r\nA total of 17306 contigs were annotated, with 30% possibly novel or highly divergent sequences observed. Pararge aegeria females expressed 74.5% of the genes that are known to be essential for D. melanogaster oogenesis. We discuss the genes involved in all aspects of oogenesis, including vitellogenesis and choriogenesis, plus those implicated in hormonal control of oogenesis and transgenerational hormonal effects in great detail. Compared to other insects, a number of significant differences were observed in; the genes involved in stem cell maintenance and differentiation in the germarium, establishment of oocyte polarity, and in several aspects of maternal regulation of zygotic development.\r\n\r\nConclusions\r\n\r\nThis study provides valuable resources to investigate a number of divergent aspects of butterfly oogenesis requiring further research. In order to fully unscramble butterfly oogenesis, we also now also have the resources to investigate expression patterns of oogenesis genes under a range of environmental conditions, and to establish their function.\r\n\r\nBackground\r\n\r\nSuccessful development relies heavily on parental contribution over and above the direct effect of maternal and paternal genes. For example, maternal effect genes, which have been particularly well studied in Drosophila melanogaster, are involved in setting up; 1) the location of the germ plasm and subsequent germ cell line development in the offspring and, 2) a basic framework of positional information, which is interpreted by the embryo’s own genetic program. Furthermore, insect embryos rely on nutrients for growth derived from the mother in the form of yolk deposited in the egg. The investigation of insect egg production (i.e. oogenesis) is thus not only crucial in understanding reproductive, and consequently fitness variation, it is also a popular model system for studying epigenetic programming, the apoptotic pathway, stem cell behaviour, cell cycle regulation and developmental patterning mechanisms in general.\r\n\r\nResearch into the physiological mechanisms underlying insect oogenesis and egg provisioning has a rich history, particularly in moths and butterflies (Lepidoptera). However, to date sufficiently detailed developmental genetic data to allow us to comprehensively understand the gene regulatory mechanisms underlying oogenesis and maternal effect gene expression controlling early embryogenesis only really exist for the model organism D. melanogaster. Developmental genetic studies focussing on species other than D. melanogaster provide us with the opportunity to investigate how the Gene Regulatory Networks (GRNs) underlying insect oogenesis might have evolved.\r\n\r\nMaternal effects can have consequences that extend well beyond embryonic or juvenile development, affecting offspring fertility and longevity. The exact nature of the maternal effects and thus the contribution of a female to the phenotype (and fitness) of her offspring are not static, however, but to a large extent depend on her own internal state, resource availability and in general the environmental conditions she experienced during her life (both biotic and abiotic). As such maternal effects constitute a form of non-genetic transmission of environmental conditions across generations. This means that elements of the regulatory states from the oogenesis GRN of a mother can be passed on to the next generation. There is thus a developmental framework in place with mothers having the possibility to influence the fecundity and survival of their offspring in response to their own environment, thereby providing an alternative system of inheritance with profound consequences for phenotypic evolution. However, much of life history theory has been developed without regard to the actual developmental genetic basis of the variation in the traits being investigated, such as reproductive output and maternal effects. What has been lacking is a powerful model system to study the developmental genetics of insect reproduction in an evolutionary ecological context. Lepidoptera are ideal candidates to undertake such ecological evolutionary developmental (eco-evo-devo) studies given the vast amount of physiological data on oogenesis, as well as very detailed information, for butterflies in particular, on reproductive variability in relation to environmental variability.\r\n\r\nRecently, valuable functional genomic tools have been developed for butterflies; for example, for Melitaea cinxia to study life history variation, Bicyclus anynana to study wing colour patterning, the monarch butterfly Danaus plexippus to study long-distance migration, Heliconius species to study mimicry and for both Erynnis propertius and Papilio zelicaon to study variability among populations in response to environmental heterogeneity and climate change. The information that has been missing so far in butterflies is a detailed description of the ovarian transcriptome, including maternal regulation of patterning the embryo along its axes and mRNA contributed maternally to eggs. In fact, in Lepidoptera, there is a distinct lack of such developmental studies; only in the silkmoth Bombyx mori have a number of recent studies on candidate genes in maternal regulation of early embryogenesis (e.g. establishing positional information) been undertaken.\r\n\r\nThe Speckled Wood butterfly Pararge aegeria (L.), a temperate zone species, is a popular model species for evolutionary ecology studies, for example on plasticity in female reproduction. Female P. aegeria mate soon after emergence and usually mate only once. At eclosion they have no or just a few mature oocytes and if mated on the day of emergence, usually they start ovipositing 48 hrs later on the third day of their life. In female P. aegeria resources for reproduction are, to a significant degree, obtained during the larval stage and there is little opportunity to obtain more nitrogenous resources for reproduction through adult feeding or nuptial gifts. Like many other butterflies, P. aegeria has meroistic ovaries with 8 ovarioles. Each ovariole consists of a germarium (i.e. stem cell region), previtellogenic primary oocytes, vitellogenic eggs and mature chorionated eggs (Figure 1). A total of seven nurse cells transfer maternal proteins, and mRNA of maternal effect genes into developing oocytes, whilst the somatic follicle cells surrounding the oocyte are involved in choriogenesis and vitellogenesis, as well as oocyte patterning.\r\n\r\nOverview ovarian morphology of the Speckled Wood butterfly\nPararge aegeria. (A) Female P.\naegeria laying an egg. (B) Complete meroistic P.\naegeria ovary, consisting of a total of 8 ovarioles. Two times 4\novarioles are attached to each other in the germarium region. Ovary in photo\nis still attached to the oviduct and part of the ovipositor. Only the\novaries were used for sequencing in this study. (C) Detail of\nprevitellogenic eggs, with nurse and follicle cells visible.\r\n\r\nIn this paper, we present a comprehensive study of the genes expressed during oogenesis for the butterfly P. aegeria, using de novo transcriptome sequencing and qPCR. Given the wealth of data on reproductive physiology in Lepidoptera, the genes implicated in hormonal control of reproduction will be investigated in particular detail in this study. Furthermore, as a first step in determining the conserved and divergent elements of the butterfly oogenesis GRN (including maternal regulation of zygotic gene expression and embryonic patterning), we investigated which of the genes known to play an essential role in D. melanogaster or B. mori oogenesis were also transcribed by P. aegeria.\r\n\r\nAlthough the number of ovarioles differs among D. melanogaster, P. aegeria and B. mori, these species have similar organisation of their meroistic ovaries, making for an ideal comparison. Furthermore, within Lepidoptera, the silkmoth B. mori and butterflies (including P. aegeria) belong to the more derived division Ditrysia within the infraorder Heteroneura and thus are likely to share developmental characteristics. Many aspects of maternal regulation of early D. melanogaster embryogenesis can be explained by the fact that it is a long germ band insect. Within the order of Lepidoptera there is a transition from a short germ in the more ancestral species to something more similar to long germ in the more derived species, such as those belonging to Ditrysia. This fact, again, makes for an interesting comparison between the three species.\r\n\r\nWe describe particular features of the P. aegeria ovarian transcriptome that were revealed during assembly and annotation, including orthologs of genes involved in several major conserved signaling pathways, maternal regulation of early embryogenesis, vitellogenesis and choriogenesis. We observed that P. aegeria differed most significantly from D. melanogaster (and many other insect species) in terms of stem cell maintenance in the germarium, EGF signalling in establishing oocyte polarity along anterior-posterior (AP) and dorsal-ventral (DV), and the signalling mechanisms used at the termini of the oocyte. Furthermore, we observed a high proportion of apparently unique sequences in the transcriptome, and we discuss how future exploration of the function and expression patterns of these unique sequences will undoubtedly provide valuable insights into the evolution of insect oogenesis.\r\n\r\nResults\r\n\r\nThe main aim of this study was to identify the genes expressed in the ovaries involved in oocyte formation, establishing oocyte polarities and the RNA transcripts transferred into the eggs by the mother, which either regulate early embryogenesis or are needed during early embryogenesis. Drosophila melanogaster is arguably the best studied insect species in terms of ovarian gene expression and maternal effect gene function. Additional file 1 contains an extensively referenced list of the key essential oogenesis genes. FlyBase and SilkBase were used as a starting point to conduct the comprehensive literature search. The vast majority of papers thus mainly concern the model species D. melanogaster and B. mori. Furthermore, for D. melanogaster genes, a high-throughput developmental time series database was consulted for FPKM (Fragments Per Kilobase of exon per Million of fragments mapped) -based gene expression levels (see also Methods), as well as an in-situ database for maternal transcript contribution to the oocyte. The oogenesis genes discussed in this paper have been classified into functional groupings and were identified predominantly from D. melanogaster studies (and to a lesser extent B. mori studies). Studies on D. melanogaster oogenesis are too numerous to list exhaustively, but key relevant papers (and references therein) have been cited to enable the reader to explore the role of each particular gene during oogenesis further. It should of course be noted that quite a number of genes are expressed in different functional contexts during oogenesis, such as genes encoding the components of various signalling pathways or a gene such as cornichon, which is involved in setting up both AP and DV axis polarity as well as oocyte nucleus localisation in D. melanogaster. Such genes only occur once in Additional file 1 and the tables presented in this paper, but the references to and discussion of such genes will highlight their pleiotropic functions.\r\n\r\nAnnotation and verification of expression by means of qPCR\r\n\r\nPararge aegeria egg and ovary RNA was sequenced using Illumina short read RNA-Seq technology. Of the 25266 contigs, 17306 contigs were of sufficient quality and length to be annotated (both automated and manually) with 30%, possibly novel or highly divergent, remaining uncharacterised (Table 1; Additional file 2; see Methods). The presence or absence of P. aegeria orthologs in the transcriptome data of 1035 essential oogenesis genes listed in Additional file 1 was verified manually; 833 were found, which is 80.5%. A total of 994 genes out of the 1035 had been identified in D. melanogaster studies. Pararge aegeria expressed 741 of these, which is 74.5%. A further 56 genes were found to be expressed for which functionality during oogenesis can be inferred, but which have not been verified experimentally. Specific genes will be discussed elsewhere in this paper. A large number of these genes are not only transcribed during oogenesis to produce an oocyte, but maternal transcripts were also found to be present in the oocyte itself (Additional file 2; Figure 2). Exceptions include genes encoding chorion proteins as well as yolk and associated proteins. Large amounts of transcripts of these genes are found in the ovaries only (Additional file 2; Table 2). A number of contigs appeared to have relatively high transcript abundance (measured by means of FPKM values; see Methods) in the oocytes compared to the ovaries, suggesting that these transcripts are important as maternal effect transcripts incorporated into the oocytes in relatively large concentrations (Table 2 and Figure 2). An example of this is the gene encoding a signal transducing adaptor molecule (STAM; Table 2 and Additional file 2), which in D. melanogaster is expressed throughout oogenesis, but of which transcripts are detected in very high levels in early embryogenesis. On the basis of the GO terms, the 838 gene orthologs appear to be representative of the annotated genes in the transcriptome as a whole (Figures 2 and 3).\r\n\r\nTranscript abundance\r\n\r\nOvary/Egg LOG2 fold change\tEgg/Ovary LOG2 fold change\tFPKM - value\t \tspherulin-2A\tsignal transducing adapter molecule 1\tribosomal protein LP2\t \tPACG20471\tnucleolar GTP-binding protein 2\t40S ribosomal protein S6\t \tchorion class A precursor family 5\tubiquitin-conjugating enzyme E2 S\tribosomal protein L39\t \tBmtitin1\tSLIT-ROBO Rho GTPase-activating protein\tcytochrome oxidase subunit 3\t \tEgg protein 80\tmo-molybdopterin cofactor sulfurase\tBmtitin1\t \tVitellogenin\tpoly U binding factor 68kD\tribosomal protein L32\t \tchorion class A precursor family 3\tNADH dehydrogenase subunit 6\t40S ribosomal protein S28\t \tchorion class A precursor family 4\tPACG6651\tubiquitin\t \tPACG21670\tchromatin regulatory protein sir2\tFerritin 2 – light chain homolog\t \tchorion class C precursor family 2\tPACG13792\tBmBR-C gene for Broad-Complex isoform Z2\t \tputative uncharacterized protein DDB\tDNA repair protein complementing XP-A cells homolog\tpolyubiquitin\t \tPACG20450\tdisulfide oxidoreductase\tribosomal protein L27\t \tPACG21661\tPACG710\t60S ribosomal protein L28\t \tPACG24051\tsimilar to phosphinothricin acetyltransferase gene\tPACG20761\t \tchorion class B precursor family 1\tPACG5386\t60S ribosomal protein L18\t \tchorion protein-like\tabhydrolase domain-containing protein 1\ttranslationally controlled tumor protein\t \tendonuclease-reverse transcriptase\tRAD51C protein\tribosomal protein S3A\t \tspec2\tPACG18339\t60S ribosomal protein L38\t \tPACG19208\tPACG19350\tribosomal protein L7A\t \tPACG20509\tSLIT-ROBO Rho GTPase-activating protein 1-like\theat shock protein cognate 3\t \t\r\n\r\nTranscript abundance (on the basis of FPKM values) in the Pararge\naegeria ovarian/oocyte transcriptome (see also Additional\nfile 2). The first column is a measure of\nwhich transcripts were most abundant in the ovaries compared to\nthose present as maternal transcripts in the oocytes. These are\ngenes highly expressed in the ovaries, but with few to no maternal\ntranscripts in the oocyte. The genes listed in column 2 have\nrelatively high FPKM values in the oocytes compared to the ovaries,\nindicating large concentrations of transcripts (see also Additional\nfile 2). The third column lists the genes\nmost transcribed during P. aegeria oogenesis. Columns list\nthe gene top 20, from high to low.\r\n\r\nGene Ontology manually annotated genes. The presence or absence of\northologs of essential oogenesis genes listed in Additional file 1 has been manually verified. The Gene Ontologies\n(GO) of genes that were present were determined by BLAST2GO and GO terms\nwere subsequently condensed using the generic GO Slim subset. The\nhistogram details the number of Pararge aegeria manually\nverified contigs (note, as has been observed for many de novo\nassemblies, for some genes multiple contigs were present in the\ntranscriptome) for each GO term. FPKM estimates were used to compare the\nlevels of transcripts found in the ovaries and as maternal transcripts\nin the egg. Using a Log2 fold change threshold of 1, genes were\nclassified in the histogram as present in similar amounts in the egg and\novarian transcriptome (labelled Ubiquitous), used predominantly\nduring oogenesis to make an egg, but not or hardly used as a maternal\ntranscript (labelled Ovary), or highly concentrated in the egg\nas maternal transcripts (labelled Egg).\r\n\r\nSequencing and annotation summary\r\n\r\nLocation/Feature\tContigs annotated\tManually curated\tAv. Contig (bp)\tAv. CDS (bp)\tAv. 5' UTR (bp)\tAv. 3'UTR (bp)\t \tGenomic\t16919\t1564\t625.99\t459.89\t69.61\t75.17\t \tComplete CDS\t4530\t473\t1022.96\t667.12\t142.07\t210.79\t \tHomology\t3055\t466\t1196.75\t855.06\t124.15\t214.53\t \tNovel\t1475\t7\t663.02\t277.87\t179.18\t203.02\t \tPartial CDS\t11842\t992\t485.34\t393.59\t45.11\t26.77\t \tHomology\t8054\t975\t521.96\t454.21\t51.65\t12.67\t \tNovel\t3788\t17\t407.48\t264.69\t31.20\t56.73\t \tPartial mRNA\t547\t99\t383.36\t179.24\t0.00\t0.00\t \tMitochondrion\t387\t11\t728.64\t563.20\t83.18\t75.32\t \tComplete CDS\t177\t7\t996.59\t719.80\t115.86\t157.94\t \tPartial CDS\t201\t3\t510.06\t443.30\t58.13\t5.95\t \tPartial mRNA\t9\t1\t340.67\t161.22\t0.00\t0.00\t \tGrand Total\t17306\t1575\t628.28\t462.20\t69.91\t75.18\t \t\r\n\r\nA total of 17306 sequences have been submitted. The sequences are\nclassified below according to their location (i.e. nuclear or\nmitochondrial genome), completeness and annotation status (i.e.\nwhether orthoogous sequences could be found in other Metazoa or\nnot). Characteristics of the contigs are listed, such as average\nsize of the contig, coding sequence and the 3’ and 5’\nUTRs (all in base-pairs, bp).\r\n\r\nGene Ontology total transcriptome. The Gene Ontologies (GO) of\nsuccesfully annotated genes in the total transcriptome were determined\nby BLAST2GO and GO terms were subsequently condensed using the generic\nGO Slim subset. The histogram details the number of Pararge\naegeria contigs (note, for some genes multiple contigs were\npresent in the transcriptome) for each GO term. FPKM estimates were used\nto compare the levels of transcripts found in the ovaries and as\nmaternal transcripts in the egg. Using a Log2 fold change threshold of\n1, genes were classified in the histogram as present in similar amounts\nin the egg and ovarian transcriptome (labelled Ubiquitous),\nused predominantly during oogenesis to make an egg, but not or hardly\nused as a maternal transcript (labelled Ovary), or highly\nconcentrated in the egg as maternal transcripts (labelled\nEgg).\r\n\r\nFor of a subset of 17 genes, sampled across the functional groups identified in Additional file 1, the expression in the ovarioles and the presence of transcripts in the oocyte were confirmed further by means of RT-qPCR. These genes were: argonaute 2 (AGO2), caudal (cad), decapentaplegic (dpp), egalitarian (egl), exuperantia (exu), Fragile X mental retardation 1 (Fmr1), nanos-like (nos-like), nanos-M (nos-M), nanos-O (nos-O), ornithine decarboxylase antizyme (Oda), anterior open (aop), par-1, piwi, chorion b-ZIP transcription factor (CbZ), staufen (stau), vitellogenin receptor yolkless (yl; VgR) and vitellogenin (Vtg/Vg). Two further genes, which have not been explicitly studied in the context of oogenesis (references in Additional file 1), were investigated: embryonic lethal abnormal vision (elav) and minibrain (mnb). Furthermore, 3 housekeeping genes were selected to be used as reference genes: RNA polymerase II 215 KD subunit (RPII215), TATA binding protein (Tbp) and zwischenferment (zw, G6PDH) (Additional file 3).\r\n\r\nThe qPCR results were used to confirm the presence of expression as well as the levels of expression (as indicated by means of FPKM values) in the transcriptome dataset (Figure 4; Additional files 4, 5, and 6). Transcripts of vitellogenin were not transferred into the oocytes and very few dpp transcripts were transferred into the egg (Figure 4). All of the other oogenesis genes investigated by means of qPCR were included as maternal effect gene transcripts in the oocytes (see also Additional file 2). Specific qPCR results will be discussed in the remainder of the paper.\r\n\r\nDiscussion\r\n\r\nGerm-line and ovarian stem cells\r\n\r\nIn D. melanogaster three major signalling pathways play a significant role in cystoblast differentiation, and the maintenance and division of germ-line and ovarian stem cells; TGF-beta, Wnt and hedgehog signalling. Components of all three signalling pathways have been identified for P. aegeria (Table 3 and Additional file 1). However, it is not clear, to what extent these signalling pathways are essential in the Lepidopteran germarium, as they were not identified as such in B. mori using SAGE analyses. Rather than signalling, for example, a previously unidentified non-coding RNA appears to regulate cystoblast differentiation in B. mori.\r\n\r\nqPCR results. Normalised relative abundance of transcripts for 19\ngenes of interest. Data above the midline (the median gene expression\nlevel set at 1) indicate a relatively high number of transcripts in the\noocyte compared with the ovary. Boxes represent the interquartile range.\nWhiskers represent the minimum and maximum observations. Note\nVtg/Vg transcripts were not found in the\noocyte.\r\n\r\nMaintenance and division of germ-line and ovarian somatic stem\ncells\r\n\r\n\t \tarmadillo (arm)\tY\tshutdown (shu)\tY\t \taxin; axis inhibition protein (axn)\tY\tFK506-binding protein (FKBP59)\tY\t \tdishevelled (dsh)\tY\tvasa; vasa-like gene (vasa homolog in\t \tLepidoptera) (vas; vlg)\tY\t \tshaggy; gsk-3 (sgg; Zw3)\tY\toutstretched (upd; sisc)\tN\t \tsugarless; UDP glucose6 dehydrogenase (sgl;\t \tUDPGDH)\tY\tbag of marbles (bam\tN\t \tlegless (lgs; BCL9)\tY\tmei-p26 (mei-p26)\tN\t \tpygopus (pygo; gam)\tY\tbrain tumor (brat)\tY\t \twingless (wg)\tY?\tbenign gonial cell neoplasm (bgcn)\tN\t \twntless; evenness interrupted (wls; Evi)\tY\twithin bgcn (wibg; pym)\tY\t \thedgehog (hh)\tY\tdecapentaplegic (dpp)\tY\t \tshifted; wnt inhibitory factor 1 precursor\t \t(shf; wif1)\tY\tkekkon5 (kek5 )\tN\t \tcosta (cos2)\tN\tMothers against dpp (Mad)\tY\t \tskinny hedgehog; hedgehog acyltransferase; CG32281\t \t(ski)\tY\tSmad on X (Smad2; Smox)\tY\t \troadkill; similar to speckle-type POZ\t \tprotein (rdx)\tY\tsaxophone (type I Dpp receptor) (sax)\tN\t \tpatched (ptc)\tN\tthickveins (type I Dpp receptor) (tkv)\tY\t \tsmoothened (smo)\tY\tpunt (type II Dpp receptor) (pnt)\tN\t \tcubitus interruptus (ci)\tY\tmedea (med; SMAD4)\tN\t \tengrailed (en)\tN\tDaughters against dpp (Dad)\tN\t \tpangolin (pan; Tcf/LEF)\tY\tglass bottom boat (gbb)\tY\t \twnt oncogene analog 4 (wnt4)\tN\tdullard (dd)\tY\t \tdicer-1 (dcr-1)\tY\tquo vadis; schnurri (quo; shn)\tN\t \tloquacious (loqs)\tY\tlethal with a checkpoint kinase (smurf;\t \tlack)\tY\t \tmir-184 (mir-184)\tN\tsupernumerary limbs (slimb)\tY\t \teffete (eff; UbcD1)\tY\tstarry night; flamingo (stan; fmi)\tN\t \tfs(1)Yb (Yb)\tN\troughened; similar to ras-related protein\t \trap-1a; enhancer of faf; similar to Bombyx mori\t \tras3 (r; rap1; dras3)\tY\t \tfused; similar to serine/threonine kinase\t \t36 (fu)\tY\tras-associated protein 2-like; ras-related protein 2\t \t(rap2l)\tY\t \tSuppressor of fused (Su(fu))\tY\tfruitless isoform a (fru)\tY\t \tbicaudal (bic)\tY\tfruitless isoform k (fru)\tY\t \totefin (ote)\tN\tfruitless (fru)\tY\t \tpiwi (piwi)\tY\tsex-lethal (sxl)\tN\t \tpelota (pelo)\tY\tpre-mRNA-splicing regulator wtap; similar to\t \tfemale lethal d; CG6315 (fl(2)d )\tN\t \tpumillio (pum)\tY\tmaleless; ATP-dependent RNA helicase\t \ta-like (mle; dhx9; nap)\tY\t \tpenguin (pen)\tY\tlamin c (lamc)\tY\t \tsans fille; U1 small nuclear ribonucleoprotein A;\t \tfs(1)1621 (snf)\tY\tclift; eyes absent (cli; eya)\tY\t \tbric a brac (bab)\tN\tslowmo (slmo)\tY\t \t\r\n\r\nGenes identified mainly from the Drosophila melanogaster\nliterature as functioning early for the maintenance and division of\ngerm-line and ovarian somatic stem cells. Presence (Y), possible\npresence (Y?) or absence (N) of orthologous transcripts in the\nPararge aegeria transcriptome is indicated.\r\n\r\nThe TGF-beta ligands glass bottom boat (gbb) and dpp were expressed in P. aegeria ovarioles (qPCR results; Table 3). The type I TGF-beta receptors used were thickveins (tkv) and an activin type 1 receptor similar to baboon (ATR1) (Additional files 1 and 2), the latter of which is present in the D. melanogaster oocyte as a maternal transcript necessary for early embryogenesis. No evidence, however, could be found for an ortholog of activin type I receptor saxophone (sax) (Table 3). No ortholog of the activin type II receptor punt (pnt) was found, although PACG16964 was found to be a type II BMP receptor (Additional file 2). The P. aegeria transcriptome contained orthologs of two SMAD family genes; Mothers against dpp (Mad) and Smad on X (Smox), but not of medea nor of the anti-SMAD Daughters against decapentaplegic (Dad), which have been shown to be of importance in D. melanogaster germline stemcell maintenance. Furthermore, the negative regulator of Dpp signalling dullard (dd) was found to be expressed in P. aegeria ovaries. In D. melanogaster this gene plays a role in wing vein formation, and although it has been found to be maternally deposited, its role in oogenesis has not been verified. Another negative regulator of Dpp signalling, brinker (brk), which plays a role in eggshell patterning in D. melanogaster, was also expressed by P. aegeria. In D. melanogaster, bag of marbles (bam) interacts with Dpp signalling to regulate stem cell maintenance and differentiation in the germarium. However, bam is a Drosophila unique gene and is not found in P. aegeria.\r\n\r\nDuring oogenesis P. aegeria females express two Wnt receptors, which show orthology to frizzled-2 and frizzled-7 (Table 4 and Additional file 1). Furthermore, they express the Wnt receptor l(2)43Ea (boca), which plays a role in D. melanogaster vitellogenesis, as well as dishevelled (dsh), which is part of the Wnt receptor complex (Table 3 and Additional files 1 and 2). Other components of the Wnt pathway expressed include armadillo (arm), pangolin (Tcf/LEF), groucho (gro), axin (axn), sugarless (sgl), legless (lgs), pygopus (pygo) and shaggy (sgg; Zw3), as well as wntless (wls)(Table 3 and Additional file 2; references in Additional file 1). Maternal transcripts of each of these genes were found in the oocyte (Table 3; Additional files 1 and 2), with the exception of sgl. Asymmetric localisation of maternal axn RNA has been shown to be involved in AP formation in Tribolium castaneum. Rather interestingly, the ligand wingless (wg) was not found in the assembled transcriptome (Table 3 and Additional file 2). However, 201 ovary and 100 oocyte raw RNA-seq reads mapped against the complete wg CDS from our unpublished P. aegeria genome (approximately between 3.2× and 6.5× coverage, displaying a discontinuous transcript with a number of gaps not covered by reads; Additional file 7). In D. melanogaster, transcripts of wg are not found in the oocyte and although Wnt signaling has been established as present during oogenesis, expression levels of wg are extremely low, making it hard to detect the transcripts. It is clear that in P. aegeria there is strong maternal contribution to zygotic Wnt signaling (Additional file 2), but whether Wnt signaling plays a role during oogenesis needs to be further investigated.\r\n\r\nCytoskeleton and actomyosin contractile ring assembly\r\n\r\n\t \tabnormal spindle (a microtubule-associated protein)\t \t(asp)\tN\tdedicator of cytokinesis 6,7; similar to CG11376\t \t(dock6; dock7)\tY\t \tjavelin-like (microtubule-associated protein);\t \tsimilar to CG3563 (jvl)\tY\tmyoblast city; dedicator of cytokinesis 1 (mbc;\t \tdock180)\tY\t \tmini spindles (microtubule-associated protein;\t \tbelongs to xmap215/tog family of genes) (msps;\t \txmap215)\tY\tspaghetti squash; myosin light polypeptide 9; myosin\t \tregulatory light chain 9 (sqh; mrlc)\tY\t \ta-kinase anchor protein 200 (akap200)\tN\tnonmuscle myosin essential light chain; myosin II\t \tessential light chain (mlc-c)\tY\t \tcapulet; act up, bcDNA:ld24380, CG5061\t \t(capt)\tN\tmyosin regulatory light chain interacting protein\t \t(mylip)\tY\t \tcdc42 (cdc42)\tY\tgenghis kahn; cdc42 binding protein kinase alpha or\t \tbeta (gek; cdc42bpb)\tY\t \tBombyx mori cdc42 small effector 2-like protein\t \t(LOC692865) (cdc42-sep2; spec2)\tY\tjaguar/myosin VI (jar; mhc95f; myo6)\tY\t \tp21/cdc42/rac1 activated kinase (pak)\tY\tmyosin heavy chain (similar to CG17927)\t \t(mhc)\tY\t \trac1; ras-related c3 botulinum toxin substrate 1\t \t(rac1)\tY\tmyosin heavy chain 2; zipper (zip; mhc2)\tY\t \tspecifically Rac1 associated protein; Fmr1-interacting\t \tprotein (sra-1; cyfip)\tY\tmyosin light chain kinase; bent; titin-like\t \t(bt)\tY\t \tengulfment and cell motility protein; ced-12\t \thomolog (ced-12; elmo)\tY\tmyosin 1 light chain; myosin alkali light chain 1\t \t(mlc)\tY\t \tcentrosomin (cnn)\tY\tmyosin 1; myosin 61f (myo1b)\tY\t \taurora-a (aur)\tY\tdilute class unconventional myosin; myosin V;\t \tmyosin-Va (myoV; myo-Va; didum)\tY\t \tchickadee (homolog of profilin)\t \t(chic)\tY\tunconventional myosin class XV (myo10a)\tY\t \tcitron; sticky (sti; dck)\tN\tmyosin heavy chain like (mhcl)\tY\t \tfocal adhesion kinase-like; fak56(D)\t \t(fak56D)\tY\tCG17293; WD40 protein type (wdr82)\tY\t \tdiaphanous (dia)\tY\twashout (wash; p63; p65)\tN\t \tfrizzled; frizzled-7-like (fz7-l)\tY\tjames bond (bond)\tN\t \tfrizzled; frizzled-2-like (fz2-l)\tY\tkette; hem-protein; similar to\t \tmembrane-associated protein hem (dhem-2);\t \tsimilar to membrane-associated protein gex-3\t \t(hem; kte; nap1; dhem2)\tY\t \tchromosome bows; mast; orbit; clasp (chb)\tN\tshort stop; kakapo; similar to bullous\t \tpemphigoid antigen 1 (Homo sapiens); microtubule-actin\t \tcross linking factor 1 (shot)\tY\t \tshotgun; E-Cadherin (shg; E-Cad)\tY\tvacuolar protein sorting 35 (vps35)\tY\t \tmushroom body defect (mud)\tN\trotund; racGTPase-activating protein; roughened eye\t \t(rn; roe; rnracgap)\tY\t \tdishevelled associated activator of morphogenesis-1\t \t(daam-1)\tY\ttwinstar; actin-depolymerizing factor 1 cofilin\t \t(tsr)\tY\t \tkarst (also known as betaheavy spectrin)\t \t(kst)\tY\tslingshot (mkp; ssh)\tY\t \tflightless I (fliI)\tY\tsubito; double or nothing; Bombyx mori kinesin-like\t \tprotein c (sub)\tY\t \tklarsicht (klar; marb)\tY\tIplI-aurora-like kinase; aurora b (kinase)\t \t(aurb)\tY\t \tmuscle-specific protein 300 (msp-300)\tY\ttumbleweed; racGAP50c; similar to\t \tracGTPase-activating protein (tum;\t \tracGAP)\tY\t \tlissencephaly-1 (lis-1)\tY\tarp2; actin-related protein 14d (arp2;\t \tarp14d)\tY\t \tcortactin(−like) (cortactin)\tY\tarp3; actin-related protein 66b (arp3;\t \tarp66b)\tY\t \tsrc oncogene at 42a (src42a)\tY\tsuppressor of profilin 2 (also known as\t \tarpc1) (sop2; arpc1; arc41)\tY\t \tsrc oncogene 1 (src64b)\tY\tarp2/3 complex subunit p34; arpc2 (arpc2;\t \tarc-p34)\tY\t \tα actinin (actn)\tY\tarp2/3 complex 21kD subunit p21; arpc3b (arpc3;\t \tarpc3b)\tY\t \tovarian tumor; fs(1)m101; fs(1)231 (otu\tN\tarp2/3 complex subunit p20; arpc4 (arpc4;\t \tarc-p20)\tY\t \tGuanyl cyclase at 32e (Gyc32e)\tN\tarp2/3 complex 16kD subunit p16; arpc5 (arpc5;\t \tp16-arc)\tY\t \tGuanylyl cyclase at 76c; receptor-type Guanylate\t \tcyclase (Gyc76c)\tY\tkinesin associated protein 3 (kap3; kap)\tY\t \tstand still (stil)\tN\tkinesin-like protein at 68d; kinesin II; kinesin-2\t \t(klp5; klp68d )\tY\t \thold up (hup)\tN\tkinesin-like protein at 64d; kinesin family member\t \t3a (klp64d; kif3a)\tY\t \tdicephalic (dic)\tN\tpericentrin-like protein (cp309) (cp309)\tN\t \tkelch (kel)\tY\trho-type Guanine exchange factor; pak-interacting\t \texchange factor; AGAP007877 (rtgef; dpix)\tY\t \tsimilar to kelch domain containing 4\t \t(klhdcp)\tY\tSCAR; actin binding protein; (in vertebrates)\t \twiskott-aldrich syndrome protein family member 2; wasp\t \tfamily protein member 2 (SCAR; wave)\tY\t \tcullin 3 (cul3)\tY\tquail; villin (qua)\tY\t \t\r\n\r\nGenes identified mainly from the Drosophila melanogaster\nliterature as important in cytoskeleton and actomyosin contractile\nring assembly. Presence (Y) or absence (N) of orthologous\ntranscripts in the Pararge aegeria transcriptome is\nindicated.\r\n\r\nNo ortholog of Drosophila wnt4 (a vertebrate wnt9 ortholog) was found (Table 3), which in D. melanogaster is involved in regulating cell movement during ovarian morphogenesis. Finally, transcripts of an ortholog of shifted (shf) were present both in the ovary and oocyte in P. aegeria (Table 3 and Additional file 2). This gene encodes an EGF-like protein acting as a Wnt inhibitory factor 1, which in D. melanogaster stabilises hedgehog signalling and transcripts of which are deposited in the oocyte. Hedgehog (hh) itself, as well as components of the pathway including smoothened (smo), fused (fu), Suppressor of fused (Sufu), and cubitus interruptus (ci) were all found to be expressed and maternal transcripts of all were present in the oocyte (Table 3; Additional files 1 and 2). Both costa (cos2) and the receptor patched (ptc) were not expressed during oogenesis by P. aegeria (Table 3; Additional file 1). Although Ptc protein has been detected in the D. melanogaster germarium, detecting ptc transcripts may prove more difficult because ptc appears to be transcribed in very low amounts, and it is possible that this is why ptc transcripts were also not found in P. aegeria. As has been observed for Wnt signalling, there is a maternal contribution to zygotic Hh signalling, but presently it is not clear whether this signalling pathway plays a significant role during P. aegeria oogenesis.\r\n\r\nCytoskeleton and actomyosin contractile ring assembly\r\n\r\nOrthologs of the vast majority of genes that have been described as affecting the cytoskeleton and actomyosin contractile ring during D. melanogaster oogenesis were expressed in P. aegeria (Table 4). One of the genes not found is ovarian tumor (otu), which plays a crucial role during D. melanogaster oogenesis. Otu is involved in cytoskeletal formation, cyst formation in germ-line cells, nurse cell chromosome dispersion and gurken (grk) mRNA localisation. For 14 genes no P. aegeria orthologs could be found in the dataset (Table 4). For a number of these, this is not surprising, as in general it has proven to be difficult to find orthologs outside the genus Drosophila; for example dicephalic (dic), mushroom body defect (mud), hold up (hup) and stand still (still)(references in Additional file 1).\r\n\r\nPararge aegeria females were found to express E-Cadherin (Table 4). E-Cadherin-dependent adhesion underlies the positioning of the oocyte at the posterior of the cyst, which in turn plays a role in establishing the AP polarity in D. melanogaster during very early oogenesis.\r\n\r\nOocyte determination (including fusome formation) and formation of the anterior-posterior polarity during the early stages of oogenesis\r\n\r\nThree genes have been described in the literature as important in D. melanogaster follicle ring canal formation; visgun (vsg), nasrat (fs(1)N) and scraps (scra). Only fs(1)N was not transcribed by P. aegeria females (Additional file 1). Fusomes, regions of spectrin-rich cytoplasm, are essential in D. melanogaster to establish a system of directional transport between cystocytes underpinning oocyte determination and subsequent oocyte polarity. The majority of genes that are expressed early in D. melanogaster oogenesis regulating the formation of the fusome (e.g. alpha and beta spectrin and hu-li tai shao) were also transcribed by P. aegeria, as well as the genes involved in establishing initial AP polarity, including par-1 and egalitarian (egl) (Figure 4 qPCR results and Table 5; references in Additional file 1). Par-1 in particular is essential in D. melanogaster for both oocyte determination and for establishing AP polarity through its effects on the organisation of the microtubule cytoskeleton in conjunction with a number of other proteins. Among the proteins with which Par-1 interacts in establishing AP polarity are Bazooka (Baz/Par3), Bicaudal D (BicD), Lkb1/Par4, Egl, 14-3-3epsilon, and Dynein proteins (references in Additional file 1). The genes encoding these proteins were all expressed by P. aegeria (Table 5). Transcripts of both par-1 and egl were also present in the oocyte (Figure 4 qPCR results and Additional file 2).\r\n\r\nOocyte determination, fusome and AP polarity\r\n\r\n\t \ttransitional endoplasmic reticulum ATPase; ter94\t \t(ter94)\tY\tatypical protein kinase c; CG10261 (apkc)\tN\t \tcapping protein alpha (cpa)\tY\ttypical protein kinase c (pkc)\tY\t \tleonardo (14-3-3zeta; leo)\tY\tprotein kinase c inhibitor; similar to CG2862\t \t(pkc inhibitor)\tY\t \tbazooka (baz; par3)\tY\trab-protein 6; small (monomeric) GTPase\t \t(rab6)\tY\t \tbicaudal C (bicC)\tY\trhino (rhi)\tN\t \tbicaudal D (bicD)\tY\tß1 tubulin 1 (tub1)\tY\t \tbicaudal D-related (CG32137)\tY\tß1 tubulin 2 (tub2)\tY\t \tglued; dynactin (gl)\tY\tβ-tubulin at 60d (tub3; betatub60d)\tY\t \tegalitarian; 3'-5' exonuclease domain-like-containing\t \tprotein (egl)\tY\tβ-tubulin at 56d (betatub56d)\tY\t \tstonewall; fs(3)02024 (stwl)\tN\thomologous to Drosophila γ-tubulin at 37c; gamma\t \ttubulin (in general) (gammatub37c;\t \tgamma tub 1)\tY\t \tegghead; zeste-white 4;\t \tbeta-1,4-mannosyltransferase (egh; zw4;\t \tbre3)\tY\tgamma-tubulin complex component 3; lethal (1) discs\t \tdegenerate 4 (tubgcp3; gcp3; dgrip91)\tY\t \t4ehp (4ehp)\tN\tgamma-tubulin complex component 2; gamma-tubulin ring\t \tprotein 84 (Drosophila) (tubgcp2; gcp2;\t \tdgrip84)\tY\t \tpipsqueak (BTB/POZ containing gene) (psq)\tN\talpha tubulin tua1; similar to Drosophila\t \talpha-tubulin at 84b (atub; tua1)\tY\t \tBTB/POZ domain containing gene (BTB-POZ)\tY\talpha tubulin tua2; similar to Drosophila\t \talpha-tubulin at 84b (atub; tua2)\tY\t \tBTB domain containing protein 2 (BTBd2)\tY\tdeadlock (del)\tN\t \tspindle c (spnc)\tN\tmo25; calcium-binding protein 39 (mo25)\tY\t \tcoracle; band 4.1-like protein (cora)\tY\t14-3-3ϵ (14-3-3epsilon)\tY\t \talpha spectrin (alpha-spec)\tY\tpar-1; map/microtubule affinity-regulating kinase\t \t(par-1)\tY\t \tbeta spectrin (beta-spec)\tY\tserine/threonine kinase lkb1; partitioning defective\t \t4 (lkb1; par4; stk11)\tY\t \thu-li tai shao (hts)\tY\tpartitioning defective 6 (par-6)\tN\t \tankyrin; similar to ankyrin 2,3/unc44;\t \tAGAP002272-PA (ank)\tY\tcombgap (cg; mig)\tY\t \tneuroglian (ceb; nrg)\tY\tdynein heavy chain 64C; cytoplasmic dynein heavy\t \tchain (dhc64c; dhc)\tY\t \tinscuteable (insc)\tN\tcut up (ddlc-1; cdlc1; dynein light chain)\tY\t \tsec61 alpha (sec61 alpha)\tY\tkinesin heavy chain (khc)\tY\t \tsec61 gamma (sec61 gamma)\tY\tkinesin light chain (klc)\tY\t \tsec63 (sec63)\tY\trhomboid-2; stem cell tumor; brother of rhomboid\t \t(stet; rho-2)\tN\t \ttropomodulin (tmod)\tY\tensconsin (ens)\tY\t \tp38 MAPK (p38MAPK)\tY\thelicase at 25e; ATP-dependent RNA helicase;\t \tddx39 (in vertebrates) (hel25E;\t \tddx39)\tY\t \tprotein kinase a; cAMP-dependent protein kinase 1; dc0,\t \tpka (pka-c1)\tY\tlicorne; similar to dual specificity\t \tmitogen-activated protein kinase kinase 3; similar\t \tto dual specificity mitogen-activated protein kinase\t \tkinase (in Nasonia); dual specificity\t \tmitogen-activated protein kinase kinase 6 (mainly\t \tin vertebrates) (lic; MAPKK; mek3)\tY\t \tcAMP-dependent protein kinase r1 (pka-r1)\tY\tprotein tyrosine phosphatase 10D (ptp10D)\tY\t \tcAMP-dependent protein kinase r2 (pka-r2)\tY\tprotein tyrosine phosphatase 4E; similar to\t \tprotein tyrosine phosphatase 10D\t \t(ptp4E)\tY\t \t\r\n\r\nGenes identified mainly from the Drosophila melanogaster\nliterature acting early in the egg for oocyte determination\n(including fusome formation) and formation of the anterior-posterior\n(AP) axis. Presence (Y) or absence (N) of orthologous transcripts in\nthe Pararge aegeria transcriptome is indicated.\r\n\r\nSoon after the posterior localisation of the oocyte in the D. melanogaster cyst, EGF signalling takes place in the posterior between the oocyte (Grk ligand) and the overlying follicle cells (Torpedo receptor), further consolidating AP polarity. Orthologs of the fast-evolving grk are difficult to find outside the genus Drosophila. Two genes encoding EGF ligands and likely to be paralogs of grk, spitz (spi) and keren (krn), are involved in the regulation of border cell migration in D. melanogaster. A single spi/krn-like EGF ligand has been found in the genomes of N. vitripennis and T. castaneum, and has been argued to be functionally similar to grk in DV patterning in these species. Pararge aegeria females expressed an ortholog of this single spi/krn-like EGF ligand, with the sequence displaying significant similarity to Harpegnathos saltator spi (Additional file 2; Table 6). Large amounts of these transcripts were detected in the P. aegeria oocyte (Additional file 2), suggesting a significant role for its use during early embryogenesis as observed in D. melanogaster. Given the expression of a spi/krn in P. aegeria and the significance of EGF signalling in insect oogenesis in general, and establishing oocyte polarity in particular, it is very surprising that only weak evidence was found for expression of egfr, the gene encoding the EGF receptor, in P. aegeria ovaries (Table 6). None of the contigs in our de novo assembly could be clearly identified as an egfr transcript. However, 780 raw RNA-seq reads did map against the complete efgr CDS from our unpublished P. aegeria genome (approximately 7.1× coverage, displaying a discontinuous transcript with a number of gaps not covered by reads; Additional file 7). Intriguingly, all of the raw reads that mapped successfully came from the ovariole transcriptome, not the oocyte transcriptome, consistent with the importance of EGF signalling during oogenesis itself. Transcript levels of egfr are low to moderate in D. melanogaster ovaries, and thus there is always the possibility, as was suggested for the absence of ptc transcripts in our study, that P. aegeria egfr transcript levels were not high enough to be accurately detected. However, it is intriguing that as for a number of other components of the EGF pathway involved in DV patterning in D. melanogaster, P. aegeria also did not transcribe, for example, rho during oogenesis (Table 6). Spatial restriction dorsally of rhomboid (rho), encoding a ligand-processing protease in the EGFR pathway, is necessary in D. melanogaster both for DV axis formation as well as for correct patterning of the eggshell (further references in Additional file 1). Although further study is required, at present it thus seems that EGF signalling either does not play a significant role in P. aegeria during oogenesis or a highly divergent one. This will be discussed further in the next section.\r\n\r\nFollicle cell gene expression and border cell migration\r\n\r\n\t \tcapping protein beta (cpb)\tY\tinnexin 3 (inx3)\tY\t \thepatocyte growth factor regulated tyrosine kinase\t \tsubstrate (hrs)\tY\tzero population growth (inx4; zpg)\tY\t \tCalpain-B (CalpB)\tN\tcrumbs (crb)\tY\t \tbig brain (bib)\tN\tstardust; weakly similar to maguk p55 subfamily\t \tmember 5 (sdt; std)\tY\t \tbrainiac (brn)\tY\tquit (qui)\tN\t \tmastermind (mam)\tN\tdual-specificity a-kinase anchor protein spoonbill;\t \tCG3249; homologous to akap149\t \t(spoon; yu)\tN\t \tneuralized (neur)\tY\tlethal (2) giant larvae (lgl)\tY\t \tderailed (drl; lio)\tN\tmyosin light chain 2; similar to Bombyx mori\t \tmyosin regulatory light chain 2 (mlc-2)\tY\t \tdelta (dl)\tY\tdeep orange; Vacuolar sorting protein 18 (dor;\t \tVps18)\tY\t \tnotch; abruptex (ax), split (spl) (N)\tY\tVacuolar protein sorting 9; sprint; rab GDP/GTP exchange\t \tfactor (gef) (Vps9; spri)\tY\t \tpresenilin (psn)\tY\ttwinfilin (twf)\tY\t \tnicastrin (nct)\tY\ttoucan (toc)\tY\t \tgamma-secretase subunit aph-1; anterior pharynx\t \tdefective 1; presenilin-stabilization factor\t \t(aph1)\tY\tabrupt (ab)\tN\t \tpresenilin enhancer (pen-2)\tY\ttaiman/ p160 coactivator fisc (DAIB1; tai)\tY\t \tstrawberry notch (sno)\tY\tpuckered; hearty; similar to dual specificity\t \tphosphatase 10 (puc; hrt)\tN\t \tnotchless (nle)\tY\tmisshapen; traf2 and nck interacting kinase;\t \thomolog of serine/threonine-protein kinase mig-15 (c.\t \telegans) (msn; tnik)\tY\t \tcut; similar to CCAAT displacement\t \tprotein; similar to homeobox protein cut\t \t(ct; cux)\tN\tfusilli; e(cacte10)7 (fus)\tY\t \tfringe (fng)\tY\tdribble; krr1 small subunit processome component\t \thomolog (dbe)\tY\t \tbunched; shortsighted (bun)\tY\tkuzbanian; similar to disintegrin and\t \tmetalloproteinase domain-containing protein 10\t \t(kuz)\tY\t \tdodo; similar to Bombyx mori rotamase pin1\t \t(dod)\tY\ttie; tie-like receptor tyrosine kinase\t \t(tie)\tN\t \tBroad-Complex core protein isoform 6 (br;\t \tBr-C)\tY\tfk506-binding protein (fkbp13) (fkbp13)\tY\t \tzinc finger and BTB domain-containing protein weak\t \thomology to Broad-Complex core protein isoforms 1, 2, 3,\t \t4, 5 (br; Br-C)\tY\tm6; myelin protolipid (m6)\tY\t \tdaughterless (da)\tY\ttanc2-like rolling pebbles; antisocial (ants;\t \trols)\tY\t \tets at 97D; tiny eggs (ets97D; tny)\tN\tamphiphysin; bridging integrator (damph)\tY\t \tpointed; similar to protein c-ets1\t \t(pnt; D-ets-1)\tN\tfasciclin II (fas2)\tN\t \tdystroglycan (dg)\tY\tsemaphorin; fasciclin-IV (fas4; sema-1a)\tY\t \tdiscs lost; tight junction pdz protein patj\t \t(dlt)\tY\tkayak (kay; fos)\tY\t \tfilamin; cheerio (fln; cher)\tY\tsrc homology 2, ankyrin repeat, tyrosine kinase\t \t(shark)\tY\t \tjitterbug; filamin-related (jbug)\tY\tbullwinkle (bwk)\tN\t \tleukocyte-antigen-related-like; tyrosine-protein\t \tphosphatase lar (lar)\tN\tbasket; jun amino terminal kinase (djnk); c-jun\t \tnh2-terminal kinase (bsk)\tY\t \tdiscs large (dlg1)\tY\tCad74A (Cad74A)\tN\t \tscribble(d) (scrib)\tY\tlocomotion defects; regulator of g protein signaling\t \t(rgs) (loco)\tY\t \tsinged (sn)\tY\tblistered; serum response factor; pruned (bs;\t \tserf)\tN\t \tslow border cells; homologous to Bombyx\t \tC/EBP (slbo; bmC/EBP)\tY\tcalmodulin-binding protein related to a rab3 gdp/gtp\t \texchange protein; weakly similar to denn\t \tdomain-containing protein 4c (crag)\tY\t \tmidline fasciclin (mfas)\tN\tG protein-coupled receptor kinase 1; similar to\t \tbeta-adrenergic receptor kinase 2\t \t(Gprk1)\tY\t \tbrinker (brk)\tY\tG protein-coupled receptor kinase 2; similar to\t \tbeta-adrenergic receptor kinase 1\t \t(Gprk2)\tY\t \tegf-r; torpedo; der (egfr; der)\tY?\trutabaga; similar to\t \tca(2+)/calmodulin-responsive adenylate cyclase;\t \tsimilar to adenylate cyclase 1 (rut)\tY\t \trhomboid-1; rhomboid; veinlet (rho)\tN\tdunce; cAMP-specific 3',5'-cyclic phosphodiesterase\t \t(dnc)\tY\t \tspitz (spi); spitz/keren-like\tY\tjun related antigen (jra)\tY\t \tovarian serine protease encoding nudel\t \t(ndl)\tY\tmyocardin-related transcription factor\t \t(mrtf)\tY\t \tkekkon-1 (kek1)\tN\tsimilar to rolling stone (rost)\tY\t \tvein (similar to a vertebrate neuregulin)\t \t(vn)\tN\tjing (jing)\tN\t \targos (aos)\tY\tyan; anterior open; similar to ets DNA-binding\t \tprotein pokkuri (aop)\tY\t \t18 wheeler (18w)\tY\tadherens junction protein p120; armadillo repeat\t \tprotein; catenin delta; CG17484 (p120ctn)\tY\t \thopscotch (hop; jak)\tN\tG protein sα 60a; G protein alpha s subunit GS1\t \t(Bombyx mori) (G-salpha60a)\tN\t \tstar; asteroid (S)\tN\tprotein tyrosine phosphatase 99a (ptp99a)\tN\t \tran-binding protein m (ranbpm )\tY\tdiacyl glycerol kinase ϵ\t \t(dgkϵ)\tN\t \tPDGF- and VEGF-receptor related (PVR)\tY\tovary protein-29kD (op29)\tN\t \tinnexin 2 (inx2)\tY\t \t \t \t\r\n\r\nGenes identified mainly from the Drosophila melanogaster\nliterature acting in follicle cells early and late and promoting\ntheir motility such as border cell migration (and in\nDrosophila important for choriogenesis and dorsal\nappendage formation). Presence (Y), possible presence (Y?) or\nabsence (N) of orthologous transcripts in the Pararge\naegeria transcriptome is indicated.\r\n\r\nGenes acting early in the ovariole to establish dorsal-ventral polarity and genes promoting follicle cell motility such as border cell migration\r\n\r\nQuite a number of genes involved in establishing DV polarity in the oocyte are also important for choriogenesis and dorsal appendage formation in D. melanogaster (references in Additional file 1). Apart from aforementioned grk, pipe was also not expressed by P. aegeria. Pipe plays an essential role in establishing DV polarity in D. melanogaster oocytes, with its expression being confined to ventral follicle cells as a result of localised EGF signalling. Recently, however, it has been proposed that pipe is not necessary in a number of insect species studied and even in D. melanogaster there appears to be a second mechanism in establishing DV that may involve delayed induction by graded maternal Dpp signalling in the perivitelline space. Whatever the mechanism employed by Lepidoptera, it is clear from B. mori research that the factors determining DV polarity are associated with the egg cortex.\r\n\r\nDespite significant differences found in expression patterns of genes involved in EGF signalling in a number of insects, this pathway has been argued to be the ancient mechanism for establishing DV polarity in insect eggs. Transcription factors that have been discussed as mediators of EGF signalling include pointed (pnt), aop and capicua (cic). Only the latter two were expressed by P. aegeria and present as maternal transcripts, but whether they play a role in establishing DV polarity remains to be investigated (Tables 6 and 7, and Additional file 2; qPCR results). The ETS transcription factor Aop also plays a role in border cell migration and does not receive input exclusively from EGF, but from a number of signalling pathways including Notch. All components of the Notch signalling pathway were expressed in the ovarioles, with only Notch (N) itself not being present as maternal transcripts in the oocyte (Table 6 and Additional file 2). Maternal N transcripts are also not found in D. melanogaster.\r\n\r\nDorsal ventral polarity\r\n\r\n\t \tcappuccino; formin 1/2 (capu)\tY\tmaelstrom (mael)\tY\t \tspire (spir)\tY\tpipe (encoding a sulfotransferase)\t \t(pip)\tN\t \tcornichon (cni)\tY\tokra (a spindle gene); rad54; rad54-like (okr;\t \trad54)\tY\t \tfs(1)k10 (fs(1)k10)\tN\tspindle B (spnB)\tN\t \tsec61 beta (sec61 beta)\tY\tspindle D (spnD)\tN\t \tmirror; iroquois-class homeodomain protein irx\t \t(mirr)\tN\torb; oo18 RNA-binding protein (orb)\tN\t \tgroucho; Enhancer of split m9/10 (gro;\t \tE(spl)m9/10)\tY\theterogeneous nuclear RNA-binding protein 40; squid\t \t(sqd; hrp40)\tY\t \tcapicua (cic)\tY\theterogeneous nuclear ribonucleoprotein at 27c;\t \tsimilar to Bombyx mori hnrnpa/b-like 28 (hrp48;\t \thrb27c; hnrnpa/b-like 28)\tY\t \tgurken (grk)\tN\theterogeneous nuclear ribonucleoprotein at 87f;\t \tsimilar to Bombyx mori heterogeneous nuclear\t \tribonucleoprotein a1 (hrp36; p11)\tY\t \ttrailer hitch (tral)\tN\ttransportin; importin 3, karyopherin beta 2b\t \t(impβ2)\tY\t \t\r\n\r\nGenes identified mainly from the Drosophila melanogaster\nliterature acting early in the egg to establish dorsal-ventral (DV)\npolarity. Presence (Y) or absence (N) of orthologous transcripts in\nthe Pararge aegeria transcriptome is indicated.\r\n\r\nThe Notch pathway interacts with the EGF pathway in establishing oocyte polarity in D. melanogaster, in particular through its effects on follicle cell differentiation at both termini of the oocyte. As has been established in this study, there is only weak evidence at present for the use of the EGF pathway during P. aegeria oogenesis, and it is striking that the iroquois-class homeodomain protein Mirror is not expressed by P. aegeria (Table 7). This protein appears essential in D. melanogaster in integrating EGF and Notch signalling in follicle differentiation and thus establishing AP and DV polarity. Apart from the EGF pathway, Notch interacts with a number of other proteins in patterning the follicle cells surrounding the oocyte, including Toucan and Daughterless (references in Additional file 1). These were expressed by P. aegeria (Table 6), suggesting that the Notch pathway is essential for correct patterning of the follicle cells and possibly oocyte polarity, but in P. aegeria it may not require an interaction with the EGF pathway. Further studies are required to establish whether butterflies have dispensed with EGF signalling and localised pipe expression in establishing oocyte polarity and instead rely on, for example, the Notch and Dpp pathway.\r\n\r\nAnterior and posterior system genes\r\n\r\nThe Lepidopteran Bombyx mori displays features of both short and long germ band type insects, in which orthodenticle (otd) and cad maternal mRNA are localised to establish the embryonic AP-axis. Both were expressed during P. aegeria oogenesis (Table 8) and indeed were present as mRNA in the oocytes (Additional file 2; Figure 4 qPCR results for cad). Bicoid (bcd) is Drosophila-specific and although no ortholog was found to be expressed, the genes that are involved in bcd localisation were, including exu and stau, but not swallow (swa) (Table 8; Figure 4 qPCR results). As observed in D. melanogaster, transcripts for both exu and stau were also present in significant amounts in P. aegeria oocytes (Figure 4 qPCR results; Additional file 2). The use of bcd in translational repression of cad is unique to Drosophila. It is very likely that the ancestral mechanism for translational repression of cad is by means of the KH-domain containing protein encoded for by mex-3. Pararge aegeria females expressed an ortholog of mex-3 (Table 8). Furthermore, in D. melanogaster, bcd interacts with genes such as bicoid interacting protein 3 (bin3), eIF4E, larp1, polyA binding protein (pAbp) and AGO2 in order to repress cad translation. All of these were found to be expressed in P. aegeria, and similarly to D. melanogaster, present as maternal transcripts in the oocytes (Tables 8 and 9, and Additional file 2; Figure 4 qPCR results for AGO2).\r\n\r\nMaternal specification of embryonic anterior-posterior axis\r\n\r\n\t \tbicoid (bcd)\tN\tbicoid-interacting protein 3 (bin3)\tY\t \torthodenticle; Drosophila ocelliless (oc;\t \totd)\tY\tlarp1 (larp1)\tY\t \texuperantia (exu)\tY\tEukaryotic initiation factor 4E; similar to\t \tBombyx mori Eukaryotic initiation factor 4E-2\t \t(eIF4E)\tY\t \tswallow; fs(1)1502 (swa)\tN\targonaute 2 (AGO2)\tY\t \tmaternal expression at 31B (me31B)\tY\tcaudal (cad)\tY\t \tstaufen (stau)\tY\thunchback (hb)\tN\t \tmuscle excess 3 (mex-3)\tY\t \t \t \t\r\n\r\nGenes in anterior-posterior axis specification, identified from a\nwide variety of insects. Presence (Y) or absence (N) of orthologous\ntranscripts in the Pararge aegeria transcriptome is\nindicated.\r\n\r\nMaternal specification of embryonic posterior\r\n\r\n\t \tapontic (apt)\tN\tmago nashi (mago)\tY\t \tnanos; nanos-like (LOC100125608)\t \t(nos-like)\tY\ttsunagi/y14 (tsu/y14)\tY\t \tnanos-M (nos-M)\tY\transhi; similar to zinc finger protein\t \t195; CG9793 (ranshi)\tY\t \tnanos-P (nos-P)\tN\tglorund (glo; p67)\tN\t \tnanos-O (nos-O)\tY\tsmaug (smg)\tY\t \tshavenbaby; ovo (ovo)\tY\ttwin; CCR4 (part of CCR4-Not complex)\t \t(twin; CCR4)\tN\t \tarmitage (armi)\tY\tnot1 (part of CCR4-Not complex)\t \t(Not1)\tY\t \tarrest (also known as bruno)\t \t(aret/bru)\tY\tnot2 (part of CCR4-Not complex); Regena\t \t(Not2; Rga)\tY\t \tlasp (lasp)\tY\tnot3 (part of CCR4-Not complex); l(2)nc136\t \t(Not3)\tY\t \toskar (osk)\tN\tchromatin assembly factor 1 (part of CCR4-Not\t \tcomplex); similar to CG4236 (caf1)\tY\t \tpoly(a)-binding protein (pAbp)\tY\tPop2; similar to CG5684; CCR4-Not transcription\t \tcomplex subunit 7 (Pop2)\tY\t \tEukaryotic translation initiation factor 4AIII\t \t(eIF4AIII)\tY\thiiragi (Poly A Polymerase) (hrg; PAP)\tY\t \tbarentsz; eIF4aIII binding protein; weak localizer\t \t(wkl; btz)\tY\trabenosyn-5; rabenosyn (rbsn-5)\tY\t \tsyntaxin 1a (syx1a)\tY\typsilon schachtel (Bombyx mori Y-box protein)\t \t(yps; ybp)\tY\t \tmoesin-like; dmoesin (ezrin, radixin, moesin gene)\t \t(moe; ERM1)\tY\tubiquitin specific protease 9; fat facets\t \t(faf)\tY\t \tEukaryotic translation initiation factor 4e\t \ttransporter similar to cup (cup;\t \tfs(2)cup; fs(1)cup)\tY\thephaestus; polypyrimidine tract-binding protein;\t \theterogeneous nuclear ribonucleoprotein I\t \t(heph; ptb; hnrnp I)\tY\t \tEukaryotic translation initiation factor 2α\t \t(eIF2alpha)\tY\tsynaptotagmin (syt 1; syt)\tY\t \tmiranda (mira)\tN\tsynaptotagmin; similar to Drosophila\t \tmelanogaster extended synaptotagmin 2\t \t(esyt2)\tY\t \t\r\n\r\nGenes identified mainly from the Drosophila melanogaster\nliterature involved in posterior pole specification. Presence (Y) or\nabsence (N) of orthologous transcripts in the Pararge\naegeria transcriptome is indicated.\r\n\r\nDrosophila melanogaster includes maternal hunchback (hb) transcripts into the egg, the protein of which will form an AP gradient during early embryogenesis and cooperate with Bcd to specify the anterior of the embryo, whilst being repressed at the posterior by Nos. Although there is variation between insect species as to whether maternal hb RNA or protein is transferred to the egg, as well as in the significance of the maternal contribution to the Hb gradient for AP patterning, the transcription of hb during oogenesis appears conserved. For example, although only zygotic Hb is necessary for AP patterning in the grasshopper Schistocerca americana embryo, maternal hb transcripts appear to be involved in distinguishing embryonic from extra-embryonic cells along the AP axis, whilst in D. melanogaster maternal and zygotic Hb are redundant for AP patterning of the embryo. In B. mori, the hb transcripts detected appear to be transcribed by the zygote, not the mother. Pararge aegeria also did not express hb during oogenesis (Table 8), suggesting that Lepidoptera, or at least Ditrysia, may have dispensed with a maternal contribution to the Hb gradient in the embryo.\r\n\r\nNanos is involved in both the differentiation of the germ plasm and posterior patterning in D. melanogaster, although these two functions can be mechanistically uncoupled. Lepidopteran primordial germ cells (PGCs) develop in a midventral position and in the germ disk after blastoderm formation, not posteriorly before the blastoderm is formed as in D. melanogaster. It is therefore unlikely in Lepidoptera that the genes involved in setting up the embryonic posterior will interact with and be dependent on the genes involved in the localisation of germline determinants, as shown to occur in D. melanogaster. Bombyx mori contains a number of nos paralogs (nos-M, -O, -P and –like (also called –N)), which indeed appear to have divided up these functions. Although it has been argued that B. mori does not have a germ plasm, the location of maternal B. mori nos-O transcripts in the embryo seems to correspond with where the PGCs will form. These nos paralogs, with the exception of nos-P, are expressed during oogenesis in both B. mori and P. aegeria, with maternal transcripts detectable in P. aegeria eggs (Figure 4 qPCR results; Additional file 2 and Table 9). Nanos-P is primarily zygotically expressed during embryogenesis in B. mori and may be implicated in stabilising the embryonic AP-axis. The nos paralogs have also been found in the monarch butterfly (D. plexippus) genome and phylogenetic analysis of nos sequences shows nos-P to be quite different from the other paralogs (Additional file 8), suggesting it may have a different functional role.\r\n\r\nTranslational repression of D. melanogaster nos RNA is accomplished during oogenesis by proteins encoded by glorund (glo) and in the early embryo by smaug (smg). Transcripts of both are found in D. melanogaster oocytes. A P. aegeria ortholog of smg was found, which was present as RNA in the oocyte, but not of glo (Table 9 and Additional file 2). Furthermore, Smg protein bound to the nos 3’ UTR recruits the deadenylation complex CCR4-NOT in D. melanogaster. Rapid deadenylation leads to decay of nos RNA, which is essential in establishing the AP gradient of nos RNA. Although it has been argued above that Lepidoptera in all likelihood do not use nos paralogs during oogenesis in establishing the posterior, P. aegeria does express all the genes that encode proteins that form this complex, despite the absence of an obvious ortholog for twin/CCR4 (Table 9). In D. melanogaster it is the germ plasm protein Oskar (Osk) that prevents rapid deadenylation at the posterior pole, establishing nos as a posterior defining gene. Ditrysia appear not to possess an osk ortholog, which could be another reason why the identified nos paralogs may not being involved in AP axis formation during oogenesis. Indeed, P. aegeria also does not possess an ortholog of osk (Table 9; unpublished P. aegeria genome).\r\n\r\nGerm plasm, polar granules, nuage and p-bodies\r\n\r\nAlthough a germ plasm type structure has been identified cytologically in the moth Pectinophora gossypiella, it is not clear whether Lepidoptera possess a proper germ plasm as they lack osk, which has been argued to have been co-opted as the essential gene in germ plasm formation in holometabolous insects. Pararge aegeria may not possess an osk ortholog, but it does express two genes, which in D. melanogaster silence osk translationally during oogenesis; bruno and cup (Table 9 and Additional file 1). It should be noted, however, that these genes are expressed in a number of functional contexts during oogenesis in D. melanogaster (e.g. cell cycle regulation; references in Additional file 1). As part of the germ plasm, Oskar induces polar (or germ) granule formation and in doing so interacts with a number of genes that characterise these polar granules, in particular tudor (tud), vasa (vas) and valois (vls). Only valois (vls) could not be found in the P. aegeria transcriptome (Tables 9 and 10).\r\n\r\nOvarian nuage and piRNA pathway\r\n\r\n\t \tcapsuléen; Arginine n-methyltransferase 5\t \t(csul; prmt5)\tY\ttejas; similar to tudor domain containing\t \t5 (tej; TDRD5)\tY\t \tvalois (vls)\tN\tvreteno; similar to CG4771 (vret)\tN\t \taubergine (related to eIF2c; a piwi protein)\t \t(aub)\tY\tsimilar to tudor domain containing CG9925 and CG9684\t \t(TDRD1)\tY\t \tATP-dependent helicase; cap; belle (cap;\t \tbel)\tY\tsimilar to CG8920; similar to tudor domain\t \tcontaining 7 (TDRD7)\tY\t \tcutoff (cuff)\tN\thomeless; fs(3); spindle E; similar to tudor\t \tdomain containing 9 (hls; spnE; TDRD9)\tY\t \tsquash (squ)\tN\tCG14303; similar to tudor domain containing\t \t4 (TDRD4)\tN\t \tpiwi-like protein; argonaute 3 (AGO3;\t \tsiwi)\tY\ttudor-SN (tudor-SN)\tY\t \tzucchini (zuc)\tN\tBrother of Yb; CG11133 (BoYb)\tN\t \ttudor; similar to tudor domain containing\t \t6 (tud)\tY\tSister of Yb; CG31755 (SoYb)\tN\t \tkrimper (mtc; krimp)\tN\t \t \t \t\r\n\r\nOvarian nuage and piRNA pathway genes identified mainly from the\nDrosophila melanogaster literature. Presence (Y) or\nabsence (N) of orthologous transcripts in the Pararge\naegeria transcriptome is indicated.\r\n\r\nBoth the ovarian nuage, an electron-dense perinuclear structure found predominantly in nurse cells, and polar granules are characterised by a number of the same genes, including tud, vas and vls (references in Additional file 1). The nuage appears not only to play a role in protecting germline cells against the expression of selfish genetic elements in the majority of animals, but also in establishing the polar granules in D. melanogaster. It is therefore not surprising that PIWI proteins and their bound PIWI-interacting RNAs (piRNAs) have been identified as important for both nuage and polar granule formation. Many of these genes encode TUDOR-domain containing proteins and seem to evolve rapidly making it difficult to find orthologs outside Drosophila; e.g. vreteno (vret), Brother of Yb (BoYb) and Sister of Yb (SoYb). Indeed, no orthologs of these genes could be found in the P. aegeria transcriptome (Table 10). Other genes encoding TUDOR-domain containing proteins seem more conserved, such as TDRD1, tejas (TDRD5), TDRD7 and spindle E/homeless (TDRD9) and these were expressed by P. aegeria (Table 10). What is interesting about TDRD7 is that it shares the OST-HTH/LOTUS functional domain with osk. It is likely that this domain is involved in RNA binding and thus for regulating mRNA translation and/or localisation in germ cell development.\r\n\r\nThere are three genes that encode PIWI proteins; piwi, aubergine (aub) and argonaute 3 (AGO3). All three were expressed during oogenesis by P. aegeria (Figure 4 qPCR results; Tables 1 and 10). Piwi also plays an essential role in the D. melanogaster germarium and is thus involved in the establishment, maintainance and renewal of germline stem cells. Furthermore, mutations in D. melanogaster piRNA (Piwi-interacting RNA) pathway genes often disrupt the axes of the developing oocyte, through their effects on the microtubule cytoskeleton; for example maelstrom (mael), zucchini (zuc) and squash (squ) affect DV polarity. The latter two also interact with aub in D. melanogaster in silencing osk translation during oogenesis. Similarly, the RNAi pathway gene armitage (armi) affects axis formation and is involved in osk translational silencing in D. melanogaster. Neither zuc nor squ was found in the P. aegeria transcriptome, but mael and armi were (Tables 7 and 10).\r\n\r\nOvarian processing bodies (i.e. P-bodies) are aggregates of translationally inactive ribonucleoproteins (RNPs). In D. melanogaster these can be found in nurse cells, but also appear to be involved in compartmentalisation of mRNA decay and translation repression, for example of osk. With the exception of EDC4/Ge-1 and pacman (pcm), genes that encode the essential components of P-bodies were expressed in P. aegeria (described in the context of oogenesis or otherwise, Table 11 and references in Additional file 1). RNA of P-body components, for example Dcp1, are also transferred to oocytes during D. melanogaster oogenesis and are necessary for early embryogenesis. This was also observed in P. aegeria (Additional file 2).\r\n\r\nOvarian processing bodies\r\n\r\n\t \tNonsense-mediated mRNA 3 (Nmd3)\tY\ttelomerase-binding protein est1a; similar to\t \tsmg6 homolog, nonsense mediated mRNA decay\t \tfactor (smg6)\tY\t \tregulator of nonsense transcripts 1; nonsense mRNA\t \treducing factor 1; up-frameshift suppressor 1\t \thomolog (rent1; norf1; Upf1)\tY\tdecapping protein 1 (Dcp1)\tY\t \tsimilar to Upf2 regulator of nonsense transcripts\t \thomolog (Upf2)\tY\tdecapping protein 2 (Dcp2)\tY\t \tsimilar to Bombyx mori Upf3 regulator of nonsense\t \ttranscripts-like protein B (Upf3)\tY\tpacman; 5'-3' exoribonuclease 1 (XRN1;\t \tpcm)\tN\t \tno-on-and-no-off-transient C (smg1)\tY\tEDC4; Ge-1 (Ge-1)\tN\t \tsmg5 (smg5)\tY\t \t \t \t\r\n\r\nOvarian processing bodies genes identified mainly from the\nDrosophila melanogaster literature. Presence (Y) or\nabsence (N) of orthologous transcripts in the Pararge\naegeria transcriptome is indicated.\r\n\r\nOnce the germ plasm has been established at the posterior in D. melanogaster, a number of (late-acting) maternal-effect genes are essential in germline formation during early embryogenesis (; further references in Additional file 1). Pararge aegeria females do express similar genes to the fruit fly, including genes associated traditionally with D. melanogaster pole plasm, such as arrest/bruno (aret) and imp. However, there are some notable exceptions, the most significant of which are germ cell-less (gcl) and polar granule component (pgc) (Tables 12, and 13, and Additional file 1). These genes are essential in D. melanogaster, but there are no known pgc orthologs outside the genus Drosophila. Although orthologs can be found for gcl even in vertebrates, none can be found in genomic databases for the Lepidoptera, including the new data presented here. The gene wunen (wun) is involved in germ cell migration in D. melanogaster embryos (references in Additional file 1). Pararge aegeria females also include wun transcripts in the oocyte (Table 13 and Additional file 1).\r\n\r\nGerm plasm formation and germline viability\r\n\r\n\t \trab-protein 11 (rab11)\tY\tgerm cell-less (gcl)\tN\t \trab-protein 5 (rab5)\tY\tstambha a; CG8739; protein efr3 homolog b;\t \trolling blackout (cmp44e ; stma)\tY\t \tskittles; pip5k (type 1) (pip5k)\tY\tmyoglianin (myo; myg )\tN\t \trap1 GTPase activating protein (rapgap)\tY\tmitochondrial small ribosomal RNA (mtsrRNA; 12s\t \trRNA)\tN\t \t\r\n\r\nGenes identified mainly from the Drosophila melanogaster\nliterature involved in germ plasm (i.e. pole plasm in D.\nmelanogaster) formation - Control of endocytosis in\ngermline and germline viability. Presence (Y) or absence (N) of\northologous transcripts in the Pararge aegeria\ntranscriptome is indicated.\r\n\r\nMaternal effect genes\r\n\r\n\t \tabstrakt (abs)\tY\tjafrac1; thioredoxin peroxidase 1; thiol\t \tperoxiredoxin (jafrac1; dpx-4783)\tY\t \tterribly reduced optic lobes; perlecan; zeste-white\t \t1 (trol; pcan; zw1)\tY\tdeadhead; thioredoxin (trx-1; trx)\tN\t \tTBC1 domain family member 1; weakly similar to\t \tDrosophila melanogaster pollux (plx)\tY\tthioredoxin-like; similar to Bombyx mori\t \tthioredoxin (trxl)\tY\t \tout at first (oaf)\tY\tthioredoxin-2; similar to Bombyx mori\t \tthioredoxin-like (trx2)\tY\t \textra macrochaetae (emc)\tY\tyema gene 2.8 (yemg2.8)\tN\t \twings up a; troponin 1 (tn1; tpn1; wupa)\tY\tyema gene 3.4 (yemg3.4)\tN\t \ttroponin c (tpnc; tnc47d)\tY\tyema gene 3a (yemg3a)\tN\t \ttroponin t; wings up b; upheld (tpnt;\t \twupb)\tY\tyema gene 3b (yemg3b)\tN\t \ttropomyosin 1 or 2 (tm1; tm2)\tY\tyema gene 3c (yemg3c)\tN\t \talcohol dehydrogenase (adh)\tY\tyema gene 4 (yemg4)\tN\t \tpolar granule component (pgc)\tN\tyema gene 9.5 (yemg9.5)\tN\t \ttype III alcohol dehydrogenase; iron-containing\t \tdehydrogenase (t3dh; adhfe1)\tY\tyemanuclein α; similar to ubinuclein\t \t(yemalpha)\tY\t \tplutonium (plu)\tN\twings down; pourquoi-pas; serendipity-cognate\t \t(pqp; wdn; sry-h1)\tY\t \tpan gu (png)\tN\tserendipity delta; serendipity δ\t \t(sry-delta)\tY\t \tgiant nuclei (gnu)\tN\tserendipity α (sry-alpha)\tY\t \tgerm cell guidance factor wunen; phosphatidate\t \tphosphatase (wun)\tY\theat shock RNA ω (hsr-omega)\tN\t \treceptor for activated protein kinase c rack 1\t \t(rack1)\tY\ttiovivo; nebbish; kinesin-like protein at 38b\t \t(klp38b; tio; neb)\tN\t \tshuttle craft; transcriptional repressor nf-x1\t \t(stc)\tY\tGTP-binding protein alpha-subunit; G protein α\t \t73b (Galpha73b)\tN\t \tmuscleblind (mbl)\tY\tGuanine nucleotide-binding protein G(I) subunit\t \t(GalphaI)\tN\t \tgrainyhead (NTF-1; grh)\tY\tG protein β-subunit 13f; heterotrimeric guanine\t \tnucleotide-binding protein beta subunit (Bombyx\t \tmori) (Gbeta13f)\tY\t \tdorsal (Drosophila); embryonic polarity protein dorsal\t \t(Bombyx - 2 isoforms) (dl)\tY\tG protein γ 1; CG8261 (Ggamma1; bro4\t \t)\tY\t \tdorsal switch protein (dsp1; ssrp2)\tY\tprotein tyrosine phosphatase 69d (ptp69d)\tN\t \ttosca; exonuclease 1 (tos)\tY\tsimilar to serine/threonine kinase pelle; homologous\t \tto irak-4 (pll)\tY\t \tDarkener of apricot; dual specificity protein kinase\t \tclk2 (Doa)\tY\tgastrulation-defective (gd)\tY\t \tclipper; cleavage and polyadenylation specific factor\t \t4 (clp; cpsf30)\tY\tshort gastrulation (sog )\tN\t \tvrille (vri; jf23)\tY\ttube (tub)\tY\t \tabsent md neurons and olfactory sensilla\t \t(amos)\tN\tsimilar to Bombyx mori spätzle 1 (spz)\tY\t \tbaboon; activin receptor type 1 (ATR1)\tY\tweckle (wek)\tN\t \teyelid; osa (eld; osa)\tY\tcactus (cact)\tY\t \tgonadal (gdl)\tY\tBzArgOEtase (Bombyx mori); similar to easter;\t \tclip-domain serine protease subfamily B\t \t(ea)\tY\t \téclair; transmembrane emp24 protein transport\t \tdomain containing 9 (eca)\tY\tsimilar to snake (Drosophila melanogaster); similar\t \tto serine protease 21 (Manduca sexta); clip-domain\t \tserine protease subfamily c (snk)\tY\t \tbaiser; transmembrane trafficking protein\t \t(bai)\tY\ttoll (tl)\tN\t \tlogjam (loj)\tY\tsimilar to Bombyx mori calpain; weakly similar to\t \tDrosophila melanogaster Calpain-A\t \t(CalpA)\tY\t \tbancal; (similar to) heterogeneous nuclear\t \tribonucleoprotein K (hrb57A; q18)\tY\tsimilar to brokenheart; similar to G protein\t \toalpha 47A; Guanine nucleotide-binding protein G(o)\t \tsubunit alpha; G protein alpha subunit go\t \t(G-olpha47A)\tY\t \tmaternal transcript 89BA (mat89BA)\tN\tconcertina; Guanine nucleotide-binding protein subunit\t \talpha-13 (conc)\tN\t \tasunder; maternal transcript 89BB (mat89BB;\t \tasun)\tY\tSNF1A/AMP-activated protein kinase - alpha subunit\t \t(SNF1-AMPK-alpha subunit)\tY\t \tdiadenosine tetraphosphatase; similar to\t \tbis(5-nucleosyl)-tetraphosphatase\t \t(datp)\tY\tSNF1A/AMP-activated protein kinase - beta subunit\t \t(SNF1-AMPK-beta subunit)\tY\t \tdopa decarboxylase; aromatic-l-amino-acid\t \tdecarboxylase (ddc)\tY\tSNF1A/AMP-activated protein kinase - gamma subunit\t \t(SNF1-AMPK-gamma subunit)\tY\t \thairless (h)\tN\tIGF-II mRNA-binding protein (imp; MRE11)\tY\t \tsuppressor of hairless; j kappa-recombination\t \tsignal-binding protein (su(h))\tY\tsimilar to G protein alpha q; G protein α49b\t \t(Gαq; Galpha49b)\tY\t \ttranscription termination factor lodestar; horka\t \t(horka; ids)\tY\tmap kinase activated protein-kinase-2 (mk2;\t \tMAPK-ak2)\tY\t \traspberry; inosine monophosphate dehydrogenase\t \t(ras)\tY\tptb-associated splicing factor; weakly similar to\t \tDrosophila no on or off transient a\t \t(psf)\tY\t \tmisato (mst; lb20)\tY\tpalmitoyl-protein thioesterase 1 (ppt1)\tY\t \tpeanut; similar to septin 7\t \t(pnut)\tY\tabl tyrosine kinase (abl)\tY\t \tseptin 1; innocent bystander (sep-1; iby)\tY\tAbelson interacting protein (Abi)\tY\t \tseptin 2 (sep-2)\tY\twing blister; homologous to laminin alpha 2\t \t(merosin) (wb)\tN\t \tseptin and tuftelin interacting protein; elongator\t \tcomplex protein 2; septin interacting protein 1\t \t(stip)\tY\tsupervillin; CG33232 (svil)\tY\t \tkurz; similar to ATP-dependent RNA helicase\t \tdhx37 (kz)\tY\tcyclope; cytochrome c oxidase subunit vic\t \t(cype)\tY\t \tpebble (pbl)\tY\tla autoantigen-like (la)\tY\t \tnumb (numb; nb)\tY\ttramtrack (ttk; ttk69)\tY\t \tcatalase (cat)\tY\thigh mobility group protein b1; dorsal switch protein\t \t1 (HMGb1; dsp1; ssrp2)\tY\t \tsuperoxide dismutase (sod1; csod;\t \tcu/znsod)\tY\tzinc finger protein 43c (az2)\tN\t \tdisc proliferation abnormal (mcm4; dpa)\tY\tmaverick (mav)\tN\t \tFragile x mental retardation 1 (Fmr1)\tY\tshibire; dynamin (shi; dyn)\tY\t \tfemale sterile (2) ketel; karyopherin beta 1; importin\t \tβ (ketel; imp-beta)\tY\tprotein o-fucosyltransferase 1; similar to\t \tBombyx mori fut12 gene (pofut1)\tY\t \tkaryopherin beta 3 (karyβ3)\tY\tprotein o-fucosyltransferase 2; similar to\t \tBombyx mori fut13 gene (pofut2)\tY\t \tcas/cse1 segregation protein; export karyopherin\t \tcas/cse1p (cas)\tY\tsimilar to bloated tubules; sodium/chloride dependent\t \ttransporter (blot)\tY\t \timportin alpha 1; karyopherin α1 (imp\t \talpha 1)\tY\tgastrulation defective protein 1 homolog;\t \tCG5543; similar to WD repeat-containing 70\t \tprotein (CG5543)\tY\t \timportin alpha 2; karyopherin α2; pendulin\t \t(imp alpha 2)\tY\thigh mobility group protein 20a (HMG20a)\tY\t \timportin alpha 3; karyopherin α3 (imp\t \talpha 3)\tY\thigh mobility group box-containing protein 4; hmg-box\t \tprotein hmg2l1 (HMGx4)\tY\t \timaginal disc growth factor 1 (idgf;\t \tidgf1)\tY\tcalcium atpase at 60a; sarcoplasmic/endoplasmic\t \treticulum calcium atpase (serca; kum; dserca;\t \tcap60a)\tY\t \timaginal disc growth factor 2 (idgf2)\tN\tdacapo (chakra; dap)\tN\t \timaginal disc growth factor 3 (idgf3)\tN\tliprin-α (liprin-a)\tN\t \timaginal disc growth factor 4 (idgf4)\tN\tmitochondrial acyl carrier protein 1; nadh-ubiquinone\t \toxidoreductase acyl carrier protein\t \t(mtacp1)\tN\t \tkinesin-like protein at 61f; urchin; kinesin-like\t \tprotein klp2 (in Bombyx mori) (klp61f;\t \tklp2)\tY\tmitochondrial assembly regulatory factor; mitofusin\t \t(marf; mfn; mfn2)\tY\t \tpuromycin sensitive aminopeptidase (psa)\tY\tripped pocket; gonad-specific amiloride-sensitive sodium\t \tchannel 1 (rpk; gnac1)\tN\t \tcask ortholog; calmodulin-dependent kinase\t \t(caki; cmg; camguk)\tY\tkurtz; similar to beta-arrestin 1\t \t(krz)\tY\t \tsignal transducing adaptor molecule (stam)\tY\tubiquitin carboxy-terminal hydrolase; CG4265\t \t(uch)\tY\t \thistone acetyltransferase kat2b; histone\t \tacetyltransferase pcaf; general control of amino acid\t \tsynthesis protein 5-like 2 (pcaf; gcn5)\tY\tlark (lark)\tY\t \tada2b (ada2b)\tY\tsemaphorin-5c (sema-5c)\tN\t \ts-adenosyl-methyl transferase mraw; CG14683\t \t(mraw)\tY\tsemaphorin 1b (sema-1b )\tN\t \tc-terminal binding protein; hairy-interacting\t \tprotein; similar to 2-hydroxyacid\t \tdehydrogenase (ctbp)\tY\tselenophosphate synthetase 1; selenide, water\t \tdikinase (sps1 )\tY\t \treticulated (ret)\tN\tsodium/potassium exchanging and transporting ATPase\t \tsubunit beta 1 nervana 1 (nrv1)\tY\t \tfurin 1; similar to convertase\t \tsubtilisin/kexin; similar to furin-like\t \tconvetase (fur1 )\tN\tsodium/potassium exchanging and transporting ATPase\t \tsubunit beta 2 nervana 2 (nrv2)\tY\t \twindbeutel; thioredoxin-like motif containing gene\t \t(wbl)\tY\t \t \t \t\r\n\r\nMaternal transcripts; Maternal effect genes identified mainly from\nthe Drosophila melanogaster literature encoding various\ntypes of proteins, including enzymes, needed for early embryogenesis\nand germ cell formation. Presence (Y) or absence (N) of orthologous\ntranscripts in the Pararge aegeria transcriptome is\nindicated.\r\n\r\nMaternal transcripts involved in regulating early embryogenesis – dorsal-ventral patterning of the embryo and early neurogenesis\r\n\r\nDrosophila melanogaster uses an elaborate network of genes to pattern the DV axis during embryogenesis on the basis of the oocyte polarity established during oogenesis (discussed in; further references in Additional file 1). As discussed elsewhere in this paper, the two genes essential for establishing DV polarity in D. melanogaster oocytes, grk and pipe (the latter of which is repressed dorsally), were absent from the P. aegeria transcriptome. The genes that are subsequently involved in establishing the ventral side of the D. melanogaster embryo are co-opted from the Toll innate immune defense pathway (including a serine protease cascade). A similar cascade has been described in T. castaneum, but at present it is not known whether it is restricted to the ventral perivitelline space. This protease cascade and associated (ventral) genes were also expressed in P. aegeria, but at present it is unclear in which functional context they are used. These genes include; windbeutel (wind), nudel (ndl), gastrulation defective (gd), snake (snk), easter (ea), spn27A, spz, tube (tub) and pelle (pll) (Tables 7 and 13; Additional files 1 and 2). No orthologs for the zinc-finger gene weckle (wek) have yet been found outside Drosophila, and wek was also not found in P. aegeria (Table 13). In D. melanogaster, Toll receptor protein accumulates during the embryonic syncytial stage prior to nuclear migration, and is activated ventrally as the result of a serine/protease cascade (references in Additional file 1). The Toll-like receptor expressed by P. aegeria during oogenesis was found to be an ortholog of 18 wheeler (18w), rather than toll (tl) (Tables 6 and 13). In D. melanogaster 18w is involved in dorsal appendage formation and follicle cell migration, and DV patterning. While P. aegeria eggs do not have dorsal appendages, 18w may be involved in DV patterning. In D. melanogaster 18w expression in relation to eggshell patterning, and thus DV polarity, is dependent on input from Dpp and EGF signalling pathways. As discussed elsewhere in the paper, there is not much evidence for EGF signalling in P. aegeria oogenesis, but there is for Dpp signalling (e.g. Figure 4 qPCR results). Furthermore, analyses of Toll receptors have shown that B. mori tl and 18w sequences were more similar to each other, than to D. melanogaster toll. It thus remains to be investigated exactly which functional role 18w fulfils during oogenesis in Lepidoptera.\r\n\r\nPararge aegeria did express cactus (cact) and dorsal (dl) (Table 13). Dorsal protein is distributed evenly in a D. melanogaster embryo, but a gradient in the uptake of Dorsal protein into the nucleus (high on the ventral side) is essential for subsequent DV patterning in the D. melanogaster embryo. Dorsal protein activates some genes, whilst repressing others along the DV axis. While there are some differences in detail, the gene regulatory network underlying embryonic DV patterning is largely conserved in all insects. The Dorsal protein represses dpp ventrally and the protein encoded by grainyhead (NTF-1/grh) acts as co-repressor. RNA of grh is deposited maternally into the oocyte to be translated and used ventrally during embryogenesis. Repression of dpp by a Dorsal gradient does not, however, occur in T. casteneum. A high concentration of Dpp will eventually be restricted to the dorsal side of the D. melanogaster embryo and its concentration is further restricted ventro-laterally by Short gastrulation (Sog), which in D. melanogaster may also be maternally provided. Rather interestingly, this antagonistic interaction between Dpp and Sog may already be employed during oogenesis for the establishment of DV polarity in the oocyte. The vrille (vri) gene encodes a Bzip transcription factor that interacts in D. melanogaster with Dpp signalling, acting as dominant maternal enhancers of embryonic DV patterning defects caused by ea and dpp mutations. Two P24 proteins encoded by eclair (eca) and baiser (bai) are essential for the activity of maternal Tkv, a type I Dpp receptor. Pararge aegeria females did transfer maternal transcripts of grh, dpp, tkv, eca, bai and vri into the oocyte, but did not express sog maternally (Figure 4 qPCR results; Tables 3 and 13; Additional files 1 and 2).\r\n\r\nDrosophila melanogaster females express a group of genes called the yema genes (yema 2.8, 3.4, 3a, 3b, 3c, 4 and 9.5) during oogenesis, with most of them displaying strict maternal expression. This may be of importance in the development of the central nervous system of the embryo. However, the exact functional roles of the yema genes are not known and there are no orthologs outside Drosophila. No orthologs were found for these genes in the P. aegeria transcriptome (Table 13 and Additional file 1). Pararge aegeria females did, however, express a number of other genes that are implicated in embryonic brain development or in general in the nervous system; e.g. neuralized (neu), elav, brainiac (brn), Fmr1, brain tumor (brat), mnb, and terribly reduced optic lobes (trol) (Tables 3, 6 and 13; Additional file 1). Of these, mnb and elav have not been explicitly studied in the context of oogenesis (references in Additional file 1). Although maternal transcripts of these genes may play a role in embryonic neural development in D. melanogaster, these genes appear to be important in establishing polarity of the oocyte and its differentiation during oogenesis (references in Additional file 1). The expressions of three of these were further investigated by means of qPCR: elav, Fmr1 and the serine/protease encoding mnb (Figure 4 qPCR results). To date, of these three, only Fmr1 has been described as present in D. melanogaster oocytes, but elav, Fmr1 and mnb were all found in P. aegeria oocytes (Figure 4 qPCR results). Compared to the ovaries, the amount of elav and Fmr1 transcripts in the oocytes was quite low (Figure 4 qPCR results; Additional file 2), suggesting they are important during oogenesis. Whether these genes play a role of significance in establishing oocyte polarity in P. aegeria needs to be investigated.\r\n\r\nTerminal genes\r\n\r\nThe Torso receptor tyrosine kinase (RTK) pathway has been implicated in a number of different processes during D. melanogaster oogenesis, including vitelline membrane (or envelope) biogenesis and in particular terminal region specification. The maternal-effect gene torso (tor) encodes a receptor whose ligand is most probably encoded for by trunk (trk). Furthermore, the protein encoded by torsolike (tsl) plays a role upstream of trk in activating the Tor receptor in a localised manner, and is thought to be essential for terminal specification. Although both tor and tsl are involved in terminal specification in T. castaneum, different tissues are patterned and Torso signalling plays a role in defining the posterior growth zone during embryogenesis in this short germband insect. Torso signalling is by no means the default mechanism for terminal specification, as the honey bee (Apis mellifera) has the gene tsl, but not tor and trk in its genome. The honey bee seems to rely on other mechanisms for terminal specification. Pararge aegeria does not express clear orthologs of either tor or trk during oogenesis, but does express tsl (Table 14). Bombyx mori does have a RTK in its genome (BGIBMGA003976), which shows similarity to torso, as well as to tie-like and Cad96Ca. Pararge aegeria did not express tie-like (Table 6), but did express Cad96Ca (PACG18092; Additional file 2). This transcript was not present in oocytes and was found only in the ovarioles (Additional file 2). Furthermore, a TBLASTN of the putative B. mori tor against the P. aegeria transcriptome showed that transcript PACG7078 (complete CDS; Additional file 2) was similar (E-value= 5.0 E-50), although it had greater similarity to the receptor tyrosine kinase Fps85D than to tor. This transcript is present in both P. aegeria oocytes and ovarioles, but its role in oogenesis has not been described in the literature. It is clear that P. aegeria uses RTK signalling during oogenesis and that the sequences of its ligands and receptors have diverged from those of other insects. However, at present it is unclear in which functional context RTK signalling takes place.\r\n\r\nTerminal specification\r\n\r\n\t \tcorkscrew; similar to protein tyrosine\t \tphosphatase, non-receptor type 11 (csw;\t \tptpn11)\tY\traf; raf1; pole hole; raf kinase; effector of ras\t \t(raf; raf1; phl)\tY\t \tdead ringer (dri)\tY\tsignal transducer and activator (stat) (stat;\t \tstat92e)\tY\t \ttorso (tor)\tN\trolled; map kinase (MAPK) (rl; MAPK; erk)\tY\t \ttorsolike (tsl)\tY\tdownstream of raf1 (dsor1)\tN\t \ttrunk (trk)\tN\themipterous; mitogen-activated protein kinase\t \tkinase (hep; MAPKK; mkk7)\tY\t \tfemale sterile (1) homeotic; fragile-chorion membrane\t \tprotein (fs(1)h)\tY\tgrowth arrest and DNA-damage inducible 45\t \t(gadd45)\tN\t \tras1 (ras1; ras85d)\tY\tshc-adaptor protein; shc-transforming protein 1; src\t \thomology 2 domain containing; CG3715 (shc)\tN\t \t\r\n\r\nGenes identified mainly from the Drosophila melanogaster\nliterature involved in terminal specification. Presence (Y) or\nabsence (N) of orthologous transcripts in the Pararge\naegeria transcriptome is indicated.\r\n\r\nChromatin regulation during oogenesis, DNA replication, general transcription and maternal regulation of zygotic transcription in general\r\n\r\nIn general, the genes that encode proteins involved in chromatin remodelling, DNA replication and transcription are highly conserved across insects and often across the Metazoa in general (references in Additional file 1). A large number of these genes have been studied specifically in the context of oogenesis in D. melanogaster (Table 15; references in Additional 1). Pararge aegeria was found to express orthologs of a number of these genes (Table 15 and Additional file 1). The genes not expressed by P. aegeria seem to either have no clear insect orthologs outside Drosophila, or no such orthologs have been reported in Lepidoptera, such as B. mori. Genes not expressed by P. aegeria, but for which Lepidopteran orthologs exist include TATA box binding protein-related factor 2 (Trf2), sex combs on midleg (scm), and Arginine methyltransferase 1 and 8 (DART1 and DART8, Table 15 and Additional file 1). The gene scm is a member of the polycomb group (PcG) and similar to D. melanogaster polyhomeotic (ph-p) gene. Both play versatile and important roles in D. melanogaster oogenesis, particularly in ovarian follicle formation. Pararge aegeria females did express and transfer orthologs of other PcG genes into the oocyte. These include the polycomb repressive complex 1 (PRC1) genes sex combs extra (sce), polycomb (ph), posterior sex combs (psc), the PRC2 genes extra sex combs (esc), Enhancer of zeste (E(z)) and the polycomb related genes Enhancer of polycomb (E(ph)) and additional sex combs (asx) (Table 15, Additional files 1 and 2; references therein). Recently these genes have also been identified in B. mori embryogenesis. These genes encode proteins that regulate DNA and histone methylation patterns and general chromatin remodelling. However, they also appear to be important specifically during oogenesis and embryogenesis and may be implicated in transferring gene regulatory states from one generation to the next, being regarded as candidate genes in epigenetic processes, with possible involvement in transgenerational effects in relation to environmental heterogeneity.\r\n\r\nRegulation of transcription and chromatin structure\r\n\r\n\t \tDNA polymerase α 180KD; DNA polymerase alpha\t \tcatalytic subunit (DNApol-α180)\tY\thomolog of regulator of chromatin condensation 2;\t \tsimilar to CG9135 (rcc2)\tY\t \tRNA polymerase II transcriptional coactivator single\t \tstranded-binding protein c31a (ssb-c31a)\tY\tDNA polymerase interacting tpr containing protein\t \t(dpit47)\tY\t \tpolyadenylate-binding protein 2 (rox2;\t \tpabp2)\tY\tDNA polymerase α (180kD)\t \t(DNApol-α180; pola)\tY\t \thigh mobility group protein; structure specific\t \trecognition protein. fact complex subunit ssrp1\t \t(ssrp; ssrp1)\tY\tDNA polymerase delta (DNApol-delta)\tY\t \tsimilar to Drosophila melanogaster high mobility group\t \tprotein d; similar to Bombyx mori high mobility\t \tgroup protein 1b (HMGd; HMG1b)\tY\tDNA polymerase ϵ (DNApol-ϵ;\t \tpole)\tY\t \tdomina; jumeau (jumu/dom)\tY\tsimilar to DNA polymerase ϵ subunit 2\t \t(DNApol-ϵ; pole2)\tY\t \tmodulo (mod)\tN\tsimilar to DNA polymerase ϵ subunit 3\t \t(DNApol-ϵ; pole3)\tY\t \tlysine-specific histone demethylase 1; suppressor of\t \tvariegation 3–3 (suv3-3; su(var)3-3;\t \tlsd1)\tY\tDNA polymerase eta (DNApol-eta; drad30a)\tY\t \thistone methyltransferase 4–20; suppressor of\t \tvariegation 4–20 (suv4-20;\t \tsu(var)4-20)\tY\tDNA polymerase iota (drad30b; DNApol-iota)\tY\t \tDrosophila melanogaster suppressor of variegation\t \t3–9 (suv3-9; su(var)3-9)\tY\tDNA polymerase zeta; similar to\t \tmutagen-sensitive 205; rev3-like\t \t(DNApol-zeta; mus205)\tY\t \tpitkin(dominant) (ptn(d))\tN\treplication protein a1 (rpa1)\tY\t \tEukaryotic translation initiation factor 2 gamma\t \tsubunit (eIF2g)\tY\treplication protein a2 (rpa2)\tY\t \tsuppressor of variegation 2–10; protein inhibitor\t \tof activated stat (su(var)2-10; pias; zimp;\t \tzimpb;)\tY\treplication protein a3 (rpa3)\tY\t \teggless (egg; SETDB1)\tY\treplication factor c 38kD subunit (rfc38)\tY\t \thistone h3k9 methyltransferase dg9A (g9A)\tN\t(Bombyx mori) replication factor c subunit 2; rfc40\t \t(rfc40; bm- rfc2)\tY\t \tmodifier of mdg4 (mod(mdg4); e(var)3-93d)\tY\t(Bombyx mori) replication factor c4; CG8142\t \t(bm-rfc4)\tY\t \tsuppressor of hairy wing (su(hw))\tY\t(Bombyx mori) replication factor c (activator 1) 5;\t \tDrosophila replication factor c subunit 3\t \t(rfc3)\tY\t \ttrithorax-like (trl; GAGA; gaf; e(var)3;\t \te(var)62)\tN\tgerm line transcription factor 1; replication factor\t \t1 (rfc1; gnf1)\tY\t \tbrahma; SWI/SNF-related matrix-associated\t \tactin-dependent regulator of chromatin subfamily A\t \tmember; transcription activator brg1 (smarca4;\t \tbrm)\tY\trecombination repair protein 1 (rrp1)\tY\t \tmarcal1; SWI/SNF-related matrix-associated\t \tactin-dependent regulator of chromatin subfamily A\t \tmember (marcal1; smarcal1)\tY\trev7 (rev7)\tN\t \tsnf5-related 1; SWI/SNF-related matrix-associated\t \tactin-dependent regulator of chromatin subfamily B\t \tmember 1 (snr1; bap45)\tY\ttrf4-1; sigma DNA polymerase (trf4-1)\tY\t \tbrg-1 associated factor; SWI/SNF-related\t \tmatrix-associated actin-dependent regulator of chromatin\t \tsubfamily d member 1; brahma associated protein\t \t60kD (bap60)\tY\ttopoisomerase 1; topoisomerase i (top1)\tY\t \tdalao; brahma-associated protein 111kD; SWI/SNF-related\t \tmatrix-associated actin-dependent regulator of chromatin\t \tsubfamily E (bap111; dalao)\tY\ttopoisomerase 2; topoisomerase II (top2;\t \ttopII)\tY\t \tmoira (mor; bap155)\tY\ttopoisomerase 3 alpha; topoisomerase III aplha\t \t(topIII-alpha)\tY\t \timitation swi (dnurf; iswi; dchrac)\tY\ttopoisomerase 3 beta; topoisomerase III beta\t \t(topIII-beta)\tY\t \tBrahma associated protein 170kD (bap170)\tY\tminichromosome maintenance 3 (mcm3)\tY\t \tBrahma associated protein 55kD (bap55)\tY\tminichromosome maintenance 5 (mcm5)\tY\t \thelicase domino (dom)\tY\tminichromosome maintenance 6; fs(1)k1214\t \t(mcm6)\tY\t \tetl1 homologue; SWI/SNF-related matrix-associated\t \tactin-dependent regulator of chromatin subfamily A\t \tcontaining dead/h box 1 (etl1; smarcad)\tY\tminichromosome maintenance 7 (mcm7)\tY\t \tEnhancer of zeste (E(z))\tY\tminichromosome maintenance 8;\t \trecombination-defective (mcm8; rec)\tY\t \textra sex combs (esc)\tY\tDNA methyltransferase 2 (mt2)\tY\t \tadditional sex combs (asx)\tY\tpoly-(adp-ribose) polymerase (parp)\tY\t \tsex comb on midleg (scm)\tN\tTATA box binding protein-related factor 2\t \t(Trf2; tlf)\tN\t \tmulti sex combs (mxc)\tN\tTATA box binding protein (Tbp)\tY\t \tpolyhomeotic (ph-p)\tN\ttbp-associated factor 250kD (taf250; taf1)\tY\t \tsex combs extra; similar to E3\t \tubiquitin-protein ligase ring1 (Bombyx mori)\t \t(sce; dring)\tY\ttrithorax-related (trr)\tY\t \tpolycomb (ph)\tY\tsupercoiling factor (scf; dcb-45)\tY\t \tEnhancer of polycomb (E(pc))\tY\tbx42; ski-interacting protein (skip)\tY\t \tposterior sex combs (psc)\tY\tboundary element-associated factor of 32KD\t \t(beaf32)\tN\t \tlethal (3) 73ah; similar to polycomb group ring\t \tfinger protein 3 (l(3)73ah)\tY\tHistone h4 (H4)\tY\t \tactivating transcription factor; homologous to\t \tBombyx activating transcription factor of\t \tchaperone (atf-2)\tY\tHistone h3.3 (H3.3)\tY\t \tcyclic-amp response element binding protein\t \t(1,2,3)(creb; dcreba)\tY\tHistone h2a (H2a)\tY\t \tcreb binding protein; similar to nejire\t \t(crebbp(a))\tY\tHistone h2a variant (H2a.v)\tY\t \tretinoblastoma binding protein (rbp)\tY\tmutagen-sensitive 308 (PolQ; mus308 )\tY\t \tretinoblastoma binding protein 2 (jumonji/arid domain\t \tcontaining); little imaginal discs (rbp2;\t \tlid)\tY\trpd3 (hdac1; rpd3; hdac)\tY\t \tsimilar to retinoblastoma binding protein 6\t \t(rbp6)\tY\tmbd-like (mbd2/3; mbd-like)\tY\t \ttousled-like kinase (tlk)\tY\tmediator complex subunit 6 (med6 )\tY\t \tno child left behind; similar to wd repeat\t \tprotein (nclb)\tY\tmitochondrial single stranded DNA-binding protein\t \t(mtssb)\tY\t \tArginine methyltransferase 1; Arginine\t \tn-methyltransferase 1 (DART1; prmt1)\tN\thomolog of recq (recq5)\tY\t \tArginine methyltransferase 2; Arginine\t \tn-methyltransferase 2 (DART2; prmt2)\tN\then1 (dmhen1; pimet)\tY\t \tArginine methyltransferase 3; Arginine\t \tn-methyltransferase 3 (DART3; prmt3)\tY\tEukaryotic translation initiation factor 4G\t \t(eIF4G)\tY\t \tArginine methyltransferase 4; histone-Arginine\t \tmethyltransferase carm 1 (DART4; prmt4)\tY\tEukaryotic translation initiation factor 4A\t \t(eIF4A)\tY\t \tArginine methyltransferase 6; Arginine\t \tn-methyltransferase 6 (DART6; prmt6)\tN\tEukaryotic translation initiation factor 5\t \t(eIF5)\tY\t \tArginine methyltransferase 7; Arginine\t \tn-methyltransferase 7 (DART7; prmt7)\tY\tretrotransposon gypsy\\envelope (gypsy\\env)\tN\t \tArginine methyltransferase 8; Arginine\t \tn-methyltransferase 8 (DART8; prmt8)\tN\tjim (ovk; ovfc.k; jim)\tY\t \tArginine methyltransferase 9; Arginine\t \tn-methyltransferase 9 (DART9; prmt9)\tN\tzelda; vielfaltig (vfl; zld)\tN\t \tabsent, small, or homeotic discs 1 (ash-1; ash;\t \tdash)\tY\tFcp1 RNA polymerase II CTD phosphatase; CG12252\t \t(fcp1)\tY\t \tbj1 protein; homolog of regulator of chromatin\t \tcondensation 1 (rangef; rcc1 )\tY\t \t \t \t\r\n\r\nGenes identified mainly from the Drosophila melanogaster\nliterature involved in regulation of chromatin structure during\noogenesis, DNA replication, general transcription and maternal\nregulation of zygotic transcription in general. Presence (Y) or\nabsence (N) of orthologous transcripts in the Pararge\naegeria transcriptome is indicated.\r\n\r\nGenes influencing the cell cycle regulators of mitosis and meiosis\r\n\r\nA large number of genes that regulate mitosis have been studied in a reproductive context in D. melanogaster. These genes are not only involved in stem cell maintenance and differentiation in the germarium, but also in relation to endocycling in nurse cells and selective amplication of genes (such as chorion genes) important in oocyte production (further references in Additional file 1). As before, the genes that were not expressed by P. aegeria in a mitotic context seemed either to have no clear insect orthologs outside Drosophila, or no such orthologs have been reported in Lepidoptera such as B. mori (Table 16). Among these are dacapo (dap), matrimony (mtrm), microcephalin (MCPH1) and chiffon (chif) (Additional file 1). The full list of genes in Table 16 contains a large number of cyclins, which regulate cyclin dependent kinases (CDKs). Orthologs of two common cyclins could not be found in the P. aegeria transcriptome: cyclin E and J (see the discussion on choriogenesis elsewhere in this paper).\r\n\r\nCell cycle tregulation during mitosis and meiosis\r\n\r\n\t \tarchipelago; WD repeat domain containing 7\t \t(ago)\tN\tmyb transforming protein; similar to CG6905\t \t(mybtp)\tY\t \tdacapo (dap)\tN\tpitchoune (pit)\tY\t \tcoiled coil domain containing protein 25\t \t(ccdc25)\tY\trad51(−like); spindle A (rad51;\t \tspna)\tY\t \tbreast cancer 2, early onset homolog\t \t(brca2)\tY\ttribbles (trbl)\tY\t \tchiffon (chif)\tN\tfizzy; cdc20 (fzy; cdc20)\tY\t \tcyclin-dependent kinase 1; cell division cycle 2\t \t(cdk1; cdc2)\tY\tmeiotic 41 (which is the Drosophila atm/atr\t \thomolog) (mei-41; fs(1)m37)\tN\t \tcyclin-dependent kinase 2 (cdk2)\tY\tmeiotic from via Salaria 332 (mei-S332)\tN\t \tcyclin-dependent kinase 4 (cdk4)\tY\tmei-4 (Forkhead domain containing)\t \t(mei4)\tY\t \tcyclin-dependent kinase 5 (cdk5)\tY\tmei-W68 (mei-W68)\tN\t \tcyclin-dependent kinase 7 (cdk7; mo15)\tY\tcortex (cort)\tY\t \tcyclin-dependent kinase 8 (cdk8)\tY\tgrauzone (grau)\tN\t \tcyclin-dependent kinase 9 (cdk9)\tY\tCG1647; zinc-finger protein (CG1647)\tY\t \tcyclin-dependent kinase 10 homolog; cdc2-related\t \tkinase (cdk10)\tY\tbtk family kinase at 29a (btk29a; tec29a)\tY\t \tcyclin A (cycA)\tY\tmutator 2 (mu2)\tN\t \tcyclin B (cycB)\tY\tmyelin transcription factor 1 (myt1)\tN\t \tcyclin B3; l(3)l6540 (cycB3)\tY\torientation disrupter (ord)\tN\t \tcyclin C (cycC)\tY\tmei-218 (mei-218)\tN\t \tcyclin D (cycD)\tY\taltered disjunction; mps1 (a kinetochore-associated\t \tprotein kinase) (ald; mps1)\tN\t \tcyclin E (cycE)\tN\tno distributive disjunction (nod )\tN\t \tCOP9 complex homolog subunit 5 (csn5)\tY\tsarah; nebula (sra; nla)\tY\t \tCOP9 complex subunit 3 (csn3; dch3)\tY\tcalcineurin a (cana)\tY\t \tCOP9 complex subunit 4 (csn4; dch4)\tY\tcalcineurin b (canb)\tY\t \tCOP9 complex subunit 6 (csn6)\tY\tmei-38 (mei38)\tN\t \tCOP9 complex subunit 7 (csn7)\tY\tubiquitin conjugating enzyme E2 rad6 (ubcd6;\t \trad6)\tY\t \tCOP9 complex subunit 8 (csn8)\tY\talpha-endosulfine (endos)\tY\t \tcyclin H (cycH)\tY\tearly girl; CG17033 (elgi)\tY\t \tcyclin J (cycJ)\tN\tencore (enc)\tN\t \tcyclin K (cycK)\tY\tcullin 1 (cul1; lin19)\tY\t \tcyclin L1; CG16903 (cycL1)\tY\tcullin 2 (cul2)\tN\t \tcyclin T (cycT)\tY\tcullin 4 (a and b) (cul4)\tY\t \tcyclin fold protein; cyclin Y (cycfp;\t \tcycY)\tY\tdouble parked (dup)\tY\t \tcyclin M2 (cycM2; cnnM2)\tY\tcullin 5 (cul5)\tY\t \tcyclin-dependent kinase subunit 30a\t \t(cks30a)\tY\tgustavus; Bombyx sequence BHIBMGA008896-PA\t \thomologous to spry domain-containing socs box protein 4\t \t(ssb4) (gus; ssb4)\tY\t \tcyclin-dependent kinase subunit 85a\t \t(cks85a)\tY\tubiquitin conjugating enzyme 2; l(2)k13206\t \t(ubcd2)\tY\t \tdiminutive; dmyc (dm)\tY\tubiquitin conjugating enzyme e2 d4 (ubcd4)\tY\t \te2f1 (e2f1)\tY\torigin recognition complex subunit 1\t \t(ORC1)\tY\t \te2f5 (e2f5)\tN\torigin recognition complex subunit 2; l(3)88ab\t \t(ORC2)\tY\t \tdp; e2f dimerization partner 2 (dp; tfdp2)\tY\torigin recognition complex subunit 5; l(2)34df\t \t(ORC5)\tY\t \tsin3a (sin3a)\tY\tachintya (zaa)\tY\t \tgeminin (geminin)\tY\tvismay (vis)\tN\t \tmatrimony (mtrm; d52)\tN\tminichromosome maintenance 2 protein\t \t(mcm2)\tY\t \timaginal discs arrested (ida)\tN\tretinoblastoma-family protein 1 (rbf1;\t \trb1)\tN\t \ttwine (twe)\tN\tgrapes; serine/threonine-protein kinase chk1\t \t(chk1; lemp; grp)\tN\t \tstring; cdc25 phosphatase (stg)\tN\tmissing oocyte (mio)\tN\t \tmicrocephalin (MCPH1)\tN\tmegator (mtor)\tY\t \tinducer of meiosis 4; mta70 homologue\t \t(ime4)\tY\tnucleoporin 44a; similar to sec13-like\t \tprotein (seh1; nup44a)\tY\t \tgreatwall; mast-like (gwl)\tY\tnucleoporin 154; tulipano (nup154; zk; nup32d;\t \ttlp)\tY\t \tpolo (kinase); l(3)01673 (polo)\tY\tkinesin-like protein ncd; non-claret disjunctional;\t \tclaret segregational (ncd)\tY\t \tloki; checkpoint kinase 2 (lok; chk2)\tY\tkinesin-13 motor; kinesin-like protein 10a; kinesin-like\t \tprotein a (in Bombyx mori)(klp10a;\t \tklpa)\tY\t \talways early; a lin9 homolog\t \t(aly)\tY\tsimilar to Bombyx mori kinesin-like protein b\t \t(klpb)\tY\t \tpavarotti; kinesin family member 23 (kif23;\t \tpav)\tY\tcrossover suppressor on 2 of Manheim (mei-910;\t \tc(2)M)\tN\t \tmorula (anaphase-promoting complex subunit)\t \t(mr)\tY\tcrossover suppressor on 3 of Gowen (c(3)G)\tN\t \tproliferating cell nuclear antigen (mutagen-sensitive\t \t209) (mus209; pcna)\tY\tcorona (cona)\tN\t \tmutagen-sensitive 304 (atrip; mus304)\tN\tnipped-B (nipped-B)\tY\t \tmyb oncogene-like (myb)\tY\tpch2 (pch2)\tN\t \tthe myb-muvb complex subunit lin-52\t \t(lin-52)\tY\tGuanylate kinase-associated protein mars; hurp\t \t(hurp; dhrp/Gkap; mars)\tY\t \t\r\n\r\nGenes identified mainly from the Drosophila melanogaster\nliterature that influence the cell cycle - regulators of mitosis\n(e.g. endocycling and selective amplification of chorion genes) and\nmeiosis. Presence (Y) or absence (N) of orthologous transcripts in\nthe Pararge aegeria transcriptome is indicated.\r\n\r\nThe cell cycle becomes arrested in meiotic prophase I in the majority of Metazoans oocytes. This is initiated during the first stages of oogenesis in region 2 of the D. melanogaster germarium. The intriguing fact is that the gene bruno is not only essential in regulating the translation of a number of genes during oocyte differentiation, but it also appears to be involved in regulating the silencing of Cdk1 activity in order to achieve primary meiotic arrest. It should be noted that oocyte AP and DV polarity is established during primary meiotic arrest and only once the oocyte is properly patterned by stage 14 is this arrest broken. As indicated before, bruno was expressed by P. aegeria females (Table 9).\r\n\r\nMeiosis during butterfly and moth oogenesis is characterised by the absence of crossing over and the formation of chiasmata. Cytological studies have established that female Lepidoptera may form synaptonemal complexes (SC) in early meiotic prophase I, but no recombination nodules (RN) are formed subsequently. Instead, a structure called elimination chromatin is formed. Usually chiasmata are formed from retained pieces of the SC in which a RN is, or has been, present. The formation of the chiasmata takes place in the cell destined to become the oocyte in the D. melanogaster germarium. Four genes appear essential in D. melanogaster for SC formation and thus possibly chiasmata formation: crossover suppressor on 2 of Manheim (c(2)M); crossover suppressor on 3 of Gowen (c(3)G); corona (cona) and nipped-B (references in Additional file 1). No genes specific for RN alone could be identified on FlyBase. Pararge aegeria females only express nipped-B (Table 16 and Additional file 1), which is involved in a number of cellular processes in D. melanogaster including mitosis. It is also the only one of the four SC genes for which orthologs outside Drosophila can be identified. Rather interestingly, a large proportion of the genes involved in D. melanogaster meiotic chromosome cohesion and segregation also appeared to be Drosophila or Diptera specific and were not identified in the P. aegeria transcriptome. These include grauzone (grau), corona (cona), orientation disrupter (ord) and mei-S332 (Table 16; references in Additional file 1). A number of genes are, however, highly conserved and orthologs have been found in Lepidoptera as males do display crossing-over. These include both mei-W68 and mei-218 but in particular includes the essential meiotic checkpoint gene pch2 (references in Additional file 1). Female P. aegeria did not express any of these genes (Table 16 and Additional file 1). The P. aegeria oogenesis transcriptome described here is thus in accordance with the previous observations made during cytological studies on female Lepidoptera.\r\n\r\nVitellogenesis and lipid storage\r\n\r\nNot only is cell cycle regulation coordinated with oocyte differentiation in D. melanogaster, but also with resource provisioning of the oocyte. The gene greatwall (gwl), for example, is both essential in D. melanogaster for maternal provisioning of the egg during vitellogenesis and to ensure secondary meiotic arrest by stage 14 of oogenesis in metaphase I. It is a highly conserved gene in Metazoa and P. aegeria females did express this gene during oogenesis (Table 16 and Additional file 1). Furthermore, gwl (antagonistically) interacts with polo kinase (polo) in mitotic regulation particularly during early embryogenesis, and is maternally provided (references in Additional file 1). Transcripts of both were detected in P. aegeria oocytes (Table 16; Additional files 1 and 2).\r\n\r\nVitellogenesis during insect oogenesis is characterised by the accumulation in the developing oocytes of large lipid transfer proteins (LLTPs; i.e. yolk protein precursors), such as Vitellogenin (Vtg/Vg) and Apolipophorins (ApoLPs). Predominantly, LLTPs are produced in the fat bodies and secreted into the hemolymph, but not all yolk proteins are extraovarian. Follicle cells not only allow extraovarian yolk protein to reach the oocytes, they also produce significant amounts of LLTPs themselves in a number of insect species, including D. melanogaster. Vitellogenic behaviour of follicle cells is under hormonal control. LLTPs are transported into the oocytes via clathrin-dependent endocytosis mediated by the receptors VgR (in D. melanogaster Yolkless, Yl) and LpR. Nurse cells transport yl/VgR RNA into previtellogenic oocytes, thus preparing the oocyte for Vtg uptake. Pararge aegeria females expressed not only Vtg/Vg, apoLp-III, apoLp, their receptors yl/VgR and LpR, but also the genes described in D. melanogaster vitellogenic endocytosis (references in Additional file 1). These genes include clathrin heavy and light chain (chc and clc), sec5, sec6, garnet (G) and jagunal (jagn) (Figure 4 qPCR results; Tables 2 and 17; further references in Additional file 1).\r\n\r\nReproductive physiology and vitellogenesis\r\n\r\n\t \tapolipophorin-III (apoLp-III)\tY\thomologous to Bombyx juvenile hormone epoxide\t \thydrolase-like protein 3 (jheh-lp3)\tY\t \tapolipophorin precursor; Drosophila CG11064\t \t(apoLp; apolp1/2)\tY\thomologous to Bombyx juvenile hormone epoxide\t \thydrolase-like protein 5 (jheh-lp5)\tY\t \tlipophorin receptor (Lpr1/2)\tY\tjuvenile hormone binding protein; homologous to\t \tDrosophila CG1532 (JHbp)\tY\t \tarylphorin (subunit beta); sex-specific storage-protein\t \t2 (hex2; sp2)\tY\tjuvenile hormone binding protein (hemolymph)\t \t(hJHbp)\tY\t \tvitellogenin (protein cleaved into vitellin light\t \tchain (vl), vitellin light chain rare isoform, vitellin\t \theavy chain rare isoform and vitellin heavy chain (vh))\t \t(Vg; Vtg)\tY\tcytosolic juvenile hormone binding protein 36 KDa\t \tsubunit (cJHbp)\tY\t \tvitellogenin receptor; yolkless (yl; VgR)\tY\ttakeout (to)\tY\t \tspherulin-2a (similar to Plodia interpunctella\t \typ4)(yp4)\tY\tsimilar to niemann-pick type c-2; ecdysteroid-regulated\t \t16 kDa protein precursor (npc2a; esr16)\tY\t \tchico (chico; IRS)\tY\tecdysone-induced protein 63e (Eip63E;\t \tcdc2-63E)\tN\t \tBombyxin genes(bbxA1; bbxA3)\tY\tsimilar to sgt1 protein homolog ecdysoneless\t \t(ecd)\tY\t \tinsulin-like receptor (InR)\tY\tcytochrome p450 (E-class, group I) protein\t \tdisembodied (dib; cyp302a1)\tN\t \tribosomal protein l10a (rpl10ab )\tY\thalfway; singed wings (hfw; swi)\tY\t \t60s ribosomal protein l10; qm protein homolog\t \t(qm)\tY\tclathrin light chain (chc)\tY\t \tstring of pearls; ribosomal protein s2 (sop;\t \trp2)\tY\tclathrin heavy chain (clc)\tY\t \tresistance to juvenile hormone; methoprene-tolerant\t \t(met)\tY\tced-6 (ced-6)\tY\t \tultraspiracle; rxr type hormone receptor (usp;\t \tcf1)\tY\twnt receptor l(2)43Ea boca (boca)\tY\t \tecdysone receptor (EcR)\tY\tjagunal (jagn)\tY\t \tstart1 (start1)\tY\texocyst complex component sec5 (sec5)\tY\t \tdefective in the avoidance of repellents dare;\t \tadrenodoxin reductase (dare)\tY\texocyst complex component sec6 (sec6)\tY\t \tecdysone-induced protein 74 (E74)\tN\tprotein phosphatase 2a regulatory subunit b’;\t \twiderborst (wdb; PP2Ab’)\tY\t \tecdysone-induced protein 75b (75a,b,c and d)\t \t(E75)\tY\tprotein phosphatase 2a regulatory subunit b 55kDa;\t \ttwins (PP2Ab55kDa)\tY\t \thomologous to Bombyx mori c-cbl-associated protein (cap)\t \ttranscript variant a (bmcap-a)\tY\tprotein phosphatase 2a regulatory subunit b gamma\t \t(PP2Agamma)\tY\t \tfollicle specific protein (fsp-I)\tN\tprotein phosphatase 2a regulatory subunit a (65\t \tkDa); homologous to Drosophila protein\t \tphosphatase 2a at 29b (PP2Aa)\tY\t \tsimilar to Bombyx mori egg-specific protein\t \t(LOC693022) (ESP)\tN\tmicrotubule star; protein phosphatase 2a catalytic\t \tsubunit c (mts; PP2Ac)\tY\t \tcalmodulin (cam)\tY\tlipid storage droplet 1; perilipin 1 (lsd1;\t \tplin-1; plin1)\tY\t \tcalmodulin-binding protein (striatin); weak\t \thomology to CG7392 (striatin)\tY\tlipid storage droplet 2 (lsd2)\tY\t \tcalmodulin dependent protein kinase (camk)\tY\tlipase-1 (lip-1)\tY\t \thormone receptor 3; Drosophila hormone receptor-like in\t \t46 (hr3; hr46)\tY\tserine/threonine protein kinase akt (akt;\t \takt1)\tY\t \thepatocyte nuclear factor 4 isoform a\t \t(hnf-4a)\tY\tliquid facets-related (lqfr)\tY\t \thepatocyte nuclear factor 4 isoform b\t \t(hnf-4b)\tY\tliquid facets (lqf)\tY\t \tjuvenile hormone esterase (jhe)\tN\tgarnet (g)\tY\t \tjuvenile hormone esterase binding protein; weak\t \thomology to Drosophila CG3776 (JHEbp;\t \tDmP29)\tY\tcationic amino acid transporter; slimfast\t \t(slif)\tY\t \tjuvenile hormone epoxide hydrolase (JHEH)\tY\tornithine decarboxylase (odc)\tY\t \thomologous to Bombyx juvenile hormone epoxide\t \thydrolase-like protein 1 (jheh-lp1)\tY\tornithine decarboxylase antizyme; gutfeeling\t \t(guf; Oda; az)\tY\t \t\r\n\r\nGenes identified mainly from the Drosophila melanogaster and\nBombyx mori literature involved in vitellogenesis,\nlipid storage, ovarian maturation and hormonal regulation of\noogenesis. Presence (Y) or absence (N) of orthologous transcripts in\nthe Pararge aegeria transcriptome is indicated.\r\n\r\nThe major yolk proteins, such as vitellogenins, share sequence similarities with lipases. Although not catalytically active, the vitellogenin region with sequence similarity to lipases is argued to be involved in steroid hormone binding, thus providing a possibility for a direct interaction with the hormones that regulate their production. For example, maternal ecdysteroids are bound as ecdysteroid-phosphates to the Vtg cleaved product Vitellin (Vn) in yolk granules in B. mori and released as ecdysteroids during yolk uptake in the embryo as a result of dephosphorylation by ecdysteroid-phosphate phosphatase (EPPase). Pararge aegeria did express EPPase (Table 18). Furthermore, a significant component of yolk in a B. mori egg is the ovarian egg-specific protein ESP, a minor yolk protein. The gene encoding ESP is intriguing, as convincing orthologs for minor yolk proteins outside the moths Galleria mellonella (yolk protein/yolk polypeptide 2) and Samia cynthia (ESP) had not been found. More recently, however, a further two sequences with strong sequence similarity to G. mellonella yolk protein 2 have been discovered in D. plexippus and Plodia interpunctella, whilst ESP does show significant sequence similarity with genes encoding the KK-42 binding proteins in Antheraea moth species (Additional file 9). Sharing the same ABhydrolase lipase region, The KK-42 binding proteins and the minor yolk proteins also show strong sequence similarity to lipases identified in species such as D. melanogaster, in particular lipase-1 and 3 (lip-1 and 3). Lepidoptera may have evolved to use paralogs of these genes in yolk formation. Rather interestingly, although not functioning as a yolk protein, lip-1, but not lip-3, is expressed in vitellogenic follicles in D. melanogaster. An orthologs of lip-1, and possibly lip-3 (very short partial contig), was expressed by P. aegeria, whilst no clear ortholog of a minor yolk protein was found (Table 17; Additional files 2 and 9).\r\n\r\nYolk consumption\r\n\r\n\t \tcathepsin l-like cysteine protease; Bombyx cysteine\t \tprotease; cysteine proteinase-1 (bcp; cl;\t \tcp1)\tY\tvacuolar proton atpase; vacuolar h+ atpase subunit\t \t100–2 (vha100-2)\tY\t \tcathepsin b; cathepsin b-like cysteine proteinase\t \t(catb)\tY\th+ transporting atpase v0 subunit d; vacuolar h+ atpase\t \tsubunit ac39-1 (vhaac39-1)\tY\t \tcathepsin d; aspartic protease (catd)\tY\tvacuolar atp synthase subunit d; vacuolar h+ atpase\t \tsubunit 36–1 (mvd; vha36-1)\tY\t \tcathepsin f-like cysteine protease; CG12163\t \t(catf)\tY\tCG7899; acid phosphatase 1 (acph-1; ap)\tN\t \tecdysteroid-phosphate phosphatase (EPPase)\tY\tprimo-1; acid phosphatase isoenzyme\t \t(primo-1)\tY\t \tvacuolar proton atpase; vacuolar h+ atpase subunit\t \t100–1 (mva; v100; vha100-1)\tY\t \t \t \t\r\n\r\nMaternal effect genes, identified mainly from the Drosophila\nmelanogaster and Bombyx mori literature, involved\nin facilitating yolk consumption by the developing embryos. Presence\n(Y) or absence (N) of orthologous transcripts in the Pararge\naegeria transcriptome is indicated.\r\n\r\nAmong the most highly transcribed genes in P. aegeria ovarioles is an ortholog of the slime mold Physarum polycephalum gene spherulin-2A. No transcripts were found for this gene in eggs (Table 2 and Additional file 2). Lepidopteran orthologs of the protein encoded by this gene have been shown to function as a subunit Yp4 of follicular epithelium yolk protein produced by follicle cells.\r\n\r\nYolk is a food source for the developing embryo and a number of genes encoding Cathepsins and Vacuolar Proton ATP-ases are maternally expressed during oogenesis to facilitate yolk uptake in the embryos (references in Additional file 1). Pararge aegeria females were found to express all described yolk uptake genes, with the exception of the acid phosphatase 1 gene (acph-1) (Table 18 and Additional file 1).\r\n\r\nPhysiology of oogenesis\r\n\r\nReproductive output depends on female nutritional status which not only affects the rate and duration of oogenesis significantly, but also whether previtellogenic egg chambers will enter the vitellogenic stage or apoptose. Two signalling systems are involved; insulin and hormone signalling. In D. melanogaster, for example, absence of the insulin receptor substrate (IRS) Chico precludes vitellogenesis, whilst a sharp increase in 20-hydroxy-ecdysone (20E) relative to juvenile hormone (JH) results in apoptosis of the egg chamber before vitellogenesis is initiated or completed. Although the two signalling systems operate simultaneously and interact, both have been shown to be able to independently terminate egg chamber progression before vitellogenesis takes place in D. melanogaster. Furthermore, the Lepidoptera express a set of unique genes encoding insulin-like peptides, the Bombyxins (Bbx). The bbx genes are expressed predominantly in the brain, but some may also be expressed in ovaries. Moths, in particular B. mori, possess a large number of bbx-like genes in their genome, but the genome of the butterfly D. plexippus appears to have only three such genes. Orthologs of 2 of these 3 (bbxA1-like and bbxA3-like) were transcribed in P. aegeria ovarioles, whilst a third partial IRS transcript showed more sequence similarity to chico than to any bbx-like gene (Table 17 and Additional file 1). The insulin-like receptor (InR) was also expressed by P. aegeria during oogenesis (Table 17 and Additional file 1). Furthermore, P. aegeria expressed a large number of downstream target genes of insulin signalling including genes encoding the serine/threonine protein kinase Akt, the various protein phosphatase 2A subunits (PP2A, e.g. Widerborst) and the lipid storage droplet proteins 1 and 2 (Lsd1 and Lsd2). Please refer to Table 17 and references in Additional file 1 for additional details.\r\n\r\nApart from nutritional status, environmental factors such as temperature can affect hormone concentrations, providing a possibility for environmental control of reproductive output. The interplay between 20E and JH is dynamic and complex, as both 20E and JH also play a role in regulating choriogenesis. Both hormones have a range of pleiotropic effects during oogenesis and their exact developmental role is not only titre related, but also dependent on the dynamic spatio-temporal expression patterns of the receptors and modulators of hormone signalling.\r\n\r\nThere has been extensive investigation of JH signalling, but the signal transduction pathway, including the JH receptor, remains poorly understood. The most likely candidate gene for the JH receptor proposed to date is the basic helix–loop–helix (bHLH)/Per-Arnt-Sim (PAS) domain gene methoprene-tolerant (met). It may form a homodimer, or possibly may form a JH-dependent transcriptionally active complex with another member of the bHLH-PAS family. The most likely candidate for the complex is the steroid co-activator NCoA-1/p160 FISC, encoded by the gene taiman (tai) in D. melanogaster. The tai gene was originally discovered as a gene that was expressed in follicle cells in the functional context of border cell migration and was described as an ecdysone co-receptor (Table 6; references in Additional file 1). Pararge aegeria females expressed both met and tai (Tables 6 and 17 and S2; contigs for tai PACG7006 and PACG13674 in Additional file 2). An ortholog for tai (UNIPROT: G6DPV9) can also been found in the genome of D. plexippus.\r\n\r\nNot much is known about which genes are transcriptionally regulated by the JH activated receptor complex. The gene kruppel-homolog 1 (krh1) has been described as a JH response gene, inhibiting 20E induced broad (br) expression in D. melanogaster, but not in the specific context of oogenesis. Both khr1 and br were expressed by P. aegeria females (Additional file 1). Furthermore, JH may either directly or indirectly upregulate ornithine decarboxylase (odc), which regulates polyamine biosynthesis and appears to be essential for vitellogenesis. Both odc and its antagonist gutfeeling (oda), also a mitotic cell-cycle regulator, were expressed in P. aegeria. Maternal transcripts of odc and oda were found in eggs (Figure 4 qPCR results; Table 17, Additional files 1 and 2).\r\n\r\nIn order to regulate the precise amount of JH in both hemolymph and organs, two sets of enzymes are involved in JH degradation; the JH epoxide hydrolases (JHEHs) and the JH esterases (JHEs). JHEs function predominantly in the hemolymph and degradation is reversible, whilst JHEHs regulate the amount of JH in organs and degradation is irreversible. Apart from JHEH, five recently discovered JHEH-like protein genes have been characterised in B. mori and in addition to JHEH, P. aegeria expressed orthologs of three of these; jheh-lp1, jheh-lp3 and jheh-lp5 (Table 17 and Additional file 1). With the exception of jheh-lp5, moderate amounts of transcripts of JHEHs were found in the eggs (Additional file 2). The females did not express a clear ortholog of jhe, but did express an ortholog of a gene encoding an intracellular binding protein of JHE presumed to be involved in its transport (JHEbp or DmP29, Drosophila mitochondrial protein 29, Table 17). Significant amounts of maternal JHEbp transcripts were found in P. aegeria eggs (Additional file 2).\r\n\r\nJuvenile hormone itself may be bound by JH binding proteins (JHbp) to enable immobilisation, regulate degradation or enable transport. Four complete JHbp CDSs were identified in P. aegeria ovaries; JHbp, cytosolic JHbp (cJHbp), hemolymph JHbp (hJHbp) and a sequence showing strong orthology to takeout (to) identified in D. melanogaster as involved in JH binding (Table 17). Transcripts of both cJHbp and to were transferred to the eggs by P. aegeria (Additional file 2). Given that JH itself can be transferred maternally into eggs in Lepidoptera, it has been argued that JH binding proteins such as cJHbp will protect the developing embryo against the teratogenic effects of any excess JH transferred from the mother.\r\n\r\nThere is a significant amount of life-history variation among insects and consequently in the relative importance of 20E and JH on oogenesis, even within Lepidoptera. Lepidoptera have been categorised into four (physiological) groups based on the hormones used to initiate vitellogenesis, choriogenesis and thus the timing of mature egg production. Nymphalids, like P. aegeria, have been argued to best match the criteria for group 4 where JH is the essential gonadotropic hormone. Juvenile hormone in this group is necessary for: a) synthesis of Vtg in the fat body and possibly the ovary (results supporting the latter in this study); b) inducing patency of ovarioles; c) uptake of Vtg by the oocyte (follicle cells deform to facilitate this uptake and this deformation is under JH control) and d) choriogenesis by the follicle cells. Whilst 20E modulates JH signalling in Nymphalids, it plays a more significant role in vitellogenesis and choriogenesis regulation in B. mori and D. melanogaster.\r\n\r\nEcdysone signalling, including its target genes, is in general better understood than JH signalling. Bombyx mori appears to be capable of producing ecdysteroids in the ovaries, as does D. melanogaster. Drosophila melanogaster expresses start1 during oogenesis in significant amounts in nurse cells, most likely in response to ecdysone signalling. The cholesterol transporter Start1 may in turn facilitate ecdysteroid production from cholesterol-based precursors. Another gene expressed in the nurse cells essential during D. melanogaster cholesterol conversion in the ovaries is defective in the avoidance of repellents (dare), which encodes an Adrenodoxin reductase. Furthermore, in D. melanogaster the SGT1 protein homolog ecdysoneless (ecd) and disembodied (dib) have been described as essential for ecdysone, both for functionality and its production in the ovaries. Maternal transcripts of D. melanogaster start1 are hypothesised to be deposited into the egg to facilitate ecdysteroid signalling in the developing embryo. Rather intriguingly P. aegeria females did not express dib, but did express ecd, start1, and dare. We observed the transfer of transcripts of all three genes into the oocytes (Table 17 and Additional file 2). Start1 has been implicated in ecdysteroid synthesis in the prothoracic gland in B. mori. Further investigation is needed to determine whether ecdysteroids can be produced in P. aegeria ovaries and if the transfer of maternal start1 and dare transcripts is involved in ecdysteroid signalling in early embryos. In common with the majority of insects, P. aegeria females did express ecdysone receptor (EcR) and its partner ultraspiracle (usp; labelled chorion factor 1 (cf1) in B. mori) in the ovaries (Table 17). Although JH may be the gonadotropic hormone in P. aegeria, it is clear from the expression results presented here that 20E signalling does play a significant role in vitellogenesis and that there may be maternal regulation of ecdysteroid signalling in early embryos.\r\n\r\nAmong the so-called early genes in the hierarchy of genes up-regulated in response to activation of EcR in B. mori ovaries are the orphan nuclear receptor genes hr3 and E75(a,b, c and d), the transcription factor gene E74 and the Broad-Complex gene Br-C. The genes encoding the two receptors Hepatocyte nuclear factor 4a and 4B (HNF4A and HNF4B) are up-regulated with a delay in B. mori and their expression increases during vitellogenesis. With the exception of E74, all of these genes were expressed in P. aegeria (Tables 6, 17 and Additional file 1). In B. mori Hr3 regulates the expression of ESP during vitellogenesis, and it regulates the expression of GATAbeta (i.e. transcription factor BCFI) during choriogenesis. As discussed before, P. aegeria females did not express ESP, but did express the related gene lip-3 (Table 17). Furthermore, they also expressed GATAbeta (Table 19 and Additional file 1).\r\n\r\nEggshell formation\r\n\r\n\t \tweak homology to Bombyx mori vitelline membrane\t \tassociated protein p30 (VMP30)\tY\tchorion peroxidase; peroxinectin-related protein\t \t(pxt)\tY\t \tBombyx mori vitelline membrane protein 90\t \t(VMP90)\tN\tgataβ; transcription factor BCFI\t \t(GATAβ)\tY\t \tvitelline membrane 32e (VM32e; VMP32e)\tN\tchorion transcription factor cf2 (cf2)\tY\t \tvitelline membrane 26a (VM26a)\tN\tchorion b-ZIP transcription factor (CbZ)\tY\t \tvitelline membrane 26b (VM26b)\tN\tchorion protein 15 (Drosophila melanogaster);\t \tCG6519 (cp15; s15)\tN\t \tvitelline membrane 26ac (VM26Ac; tu-3)\tN\tchorion protein 16 (Drosophila melanogaster);\t \tCG6533 (cp16; s16)\tN\t \tvitelline membrane 34ca (VM34c)\tN\tchorion protein 18 (Drosophila melanogaster);\t \tCG6517 (cp18; s18)\tN\t \tfemcoat (femcoat)\tN\tchorion protein 19 (Drosophila melanogaster);\t \tCG6524 (cp19; s19)\tN\t \tfollicle cell protein 26Aa; palisade (psd;\t \tfcp26Aa; tu-1)\tN\tchorion protein 36 (Drosophila melanogaster);\t \tCG1478 (cp36; s36)\tN\t \tcad99c (cad99c; ca-10)\tY\tchorion protein 38 (Drosophila melanogaster);\t \tCG11213 (cp38; s38)\tN\t \tcrinkled; myosin-VIIa (ck; myoVIIa)\tY\tchorion protein a at 7f (Drosophila melanogaster);\t \tCG33962 (cp7fa)\tN\t \tvitelline membrane like (vml)\tN\tchorion protein b at 7f (Drosophila melanogaster);\t \tCG15350 (cp7fb)\tN\t \thigh mobility group protein a (HMGa)\tY\tchorion protein c at 7f (Drosophila melanogaster);\t \tCG15351 (cp7fc)\tN\t \tegg protein 80 (EP80)\tY\tdefective chorion 1 (dec1)\tN\t \tfollicle cell protein 3c (fcp3c)\tY\tLepidopteran chorion genes (see Additional file\t \t9)\tY\t \t\r\n\r\nGenes identified mainly from the Drosophila melanogaster and\nBombyx mori literature involved in eggshell formation;\nvitelline membrane formation and choriogenesis. Presence (Y) or\nabsence (N) of orthologous transcripts in the Pararge\naegeria transcriptome is indicated. See Additional file\n9 for details on lepidopteran chorion\ngenes.\r\n\r\nVitelline membrane formation and choriogenesis\r\n\r\nVitellogenesis and choriogenesis are carefully coordinated, primarily by hormone signalling. The vitelline membrane (i.e. the inner eggshell layer) is formed halfway through vitellogenesis, for which RTK signalling is necessary as discussed elsewhere in this paper. The formation of the vitelline membrane is of significance in maternal regulation of embryonic AP and DV patterning, as some maternal factors become localised in the perivitelline space in D. melanogaster and interact with localised factors inside the oocyte. This also appears to be the case in B. mori, although the genes involved remain uncharacterised. As discussed before, Ndl protein (also tellingly called ovarian serine protease in B. mori) is expressed in all follicle cells and is essential for DV patterning of the embryo in D. melanogaster. Ndl is an unusual protein in that not only is its structure reminiscent of an extracellular matrix protein, but that it also has a catalytically active serine/protease domain. As such, it is involved in both vitelline membrane formation as well as acting as the basis of the serine/protease cascade ventrally, essential for the maternally regulated DV patterning of the D. melanogaster embryo. Pararge aegeria females expressed ndl and as in D. melanogaster, no transcripts were found in the oocyte (Table 6 and Additional file 2). It remains to be seen whether Ndl plays a similar dual role in P. aegeria.\r\n\r\nInsect vitelline membrane protein (VMP) genes show tremendous sequence diversity. For example, no clear orthologs can be found for D. melanogaster VMP genes outside the genus Drosophila. The best-characterised VMP gene in Lepidoptera is VMP30, for which orthologs can be found in both moths and butterflies and which was also expressed in P. aegeria ovarioles. Once again, no transcripts were found in the oocyte (Table 19 and Additional file 2).\r\n\r\nAfter the follicle cells have secreted proteins to form the vitelline membrane, endocycling takes place in D. melanogaster and clusters of chorion genes are selectively amplified or expressed at very high levels. Perhaps rather surprisingly, P. aegeria did not express an ortholog of G1/S specific cycE, which in D. melanogaster is essential for chorion gene amplification and endocycling in general (; Table 16; further references in Additional file 1). There is a possibility that Lepidoptera do not selectively amplify the chorion genes prior to the onset of choriogenesis, as no evidence was found for this in B. mori. However, nurse cells do become polyploid during B. mori oogenesis. Pararge aegeria females did express the G1/S specific genes cycC and cycD, as well as the S-phase regulators E2f1 and dp (Table 16; further references in Additional file 1).\r\n\r\nChoriogenesis as a whole is coordinated by genes such as chorion peroxidase (pxt) in D. melanogaster, which was also expressed by P. aegeria (Table 19). Furthermore, apart from aforementioned GATAbeta, a number of specific transcription factors are involved in the critical regulation of the spatio-temporal expression patterns of the various chorion genes in the later stages of oogenesis in Lepidoptera. All chorion genes in B. mori have multiple cis-regulatory binding sites for CCAAT/enhancer binding protein (C/EBP) transcription factors and their expression levels are C/EBP concentration dependent. The D. melanogaster ortholog of C/EBP is slbo, which is also expressed in follicle cells though predominantly involved in border cell migration (references in Additional file 1). High mobility group protein A (HMGA) is essential for B. mori choriogenesis as it induces chorion gene promoter bending and recruits C/EBP and GATAbeta. Pararge aegeria expressed C/EBP (i.e. slbo), its negative regulator tribbles (trbl) and HMGa (Tables 6, 16 and 19), but it is not known in which functional context slbo is used. Another transcription factor for which cis-regulatory binding sites have been identified for chorion genes, in both D. melanogaster and B. mori, is the C2H2 zinc finger protein Chorion factor 2 (Cf2). Furthermore, a chorion-specific b-ZIP transcription factor (CbZ) has been described in B. mori and orthologs can be found in butterfly genomes, such as that of D. plexippus. However, the exact function of CbZ during choriogenesis has not been characterised. Both cf1 and CbZ were transcribed by P. aegeria, with transcripts of the latter rather intriguingly found to be present in the oocyte (Figure 4 qPCR results; Table 19).\r\n\r\nChorion protein (cp) genes evolve possibly even faster than vitelline membrane protein genes and sequence similarity between D. melanogaster cp genes with those identified in Lepidoptera, including P. aegeria, is very low indeed (Table 19; further references in Additional file 1). The infraorder Heteroneura, to which B. mori and butterflies belong, possess unique helicoidal lamellar chorions, which may provide additional strength. Furthermore, the two species for which chorion genes have been characterised and studied in some detail, Lymantria dispar and B. mori, have an extensively derived chorion in which the helicoidal lamellar framework is modified by expansion and densification. Expression patterns of these chorion genes are also dynamically very complex. Gene families in Lepidoptera encoding the structural chorion proteins are characterised by numerous gene duplications, occasional subsequent gene loss, gene conversion, and in general rapid sequence divergence. As a result, determining orthology between individual chorion genes of different species is very difficult and chorion protein phylogenetic trees are characterised by species-specific clusters (i.e. families) of genes. Automatic annotation of butterfly chorion genes in the D. plexippus genome and from our P. aegeria ovarian transcriptome was performed on the basis of the most significant BLAST hit to available moth chorion gene sequences (Additional file 2 and Table 19). It is very doubtful, however, that true orthology has been uncovered in this way, as chorion genes within a species tend to be more similar to each other than to those found in other species. The phylogenetic tree of Lepidopteran chorion genes in Additional file 9 shows distinct clustering between moths and butterflies for each of the chorion gene families. Pararge aegeria chorion genes were highly transcribed during oogenesis (Table 2 and Additional file 1). As well as expressing these chorion gene families, Bombyx mori expresses a gene encoding protein 80 (BmEP80), which forms part of the eggshell and is produced by the follicle cells. BmEP80 is also highly transcribed during P. aegeria oogenesis (Tables 2 and 19; Additional data file 1).\r\n\r\nApoptosis and autophagy\r\n\r\nProgrammed cell death is an essential process during oogenesis in D. melanogaster and B. mori, with nurse and follicle cells undergoing apoptosis as oogenesis progresses, while complete egg chambers may apoptose in response to environmentally induced hormonal signals such as starvation. Often, apoptosis and autophagy operate synergistally and are to some extent integrated in D. melanogaster ovaries, where the effector caspase Dcp-1 and the inhibitor of apoptosis protein BIR-superfamily domain protein Bruce (also called survivin in B. mori) regulate both autophagy and starvation-induced cell death. Recently, all apoptosis-related genes have been characterised in B. mori, and the results of the study by Zhang and co-workers showed that most of these genes are highly conserved. Furthermore they demonstrated that a number of gene duplications have occurred in the Lepidoptera (e.g. genes ecoding BIR-superfamily domain proteins). Many of the known genes involved in autophagy and apoptosis have been studied in a reproductive context in D. melanogaster (references in Additional file 1) and the majority of these were expressed during oogenesis by P. aegeria (Table 20). In particular, P. aegeria expressed buffy, three orthologs of bruce (Additional file 2) and the Lepidopteran ortholog of D. melanogaster dcp1, caspase-1 (Table 20).\r\n\r\nGrowth regulation, apoptosis and autophagy\r\n\r\n\t \tp53 (p53)\tY\tquaking related 54b; sam50 (qkr; sam50)\tY\t \tp35 (p35)\tN\theld out wings (how)\tY\t \tdeath executioner Bcl-2 homologue (debcl)\tN\tspinster (spin)\tY\t \thomologous to bruce and Bombyx bir-superfamily domain\t \tprotein - survivin-1 (bruce; survivin-1)\tY\tdeath executioner caspase related to apopain/yama;\t \tdecay; caspase 3 (decay)\tN\t \tbir-superfamily domain protein - inhibitor of apoptosis\t \t1; thread (iap1; th; diap1)\tY\tdeath caspase 1 (dcp-1)\tN\t \tbir-superfamily domain protein - inhibitor of apoptosis\t \t2 (iap2; diap2)\tY\tdeath related ced-3/nedd2-like protein; dredd/dcp-2\t \t(dredd)\tY\t \tubiquitin conjugation enzyme E2; bendless\t \t(ubc13; ben)\tY\tice; drice; caspase-1 (in Bombyx mori)\t \t(ice)\tY\t \tb-cell lymphoma protein 2 (bcl-2) protein - buffy\t \t(buffy)\tY\tdronc; nedd2-like caspase (dronc; nc)\tY\t \tautophagy-specific gene 1; serine/threonine-protein\t \tkinase unc-51 (atg1)\tY\tdynamin related protein 1 (drp1)\tY\t \tautophagy-specific gene 2 (atg2)\tY\tsimilar to optic atrophy 1-like (opa1-like)\tY\t \tautophagy-specific gene 3 (atg3; aut1)\tY\tresistance to juvenile hormone; methoprene-tolerant\t \t(met)\tY\t \tautophagy-specific gene 4 (atg4)\tY\tdeterin (det )\tN\t \tautophagy-specific gene 5 (atg5)\tY\ttao-1 (tao-1)\tY\t \tautophagy-specific gene 6; beclin-1 (atg6)\tY\tmelted (melt)\tN\t \tautophagy-specific gene 7 (atg7)\tY\tmidway (mdy)\tN\t \tautophagy-specific gene 8 (atg8)\tY\tpita (pita)\tY\t \tautophagy-specific gene 12 (atg12)\tY\tplenty of sh3s (posh)\tN\t \tautophagy-specific gene 13 (atg13)\tN\tphosphoinositide-dependent kinase 1 dstpk61\t \t(dstpk61)\tY\t \tphosphotidylinositol 3 kinase 59f (pi3k59f;\t \tvps34)\tY\tdream (strica; dream)\tN\t \tcell death activator-b (cide-b)\tY\ttarget of rapamycin (tor)\tY\t \tcell cycle and apoptosis regulatory protein 1\t \t(ccar1)\tY\tthor (thor)\tN\t \tlongitudinals-lacking (lola)\tY\tdeath associated molecule related to mch2; daydream\t \t(damm)\tN\t \ttranslationally controlled tumour protein\t \t(tctp)\tY\tecdysone-induced protein 28/29kD; methionine-s-sulfoxide\t \treductase (Eip28/29; Eip71CD)\tY\t \tapoptosis linked protein 2 (alg-2)\tY\tmodifier of rpr and grim, ubiquitously expressed;\t \tweak homology to ubiquitin-conjugating enzyme E2 D4\t \t(morgue)\tN\t \t\r\n\r\nGenes identified mainly from the Drosophila melanogaster\nliterature involved in regulation of growth during oogenesis\n(apoptosis, autophagy - response to starvation). Presence (Y) or\nabsence (N) of orthologous transcripts in the Pararge\naegeria transcriptome is indicated.\r\n\r\nGeneral growth regulators (including the Hippo Pathway)\r\n\r\nHippo is a highly conserved serine-threonine kinase 3-like signalling protein (also called STE20). It is essential for regulating tissue size and growth. Hippo signalling interacts with various other cellular processes in this functional context, including programmed cell death and cell cycling. Hippo signalling is, however, required in a wide variety of developmental contexts, not just tissue growth. In D. melanogaster oogenesis, for example, it is essential for establishing AP polarity in the oocyte as it regulates the expression of the downstream effector of Notch signalling, the gene hindsight/pebbled (hnt), which is required for posterior follicle cell maturation. Orthologs of all the Hippo signalling related genes (i.e. Hippo signalling components, as well as up- and downstream factors) have been identified as being essential in D. melanogaster oogenesis (references in Additional file 1) and were transcribed by P. aegeria, with possibly two exceptions: merlin (mer; ERM2) and mob as tumor suppressor (mats, mob1) (Table 21). Merlin/ERM2 is a member of the band 4.1 protein superfamily and is characterised by a highly conserved FERM (Four.1 protein, Ezrin, Radixin, Moesin) domain involved in crosslinking the cell membrane and the actin cytoskeleton and so is thus important in localising proteins. Pararge aegeria expressed a highly similar gene, ERM1 (Table 9), which in P. aegeria shows a highly significant sequence similarity to ERM2 (Table 9). In D. melanogaster ERM1 is important for Osk localisation, but clearly it cannot function in this way in P. aegeria, which lacks Osk. Likewise, P. aegeria appeared to express paralogs that are significantly similar to mob1; mob2 and mob4-like (i.e. preimplantation protein in B. mori) (Table 21). The latter is most likely the Lepidopteran ortholog of D. melanogaster mob1.\r\n\r\nGrowth regulation and Hippo pathway\r\n\r\n\t \tserine/threonine kinase 3-like (hippo;\t \tSTE20)(hpo)\tY\texpanded (ex)\tY\t \tsalvador (sav)\tY\tmerlin (mer; ERM2)\tN\t \twarts (wts)\tY\tkibra; CG33967 (kibra)\tY\t \tmob as tumor suppressor (mats; mob1)\tN\tyorkie; yap65-like protein (yki)\tY\t \tmob-2 (mob2)\tY\tphosphatidylinositol 4-kinase alpha\t \t(PI4kIIIalpha)\tY\t \tpreimplantation protein; mps one binder kinase\t \tactivator-like 4 (mob4-like)\tY\tbitesize; synaptotagmin-like (btsz)\tY\t \thindsight; pebbled (hnt)\tY\tpar-domain protein 1; CG17888 (pdp1)\tY\t \t\r\n\r\nGenes identified mainly from the Drosophila melanogaster\nliterature involved in regulation of growth during oogenesis\n(including the Hippo pathway). Presence (Y) or absence (N) of\northologous transcripts in the Pararge aegeria\ntranscriptome is indicated.\r\n\r\nHeat shock proteins and their control of protein abundance during oogenesis\r\n\r\nHeat shock proteins (Hsps) provide a possible mechanism for environmental control of development in ovaries and as maternal effects. The transcription of genes encoding Hsps, or molecular chaperones in general, is not only regulated in response to various environmental factors (e.g. temperature), but is also essential during many developmental processes, including oogenesis. It is thought that Hsps are important for both developmental buffering and differentiation (further references in Additional file 1). The functional contexts in which Hsps operate are incredibly varied. In D. melanogaster, for example, Hsp60C is essential in organising and maintaining cytoskeletal and cell adhesion components and thus for establishing AP and DV oocyte polarity, whilst Hsp70 affects border cell migration through its effects on the actin cytoskeleton. A large number of genes encoding Hsps and related proteins have been described in a functional context during D. melanogaster oogenesis (references in Additional file 1) and orthologs of all of these were transcribed during P. aegeria ovarioles, often very abundantly (e.g. heat shock protein cognate 3, hsc3) (Tables 2 and 22; Additional file 2).\r\n\r\nHeat shock proteins\r\n\r\n\t \tsimilar to heat shock factor a2 (Bombyx mori)\t \t(hsf-2a)\tY\theat shock cognate protein 70; heat shock protein\t \tcognate 3 (hsc70; hsc3; hsc70-3)\tY\t \tsimilar to heat shock factor b (Bombyx mori)\t \t(hsfb)\tY\theat shock cognate protein 70cb (hsc70cb)\tY\t \tsimilar to heat shock factor c (Bombyx mori)\t \t(hsfc)\tY\theat shock protein cognate 5 (hsc5;\t \thsp70-5)\tY\t \theat shock factor binding protein 1-like; CG5446\t \t(hsfbp1; hsbpsb)\tY\tsimilar to Bombyx mori heat shock protein 40 homolog\t \tDNAj-1 (hsp40; DNAj)\tY\t \t19.5 kDa heat shock protein (Bombyx mori)\t \t(19.5hsp)\tY\theat shock protein 60 (hsp60)\tY\t \ttrap1 ; hsp90-like (trap1)\tY\tsimilar to heat shock protein 68; heat shock protein\t \t70-like (hsp70)\tY\t \t(Bombyx mori) heat shock protein 1; similar to\t \tDrosophila lethal (2) essential for life and\t \thsp27 (hsp1)\tY\theat shock protein 83; heat shock protein 90\t \t(hsp90)\tY\t \t(Bombyx mori small heat shock protein, shsp) - heat\t \tshock protein 19.9; similar to Drosophila\t \tlethal (2) essential for life (hsp19.9)\tY\tendoplasmin; 94 kDa glucose-regulated protein;\t \tsimilar to Drosophila glycoprotein 93; heat shock\t \tprotein 90 kDa beta member 1 (gp93)\tY\t \t(Bombyx mori small heat shock protein, shsp) - heat\t \tshock protein 20.1; similar to Drosophila\t \tlethal (2) essential for life (hsp20.1)\tY\thsc70/hsp90-organisng protein hop (hop)\tY\t \t(Bombyx mori small heat shock protein, shsp) - heat\t \tshock protein 20.4; similar to Drosophila\t \tlethal (2) essential for life (hsp20.4)\tY\tCG11267; heat shock 10kDa protein (CG11267)\tY\t \t(Bombyx mori small heat shock protein, shsp) - heat\t \tshock protein 20.8; similar to Drosophila\t \tlethal (2) essential for life (hsp20.8)\tY\tCG1416; activator of 90 kDa heat shock protein ATPase\t \thomolog; Bombyx mori bm44 (bm44)\tY\t \t(Bombyx mori small heat shock protein, shsp) - heat\t \tshock protein 23.7; similar to Drosophila\t \tlethal (2) essential for life (hsp23.7)\tY\tRNA polymerase II 140kD subunit (rpII140)\tY\t \theat shock protein 21.4 (hsp21.4)\tY\tsamui (samui)\tY\t \theat shock cognate protein 70–4; heat shock\t \tprotein cognate 4 (hsc70-4; hsc4)\tY\t \t \t \t\r\n\r\nGenes encoding heat shock proteins (in ovaries and as maternal\neffects) and their control of protein abundance during oogenesis\nidentified mainly from the Drosophila melanogaster\nliterature. Presence (Y) or absence (N) of orthologous transcripts\nin the Pararge aegeria transcriptome is indicated.\r\n\r\nRibosomal machinery needed for increased ovarian protein synthesis and early embryogenesis\r\n\r\nGenes encoding ribosomal proteins, rRNA and other proteins involved in translation (e.g. RpA1) are among the most highly transcribed genes during Metazoan oogenesis, as large amounts of the translation machinery are needed both during oogenesis and by the developing embryo. Just like Hsps, specific ribosomal proteins have been studied in a wide variety of functional contexts during D. melanogaster oogenesis and early embryogenesis (Tables 12 and 18; further references in Additional file 1). Ribosomal genes were also among the most highly transcribed in P. aegeria oogenesis (Table 2; Additional file 2).\r\n\r\nImmune defense and Wolbachia infection\r\n\r\nOrthologs of the majority of the genes identified from the literature as being involved in immune response during oogenesis were also found to be expressed by P. aegeria and present as maternal transcripts in the oocytes (Table 23; Additional files 1 and 2). Apart from the aforementioned Toll innate immune defense pathway, which may have been co-opted for DV patterning of the embryo (Table 13), these include a large number of genes encoding Serpins (Table 23). Drosophila melanogaster spn27A (the ortholog of which is called serpin-3 in B. mori), has been implicated in DV axis formation.\r\n\r\nImmune defense\r\n\r\n\t \themolin; p4 (p4)\tY\tMAPKK4 (mkk4; MAPKK4)\tY\t \themolin interacting protein; yippee (yip)\tY\tsimilar to Bombyx mori clip domain serine protease\t \t4; similar to manduca sexta hemolymph\t \tproteinase 17 (bmclip4)\tY\t \tyippee interacting protein 2 (yip2)\tY\tsimilar to Bombyx mori clip domain serine protease\t \t11; similar to manduca sexta serine\t \tproteinase-like protein 1 (bmclip11)\tY\t \tcecropin A (cecA)\tY\ttransferrin (tf; tsf)\tY\t \tweak homology to cecropin B (cecB)\tY\tFerritin 2 – light chain homolog\t \t(FER2-LCH)\tY\t \thomology to Bombyx serpin-1 and Drosophila\t \tspn4/42Da (srp1; spn4/42Da)\tY\tFerritin 1/3 – heavy chain homolog\t \t(FER1/3-HCH)\tY\t \thomology to Bombyx serpin-2 and Drosophila\t \tspn4/42Da (srp2; spn4/42Da)\tY\tFK506-binding protein 2; FK506-binding protein 12 (in\t \tBombyx mori) (FKBP12)\tY\t \thomology to Bombyx serpin-3 and Drosophila\t \tspn27A (srp3; spn27A)\tY\tFK506-binding protein 1 (FKBP39)\tY\t \thomology to Bombyx serpin-4 and Drosophila\t \tspn28D (srp4; spn28D)\tY\tweakly similar to refractory to sigma p\t \t(ref(2)p)\tY\t \thomology to Bombyx serpin-5 and Drosophila\t \tspn77Ba (srp5; spn77Ba)\tY\tsimilar to bmrelish1 and bmrelish2; nuclear factor\t \tnf-kappa-b p110 subunit isoform 1 or 2; weakly\t \tsimilar to Drosophila melanogaster relish\t \t(rel)\tY\t \thomology to Bombyx serpin-6 and Drosophila\t \tspn88Ea (srp6; spn88Ea)\tY\themomucin (rrm5; hmu)\tY\t \thomology to Bombyx serpin-10 and Drosophila\t \tspn100a (srp10; spn100A)\tY\tsmt3 activating enzyme 2 (sae2; sip2;\t \tuba2)\tY\t \thomology to Bombyx serpin-11 and Drosophila\t \tspn100A (srp11; spn100A)\tY\tgalactin; galactose specific c-type lectin\t \t(lectin-galc1)\tN\t \thomology to Bombyx serpin-13 and Drosophila\t \tspn28d (srp13; spn28D)\tY\t \t \t \t\r\n\r\nGenes identified mainly from the Drosophila melanogaster\nliterature involved in immune defense during oogenesis. Presence (Y)\nor absence (N) of orthologous transcripts in the Pararge\naegeria transcriptome is indicated.\r\n\r\nThe facultative reproductive parasite Wolbachia sp. is an endocytosymbiont in many arthropod species affecting oogenesis in a multitude of ways and the Bacterium is maternally transmitted. In D. mauritiana, Wolbachia increases egg production by affecting the maintenance and division of germ-line stem cells, while in the wasp Asobara tabida, Wolbachia confers a reproductive advantage to the females by properly regulating apoptosis during oogenesis via its regulation of iron metabolism and ferritin expression. However, in D. melanogaster highly infected females suffer from a range of oogenesis defects mediated via grk signalling. Pararge aegeria females were also found to be infected with Wolbachia, but how this affects oogenesis in this species is at present not known. However, we did observe that the gene encoding an ortholog of the Ferritin 2 light chain protein (FER2-LCH) was amongst the most highly transcribed genes during P. aegeria oogenesis (Tables 2 and 23), but at present it is unknown whether this effect is due to Wolbachia or whether elevated expression levels are a normal part of female P. aegeria reproduction.\r\n\r\nEgg activation, ovulation, gene regulation in oviduct upon mating and maternal effect genes involved in fertilisation\r\n\r\nAs discussed elsewhere in this paper, after vitellogenesis both the D. melanogaster and the Lepidopteran oocyte are in a secondary meiotic arrest in metaphase I. Unlike in Lepidoptera, egg activation in D. melanogaster is not triggered by the act of fertilisation, but due to the mechanical pressure experienced by the oocyte when moving from the ovary into the small and tight oviducts. Egg activation involves eggshell modifications, resumption of meiosis, translation and subsequent degradation of maternal mRNAs, and cytoskeletal changes. A small number of genes have been described as important in D. melanogaster in the latter stages of oogenesis in the general functional context of egg activation (references in Additional file 1). Orthologs for only around half of these were found in the P. aegeria transcriptome (Table 24), which may indicate observed differences in the mechanism of egg activation between the Lepidoptera and D. melanogaster. Among the genes found in the P. aegeria transcriptome is wispy (fs(1)M19/wisp) (Table 24). In D. melanogaster it is a maternal effect gene, encoding a GLD-2 family protein with polynucleotide adenylyltransferase activity and is essential for the oocyte-to-embryo transition. The D. melanogaster Wisp protein is required for poly(A) tail elongation of bcd, toll, and tor transcripts upon egg activation. It is thus important for proper patterning of the embryo, but is also required to maintain a high level of active (phospho-) mitogen-activated protein kinases (MAPKs). Given that P. aegeria females did not express bcd and tor, it remains to be investigated whether wisp is of any importance in patterning of the embryo.\r\n\r\nEgg activation\r\n\r\n\t \tcathepsin l-like cysteine protease; Bombyx cysteine\t \tprotease; cysteine proteinase-1 (bcp; cl;\t \tcp1)\tY\tvacuolar proton atpase; vacuolar h+ atpase subunit\t \t100–2 (vha100-2)\tY\t \tcathepsin b; cathepsin b-like cysteine proteinase\t \t(catb)\tY\th+ transporting atpase v0 subunit d; vacuolar h+ atpase\t \tsubunit ac39-1 (vhaac39-1)\tY\t \tcathepsin d; aspartic protease (catd)\tY\tvacuolar atp synthase subunit d; vacuolar h+ atpase\t \tsubunit 36–1 (mvd; vha36-1)\tY\t \tcathepsin f-like cysteine protease; CG12163\t \t(catf)\tY\tCG7899; acid phosphatase 1 (acph-1; ap)\tN\t \tecdysteroid-phosphate phosphatase (EPPase)\tY\tprimo-1; acid phosphatase isoenzyme\t \t(primo-1)\tY\t \tvacuolar proton atpase; vacuolar h+ atpase subunit\t \t100–1 (mva; v100; vha100-1)\tY\t \t \t \t\r\n\r\nGenes identified mainly from the Drosophila melanogaster\nliterature involved in egg activation, ovulation, gene regulation in\noviduct upon mating and maternal effect genes involved in\nfertilisation. Presence (Y) or absence (N) of orthologous\ntranscripts in the Pararge aegeria transcriptome is\nindicated.\r\n\r\nConclusions\r\n\r\nA large proportion of the genes currently described in the literature as being essential during insect oogenesis (in particular D. melanogaster oogenesis) were transcribed by P. aegeria and transcripts were transferred to the oocytes. As this was an ovarian transcriptome study, the precise functional context in which these genes were transcribed has not been identified. Differences in the functional context in which particular genes are expressed are to be expected compared to model organisms such as D. melanogaster and even B. mori. What is perhaps more revealing, however, is the absence of certain transcripts in the database, in particular where these transcripts concern paradigms of maternal regulation for various aspects of early insect embryogenesis. Pararge aegeria differed most significantly from D. melanogaster (and quite a number of other insect species), both in terms of stem cell maintenance or differentiation in the germarium and in establishing (and maintaining) polarity along AP, DV and at the termini of the oocyte. In particular, although Pararge aegeria females expressed an ortholog of a spi/krn-like EGF ligand and possibly its receptor, many components of the EGF pathway involved in patterning of the axes in D. melanogaster embryos, as well as pipe and mirror, were not expressed. This may either suggest that there is not much evidence for a significant role of EGF signalling in establishing P. aegeria oocyte polarity, or that its functional role and genes involved is divergent from other insects. This requires further study, as well as the functional role and significance of Dpp and Notch signalling in this context.\r\n\r\nAlthough the more derived species such as B. mori within the Ditrysia are argued to be long germ band-like, it is more appropriate to describe them as intermediate germ band, as they have a very unusual preblastoderm stage. Like D. melanogaster, cleavage in B. mori and the butterfly Pieris rapae is superficial but nuclear migration to the periphery of the oocyte and subsequent cellularisation occurs in an anterior to posterior gradient, after which they display long germ band characteristics. It is very likely that this has a bearing on maternal effect gene expression regulating axes patterning after oocyte polarity has been established during the pre-vitellogenic stages in Ditrysia compared to D. melanogaster, and this could be reflected in the gene expression data presented in this study (e.g. the absence of maternal expression of hb). Although progress has been made in investigating B. mori embryonic patterning, how polarity is established during oogenesis in Ditrysia and in the Lepidoptera as a whole is not known. This needs further investigation, and P. aegeria may prove an ideal model these future studies.\r\n\r\nUnfortunately, maternal effect gene expression and regulation have received significantly less research attention in Lepidoptera compared to vitellogenesis, choriogenesis and reproductive physiology. This is reflected in the discussion of the results in this paper. Although the latter aspects of oogenesis are well suited to studies of reproductive output under a variety of environmental conditions, many of the genes discussed in this study highlight the interconnectedness of all stages during oogenesis, for example eggshell production and oocyte polarity. Furthermore, key candidate genes that have the potential to play an important role in transgenerational maternal effects have been identified. Among these are genes encoding heat shock proteins and proteins involved in chromatin remodelling.\r\n\r\nThis study has taken a much-needed first step in determining the conserved and divergent elements of the butterfly oogenesis GRN (including maternal regulation of embryonic patterning) and establishes P. aegeria as an eco-evo-devo model system for the study of butterfly oogenesis. In order to fully unscramble butterfly oogenesis, an investigation of the spatio-temporal expression patterns of the genes discussed in this study, as well as establishment of their function, is required. Further studies are also required to establish the function and expression patterns of the uncharacterised contigs identified in this study, which make up 30% of the total contigs found, and are undoubtedly composed of genes that are of high importance in butterfly oogenesis.\r\n\r\nMethods\r\n\r\nButterfly rearing and sample collection\r\n\r\nAs butterflies were used in this study, no ethical approval was required. Eggs were collected from a large outbred laboratory population of P. aegeria (kept at 300–400 individuals per generation). This population originated from a woodland population from the south of Belgium (St. Hubert; established from 50 eggs) and by the time of the experiment, the butterflies had been reared in the laboratory for 10 generations. Newly hatched larvae were placed on potted host plants (4 larvae per plant) of Poa trivialis L. with access to ad libitum food and were reared until eclosion in a climate room under a regime (24±0.3°C, LD 16:8) that promotes direct development (i.e. no diapause). On the day of eclosion (i.e. day −1, between 9 and 12 h) females from this laboratory stock placed individually in netted cages (0.5 m3) along with a potted P. trivialis plant for oviposition and an artificial flower containing a 10% honey solution. Later the same day (between 13.00 and 16.00 h) a virgin male was introduced to the cage and the mating pair was left undisturbed for 24 h.\r\n\r\nEggs from 50 mated 4-day old females were collected within 20 minutes of being laid, which is well before the onset of cleavage and thus early embryogenesis in butterflies. The eggs were placed immediately in 1ml TRI-Reagent (Sigma-Aldrich, Dorset, UK) and homogenised thoroughly. Furthermore, 2 mated females aged 4 days were sacrificed by severing the nerve cord, after which the abdomen was removed and the ovaries dissected out in ice-cold PBS (1×), with dissection taking no longer than 15 minutes to avoid RNA degradation. The ovaries were pooled and likewise homogenised immediately in 1ml TRI-Reagent.\r\n\r\nRNA extraction and quality control\r\n\r\nThe homogenate (both of eggs and ovarioles/ovary) was first centrifuged at 13000g for 10min primarily to remove the yolk, after which the supernatant was vortexed with 200μl of chloroform. Phases were separated at 13000g for 15min at room temperature. The aqueous phase was removed and precipitated in 0.5ml isopropanol. The RNA samples were further purified using the RNeasy Mini Kit and re-eluted in 30μl nuclease-free water, following the manufacturer’s instructions (Qiagen, Hilden, Germany). Preliminary yield and quality for each RNA extraction were assayed using a Nanodrop, while RNA integrity was verified using the Agilent BioAnalyzer 2100 PicoRNA Chip (Agilent Technologies, Winnersh, UK) (Additional file 10).\r\n\r\nDe novo transcriptome assembly\r\n\r\nPararge aegeria egg and ovary RNA was sequenced by Source BioScience (Nottingham, UK) using Illumina short read RNA-Seq technology. Both total RNA samples went through polyA selection, fragmentation and double stranded cDNA conversion to produce two separate libraries (300bp insert size) in accordance with the Illumina mRNA-seq library preparation protocol (Illumina, San Diego, USA). Sequencing was performed on the Illumina Genome Analyzer IIx platform with one flowcell lane allocated to each library. A total of 61,400,070 single-reads of 38 base pairs (bp) in length were obtained from the ovary and egg flowcell lanes (31,836,256 and 29,563,814 reads for ovary and egg samples respectively) which were pooled to produce a de novo assembly in CLC Genomics Workbench v4.0 (CLC bio, Aarhus, Denmark) using the default settings for short read data (automatic word and bubble size). The assembly generated 25266 contigs (Additional file 2) of an average length of 535bp (N50=671bp), 41.06% GC content and an estimated average coverage of 124× per nucleotide.\r\n\r\nThe RNA-seq data was analysed by FASTQC on the Galaxy platform. Adaptor dimer or overruns in the reads (stretches of sequence matching the library preparation primers/adaptors) were trimmed from both egg and ovary data sets using CLC Genomics Workbench. Furthermore, the sequences were trimmed down to 25 bp from the 5’ end and sequencing artefacts discarded using the FASTX-Toolkit on Galaxy. Subsequently, the trimmed reads were mapped using default parameters against the de novo assembly using TopHat on the Galaxy server. FPKM values were estimated from the TopHat output using Cufflinks with quartile normalisation and multi read correct enabled. The estimates were limited to a reference general feature format file containing locations of the predicted coding regions from the automated annotation if available.\r\n\r\nAnnotation\r\n\r\nThe 25,266 contigs generated by the de novo assembly (Additional file 2) were processed through a similarity-based annotation workflow. Open reading frames (ORF) over 200 bp were identified and extracted with the EMBOSS tool “getorf” in Galaxy. The GC content increased to 42.23% when limited to possible coding regions. The predicted ORF and contig sequences were then processed through different BLAST strategies to provide the most suitable annotation possible (Additional files 11 and 12). The alpha group compared the predicted ORF sequences against protein databases to identify complete or highly conserved transcripts. The beta group compared the full contigs against protein databases to identify incomplete or out of frame transcripts. Sequences not identified in the alpha and beta group were compared further against nucleic acid coding sequences (delta) and finally the whole nucleotide database (zeta). Each search strategy was attributed a different rank, ranging from A to I. Identity was inferred based on similarity to the top ranking hit. Similarity scores (SS) were assigned to each hit based on the bitscore (S’), number of positives in each alignment (P) and original contig length (L). Similarity score was calculated using the formula:\r\n\r\nEffectively this required hits with higher bitscores to also have good query coverage and positive matches. Any hit attaining an SS below 18 (lower SS threshold) was discarded from each rank, using the next best hit (which may be in a lower rank or group) (Additional file 11). Hits were sorted based on group, positives, rank and SS to determine the top hit that would be used to infer the nature of each sequence. Similarity scores also allowed an initial indication of possible homology; SS above the upper threshold (\u003e/=40) were considered High, those above the lower SS threshold (\u003e/=18) were considered Mild and any others were considered Low. Any hit with a bitscore below 40 was excluded from inferring any possible identity or homology (Additional files 12 and 13).\r\n\r\nThe output from the automated annotation was checked manually for any errors (Additional file 2). Furthermore, using FlyBase and SilkBase as a starting point, a comprehensive literature search was conducted to identify those genes that have been studied in the context of insect oogenesis and maternal regulation of early embryogenesis (1035 genes, of which 994 have been studied in D. melanogaster; fully referenced in Additional file 1). For a further 56 genes functionality during oogenesis can be inferred, but their expression during oogenesis has not always been verified experimentally. The presence or absence of orthologous P. aegeria transcripts in both the oocyte and the ovarioles was verified for each of the 1091 genes and these transcripts were further annotated manually (indicated as such in Additional file 2).\r\n\r\nThe final BLAST results (1 top hit per sequence) used for annotation, including those genes annotated manually, were used as input in the BLAST2GO software and assigned with Gene Ontology (GO) terms where possible. To help provide an overview of the GO based on the BLAST results, the GO terms were condensed using the generic GO Slim subset.\r\n\r\nTranscript abundance and qPCR of genes involved in oogenesis and maternal regulation of early embryogenesis\r\n\r\nFor of a subset of 19 genes the expression in the ovarioles and the presence of transcripts in the oocyte were confirmed further by means of RT-qPCR (Additional file 3). For both ovary and oocyte, cDNA was generated from 500 – 1000 ng of RNA using the Verso RT Kit (Thermo Fisher, Surrey, UK). The reverse transcriptions were primed by a 3:1 mix of random hexamers:oligo-dT taking place in 20μl total volume reactions at 42°C for 30 min after an initial 5 min denaturation step at 70°C. Negative reverse transcription (NRT) controls were run in parallel without both Verso RT enzyme mix and primers. A final heat deactivation at 95°C for 2 min was also implemented to deactivate the RT enhancer. The resulting cDNA was stored at −20°C.\r\n\r\nFor the qPCR stage, suitable primer pairs were selected automatically using the online Primer3+ primer design service and tested in-silico via the Integrated DNA Technologies online structure prediction package (Oligo Analyzer). Only those primers exhibiting the best stability were selected. Each primer pair was tested on a 3-step 5-fold dilution series of the ovary cDNA in triplicate, which enabled the primer pair efficiencies to be determined using the CFX Manager software (Bio-Rad Laboratories, California, USA). Primers with adequate efficiency (\u003e65%) were then used for investigating the transcript abundance in the egg and ovary cDNA (Additional file 3).\r\n\r\nAll qPCR runs were performed on the CFX96 Real-Time PCR Detection System (Bio-Rad) on white 96-well plates in ABsolute Blue qPCR SYBR Green Mastermix (Thermo Fisher, Surrey, UK) with the recommended amount of ROX reference dye (Additional file 14). Test samples were measured in triplicate, while no template controls (NTC) and NRTs were present in duplicate on each plate. The CFX96 data generated was recorded by the CFX manager program using automatic threshold determination. The quantification cycle (Cq) values are listed in Additional file 4.\r\n\r\nRelative transcript abundance (i.e. ovary versus egg) was used to reveal whether any individual transcript was used as a maternal effect gene transcript or was merely necessary for oocyte production. Relative transcript abundance in the ovaries and eggs were obtained using the relative expression software tool REST v2.0.13.0 software package, which used the 3 available reference genes to normalise the measurements obtained from the egg and ovary derived cDNA (Additional file 5).\r\n\r\nThe number of reads mapping to a transcript of a particular gene in RNA-seq data was argued to be correlated linearly with the number of transcripts of that gene. Rather than using read counts, it is considered to be more appropriate to use a corrected relative value, taking transcript length and total number of mapped reads into account. Cufflinks generated such corrected values, the FPKM values, which can be used for the reliable determination of transcript abundance for each of the genes discussed in this study (Additional file 2). In fact, for the 22 genes in the P. aegeria transcriptome investigated by means of qPCR, transcript abundance calculated on the basis of Cq values by means of the methods described in showed significant positive correlation with FPKM values in the combined oocyte and ovary transcriptome (Pearson regression, with null hypothesis that correlation is \u003e0: t41 = 2.37, P = 0.011; Additional file 6).\r\n\r\nAnnotated contigs and accession numbers of raw data\r\n\r\nThe sequence read data reported in this manuscript have been deposited in the NCBI Sequence Read Archive and are available under the accession numbers SRR771147 (ovarian reads) and SRR772253 (oocyte reads). Additional file 15 provides the fasta format sequences of the assembled contigs, including the suggested annotated names (top BLAST results as well as information on the manual annotation listed in Additional file 2). Additional file 2 provides information on the start and end of the coding regions in the contigs.\r\n\r\nAbbreviations\r\n\r\nGRN: Gene Regulatory Network; eco-evo-devo: Ecological evolutionary development; AP: Anterior-posterior; DV: Dorso-ventral; RNA-seq: RNA-sequencing; RNP: Ribonucleoprotein; RTK: Receptor Tyrosine Kinase; CDK: Cyclin-dependent kinase; SC: Synaptonemal Complex; RN: Recombination Nodules; IRS: Insulin Receptor Substrate; 20E: 20-hydroxy-ecdysone; JH: Juvenile Hormone; FPKM: Fragments Per Kilobase of exon per Million of fragments mapped; ORF: Open Reading Frame; SS: Similarity Score; GO: Gene Ontology; RT-qPCR: Real-time reverse transcription quantitative polymerase chain reaction; NRT: Negative reverse transcription; NTC: No template control\r\n\r\nCompeting interests\r\n\r\nThe authors declare that they have no competing interests.\r\n\r\nAuthors’ contributions\r\n\r\nJMC collected and analysed RT-qPCR data, designed the automatic annotation pipeline, performed bioinformatic analyses, and co-wrote the manuscript. SCB assisted in RT-qPCR study design and data collection. RP and DRFC prepared RNA samples for RNA-seq. AC performed phylogenetic analyses of nanos. JT assisted in manual annotation of the transcriptome. MG and CJB designed and supervised the study, performed the manual annotation of the transcriptome, and co-wrote the manuscript. All authors have provided comments on earlier drafts of the manuscript and approved the final version of the manuscript for publication.\r\n\r\nSupplementary Material\r\n\r\nAcknowledgements\r\n\r\nResearch funding for JMC and CJB was provided by the Faculty of Health and Life Sciences, Department of Biological and Medical Sciences, Oxford Brookes University (Jnl no 105595 and 103324) and a NERC studentship quota award. In particular we would like to thank Peter Holland, Laura Ferguson and Ferdinand Marletaz for the collaboration on the Pararge aegeria genome. Furthermore, we would like to thank Alistair McGregor and the two anonymous reviewers for helpful comments on earlier versions of the manuscript, Maarten Hilbrant for discussions on maternal effect genes, Tom Annat for his help with chorion gene phylogenetic analyses, Luca Livraghi for discussions on caudal translational repression, as well as the numerous undergraduate students who have worked in the lab of CJB on butterfly oogenesis.\r\n\r\nOskar predates the evolution of germ plasm in insects\r\n\r\nFine structure of the blastoderm embryo of the pink bollworm, Pectinophora Gossypiella (saunders) (lepidoptera: Gelechiidae)\r\n\r\nThe phylogenetic origin of oskar coincided with the origin of maternally provisioned germ plasm and pole cells at the base of the Holometabola\r\n\r\nThe evolution of dorsal–ventral patterning mechanisms in insects\r\n\r\nHeads and tails: evolution of antero-posterior patterning in insects\r\n\r\nLipid uptake by insect oocytes\r\n\r\nDynamics of juvenile hormone-mediated gonadotropism in the Lepidoptera\r\n\r\nEgg formation in Lepidoptera\r\n\r\nInsect vitellogenin/lipophorin receptors: Molecular structures, role in oogenesis, and regulatory mechanisms\r\n\r\nReproductive plasticity, ovarian dynamics and maternal effects in response to temperature and flight in Pararge aegeria\r\n\r\nFlight during oviposition reduces maternal egg provisioning and influences offspring development in Pararge aegeria (L.)\r\n\r\nParental effects in Pieris rapae in response to variation in food quality: adaptive plasticity across generations?\r\n\r\nEpigenetic stability increases extensively during Drosophila follicle stem cell differentiation\r\n\r\nPolycomb group genes Psc and Su(z)2 restrict follicle stem cell self-renewal and extrusion by controlling canonical and noncanonical Wnt signaling\r\n\r\nEggs over easy: cell death in the Drosophila ovary\r\n\r\nNutritional status affects 20-hydroxyecdysone concentration and progression of oogenesis in Drosophila melanogaster\r\n\r\nA niche maintaining germ line stem cells in the Drosophila ovary\r\n\r\nThe development of germline stem cells in Drosophila\r\n\r\nMei-P26 regulates microRNAs and cell growth in the Drosophila ovarian stem cell lineage\r\n\r\nWolbachia enhance Drosophila stem cell proliferation and target the germline stem cell niche\r\n\r\nDrosophila oogenesis\r\n\r\nMutations in Drosophila Greatwall/Scant reveal its roles in mitosis and meiosis and interdependence with polo kinase\r\n\r\nThe evolution of oocyte patterning in insects: multiple cell-signaling pathways are active during honeybee oogenesis and are likely to play a role in axis patterning\r\n\r\nEGF Signaling and the Origin of Axial Polarity among the Insects\r\n\r\nSymmetry Breaking During Drosophila Oogenesis\r\n\r\nEffects of juvenile hormone on the programming of postembryonic development in eggs of the silkworm, Hyalophora cecropia\r\n\r\nEmbryonic expression of juvenile hormone binding protein and its relationship to the toxic effects of juvenile hormone in Manduca sexta\r\n\r\nEvaluating the role of reproductive constraints in ant social evolution\r\n\r\nThe role of nourishment in oogenesis\r\n\r\nDevelopmental plasticity and the evolution of parental effects\r\n\r\nReproductive constraint is a developmental mechanism that maintains social harmony in advanced ant societies\r\n\r\nMaternal effects generate variation in life history: consequences of egg weight plasticity in the Gypsy Moth\r\n\r\nPopulation cycles of forest Lepidoptera - A maternal effect hypothesis\r\n\r\nThe origin of pattern and polarity in the Drosophila embryo\r\n\r\nThe anterior-posterior and dorsal-ventral axes have a common origin in Drosophila melanogaster\r\n\r\nEmbryonic development - Maternal effect of Hsf1 on reproductive success\r\n\r\nMaternal RNAs encoding transcription factors for germline-specific gene expression in Drosophila embryos\r\n\r\nThe morphogenesis of evolutionary developmental biology\r\n\r\nToward a new synthesis: population genetics and evolutionary developmental biology\r\n\r\nThe choice of model organisms in evo-devo\r\n\r\nResource allocation to oocytes - heritable variation with altitude in Colias philodice eriphyle (Lepidoptera)\r\n\r\nDevelopment on drought-stressed host plants affects life history, flight morphology and reproductive output relative to landscape structure\r\n\r\nReproductive plasticity, oviposition site selection, and maternal effects in fragmented landscapes\r\n\r\nEgg maturation strategy and survival trade-offs in holometabolous insects: a comparative approach\r\n\r\nButterflyBase: a platform for lepidopteran genomics\r\n\r\nFunctional genomics of life history variation in a butterfly metapopulation\r\n\r\nA wing expressed sequence tag resource for Bicyclus anynana butterflies, an evo-devo model\r\n\r\nThe monarch butterfly genome yields insights into long-distance migration\r\n\r\nButterfly genome reveals promiscuous exchange of mimicry adaptations among species\r\n\r\nPopulation-level transcriptome sequencing of nonmodel organisms Erynnis propertius and Papilio zelicaon\r\n\r\nAnterior and posterior centers jointly regulate Bombyx embryo body segmentation\r\n\r\nGerm cell specification and early embryonic patterning in Bombyx mori as revealed by nanos orthologues\r\n\r\nMaternal effects, flight versus fecundity trade-offs, and offspring immune defence in the Speckled Wood butterfly, Pararge aegeria\r\n\r\nVariation in egg weight, oviposition rate and reproductive reserves with female age in a natural population of the speckled wood butterfly, Pararge aegeria\r\n\r\nIntraspecific variation in body size and the rate of reproduction in female insects - adaptive allometry or biophysical constraint?\r\n\r\nTerritorial defense and its seasonal decline in the Speckled Wood Butterfly (Pararge aegeria)\r\n\r\nFeeding habits and change of body composition with age in three Nymphalid butterfly species\r\n\r\nChapter 19: Embryology\r\n\r\nLepidopteran phylogeny and applications to comparative studies of development\r\n\r\nFlyBase\r\n\r\nSilkBase\r\n\r\nFlyBase high throughput expression pattern data Beta Version\r\n\r\nBDGP insitu homepage\r\n\r\nCornichon and the EGF receptor signaling process are necessary for both anterior-posterior and dorsal-ventral pattern formation in Drosophila\r\n\r\ndSTAM expression pattern during wild type and mutant egg chamber development in D. melanogaster\r\n\r\nThe Drosophila STAM gene homolog is in a tight gene cluster, and its expression correlates to that of the adjacent gene ial\r\n\r\nWingless signaling regulates the maintenance of ovarian somatic stem cells in Drosophila\r\n\r\nThe role of segment polarity genes during early oogenesis in Drosophila\r\n\r\nDecapentaplegic is essential for the maintenance and division of germline stem cells in the Drosophila ovary\r\n\r\nSAGE analysis of early oogenesis in the silkworm, Bombyx mori\r\n\r\nTwo distinct transmembrane serine/threonine kinases from Drosophila melanogaster form an activin receptor complex\r\n\r\nNegative modulation of bone morphogenetic protein signaling by Dullard during wing vein formation in Drosophila\r\n\r\nThe role of brinker in eggshell patterning\r\n\r\nThe role of Dpp and its inhibitors during eggshell patterning in Drosophila\r\n\r\nGermline stem cell number in the Drosophila ovary is regulated by redundant mechanisms that control Dpp signaling\r\n\r\nBoca, an endoplasmic reticulum protein required for wingless signaling and trafficking of LDL receptor family members in Drosophila\r\n\r\nAsymmetrically expressed axin required for anterior development in Tribolium\r\n\r\nDWnt4 regulates cell movement and focal adhesion kinase during Drosophila ovarian morphogenesis\r\n\r\nThe Drosophila ortholog of the human Wnt inhibitor factor Shifted controls the diffusion of lipid-modified Hedgehog\r\n\r\nHrb27C, Sqd and Otu cooperatively regulate gurken RNA localization and mediate nurse cell chromosome dispersion in Drosophila oogenesis\r\n\r\nThe Drosophila AP axis is polarised by the cadherin-mediated positioning of the oocyte\r\n\r\nMorphogenesis of the Drosophila fusome and its implications for oocyte specification\r\n\r\nIntercellular protein movement in syncytial Drosophila follicle cells\r\n\r\nFusome asymmetry and oocyte determination in Drosophila\r\n\r\nDrosophila par-1 is required for oocyte differentiation and microtubule organization\r\n\r\nPolarization of both major body axes in Drosophila by gurken-torpedo signalling\r\n\r\nA combinatorial code for pattern formation in Drosophila oogenesis\r\n\r\nMultiple EGFR ligands participate in guiding migrating border cells\r\n\r\nMolecular mechanisms of EGF signaling-dependent regulation of pipe, a gene crucial for dorsoventral axis formation in Drosophila\r\n\r\nDistinct functional specificities are associated with protein isoforms encoded by the Drosophila dorsal-ventral patterning gene pipe\r\n\r\nGraded maternal short gastrulation protein contributes to embryonic dorsal–ventral patterning by delayed induction\r\n\r\nFate mapping of the silkworm, Bombyx mori, using localized UV irradiation of the egg at fertilization\r\n\r\nFunction of the ETS transcription factor Yan in border cell migration\r\n\r\nRole of Notch pathway in terminal follicle cell differentiation during Drosophila oogenesis\r\n\r\nThe Mirror transcription factor links signalling pathways in Drosophila oogenesis\r\n\r\nAn ancient anterior patterning system promotes caudal repression and head formation in Ecdysozoa\r\n\r\nThe Bin3 RNA methyltransferase is required for repression of caudal translation in the Drosophila embryo\r\n\r\nBinding of pumilio to maternal hunchback mRNA is required for posterior patterning in Drosophila embryos\r\n\r\nGrasshopper hunchback expression reveals conserved and novel aspects of axis formation and segmentation\r\n\r\nEssential role of the posterior morphogen nanos for germline development in Drosophila\r\n\r\nValois, a component of the nuage and pole plasm, is involved in assembly of these structures, and binds to Tudor and the methyltransferase Capsuléen\r\n\r\nMultiple mechanisms collaborate to repress nanos translation in the Drosophila ovary and embryo\r\n\r\nOskar allows nanos mRNA translation in Drosophila embryos by preventing its deadenylation by Smaug/CCR4\r\n\r\nTranslational regulation of oskar mRNA by Bruno, an ovarian RNA-binding protein, is essential\r\n\r\nThe Drosophila SDE3 homolog armitage is required for oskar mRNA silencing and embryonic axis specification\r\n\r\nTargeting and anchoring Tudor in the pole plasm of the Drosophila oocyte\r\n\r\nRepression of retroelements in Drosophila germline via piRNA pathway by the tudor domain protein tejas\r\n\r\nA systematic analysis of Drosophila TUDOR domain-containing proteins identifies Vreteno and the Tdrd12 family as essential primary piRNA pathway factors\r\n\r\nLOTUS, a new domain associated with small RNA pathways in the germline\r\n\r\nSpecialized piRNA pathways act in germline and somatic tissues of the Drosophila ovary\r\n\r\nA novel class of evolutionarily conserved genes defined by piwi are essential for stem cell self-renewal\r\n\r\nMaelstrom coordinates microtubule organization during Drosophila oogenesis through interaction with components of the MTOC\r\n\r\nZucchini and squash encode two putative nucleases required for rasiRNA production in the Drosophila germline\r\n\r\nDrosophila processing bodies in oogenesis\r\n\r\nDrosophila Ge-1 promotes P Body formation and oskar mRNA localization\r\n\r\nThe germ cell-less gene product: a posteriorly localized component necessary for germ cell development in Drosophila\r\n\r\nGlobal analysis of mRNA localization reveals a prominent role in organizing cellular architecture and function\r\n\r\nGraded Dorsal and differential gene regulation in the Drosophila embryo\r\n\r\nWeckle is a zinc finger adaptor of the Toll pathway in dorsoventral patterning of the Drosophila embryo\r\n\r\nExpression of 18-wheeler in the follicle cell epithelium affects cell migration and egg morphology in Drosophila\r\n\r\nMolecular cloning and expression of a Toll receptor gene homologue from the silkworm, Bombyx mori\r\n\r\nBinding sites for transcription factor NTF-1/Elf-1 contribute to the ventral repression of decapentaplegic\r\n\r\nSog and dpp exert opposing maternal functions to modify Toll signaling and pattern the dorsoventral axis of the Drosophila embryo\r\n\r\nThe vrille gene of Drosophila is a maternal enhancer of decapentaplegic and encodes a new member of the bZIP family of transcription factors\r\n\r\nDrosophila p24 homologues eclair and baiser are necessary for the activity of the maternally expressed Tkv receptor during early embryogenesis\r\n\r\nExpression in the central nervous system of a subset of the yema maternally acting genes during Drosophila embryogenesis. Post-embryonic expression extends to imaginal discs and spermatocytes\r\n\r\nFragile X protein functions with lgl and the par complex in flies and mice\r\n\r\nClosca, a new gene required for both Torso RTK activation and vitelline membrane integrity. Germline proteins contribute to Drosophila eggshell composition\r\n\r\nFunction of torso in determining the terminal anlagen of the Drosophila embryo\r\n\r\nTorso-like encodes the localized determinant of Drosophila terminal pattern formation\r\n\r\nMaternal torso signaling controls body axis elongation in a short germ insect\r\n\r\nPatterns of conservation and change in honey bee developmental genes\r\n\r\nTailless patterning functions are conserved in the honeybee even in the absence of Torso signaling\r\n\r\nThe Drosophila Polycomb group gene Sex comb on midleg (Scm) encodes a zinc finger protein with similarity to polyhomeotic protein\r\n\r\nPolyhomeotic is required for somatic cell proliferation and differentiation during ovarian follicle formation in Drosophila\r\n\r\nIdentification and characterization of Polycomb group genes in the silkworm, Bombyx mori\r\n\r\nEpigenetics in development\r\n\r\nBruno inhibits the expression of mitotic cyclins during the prophase I meiotic arrest of Drosophila oocytes\r\n\r\nAchiasmatic oogenesis in the Heliconiine butterflies\r\n\r\nMeiosis in Bombyx mori females\r\n\r\nThe transformation of the Synaptonemal Complex into the ‘elimination chromatin’ in Bombyx mori oocytes\r\n\r\nThe synaptonemal complex and genetic segregation\r\n\r\nFunctional links between Drosophila Nipped-B and cohesin in somatic and meiotic cells\r\n\r\nThe Drosophila ecdysone receptor (EcR) gene is required maternally for normal oogenesis\r\n\r\nThe exocyst component Sec5 is present on endocytic vesicles in the oocyte of Drosophila melanogaster\r\n\r\nRegulation of the vitellogenin receptor during Drosophila melanogaster oogenesis\r\n\r\nThe Drosophila melanogaster lipase homologs: a gene family with tissue and developmental specific expression\r\n\r\nRelease of ecdysteroid-phosphates from egg yolk granules and their dephosphorylation during early embryonic development in silkworm, Bombyx mori\r\n\r\nThe orphan nuclear receptor BmHR3A of Bombyx mori: hormonal control, ovarian expression and functional properties\r\n\r\nCharacterization of a gene encoding KK-42-binding protein in Antheraea pernyi (Lepidoptera: Saturniidae)\r\n\r\ncDNA of YP4, a follicular epithelium yolk protein subunit, in the moth, Plodia interpunctella\r\n\r\nTranslating available food into the number of eggs laid by Drosophila melanogaster\r\n\r\nInsulin signaling is necessary for vitellogenesis in Drosophila melanogaster independent of the roles of juvenile hormone and ecdysteroids: female sterility of the chico1 insulin signaling mutation is autonomous to the ovary\r\n\r\nBombyxin gene expression in tissues other than brain detected by reverse transcription-polymerase chain reaction (RT-PCR) and in situ hybridization\r\n\r\nThe silkmoth homolog of the Drosophila ecdysone receptor (BI Isoform): Cloning and analysis of expression during follicular cell differentiation\r\n\r\nLigand-binding properties of a juvenile hormone receptor, Methoprene-tolerant\r\n\r\nDrosophila Met and Gce are partially redundant in transducing juvenile hormone action\r\n\r\nHeterodimer of two bHLH-PAS proteins mediates juvenile hormone-induced gene expression\r\n\r\nMicroarray analysis of juvenile hormone response in Drosophila melanogaster S2 cells\r\n\r\nJuvenile hormone stimulation of ornithine decarboxylase activity during vitellogenesis in Drosophila melanogaster\r\n\r\nCharacterization of juvenile hormone epoxide hydrolase and related genes in the larval development of the silkworm Bombyx mori\r\n\r\nEcdysone response genes govern egg chamber development during mid-oogenesis in Drosophila\r\n\r\nThe Drosophila gene Start1: a putative cholesterol transporter and key regulator of ecdysteroid synthesis\r\n\r\nThe dare gene: steroid hormone production, olfactory behavior, and neural degeneration in Drosophila\r\n\r\nCell-autonomous roles of the ecdysoneless gene in Drosophila development and oogenesis\r\n\r\nBmStart1, a novel carotenoid-binding protein isoform from Bombyx mori, is orthologous to MLN64, a mammalian cholesterol transporter\r\n\r\nThe orphan receptor BmHNF-4 of the silkmoth Bombyx mori: ovarian and zygotic expression of two mRNA isoforms encoding polypeptides with different activating domains\r\n\r\nDrosophila eggshell production: identification of new genes and coordination by Pxt\r\n\r\nAn unusual mosaic protein with a protease domain, encoded by the nudeI gene, is involved in defining embryonic dorsoventral polarity in Drosophila\r\n\r\nAn ovarian follicular epithelium protein of the silkworm (Bombyx mori) that associates with the vitelline membrane and contributes to the structural integrity of the follicle\r\n\r\nCell cycle control of chorion gene amplification\r\n\r\nLinkage and evolutionary diversification of developmentally regulated multigene families: tandem arrays of the 401/18 chorion gene pair in silkmoths\r\n\r\nA novel role for the Bombyx Slbo homologue, BmC/EBP, in insect choriogenesis\r\n\r\nArchitectural factor HMGA induces promoter bending and recruits C/EBP and GATA during silkmoth chorion gene regulation\r\n\r\nProteins that bind to Drosophila chorion cis-regulatory elements: a new C[[2]]H[[2]] zinc finger protein and a C[[2]]C[[2]] steroid receptor-like component\r\n\r\nRapid evolution of outer egg membrane proteins in the Drosophila melanogaster subgroup: a case of ecologically driven evolution of female reproductive traits\r\n\r\nEvolution of chorion gene families in lepidoptera: characterization of 15 cDNAs from the gypsy moth\r\n\r\nSilkworm egg proteins at the germ-band formation stage and a functional analysis of BmEP80 protein\r\n\r\nDifferent modes of programmed cell death during oogenesis of the silkmoth Bombyx mori\r\n\r\nEffector caspase Dcp-1 and IAP protein Bruce regulate starvation-induced autophagy during Drosophila melanogaster oogenesis\r\n\r\nThe genomic underpinnings of apoptosis in the silkworm, Bombyx mori\r\n\r\nKibra functions as a tumor suppressor protein that regulates Hippo signaling in conjunction with Merlin and Expanded\r\n\r\nMoesin crosslinks actin and cell membrane in Drosophila oocytes and is required for Oskar anchoring\r\n\r\nHsp60C is required in follicle as well as germline cells during oogenesis in Drosophila melanogaster\r\n\r\nA role for the chaperone Hsp70 in the regulation of border cell migration in the Drosophila ovary\r\n\r\nAntisense ribosomal protein gene expression specifically disrupts oogenesis in Drosophila melanogaster\r\n\r\nA host parasite interaction rescues Drosophila oogenesis defects\r\n\r\nWolbachia interferes with ferritin expression and iron metabolism in insects\r\n\r\nWolbachia pipientis: Microbial manipulator of arthropod reproduction\r\n\r\nRemoving symbiotic Wolbachia bacteria specifically inhibits oogenesis in a parasitic wasp\r\n\r\nA feedback loop between Wolbachia and the Drosophila gurken mRNP complex influences Wolbachia titer\r\n\r\nTransitioning from egg to embryo: Triggers and mechanisms of egg activation\r\n\r\nWispy, the Drosophila homolog of GLD-2, is required during oogenesis and egg activation\r\n\r\nSingle-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction\r\n\r\nSolexa sequencing based transcriptome analysis of Helicoverpa armigera larvae\r\n\r\nGalaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences\r\n\r\nTeam tG: Manipulation of FASTQ data with Galaxy\r\n\r\nTopHat: discovering splice junctions with RNA-Seq\r\n\r\nImproving RNA-Seq expression estimates by correcting for fragment bias\r\n\r\nBlast2GO: a universal tool for annotation, visualization and analysis in functional genomics research\r\n\r\nRelative expression software tool (REST©) for group-wise comparison and statistical analysis of relative expression results in real-time PCR\r\n\r\nDifferential expression in RNA-seq: A matter of depth\r\n\r\nEstimation of copy number using SYBR Green: confounding by AT-rich DNA and by variation in amplicon length\r\n\r\n\r\n","tracks":[]}