Using phylogenetically-informed annotation (PIA) to search for light-interacting genes in transcriptomes from non-model organisms
| dc.contributor.author | Speiser, Daniel I | |
| dc.contributor.author | Pankey, M S | |
| dc.contributor.author | Zaharoff, Alexander K | |
| dc.contributor.author | Battelle, Barbara A | |
| dc.contributor.author | Bracken-Grissom, Heather D | |
| dc.contributor.author | Breinholt, Jesse W | |
| dc.contributor.author | Bybee, Seth M | |
| dc.contributor.author | Cronin, Thomas W | |
| dc.contributor.author | Garm, Anders | |
| dc.contributor.author | Lindgren, Annie R | |
| dc.contributor.author | Patel, Nipam H | |
| dc.contributor.author | Porter, Megan L | |
| dc.contributor.author | Protas, Meredith E | |
| dc.contributor.author | Rivera, Ajna S | |
| dc.contributor.author | Serb, Jeanne M | |
| dc.contributor.author | Zigler, Kirk S | |
| dc.contributor.author | Crandall, Keith A | |
| dc.contributor.author | Oakley, Todd H | |
| dc.date.accessioned | 2014-12-04T03:59:21Z | |
| dc.date.available | 2014-12-04T03:59:21Z | |
| dc.date.issued | 2014-11-19 | |
| dc.date.updated | 2014-12-04T03:59:25Z | |
| dc.description.abstract | Abstract Background Tools for high throughput sequencing and de novo assembly make the analysis of transcriptomes (i.e. the suite of genes expressed in a tissue) feasible for almost any organism. Yet a challenge for biologists is that it can be difficult to assign identities to gene sequences, especially from non-model organisms. Phylogenetic analyses are one useful method for assigning identities to these sequences, but such methods tend to be time-consuming because of the need to re-calculate trees for every gene of interest and each time a new data set is analyzed. In response, we employed existing tools for phylogenetic analysis to produce a computationally efficient, tree-based approach for annotating transcriptomes or new genomes that we term Phylogenetically-Informed Annotation (PIA), which places uncharacterized genes into pre-calculated phylogenies of gene families. Results We generated maximum likelihood trees for 109 genes from a Light Interaction Toolkit (LIT), a collection of genes that underlie the function or development of light-interacting structures in metazoans. To do so, we searched protein sequences predicted from 29 fully-sequenced genomes and built trees using tools for phylogenetic analysis in the Osiris package of Galaxy (an open-source workflow management system). Next, to rapidly annotate transcriptomes from organisms that lack sequenced genomes, we repurposed a maximum likelihood-based Evolutionary Placement Algorithm (implemented in RAxML) to place sequences of potential LIT genes on to our pre-calculated gene trees. Finally, we implemented PIA in Galaxy and used it to search for LIT genes in 28 newly-sequenced transcriptomes from the light-interacting tissues of a range of cephalopod mollusks, arthropods, and cubozoan cnidarians. Our new trees for LIT genes are available on the Bitbucket public repository (http://bitbucket.org/osiris_phylogenetics/pia/) and we demonstrate PIA on a publicly-accessible web server (http://galaxy-dev.cnsi.ucsb.edu/pia/). Conclusions Our new trees for LIT genes will be a valuable resource for researchers studying the evolution of eyes or other light-interacting structures. We also introduce PIA, a high throughput method for using phylogenetic relationships to identify LIT genes in transcriptomes from non-model organisms. With simple modifications, our methods may be used to search for different sets of genes or to annotate data sets from taxa outside of Metazoa. | |
| dc.description.version | Peer Reviewed | |
| dc.identifier.citation | BMC Bioinformatics. 2014 Nov 19;15(1):350 | |
| dc.identifier.uri | http://dx.doi.org/10.1186/s12859-014-0350-x | |
| dc.identifier.uri | https://hdl.handle.net/2139/39299 | |
| dc.language.rfc3066 | en | |
| dc.rights.holder | Daniel I Speiser et al.; licensee BioMed Central Ltd. | |
| dc.title | Using phylogenetically-informed annotation (PIA) to search for light-interacting genes in transcriptomes from non-model organisms | |
| dc.type | Journal Article |
Files
Original bundle
1 - 5 of 7
Loading...
- Name:
- s12859-014-0350-x.pdf
- Size:
- 924.98 KB
- Format:
- Adobe Portable Document Format
Loading...
- Name:
- s12859-014-0350-x-S2.pdf
- Size:
- 489.62 KB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description:
