Abstract Detail

Phylogenomics

Ramanauskas, Karolis [1], Igić, Boris [1].

Extract and annotate genes from raw RNA-seq reads: kakapo.

Studies in many fields within life sciences increasingly rely on RNA sequencing (RNA-seq) data. As a result, RNA-seq datasets deposited to the NCBI Sequence Read Archive (SRA) are proliferating. In addition to serving as an archive for the original studies, these datasets present an opportunity for novel research. Here, we present Kakapo, a pipeline that allows users to extract and assemble a specified gene or protein family from any number of SRA accessions (or their own RNA-seq data). Kakapo identifies open reading frames in the assembled transcripts and annotates them using InterProScan. Additionally, raw reads can be filtered for ribosomal, plastid, and mitochondrial reads or reads belonging to non-target organisms (viral, bacterial, etc.) We demonstrate the utility of this pipeline with a case-study: the identification of putative self-incompatibility locus in Schlumbergera truncata (Cactaceae).

Related Links:
kakapo GitHub repository

1 - University of Illinois at Chicago

Keywords:
SRA
RNA-seq
self-incompatibility
gene family evolution
evolution.

Presentation Type: Oral Paper
Session: PHYL4, Phylogenomics IV
Location: Virtual/Virtual
Date: Friday, July 31st, 2020
Time: 4:30 PM
Number: PHYL4007
Abstract ID:865
Candidate for Awards:None