Powerful sequence similarity search methods and in-depth manual analyses can identify remote homologs in many apparently "orphan" viral proteins

D. Kuchibhatla, W. Sherman, B. Chung, S. Cook, G. Schneider, B. Eisenhaber, D. Karlin, Journal of Virology 88 (2014) 10–20.

Download
OA 825.76 KB

Journal Article | Published | English
Author
; ; ; ; ; ;
Department
Abstract
The genome sequences of new viruses often contain many "orphan" or "taxon-specific" proteins apparently lacking homologs. However, because viral proteins evolve very fast, commonly used sequence similarity detection methods such as BLAST may overlook homologs. We analyzed a data set of proteins from RNA viruses characterized as "genus specific" by BLAST. More powerful methods developed recently, such as HHblits or HHpred (available through web-based, user-friendly interfaces), could detect distant homologs of a quarter of these proteins, suggesting that these methods should be used to annotate viral genomes. In-depth manual analyses of a subset of the remaining sequences, guided by contextual information such as taxonomy, gene order, or domain cooccurrence, identified distant homologs of another third. Thus, a combination of powerful automated methods and manual analyses can uncover distant homologs of many proteins thought to be orphans. We expect these methodological results to be also applicable to cellular organisms, since they generally evolve much more slowly than RNA viruses. As an application, we reanalyzed the genome of a bee pathogen, Chronic bee paralysis virus (CBPV). We could identify homologs of most of its proteins thought to be orphans; in each case, identifying homologs provided functional clues. We discovered that CBPV encodes a domain homologous to the Alphavirus methyltransferase-guanylyltransferase; a putative membrane protein, SP24, with homologs in unrelated insect viruses and insect-transmitted plant viruses having different morphologies (cileviruses, higreviruses, blunerviruses, negeviruses); and a putative virion glycoprotein, ORF2, also found in negeviruses. SP24 and ORF2 are probably major structural components of the virionsd.
Publishing Year
Date Published
2014-01-01
Journal Title
Journal of Virology
Volume
88
Issue
1
Page
10 - 20
ISSN
IST-REx-ID

Cite this

Kuchibhatla D, Sherman W, Chung B, et al. Powerful sequence similarity search methods and in-depth manual analyses can identify remote homologs in many apparently “orphan” viral proteins. Journal of Virology. 2014;88(1):10-20. doi:10.1128/JVI.02595-13
Kuchibhatla, D., Sherman, W., Chung, B., Cook, S., Schneider, G., Eisenhaber, B., & Karlin, D. (2014). Powerful sequence similarity search methods and in-depth manual analyses can identify remote homologs in many apparently “orphan” viral proteins. Journal of Virology, 88(1), 10–20. https://doi.org/10.1128/JVI.02595-13
Kuchibhatla, Durga, Westley Sherman, Betty Chung, Shelley Cook, Georg Schneider, Birgit Eisenhaber, and David Karlin. “Powerful Sequence Similarity Search Methods and In-Depth Manual Analyses Can Identify Remote Homologs in Many Apparently ‘Orphan’ Viral Proteins.” Journal of Virology 88, no. 1 (2014): 10–20. https://doi.org/10.1128/JVI.02595-13.
D. Kuchibhatla et al., “Powerful sequence similarity search methods and in-depth manual analyses can identify remote homologs in many apparently ‘orphan’ viral proteins,” Journal of Virology, vol. 88, no. 1, pp. 10–20, 2014.
Kuchibhatla D, Sherman W, Chung B, Cook S, Schneider G, Eisenhaber B, Karlin D. 2014. Powerful sequence similarity search methods and in-depth manual analyses can identify remote homologs in many apparently ‘orphan’ viral proteins. Journal of Virology. 88(1), 10–20.
Kuchibhatla, Durga, et al. “Powerful Sequence Similarity Search Methods and In-Depth Manual Analyses Can Identify Remote Homologs in Many Apparently ‘Orphan’ Viral Proteins.” Journal of Virology, vol. 88, no. 1, ASM, 2014, pp. 10–20, doi:10.1128/JVI.02595-13.
All files available under the following license(s):
Creative Commons License:
CC-BYCreative Commons Attribution 4.0 International Public License (CC-BY 4.0)
Main File(s)
Access Level
OA Open Access
Last Uploaded
2018-12-12T10:13:43Z


Export

Marked Publications

Open Data IST Research Explorer

Search this title in

Google Scholar