A horizontal alignment tool for numerical trend discovery in sequence data: application to protein hydropathy.
Wrabl, James O.
Hilser, Vincent J.
MetadataShow full item record
An algorithm is presented that returns the optimal pairwise gapped alignment of two sets of signed numerical sequence values. One distinguishing feature of this algorithm is a flexible comparison engine (based on both relative shape and absolute similarity measures) that does not rely on explicit gap penalties. Additionally, an empirical probability model is developed to estimate the significance of the returned alignment with respect to randomized data. The algorithm's utility for biological hypothesis formulation is demonstrated with test cases including database search and pairwise alignment of protein hydropathy. However, the algorithm and probability model could possibly be extended to accommodate other diverse types of protein or nucleic acid data, including positional thermodynamic stability and mRNA translation efficiency. The algorithm requires only numerical values as input and will readily compare data other than protein hydropathy. The tool is therefore expected to complement, rather than replace, existing sequence and structure based tools and may inform medical discovery, as exemplified by proposed similarity between a chlamydial ORFan protein and bacterial colicin pore-forming domain. The source code, documentation, and a basic web-server application are available.
Showing items related by title, author, creator and subject.
Greider, Carol W.; Sternglanz, Rolf; Le, Siyuan (American Society for Cell Biology, 2000-03)Telomerase plays a crucial role in telomere maintenance in vivo. To understand telomerase regulation, we have been characterizing components of the enzyme. To date several components of the mammalian telomerase holoenzyme ...
Greider, Carol W.; Opperman, Kay Keyer; Chen, Jiunn-Liang (Oxford University Press, 2002-01-15)Telomerase is an enzyme that maintains telomere length by adding telomeric sequence repeats onto chromosome ends. The telomerase ribonucleoprotein complex consists of two essential components, a reverse transcriptase and ...
Beleva Guthrie, Violeta; 0000-0002-5526-4957 (Johns Hopkins UniversityUSA, 2016-04-20)Proteins often evolve new functions by acquiring a small number of mutations in an ancestral sequence not containing the phenotype. Modeling the functional effect of a mutation is, however, a nontrivial task, due to strong ...