Título: A frequency-based linguistic approach to protein decoding and design: Simple concepts, diverse applications, and the SCS Package
Autores: Motomura, Kenta; University of the Ryukyus
Nakamura, Morikazu; University of the Ryukyus
Otaki, Joji M.; University of the Ryukyus
Fecha: 2013-03-29
Publicador: Computacional and structural biotechnology journal
Fuente:
Tipo: info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
Peer-reviewed Article
Tema: No aplica
Descripción: Protein structure and function information is coded in amino acid sequences. However, the relationship between primary sequences and three-dimensional structures and functions remains enigmatic. Our approach to this fundamental biochemistry problem is based on the frequencies of short constituent sequences (SCSs) or words. A protein amino acid sequence is considered analogous to an English sentence, where SCSs are equivalent to words. Availability scores, which are defined as real SCS frequencies in the non-redundant amino acid database relative to their probabilistically expected frequencies, demonstrate the biological usage bias of SCSs. As a result, this frequency-based linguistic approach is expected to have diverse applications, such as secondary structure specifications by structure-specific SCSs and immunological adjuvants with rare or non-existent SCSs. Linguistic similarities (e.g., wide ranges of scale-free distributions) and dissimilarities (e.g., behaviors of low-rank samples) between proteins and the natural English language have been revealed in the rank-frequency relationships of SCSs or words. We have developed a web server, the SCS Package, which contains five applications for analyzing protein sequences based on the linguistic concept. These tools have the potential to assist researchers in deciphering structurally and functionally important protein sites, species-specific sequences, and functional relationships between SCSs. The SCS Package also provides researchers with a tool to construct amino acid sequences de novo based on the idiomatic usage of SCSs.
Idioma: Inglés

Artículos similares:

Systems biology and metabolic engineering of Arthrospira cell factories por Klanchui, Amornpan,Vorapreeda, Tayvich,Vongsangnak, Wanwipa,Kannapho, Chiraphan,Cheevadhanarak, Supapon,Meechai, Asawin
The Role of INDY in Metabolic Regulation por Willmes, Diana M; Charité University School of Medicine Berlin,Birkenfeld, Andreas L; Charité University School of Medicine Berlin
Structure-based Methods for Computational Protein Functional Site Prediction por KC, Dukka B; North Carolina A&T State University
The Biochemistry of Vitreoscilla hemoglobin por Stark, Benjamin C.; Illinois Institute of Technology,Dikshit, Kanak L.; Institute of Microbial Technology,Pagilla, Krishna R.; Illinois Institute of Technology
Computer-Aided Protein Directed Evolution: a Review of Web Servers, Databases and other Computational Tools for Protein Engineering por Verma, Rajni; Jacobs University Bremen,Schwaneberg, Ulrich; RWTH Aachen University,Roccatano, Danilo; Jacobs University Bremen
A method to predict edge strands in beta-sheets from protein sequences por Guilloux, Antonin,Caudron, Bernard,Jestin, Jean-Luc
MD simulation studies to investigate iso-energetic conformational behaviour of modified nucleosides m2G and m22G present in tRNA por Bavi, Rohit S,Sambhare, Susmit B,Sonawane, Kailas D; Structural Bioinformatics Unit, Department of Biochemistry, Shivaji University, Kolhapur 416 004, Maharashtra (M.S.), India.
Metabolomics in the identification of biomarkers of dietary intake por O’Gorman, Aoife,Gibbons, Helena,Brennan, Lorraine
10