Global Sequence Homology Detection Using Word Conservation Probability

관리자, 2011-10-21 16:58:40

조회 수
1091
추천 수
0

Authors

Jae-Seong Yang1+, Dae-Kyum Kim2+, Jinho Kim and Sanguk Kim1,2,3,*

1School of Interdisciplinary Bioscience and Bioengineering, Pohang University of Science and Technology, Hyoja-dong, Nam-gu, Pohang, Gyungbuk, Republic of Korea
2Division of Molecular and Life Science, Pohang University of Science and Technology, Hyoja-dong, Nam-gu, Pohang, Gyungbuk, Republic of Korea
3Division of IT Convergence Engineering, Pohang University of Science and Technology, Hyoja-dong, Nam-gu, Pohang, Gyungbuk, Republic of Korea

+These authors contributed equally to this work

Synopsis

Protein homology detection is an important issue in comparative genomics. Because of the exponential growth of sequence databases, fast and efficient homology detection tools are urgently needed. Currently, for homology detection, sequence comparison methods using local alignment such as BLAST are generally used as they give a reasonable measure for sequence similarity. However, these methods have drawbacks in offering overall sequence similarity, especially in dealing with eukaryotic genomes that often contain many insertions and duplications on sequences. Also these methods do not provide the explicit models for speciation, thus it is difficult to interpret their similarity measure into homology detection. Here, we present a novel method based on Word Conservation Score (WCS) to address the current limitations of homology detection. Instead of counting each amino acid, we adopted the concept of ‘Word’ to compare sequences. WCS measures overall sequence similarity by comparing word contents, which is much faster than BLAST comparisons. Furthermore, evolutionary distance between homologous sequences could be measured by WCS. Therefore, we expect that sequence comparison with WCS is useful for the multiple-species-comparisons of large genomes. In the performance comparisons on protein structural classifications, our method showed a considerable improvement over BLAST. Our method found bigger micro-syntenic blocks which consist of orthologs with conserved gene order. By testing on various datasets, we showed that WCS gives faster and better overall similarity measure compared to BLAST.

http://www.ibc7.org/article/journal_v.php?sid=270&page=1

0 엮인글

0 댓글

댓글 쓰기

문서 첨부 제한 : 0Byte/ 2.00MB
파일 제한 크기 : 2.00MB (허용 확장자 : *.*)

Board Menu

목록

Page 1 / 3
번호 제목 글쓴이 날짜 조회 수
41

Endocytic Regulation of EGFR Signaling

관리자 2012-04-27 98
40

Classification of HDAC8 Inhibitors and Non-Inhibitors Using Support Vector Machines

관리자 2012-04-13 149
39

Members of Ectocarpus siliculosus F-box Family Are Subjected to Differential Selective Forces

관리자 2012-03-02 377
38

Mitochondrial DNA Mutation and Oxidative Stress

관리자 2012-03-02 373
37

Genetic Function Approximation and Bayesian Models for the Discovery of Future HDAC8 Inhibitors

관리자 2012-03-02 367
36

Global Sequence Homology Detection Using Word Conservation Probability

관리자 2011-10-21 1091
35

Evidence of Sexual Selection for Evening Orientation in Human Males: A Cross Cultural Study in Italy and Sri Lanka

관리자 2011-10-13 1131
34

IntoPub: A Directory Server for Bioinformatics Tools and Databases

관리자 2011-09-20 1181
33

Body Height Effect on Brain Volumes in Youth Decreases in Old Age in Koreans

관리자 2011-08-22 1341
32

Bacterial Hash Function Using DNA-Based XOR Logic Reveals Unexpected Behavior of the LuxR Promoter [Reports on negative result]

관리자 2011-07-28 1548
31

Optimized Entity Attribute Value Model: A Search Efficient Representation of High Dimensional and Sparse Data [Rapid Report]

관리자 2011-07-15 1180
30

G-Networks Based Two Layer Stochastic Modeling of Gene Regulatory Networks with Post-Translational Processes [Rapid Report]

관리자 2011-07-15 1195
29

PubMine: An Ontology-Based Text Mining System for Deducing Relationships among Biological Entities [Full Report]

관리자 2011-07-15 1134
28

DMBase: An Integrated Genetic Information Resource for Diabetes Mellitus [Rapid Report]

관리자 2011-07-15 891
27

Inferring Relative Activity between Pathway and Downstream Genes to Classify Melanoma Cancer Progression [Full Report]

관리자 2011-07-15 1010
26

Target Identification : A Challenging Step in Forward Chemical Genetics [Review]

관리자 2011-07-15 927
25

Field Study on the Mycotoxin Binding Effects of Clay in Oreochromis niloticus Feeds and Their Impacts on the Performance as Well as the Health Status throughout the Culture Season [Full Report]

관리자 2011-07-15 1088
24

Tutorial on Drug Development for Central Nervous System [Tutorial]

관리자 2011-07-15 1015
23

Theoretical Investigations on Structure and Function of Human Homologue hABH4 of E.coli ALKB4 [Full Report]

관리자 2011-07-15 1053
22

StrokePortal: A Complete Stroke Information Resource Based on Oriental and Western Medicine [Rapid Report]

관리자 2011-07-15 953

Board Links

Page Navigation