Please use this identifier to cite or link to this item: http://nopr.niscpr.res.in/handle/123456789/43544
Title: Identification of Protein Coding Region in DNA Sequence Using Novel Adaptive Exon Predictor
Authors: Putluri, S R
Rahman, Md Zia Ur
Keywords: Adaptive Algorithm;Computational Complexity;Deoxyribonucleic Acid;Disease Identification;Exons
Issue Date: Feb-2018
Publisher: NISCAIR-CSIR, India
Abstract: Accurately identifying the exon regions in a deoxyribonucleic acid (DNA) sequence is an important task in bio-informatics. Analysis of regions which code for proteins is a key phenomenon for disease identification and design drugs. Exons are DNA segments contain coding information of proteins. Particularly exons within the genes exhibit three base periodicity (TBP), which forms the basis of all exon identification techniques. Several exon identification techniques have been applied successfully for prediction of exons, but improvement is still needed in this direction. By applying signal processing methods, TBP can be easily determined. Adaptive signal processing techniques found to be promising because of their distinct ability of changing weight coefficients in comparison with several other methods. In this paper, a novel adaptive exon predictor (AEP) is proposed based on these considerations using normalization to increase the tracking ability of the adaptive algorithm for exons. Several AEPs are developed using LMS algorithm and its maximum normalized sign based variants to reduce the computational complexity. Hybrid variants of proposed AEPs include MDNLMS, MDNSRLMS, MDNSLMS and MDNSSLMS algorithms. It was shown that MDNSRLMS is more accurate in exon prediction based on performance measures with Sensitiviy (Sn) 0.7372, Specificity (Sp) 0.7573and Precision (Pr) 0.7122 at a threshold of 0.8 for genomic sequence with Accession AF009962. Finally the exon tracking ability of various AEPs has been assessed through a simulation study and results obtained are compared with existing method using various standard genomic datasets taken from the National Center for Biotechnology Information (NCBI) genomic sequence database.
Page(s): 87-91
ISSN: 0975-1084 (Online); 0022-4456 (Print)
Appears in Collections:JSIR Vol.77(02) [February 2018]

Files in This Item:
File Description SizeFormat 
JSIR 77(2) 87-91.pdf689.87 kBAdobe PDFView/Open


Items in NOPR are protected by copyright, with all rights reserved, unless otherwise indicated.