SEMINAR

 

Bioinformatics: Shannon*s negentropy perspectives

 

Perambur S. Neelakanta, Department of Electrical Engineering, College of Engineering and Computer Science, Florida Atlantic University, Boca Raton, Florida 33431

 

Abstract

The research addresses negentropy aspects of genomic details elucidated in terms of information-theoretic perspectives (due to Claude Shannon).  Information theory is a probability-theoretic approach based on negative-entropy (or non-uncertainty) details buried in a set of entities (or signal elements).The underlying art can be used in rational contexts of perceiving useful or meaningful details from a set of stochastical considerations associated with the subject of interest, such as bioinformatics.    

 

For example, application of information theory (IT) or negentropic concepts to genomic details, concerns with the information stored in nucleotide/DNA sequences and in the translation process leading to molecules of life proteins. The concept of entropy or measure of disorderliness (H), when viewed negatively in  Shannon*s sense leads to the celebrated information metric, I = 每 H = 每k where pi is the probability associated with the set of stochastical entities involved. A companion formulation depicting the relative entropy (or mutual information) between two sets of probability distributions {pi} and {qi} is IM =

Identified here is a gamut of avenues in bioinformatics where the concept of information theory can be fruitfully applied: Examples presented refer to:(i) The informatic structure of DNA sequence 每 Redundancy and noisy considerations; (ii) genetic coding: Discrimination/delineation of crisp and/or fuzzy codon-noncodon boundaries; (iii) gene expression 每 elucidation of specific information such as segregation of promoter section TATA box from putative TATA boxes; (iv) estimating the base compositional symmetry between complementary DNA strands of mitochondrial chromosome; (v) complexity metric versus negentropy 每 application in bioinformatic sequence discrimination efforts; (vi)  Csiszar*s family of (neg)entropy formulations: Applications to bioinformatics; (vii) tracking HMM in IT-domain: Relevant Viterbi/forward algorithms of dynamic programming in biological pursuits;(viii) phylogenetic predictions 每 use of various/unusual statistical distance/divergence measures etc.

 

 

Contact for more details: rnarayan@fau.edu, URL: http://www.science.fau.edu/fbrcf/