Title: RDNAS: a simple DNA sequence squeezer using enhanced run length encoding
Authors: S. Panneer Arokiaraj; L. Robert
Addresses: Periyar EVR College, Trichy – 620 023, TN, India ' Government Arts College, Coimbatore – 641 018, TN, India
Abstract: Collecting and organising the DNA data of all organisms have become the corner stone of the basic biological science. It is indispensable to all those areas utilising this knowledge of DNA sequences such as diagnostic, biotechnology, forensic biology, drug design, etc. This area is developing fast with demand as well as the growth of databases, one inducing the other's growth. Due to this ever increasing demand, the size of the DNA databases are growing in an exponential manner. The scientific community working on data compression has proved that the data can be considerably reduced in their size if the repetitions are supervised and thereby increasing the capacity of the usage of the storage media. To address this issue, an attempt is made to compress the DNA sequences using run length encoding (RLE) in a simplest form and to obtain a better compression ratio and compression gain.
Keywords: data compression; compression gain; compression ratio; run length encoding; RLE; DNA sequences.
International Journal of Advanced Intelligence Paradigms, 2016 Vol.8 No.4, pp.443 - 450
Received: 22 Oct 2014
Accepted: 08 Aug 2015
Published online: 07 Nov 2016 *