Title: RDNAS: a simple DNA sequence squeezer using enhanced run length encoding

Authors: S. Panneer Arokiaraj; L. Robert

Addresses: Periyar EVR College, Trichy – 620 023, TN, India ' Government Arts College, Coimbatore – 641 018, TN, India

Abstract: Collecting and organising the DNA data of all organisms have become the corner stone of the basic biological science. It is indispensable to all those areas utilising this knowledge of DNA sequences such as diagnostic, biotechnology, forensic biology, drug design, etc. This area is developing fast with demand as well as the growth of databases, one inducing the other's growth. Due to this ever increasing demand, the size of the DNA databases are growing in an exponential manner. The scientific community working on data compression has proved that the data can be considerably reduced in their size if the repetitions are supervised and thereby increasing the capacity of the usage of the storage media. To address this issue, an attempt is made to compress the DNA sequences using run length encoding (RLE) in a simplest form and to obtain a better compression ratio and compression gain.

Keywords: data compression; compression gain; compression ratio; run length encoding; RLE; DNA sequences.

DOI: 10.1504/IJAIP.2016.080198

International Journal of Advanced Intelligence Paradigms, 2016 Vol.8 No.4, pp.443 - 450

Received: 22 Oct 2014
Accepted: 08 Aug 2015

Published online: 07 Nov 2016 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article