Concod: an effective integration framework of consensus-based calling deletions from next-generation sequencing data
by Lei Cai; Chong Chu; Xiaodong Zhang; Yufeng Wu; Jingyang Gao
International Journal of Data Mining and Bioinformatics (IJDMB), Vol. 17, No. 2, 2017

Abstract: Detection of structural variations such as deletion with short sequence reads from next-generation sequencing is a significant but challenging problem in the field of genome analysis. This paper proposes a conceptual framework to improve the effects of calling deletions. Although the genetic sequencing tools are massively produced for the moment, not a single method clearly outperforms all other methods. At present, a widely used way of deletion detection is merging, which combined all the features to achieve more accurate deletion calling. However, most existing methods using the combining approach are heuristic and the called deletions by these tools still contain many wrongly called deletions. In this paper, we introduce Concod, an effective integration framework using machine learning to detect deletions. First, Concod collects the candidate deletions from multiple existing deletion detection tools. Then, based on the multiple detection theories, the features of candidates are extracted from sequence. Last, according to these features, a machine learning model is trained to distinguish the true and false candidates. We test our framework on different coverage of real data and make a comparison with other existing tools, including Pindel, SVseq2, BreakDancer and DELLY. Results show that Concod improves both precision and sensitivity of deletion detection significantly.

Online publication date: Mon, 22-May-2017

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Data Mining and Bioinformatics (IJDMB):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com