Smart variant filtering
by Vladimir Kovacevic; Predrag Obradovic
International Journal of Data Mining and Bioinformatics (IJDMB), Vol. 26, No. 3/4, 2021

Abstract: Variant filtering as a part of the genome reconstruction process is used for identifying falsely called variants. Availability of truth set variants published for several human DNA samples enabled the creation of the machine learning-based Smart Variant Filtering tool and framework for filtering germline variants. Conceptually, the framework consists of selecting an optimal machine learning algorithm, configuration, set of features, and producing a model used for filtering novel variants. With direct comparison, we demonstrated that the presented solution outperforms variant filtering currently used within most secondary DNA analyses. Smart Variant Filtering increases the precision of called single nucleotide variants (removes false positives) by up to 0.2% while keeping the overall f-score higher by 0.12-0.27% than in existing solutions. The precision of calling insertions and deletions is increased up to 7.8%, while the f-score increase is in the range of 0.1% to 3.2%.

Online publication date: Tue, 08-Nov-2022

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Data Mining and Bioinformatics (IJDMB):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com