Paper ID: 2112.13117

Application of Markov Structure of Genomes to Outlier Identification and Read Classification

Alan F. Karr, Jason Hauzel, Adam A. Porter, Marcel Schaefer

In this paper we apply the structure of genomes as second-order Markov processes specified by the distributions of successive triplets of bases to two bioinformatics problems: identification of outliers in genome databases and read classification in metagenomics, using real coronavirus and adenovirus data.

Submitted: Dec 24, 2021