Incorporating prior knowledge induced from stochastic differential equations in the classification of stochastic observations

Springer Science and Business Media LLC - Tập 2016 - Trang 1-14 - 2016
Amin Zollanvari1, Edward R. Dougherty2
1Department of Electrical and Electronic Engineering, Nazarbayev University, Astana, Kazakhstan
2The Center for Bioinformatics and Genomic Systems Engineering and the Department of Electrical and Computer Engineering, Texas A&M University, College Station, Texas

Tóm tắt

In classification, prior knowledge is incorporated in a Bayesian framework by assuming that the feature-label distribution belongs to an uncertainty class of feature-label distributions governed by a prior distribution. A posterior distribution is then derived from the prior and the sample data. An optimal Bayesian classifier (OBC) minimizes the expected misclassification error relative to the posterior distribution. From an application perspective, prior construction is critical. The prior distribution is formed by mapping a set of mathematical relations among the features and labels, the prior knowledge, into a distribution governing the probability mass across the uncertainty class. In this paper, we consider prior knowledge in the form of stochastic differential equations (SDEs). We consider a vector SDE in integral form involving a drift vector and dispersion matrix. Having constructed the prior, we develop the optimal Bayesian classifier between two models and examine, via synthetic experiments, the effects of uncertainty in the drift vector and dispersion matrix. We apply the theory to a set of SDEs for the purpose of differentiating the evolutionary history between two species.

Tài liệu tham khảo

Braga-Neto, UM, & Dougherty, ER. (2015). Error Estimation for Pattern Recognition. New York: Wiley-IEEE Press.

Kay, S. (1993). Fundamentals of Statistical Signal Processing: Estimation Theory. New Jersey: Prentice-Hall.

Knight, J, Ivanov, I, Dougherty, ER (2014). MCMC implementation of the optimal Bayesian classifier for non-gaussian models: model-based RNA-seq classification. BMC Bioinformatics, 15. doi:10.1186/s12859-014-0401-3.

Esfahani, MS, & Dougherty, ER (2015). An optimization-based framework for the transformation of incomplete biological knowledge into a probabilistic structure and its application to the utilization of gene/protein signaling pathways in discrete phenotype classification. IEEE/ACM Transactions on Computational Biology and Bioinformatics. doi:10.1109/TCBB.2015.2424407.

Jaynes, ET (1968). Prior probabilities. IEEE Transactions on Systems Science and Cybernetics, 4, 227–241.

Arnold, L. (1974). Stochastic Differential Equations: Theory and Applications. New York: Wiley.

Murphy, KP. (2012). Machine Learning: A Probabilistic Perspective. Cambridge: MIT Press.

DeGroot, MH. (1970). Optimal Statistical Decisions. New York: McGrawHill.

Thompson, K, & Kubatko, LS (2013). Using ancestral information to detect and localize quantitative trait loci in genome-wide association studies. BMC Bioinformatics, 14. doi:10.1186/1471-2105-14-200.

Dougherty, ER. (1999). Random Processes for Image and Signal Processing. New York: SPIE Press and IEEE Presses.

Duda, RO, Hart, PE, Stork, DG. (2000). Pattern Classification. New York: Wiley.