Incorporating prior knowledge induced from stochastic differential equations in the classification of stochastic observations

Springer Science and Business Media LLC - Tập 2016 - Trang 1-14 - 2016

Amin Zollanvari¹, Edward R. Dougherty²

¹Department of Electrical and Electronic Engineering, Nazarbayev University, Astana, Kazakhstan

²The Center for Bioinformatics and Genomic Systems Engineering and the Department of Electrical and Computer Engineering, Texas A&M University, College Station, Texas

Tóm tắt

In classification, prior knowledge is incorporated in a Bayesian framework by assuming that the feature-label distribution belongs to an uncertainty class of feature-label distributions governed by a prior distribution. A posterior distribution is then derived from the prior and the sample data. An optimal Bayesian classifier (OBC) minimizes the expected misclassification error relative to the posterior distribution. From an application perspective, prior construction is critical. The prior distribution is formed by mapping a set of mathematical relations among the features and labels, the prior knowledge, into a distribution governing the probability mass across the uncertainty class. In this paper, we consider prior knowledge in the form of stochastic differential equations (SDEs). We consider a vector SDE in integral form involving a drift vector and dispersion matrix. Having constructed the prior, we develop the optimal Bayesian classifier between two models and examine, via synthetic experiments, the effects of uncertainty in the drift vector and dispersion matrix. We apply the theory to a set of SDEs for the purpose of differentiating the evolutionary history between two species.

Tài liệu tham khảo

Braga-Neto, UM, & Dougherty, ER. (2015). Error Estimation for Pattern Recognition. New York: Wiley-IEEE Press.

Kay, S. (1993). Fundamentals of Statistical Signal Processing: Estimation Theory. New Jersey: Prentice-Hall.

Carlin, BP, & Louis, TA. (2008). Bayesian Methods for Data Analysis. Boca Raton: CRC Press.

Dalton, L, & Dougherty, ER (2011). Bayesian minimum mean-square error estimation for classification error–part I: definition and the Bayesian MMSE error estimator for discrete classification. IEEE Transactions on Signal Processing, 59(1), 115–129.

Dalton, L, & Dougherty, ER (2011). Bayesian minimum mean-square error estimation for classification error–part II: linear classification of Gaussian models. IEEE Transactions on Signal Processing, 59(1), 130–144.

Dalton, L, & Dougherty, ER (2013). Optimal classifiers with minimum expected error within a Bayesian framework – part I: discrete and Gaussian models. Pattern Recognition, 46, 1301–1314.

Dalton, L, & Dougherty, ER (2013). Optimal classifiers with minimum expected error within a Bayesian framework – part II: properties and performance analysis. Pattern Recognition, 46, 1288–1300.

Knight, J, Ivanov, I, Dougherty, ER (2014). MCMC implementation of the optimal Bayesian classifier for non-gaussian models: model-based RNA-seq classification. BMC Bioinformatics, 15. doi:10.1186/s12859-014-0401-3.

Esfahani, MS, & Dougherty, ER (2014). Incorporation of biological pathway knowledge in the construction of priors for optimal Bayesian classification. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 11, 202–218.

Esfahani, MS, & Dougherty, ER (2015). An optimization-based framework for the transformation of incomplete biological knowledge into a probabilistic structure and its application to the utilization of gene/protein signaling pathways in discrete phenotype classification. IEEE/ACM Transactions on Computational Biology and Bioinformatics. doi:10.1109/TCBB.2015.2424407.

Jaynes, ET (1968). Prior probabilities. IEEE Transactions on Systems Science and Cybernetics, 4, 227–241.

Kloeden, PE, & Platen, E. (1995). Numerical Solution of Stochastic Differential Equations. New York: Springer.

Arnold, L. (1974). Stochastic Differential Equations: Theory and Applications. New York: Wiley.

Higham, D (2001). An algorithmic introduction to numerical simulation of stochastic differential equations. SIAM Review, 43, 525–546.

Anderson, TW (1951). Classification by multivariate analysis. Psychometrika, 16, 31–50.

Murphy, KP. (2012). Machine Learning: A Probabilistic Perspective. Cambridge: MIT Press.

DeGroot, MH. (1970). Optimal Statistical Decisions. New York: McGrawHill.

Esfahani, MS, & Dougherty, ER (2014). Effect of separate sampling on classification accuracy. Bioinformatics, 30, 242–250.

Braga-Neto, UM, Zollanvari, A, Dougherty, ER (2014). Cross-validation under separate sampling: strong bias and how to correct it. Bioinformatics, 30, 3349–3355.

Hansen, TF (1997). Stabilizing selection and the comparative analysis of adaptation. Evolution, 51, 1341–1351.

Thompson, K, & Kubatko, LS (2013). Using ancestral information to detect and localize quantitative trait loci in genome-wide association studies. BMC Bioinformatics, 14. doi:10.1186/1471-2105-14-200.

Zollanvari, A, & Dougherty, ER (2014). Moments and root-mean-square error of the Bayesian MMSE estimator of classification error in the Gaussian model. Pattern Recognition, 47, 2178–2192.

Dalton, L, & Dougherty, ER (2014). Intrinsically optimal Bayesian robust filtering. IEEE Transactions on Signal Processing, 62(3), 657–670.

Pugachev, VS. (1965). Theory of Random Functions and Its Applications to Control Problems. Oxford: Pergamon.

Dougherty, ER. (1999). Random Processes for Image and Signal Processing. New York: SPIE Press and IEEE Presses.

Higham, DJ (2015). An introduction to multilevel Monte Carlo for option valuation. International Journal of Computer Mathematics, 92(12).

Duda, RO, Hart, PE, Stork, DG. (2000). Pattern Classification. New York: Wiley.

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Về chúng tôi

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích các bài báo, công bố khoa học Việt Nam. Công cụ trợ giúp người nghiên cứu, tạp chí, đơn vị nghiên cứu tra cứu, phân tích và thống kê dữ liệu nghiên cứu khoa học tại Việt Nam và quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia vào Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Công cụ kiểm tra chính tả và thể thức Viver

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA