Decomposable Naive Bayes Classifier for Partitioned Data

keywords: Agents, decomposable algorithms, naive bayes classifier, vertical and horizontal partitions
Most learning algorithms are designed to work on a single dataset. However, with the growth of networks, data is increasingly distributed over many databases in many different geographical sites. These databases cannot be moved to other network sites due to security, size, privacy, or data ownership consideration. In this paper, we propose two decomposable versions of Naive Bayes Classifier for horizontally and vertically partitioned data. The goal of our algorithms is to achieve the learning objectives for any data distribution encountered across the network by exchanging minimum local summaries among the participating sites.
mathematics subject classification 2000: 68U99
reference: Vol. 31, 2012, No. 6+, pp. 1511–1531