An Accelerated Chow and Liu Algorithm: Fitting Tree Distributions to High Dimensional Sparse Data
Author(s)
Meila, Marina
DownloadAIM-1652.ps (1.311Mb)
Additional downloads
Metadata
Show full item recordAbstract
Chow and Liu introduced an algorithm for fitting a multivariate distribution with a tree (i.e. a density model that assumes that there are only pairwise dependencies between variables) and that the graph of these dependencies is a spanning tree. The original algorithm is quadratic in the dimesion of the domain, and linear in the number of data points that define the target distribution $P$. This paper shows that for sparse, discrete data, fitting a tree distribution can be done in time and memory that is jointly subquadratic in the number of variables and the size of the data set. The new algorithm, called the acCL algorithm, takes advantage of the sparsity of the data to accelerate the computation of pairwise marginals and the sorting of the resulting mutual informations, achieving speed ups of up to 2-3 orders of magnitude in the experiments.
Date issued
1999-01-01Other identifiers
AIM-1652
Series/Report no.
AIM-1652