Unsupervised Learning on Document Datasets. Encyclopedia of Machine Learning 2010: 1009