Mining Text Outliers in Document Directories