You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@uima.apache.org by Milen <mi...@yahoo.com> on 2010/11/06 14:10:37 UTC

Collecting all documents from a directory

Hello,
I am trying to cluster the files from a directory. 
So i have already implemented the clustering part 
i am using the library Mallet and it works when i cluster 
directly the files. But i have to use UIMA. So i wrote a 
collection reader and i read the files and annotate them 
as documents. But as an input parameter for the cluster i need 
a collection of documents, and when i cant manage to do this. 
With the collection reader the files are collected one by one. 
What should I implement in order to collect all the files in 
an array for example and than be able to use them.
Regards,
Milen