You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@uima.apache.org by Milen <mi...@yahoo.com> on 2010/11/06 14:10:37 UTC
Collecting all documents from a directory
Hello,
I am trying to cluster the files from a directory.
So i have already implemented the clustering part
i am using the library Mallet and it works when i cluster
directly the files. But i have to use UIMA. So i wrote a
collection reader and i read the files and annotate them
as documents. But as an input parameter for the cluster i need
a collection of documents, and when i cant manage to do this.
With the collection reader the files are collected one by one.
What should I implement in order to collect all the files in
an array for example and than be able to use them.
Regards,
Milen