You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@jackrabbit.apache.org by Alexandre Martins <al...@gmail.com> on 2007/01/24 17:39:32 UTC

Indexing PDF Files

Hi,

Someone knows which version of PDFBox is necessary to index pdf files. I
have a problem, because the version PDFBox.0.7.3 doesn't has some classes,
and several exceptions are launched. (see exceptions below)

[]s

24.01.2007 13:08:31 *WARN * LazyReader: exception initializing reader
org.apache
.jackrabbit.core.query.PdfTextFilter$1: java.lang.NoClassDefFoundError:
org/font
box/cmap/CMapParser (LazyReader.java , line 82)
24.01.2007 13:08:31 *WARN * LazyReader: exception initializing reader
org.apache
.jackrabbit.core.query.PdfTextFilter$1: java.lang.NoClassDefFoundError:
org/font
box/cmap/CMapParser (LazyReader.java, line 82)
24.01.2007 13:08:31 *WARN * LazyReader: exception initializing reader
org.apache
.jackrabbit.core.query.PdfTextFilter$1: java.lang.NoClassDefFoundError:
org/font
box/cmap/CMapParser (LazyReader.java, line 82)
24.01.2007 13:08:34 *WARN * LazyReader: exception initializing reader
org.apache
.jackrabbit.core.query.PdfTextFilter$1: java.lang.NoClassDefFoundError:
org/font
box/afm/AFMParser (LazyReader.java, line 82)
24.01.2007 13:33:34 *INFO * ImportContextImpl: Result for IOHandler (
org.apache.
jackrabbit.server.io.DefaultHandler): OK (DefaultIOListener.java, line 50)
24.01.2007 13:33:39 *WARN * LazyReader: exception initializing reader
org.apache
.jackrabbit.core.query.PdfTextFilter$1: java.lang.NoClassDefFoundError:
org/font
box/afm/FontMetric (LazyReader.java, line 82)
-- 
Alexandre Costa Martins
CESAR - Recife Center for Advanced Studies and Systems
Software Engineer and Software Reuse Researcher
MSc Candidate at Federal University of Pernambuco
RiSE Member - http://www.rise.com.br

E-mail: alexandre.martins@cesar.org.br
MSN: xandecmartins@hotmail.com
GTalk: alexandremartins@gmail.com
Skype: xandecmartins
Mobile: +55 (81) 9929-9548
Office: +55 (81) 3425-4787
Fax: +55 (81) 3425-4701

RE: Indexing PDF Files

Posted by Patrick Herber <pa...@ticino.com>.
Hi, I think you also need the FontBox JAR (for example
http://repository.aduna-software.org/maven2/fontbox/fontbox/0.1.0-dev/)

Regards,
Patrick

> -----Original Message-----
> From: Alexandre Martins [mailto:alexandremartins@gmail.com] 
> Sent: Wednesday, 24 January 2007 17:40
> To: users@jackrabbit.apache.org
> Subject: Indexing PDF Files
> 
> Hi,
> 
> Someone knows which version of PDFBox is necessary to index 
> pdf files. I have a problem, because the version PDFBox.0.7.3 
> doesn't has some classes, and several exceptions are 
> launched. (see exceptions below)
> 
> []s
> 
> 24.01.2007 13:08:31 *WARN * LazyReader: exception 
> initializing reader org.apache
> .jackrabbit.core.query.PdfTextFilter$1: 
> java.lang.NoClassDefFoundError:
> org/font
> box/cmap/CMapParser (LazyReader.java , line 82)
> 24.01.2007 13:08:31 *WARN * LazyReader: exception 
> initializing reader org.apache
> .jackrabbit.core.query.PdfTextFilter$1: 
> java.lang.NoClassDefFoundError:
> org/font
> box/cmap/CMapParser (LazyReader.java, line 82)
> 24.01.2007 13:08:31 *WARN * LazyReader: exception 
> initializing reader org.apache
> .jackrabbit.core.query.PdfTextFilter$1: 
> java.lang.NoClassDefFoundError:
> org/font
> box/cmap/CMapParser (LazyReader.java, line 82)
> 24.01.2007 13:08:34 *WARN * LazyReader: exception 
> initializing reader org.apache
> .jackrabbit.core.query.PdfTextFilter$1: 
> java.lang.NoClassDefFoundError:
> org/font
> box/afm/AFMParser (LazyReader.java, line 82)
> 24.01.2007 13:33:34 *INFO * ImportContextImpl: Result for 
> IOHandler ( org.apache.
> jackrabbit.server.io.DefaultHandler): OK 
> (DefaultIOListener.java, line 50)
> 24.01.2007 13:33:39 *WARN * LazyReader: exception 
> initializing reader org.apache
> .jackrabbit.core.query.PdfTextFilter$1: 
> java.lang.NoClassDefFoundError:
> org/font
> box/afm/FontMetric (LazyReader.java, line 82)
> --
> Alexandre Costa Martins
> CESAR - Recife Center for Advanced Studies and Systems 
> Software Engineer and Software Reuse Researcher MSc Candidate 
> at Federal University of Pernambuco RiSE Member - 
> http://www.rise.com.br
> 
> E-mail: alexandre.martins@cesar.org.br
> MSN: xandecmartins@hotmail.com
> GTalk: alexandremartins@gmail.com
> Skype: xandecmartins
> Mobile: +55 (81) 9929-9548
> Office: +55 (81) 3425-4787
> Fax: +55 (81) 3425-4701
>