You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (JIRA)" <ji...@apache.org> on 2013/10/08 19:20:42 UTC
[jira] [Updated] (PDFBOX-929) Extraction of the Content from CJK
pdf's using PDFBox and indexing the same with LUCENE search in Solaris
fails.
[ https://issues.apache.org/jira/browse/PDFBOX-929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Andreas Lehmkühler updated PDFBOX-929:
--------------------------------------
Component/s: (was: Lucene)
> Extraction of the Content from CJK pdf's using PDFBox and indexing the same with LUCENE search in Solaris fails.
> ----------------------------------------------------------------------------------------------------------------
>
> Key: PDFBOX-929
> URL: https://issues.apache.org/jira/browse/PDFBOX-929
> Project: PDFBox
> Issue Type: Bug
> Components: PDFReader, Text extraction
> Environment: Solaris
> Reporter: gomathy s
> Original Estimate: 5h
> Remaining Estimate: 5h
>
> In the solaris environment , when we are using the PDFBox ,extracting the content and setting few lines from the PDF as a description and
> indexing the content.In the search we don't get any results when we are searching with the CJK characters but english words it is
> able to retreive results.Am using the correct analyzer both during indexing and searching.This happens only in Solaris , in windows it is working
> fine.Please suggest me guys , this is an major issue for me.
--
This message was sent by Atlassian JIRA
(v6.1#6144)