You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Rajesh Rajamani (JIRA)" <ji...@apache.org> on 2019/03/12 11:05:00 UTC
[jira] [Commented] (TIKA-2362) Skipping Header and Footer data from
documents
[ https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16790441#comment-16790441 ]
Rajesh Rajamani commented on TIKA-2362:
---------------------------------------
Hi,
Just wanted to see if can have options to specify x-y co-ordinates similar to poppler PDF extraction utility for omitting headers and footers.
> Skipping Header and Footer data from documents
> ----------------------------------------------
>
> Key: TIKA-2362
> URL: https://issues.apache.org/jira/browse/TIKA-2362
> Project: Tika
> Issue Type: Wish
> Components: general, handler
> Reporter: Mujahid Ateeb Khan
> Assignee: Tim Allison
> Priority: Trivial
>
> Is there any method to skip header and footer data of documents(pdf,docx,doc,odt)?
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)