You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by Thejan Wijesinghe <th...@gmail.com> on 2017/03/11 16:18:52 UTC

Request for the dataset to test OCR

Hello Tim,

I need the OCR dataset to benchmark my new OCR parser with the existing
one. I just saw that you've commented here,
 https://issues.apache.org/jira/browse/TIKA-2262?focusedCommentId=15862781&
page=com.atlassian.jira.plugin.system.issuetabpanels:
comment-tabpanel#comment-15862781

saying that you've got some in the regression corpus. Could you provide me
with those resources? Thank you.