You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tika User (Jira)" <ji...@apache.org> on 2023/01/09 11:45:00 UTC

[jira] [Created] (TIKA-3952) Content mismatch

Tika User created TIKA-3952:
-------------------------------

             Summary: Content mismatch 
                 Key: TIKA-3952
                 URL: https://issues.apache.org/jira/browse/TIKA-3952
             Project: Tika
          Issue Type: Bug
    Affects Versions: 2.6.0
            Reporter: Tika User
         Attachments: download.pdf

While extracting content of attached file. We are seeing below content mismatch.



Native file content  : 95 (1972); Erznoznik v. City of Jacksonville

Content we got from Tika : 95 (1972); Er{*}e{*}noznik v. City of Jacksonville

 

Native file content   : 438 U.S.\n726

Content we got from Tika : 438 {*}U-S{*}.\n726



--
This message was sent by Atlassian Jira
(v8.20.10#820010)