You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Darya Arbuzova (JIRA)" <ji...@apache.org> on 2014/11/18 09:53:34 UTC

[jira] [Updated] (TIKA-1481) TikaJAXRS get metadata calls give different results

     [ https://issues.apache.org/jira/browse/TIKA-1481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Darya Arbuzova updated TIKA-1481:
---------------------------------
    Description: 
Hello!

I'm trying to use Tika in server mode.
I downloaded tika-server-1.6.jar from http://mirror.vorboss.net/apache/tika/.
I have tried to get file metadata in 2 different ways (as explained here: http://wiki.apache.org/tika/TikaJAXRS):
{{> curl -T sample.csv http://localhost:9998/meta --header "Content-Type: text/csv"}}
{{"Content-Encoding","windows-1252"}}
{{"Content-Type","text/plain; charset=windows-1252"}}
and
{{> curl -X PUT -d @sample.csv http://localhost:9998/meta --header "Content-Type: text/csv"}}
{{"Content-Encoding","ISO-8859-1"}}
{{"Content-Type","text/plain; charset=ISO-8859-1"}}
How come they give different results in encoding if I call the same {{http://localhost:9998/meta}}?
What could the other differences appear and which is the preferable way to get metadata?

Many thanks!

Best regards,
Darya Arbuzova

  was:
Hello!

I'm trying to use Tika in server mode.
I downloaded tika-server-1.6.jar from http://mirror.vorboss.net/apache/tika/.
I have tried to get file metadata in 2 different ways (as explained here: http://wiki.apache.org/tika/TikaJAXRS):
{{> curl -T sample.csv http://localhost:9998/meta --header "Content-Type: text/csv"
"Content-Encoding","windows-1252"
"Content-Type","text/plain; charset=windows-1252"}}
and
{{> curl -X PUT -d @sample.csv http://localhost:9998/meta --header "Content-Type: text/csv"
"Content-Encoding","ISO-8859-1"
"Content-Type","text/plain; charset=ISO-8859-1"}}
How come they give different results in encoding if I call the same {{http://localhost:9998/meta}}?
What could the other differences appear and which is the preferable way to get metadata?

Many thanks!

Best regards,
Darya Arbuzova


> TikaJAXRS get metadata calls give different results
> ---------------------------------------------------
>
>                 Key: TIKA-1481
>                 URL: https://issues.apache.org/jira/browse/TIKA-1481
>             Project: Tika
>          Issue Type: Bug
>          Components: server
>    Affects Versions: 1.6
>         Environment: Windows 8, JDK 1.8
>            Reporter: Darya Arbuzova
>            Priority: Minor
>
> Hello!
> I'm trying to use Tika in server mode.
> I downloaded tika-server-1.6.jar from http://mirror.vorboss.net/apache/tika/.
> I have tried to get file metadata in 2 different ways (as explained here: http://wiki.apache.org/tika/TikaJAXRS):
> {{> curl -T sample.csv http://localhost:9998/meta --header "Content-Type: text/csv"}}
> {{"Content-Encoding","windows-1252"}}
> {{"Content-Type","text/plain; charset=windows-1252"}}
> and
> {{> curl -X PUT -d @sample.csv http://localhost:9998/meta --header "Content-Type: text/csv"}}
> {{"Content-Encoding","ISO-8859-1"}}
> {{"Content-Type","text/plain; charset=ISO-8859-1"}}
> How come they give different results in encoding if I call the same {{http://localhost:9998/meta}}?
> What could the other differences appear and which is the preferable way to get metadata?
> Many thanks!
> Best regards,
> Darya Arbuzova



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)