You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@tika.apache.org by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2009/12/18 07:02:59 UTC

Re: Tika jar without dependencies

....CC¹ing to tika-user@, because this is probably more a question for the
user list...

Hi Kumar,

I think what you want is the tika-core jar -- it contains only the core tika
classes, no dependencies, and no parsing or app code. Give that a try. You
can find that jar in your m2 repo, likely:
$HOME/.m2/repository/org/apache/tika.

HTH,
Chris



On 12/17/09 9:58 PM, "Jana, Kumar Raja" <kj...@ptc.com> wrote:

> Hi,
> 
> Is it possible to add one more target in Tika 0.6 build scripts to
> generate a Tika-classes-only jar along with the tika-standalone.jar
> (which contains the Tika classes + all the dependent jars)?
> 
> This is 'coz we usually use most of the dependent jars in our
> application and adding the Tika standalone jar to the classpath is
> duplicating the code already present. Although, not a major concern in
> most cases but sometimes the dependent jars' code in Tika standalone
> overrides those in the actual patched jars (we used patched versions of
> log4j and a few other jars).
> 
> 
> 
> I tried changing the scripts in Tika-app folder and built the Tika 0.5
> source to achieve this. The only change is to remove/comment the
> <Embed-Dependency> and <Embed-Transitive> tags (lines 55 and 56) in
> tika-app\pom.xml file. I would love to see if this gets integrated into
> the new release.
> 
> 
> 
> Thanks,
> 
> Kumar
> 
> 
> 
> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department University of
Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++



Re: Tika jar without dependencies

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Kumar,

tika-core only includes the base Tika classes which is what I thought you
were asking for. Are you looking for a jar that includes tika-parsers and
tika-app along with tika-core?

Cheers,
Chris



On 12/17/09 10:19 PM, "Jana, Kumar Raja" <kj...@ptc.com> wrote:

> Hi Chris,
> The tika-core.jar does not contain the classes in tika-parsers.jar and also
> those in tika-app/src folder. Please correct me if I am looking at the wrong
> folder.
> I usually execute "mvn dependency:copy-dependencies" after building tika on my
> machine. From the target/dependency folder, I see that tika-core jar and
> tika-parsers jar are dependencies for tika-app.
> Also, tika-app has tika-cli and tika-gui which I find very useful.
> 
> Thanks,
> Kumar
> 
> -----Original Message-----
> From: Mattmann, Chris A (388J) [mailto:chris.a.mattmann@jpl.nasa.gov]
> Sent: Friday, December 18, 2009 11:33 AM
> To: tika-user@lucene.apache.org
> Cc: tika-dev@lucene.apache.org
> Subject: Re: Tika jar without dependencies
> 
> ....CC¹ing to tika-user@, because this is probably more a question for the
> user list...
> 
> Hi Kumar,
> 
> I think what you want is the tika-core jar -- it contains only the core tika
> classes, no dependencies, and no parsing or app code. Give that a try. You
> can find that jar in your m2 repo, likely:
> $HOME/.m2/repository/org/apache/tika.
> 
> HTH,
> Chris
> 
> 
> 
> On 12/17/09 9:58 PM, "Jana, Kumar Raja" <kj...@ptc.com> wrote:
> 
>> Hi,
>> 
>> Is it possible to add one more target in Tika 0.6 build scripts to
>> generate a Tika-classes-only jar along with the tika-standalone.jar
>> (which contains the Tika classes + all the dependent jars)?
>> 
>> This is 'coz we usually use most of the dependent jars in our
>> application and adding the Tika standalone jar to the classpath is
>> duplicating the code already present. Although, not a major concern in
>> most cases but sometimes the dependent jars' code in Tika standalone
>> overrides those in the actual patched jars (we used patched versions of
>> log4j and a few other jars).
>> 
>> 
>> 
>> I tried changing the scripts in Tika-app folder and built the Tika 0.5
>> source to achieve this. The only change is to remove/comment the
>> <Embed-Dependency> and <Embed-Transitive> tags (lines 55 and 56) in
>> tika-app\pom.xml file. I would love to see if this gets integrated into
>> the new release.
>> 
>> 
>> 
>> Thanks,
>> 
>> Kumar
>> 
>> 
>> 
>> 
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: Chris.Mattmann@jpl.nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department University of
> Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 
> 
> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department University of
Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++



RE: Tika jar without dependencies

Posted by "Jana, Kumar Raja" <kj...@ptc.com>.
Hi Chris,
The tika-core.jar does not contain the classes in tika-parsers.jar and also those in tika-app/src folder. Please correct me if I am looking at the wrong folder. 
I usually execute "mvn dependency:copy-dependencies" after building tika on my machine. From the target/dependency folder, I see that tika-core jar and tika-parsers jar are dependencies for tika-app.
Also, tika-app has tika-cli and tika-gui which I find very useful. 

Thanks,
Kumar

-----Original Message-----
From: Mattmann, Chris A (388J) [mailto:chris.a.mattmann@jpl.nasa.gov] 
Sent: Friday, December 18, 2009 11:33 AM
To: tika-user@lucene.apache.org
Cc: tika-dev@lucene.apache.org
Subject: Re: Tika jar without dependencies

....CC¹ing to tika-user@, because this is probably more a question for the
user list...

Hi Kumar,

I think what you want is the tika-core jar -- it contains only the core tika
classes, no dependencies, and no parsing or app code. Give that a try. You
can find that jar in your m2 repo, likely:
$HOME/.m2/repository/org/apache/tika.

HTH,
Chris



On 12/17/09 9:58 PM, "Jana, Kumar Raja" <kj...@ptc.com> wrote:

> Hi,
> 
> Is it possible to add one more target in Tika 0.6 build scripts to
> generate a Tika-classes-only jar along with the tika-standalone.jar
> (which contains the Tika classes + all the dependent jars)?
> 
> This is 'coz we usually use most of the dependent jars in our
> application and adding the Tika standalone jar to the classpath is
> duplicating the code already present. Although, not a major concern in
> most cases but sometimes the dependent jars' code in Tika standalone
> overrides those in the actual patched jars (we used patched versions of
> log4j and a few other jars).
> 
> 
> 
> I tried changing the scripts in Tika-app folder and built the Tika 0.5
> source to achieve this. The only change is to remove/comment the
> <Embed-Dependency> and <Embed-Transitive> tags (lines 55 and 56) in
> tika-app\pom.xml file. I would love to see if this gets integrated into
> the new release.
> 
> 
> 
> Thanks,
> 
> Kumar
> 
> 
> 
> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department University of
Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++