You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by A Laxmi <a....@gmail.com> on 2013/08/01 15:02:31 UTC

Re: Nutch 1.6 - Parse Meta-tags plugin question

Can Nutch parse meta-tag data from *a URL of a PDF file*? Eg. "
www.domain.com/abc/xyz.pdf"


On Wed, Jul 31, 2013 at 11:01 AM, A Laxmi <a....@gmail.com> wrote:

> Hello,
>
> Can Nutch parse-metatags/index-metatags plugin parse meta-tag data from *a
> URL of a PDF file*?
>
>
> P.S: PDF file URL in this case has the metatag information such as
> keywords, description.
>
> Thanks for your help!
>

Re: Nutch 1.6 - Parse Meta-tags plugin question

Posted by feng lu <am...@gmail.com>.
yes, it can parse meta-tag data from a url of pdf file. you can use this
command to check

bin/nutch plugin


On Thu, Aug 1, 2013 at 9:02 PM, A Laxmi <a....@gmail.com> wrote:

> Can Nutch parse meta-tag data from *a URL of a PDF file*? Eg. "
> www.domain.com/abc/xyz.pdf"
>
>
> On Wed, Jul 31, 2013 at 11:01 AM, A Laxmi <a....@gmail.com> wrote:
>
> > Hello,
> >
> > Can Nutch parse-metatags/index-metatags plugin parse meta-tag data from
> *a
> > URL of a PDF file*?
> >
> >
> > P.S: PDF file URL in this case has the metatag information such as
> > keywords, description.
> >
> > Thanks for your help!
> >
>



-- 
Don't Grow Old, Grow Up... :-)