You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Michael Lissner <ml...@michaeljaylissner.com> on 2012/01/22 01:01:26 UTC
Support for x-robots-tag
Hi,
I'm doing some research on what technologies various crawlers support
for crawl exclusion. Without installing and figuring out Nutch, I can't
figure out if it supports the x-robots-tag HTTP header?
Does anybody know?
Thanks,
Mike
Re: Support for x-robots-tag
Posted by Michael Lissner <ml...@michaeljaylissner.com>.
OK, issue created: https://issues.apache.org/jira/browse/NUTCH-1257
Thanks again.
Mike
On 01/24/2012 11:44 PM, Markus Jelsma wrote:
> Not that i'm aware of. You can create an issue in Jira if you like and provide
> a patch if possible.
>
>> Thanks for the follow up. Very good to know. Seems like this would be
>> pretty easy to add, given that the robots meta tag is supported.
>>
>> Are there any plans to do so?
>>
>> Mike
>>
>> On Mon 23 Jan 2012 02:27:02 AM PST, Markus Jelsma wrote:
>>> There is currently no built-in support for the x-robots-tag header.
>>>
>>> On Sunday 22 January 2012 01:01:26 Michael Lissner wrote:
>>>> Hi,
>>>>
>>>> I'm doing some research on what technologies various crawlers support
>>>> for crawl exclusion. Without installing and figuring out Nutch, I can't
>>>> figure out if it supports the x-robots-tag HTTP header?
>>>>
>>>> Does anybody know?
>>>>
>>>> Thanks,
>>>>
>>>> Mike
Re: Support for x-robots-tag
Posted by Markus Jelsma <ma...@openindex.io>.
Not that i'm aware of. You can create an issue in Jira if you like and provide
a patch if possible.
> Thanks for the follow up. Very good to know. Seems like this would be
> pretty easy to add, given that the robots meta tag is supported.
>
> Are there any plans to do so?
>
> Mike
>
> On Mon 23 Jan 2012 02:27:02 AM PST, Markus Jelsma wrote:
> > There is currently no built-in support for the x-robots-tag header.
> >
> > On Sunday 22 January 2012 01:01:26 Michael Lissner wrote:
> >> Hi,
> >>
> >> I'm doing some research on what technologies various crawlers support
> >> for crawl exclusion. Without installing and figuring out Nutch, I can't
> >> figure out if it supports the x-robots-tag HTTP header?
> >>
> >> Does anybody know?
> >>
> >> Thanks,
> >>
> >> Mike
Re: Support for x-robots-tag
Posted by Michael Lissner <ml...@michaeljaylissner.com>.
Thanks for the follow up. Very good to know. Seems like this would be
pretty easy to add, given that the robots meta tag is supported.
Are there any plans to do so?
Mike
On Mon 23 Jan 2012 02:27:02 AM PST, Markus Jelsma wrote:
> There is currently no built-in support for the x-robots-tag header.
>
> On Sunday 22 January 2012 01:01:26 Michael Lissner wrote:
>> Hi,
>>
>> I'm doing some research on what technologies various crawlers support
>> for crawl exclusion. Without installing and figuring out Nutch, I can't
>> figure out if it supports the x-robots-tag HTTP header?
>>
>> Does anybody know?
>>
>> Thanks,
>>
>> Mike
>
Re: Support for x-robots-tag
Posted by Markus Jelsma <ma...@openindex.io>.
There is currently no built-in support for the x-robots-tag header.
On Sunday 22 January 2012 01:01:26 Michael Lissner wrote:
> Hi,
>
> I'm doing some research on what technologies various crawlers support
> for crawl exclusion. Without installing and figuring out Nutch, I can't
> figure out if it supports the x-robots-tag HTTP header?
>
> Does anybody know?
>
> Thanks,
>
> Mike
--
Markus Jelsma - CTO - Openindex