You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Adam Estrada <es...@gmail.com> on 2010/12/13 00:49:53 UTC

[pubDate] is not converting correctly

All,

I am having some difficu"lties parsing the pubDate field that is part of the
RSS spec (I believe). I get the warning that "states, "Dec 12, 2010 6:45:26
PM org.apache.solr.handler.dataimport.DateFormatTransformer
 transformRow
WARNING: Could not parse a Date field
java.text.ParseException: Unparseable date: "Thu, 30 Jul 2009 14:41:43
+0000"
        at java.text.DateFormat.parse(Unknown Source)"

Does anyone know how to fix this? I would eventually like to do a date query
but without the ability to properly parse them I don't know if it's going to
work.

Thanks,
Adam

Re: [pubDate] is not converting correctly

Posted by Adam Estrada <es...@gmail.com>.
My first submission ;-)

https://issues.apache.org/jira/browse/SOLR-2286

<https://issues.apache.org/jira/browse/SOLR-2286>Adam

On Mon, Dec 13, 2010 at 5:14 PM, Lance Norskog <go...@gmail.com> wrote:

> Create an account at
> https://issues.apache.org/jira/secure/Dashboard.jspa and do 'Create
> New Issue' for the Solr project.
>
> On Mon, Dec 13, 2010 at 2:13 PM, Lance Norskog <go...@gmail.com> wrote:
> > Please file a JIRA requesting this.
> >
> > On Mon, Dec 13, 2010 at 6:29 AM, Adam Estrada <es...@gmail.com>
> wrote:
> >> +1  If I knew enough about how to do this in Java I would but I do not
> >> soooo.....What is the correct way to add or suggest enhancements to Solr
> >> core?
> >>
> >> Adam
> >>
> >> On Sun, Dec 12, 2010 at 11:38 PM, Lance Norskog <go...@gmail.com>
> wrote:
> >>
> >>> Nice find!  This is Apache 2.0, copyright SUN.
> >>>
> >>> O Great Apache Elders: Is it kosher to add this to the Solr
> >>> distribution? It's not in the JDK and is also com.sun.*
> >>>
> >>> On Sun, Dec 12, 2010 at 5:33 PM, Adam Estrada
> >>> <es...@gmail.com> wrote:
> >>> > Thanks for the feedback! There are quite a few formats that can be
> used.
> >>> I
> >>> > am experiencing at least 5 of them. Would something like this work?
> Note
> >>> > that there are 2 different formats separated by a comma.
> >>> >
> >>> > <field column="pubdate" xpath="/rss/channel/item/pubDate"
> >>> > dateTimeFormat="EEE, dd MMM yyyy HH:mm:ss zzz,
> yyyy-MM-dd'T'HH:mm:ss'Z'"
> >>> />
> >>> >
> >>> > I don't suppose it will because there is already a comma in the first
> >>> > parser. I guess I am reallly looking for an all purpose data time
> parser
> >>> but
> >>> > even if I have that, would I still be able to query *all* fields in
> the
> >>> > index?
> >>> >
> >>> > Good article:
> >>> >
> >>>
> http://www.java2s.com/Open-Source/Java-Document/RSS-RDF/Rome/com/sun/syndication/io/impl/DateParser.java.htm
> >>> >
> >>> > Adam
> >>> >
> >>> > On Sun, Dec 12, 2010 at 7:31 PM, Koji Sekiguchi <ko...@r.email.ne.jp>
> >>> wrote:
> >>> >
> >>> >> (10/12/13 8:49), Adam Estrada wrote:
> >>> >>
> >>> >>> All,
> >>> >>>
> >>> >>> I am having some difficu"lties parsing the pubDate field that is
> part
> >>> of
> >>> >>> the?
> >>> >>> RSS spec (I believe). I get the warning that "states, "Dec 12, 2010
> >>> >>> 6:45:26
> >>> >>> PM org.apache.solr.handler.dataimport.DateFormatTransformer
> >>> >>>  transformRow
> >>> >>> WARNING: Could not parse a Date field
> >>> >>> java.text.ParseException: Unparseable date: "Thu, 30 Jul 2009
> 14:41:43
> >>> >>> +0000"
> >>> >>>         at java.text.DateFormat.parse(Unknown Source)"
> >>> >>>
> >>> >>> Does anyone know how to fix this? I would eventually like to do a
> date
> >>> >>> query
> >>> >>> but without the ability to properly parse them I don't know if it's
> >>> going
> >>> >>> to
> >>> >>> work.
> >>> >>>
> >>> >>> Thanks,
> >>> >>> Adam
> >>> >>>
> >>> >>
> >>> >> Adam,
> >>> >>
> >>> >> How does your data-config.xml look like for that field?
> >>> >> Have you looked at rss-data-config.xml file
> >>> >> under example/example-DIH/solr/rss/conf directory?
> >>> >>
> >>> >> Koji
> >>> >> --
> >>> >> http://www.rondhuit.com/en/
> >>> >>
> >>> >
> >>>
> >>>
> >>>
> >>> --
> >>> Lance Norskog
> >>> goksron@gmail.com
> >>>
> >>
> >
> >
> >
> > --
> > Lance Norskog
> > goksron@gmail.com
> >
>
>
>
> --
> Lance Norskog
> goksron@gmail.com
>

Re: [pubDate] is not converting correctly

Posted by Lance Norskog <go...@gmail.com>.
Create an account at
https://issues.apache.org/jira/secure/Dashboard.jspa and do 'Create
New Issue' for the Solr project.

On Mon, Dec 13, 2010 at 2:13 PM, Lance Norskog <go...@gmail.com> wrote:
> Please file a JIRA requesting this.
>
> On Mon, Dec 13, 2010 at 6:29 AM, Adam Estrada <es...@gmail.com> wrote:
>> +1  If I knew enough about how to do this in Java I would but I do not
>> soooo.....What is the correct way to add or suggest enhancements to Solr
>> core?
>>
>> Adam
>>
>> On Sun, Dec 12, 2010 at 11:38 PM, Lance Norskog <go...@gmail.com> wrote:
>>
>>> Nice find!  This is Apache 2.0, copyright SUN.
>>>
>>> O Great Apache Elders: Is it kosher to add this to the Solr
>>> distribution? It's not in the JDK and is also com.sun.*
>>>
>>> On Sun, Dec 12, 2010 at 5:33 PM, Adam Estrada
>>> <es...@gmail.com> wrote:
>>> > Thanks for the feedback! There are quite a few formats that can be used.
>>> I
>>> > am experiencing at least 5 of them. Would something like this work? Note
>>> > that there are 2 different formats separated by a comma.
>>> >
>>> > <field column="pubdate" xpath="/rss/channel/item/pubDate"
>>> > dateTimeFormat="EEE, dd MMM yyyy HH:mm:ss zzz, yyyy-MM-dd'T'HH:mm:ss'Z'"
>>> />
>>> >
>>> > I don't suppose it will because there is already a comma in the first
>>> > parser. I guess I am reallly looking for an all purpose data time parser
>>> but
>>> > even if I have that, would I still be able to query *all* fields in the
>>> > index?
>>> >
>>> > Good article:
>>> >
>>> http://www.java2s.com/Open-Source/Java-Document/RSS-RDF/Rome/com/sun/syndication/io/impl/DateParser.java.htm
>>> >
>>> > Adam
>>> >
>>> > On Sun, Dec 12, 2010 at 7:31 PM, Koji Sekiguchi <ko...@r.email.ne.jp>
>>> wrote:
>>> >
>>> >> (10/12/13 8:49), Adam Estrada wrote:
>>> >>
>>> >>> All,
>>> >>>
>>> >>> I am having some difficu"lties parsing the pubDate field that is part
>>> of
>>> >>> the?
>>> >>> RSS spec (I believe). I get the warning that "states, "Dec 12, 2010
>>> >>> 6:45:26
>>> >>> PM org.apache.solr.handler.dataimport.DateFormatTransformer
>>> >>>  transformRow
>>> >>> WARNING: Could not parse a Date field
>>> >>> java.text.ParseException: Unparseable date: "Thu, 30 Jul 2009 14:41:43
>>> >>> +0000"
>>> >>>         at java.text.DateFormat.parse(Unknown Source)"
>>> >>>
>>> >>> Does anyone know how to fix this? I would eventually like to do a date
>>> >>> query
>>> >>> but without the ability to properly parse them I don't know if it's
>>> going
>>> >>> to
>>> >>> work.
>>> >>>
>>> >>> Thanks,
>>> >>> Adam
>>> >>>
>>> >>
>>> >> Adam,
>>> >>
>>> >> How does your data-config.xml look like for that field?
>>> >> Have you looked at rss-data-config.xml file
>>> >> under example/example-DIH/solr/rss/conf directory?
>>> >>
>>> >> Koji
>>> >> --
>>> >> http://www.rondhuit.com/en/
>>> >>
>>> >
>>>
>>>
>>>
>>> --
>>> Lance Norskog
>>> goksron@gmail.com
>>>
>>
>
>
>
> --
> Lance Norskog
> goksron@gmail.com
>



-- 
Lance Norskog
goksron@gmail.com

Re: [pubDate] is not converting correctly

Posted by Lance Norskog <go...@gmail.com>.
Please file a JIRA requesting this.

On Mon, Dec 13, 2010 at 6:29 AM, Adam Estrada <es...@gmail.com> wrote:
> +1  If I knew enough about how to do this in Java I would but I do not
> soooo.....What is the correct way to add or suggest enhancements to Solr
> core?
>
> Adam
>
> On Sun, Dec 12, 2010 at 11:38 PM, Lance Norskog <go...@gmail.com> wrote:
>
>> Nice find!  This is Apache 2.0, copyright SUN.
>>
>> O Great Apache Elders: Is it kosher to add this to the Solr
>> distribution? It's not in the JDK and is also com.sun.*
>>
>> On Sun, Dec 12, 2010 at 5:33 PM, Adam Estrada
>> <es...@gmail.com> wrote:
>> > Thanks for the feedback! There are quite a few formats that can be used.
>> I
>> > am experiencing at least 5 of them. Would something like this work? Note
>> > that there are 2 different formats separated by a comma.
>> >
>> > <field column="pubdate" xpath="/rss/channel/item/pubDate"
>> > dateTimeFormat="EEE, dd MMM yyyy HH:mm:ss zzz, yyyy-MM-dd'T'HH:mm:ss'Z'"
>> />
>> >
>> > I don't suppose it will because there is already a comma in the first
>> > parser. I guess I am reallly looking for an all purpose data time parser
>> but
>> > even if I have that, would I still be able to query *all* fields in the
>> > index?
>> >
>> > Good article:
>> >
>> http://www.java2s.com/Open-Source/Java-Document/RSS-RDF/Rome/com/sun/syndication/io/impl/DateParser.java.htm
>> >
>> > Adam
>> >
>> > On Sun, Dec 12, 2010 at 7:31 PM, Koji Sekiguchi <ko...@r.email.ne.jp>
>> wrote:
>> >
>> >> (10/12/13 8:49), Adam Estrada wrote:
>> >>
>> >>> All,
>> >>>
>> >>> I am having some difficu"lties parsing the pubDate field that is part
>> of
>> >>> the?
>> >>> RSS spec (I believe). I get the warning that "states, "Dec 12, 2010
>> >>> 6:45:26
>> >>> PM org.apache.solr.handler.dataimport.DateFormatTransformer
>> >>>  transformRow
>> >>> WARNING: Could not parse a Date field
>> >>> java.text.ParseException: Unparseable date: "Thu, 30 Jul 2009 14:41:43
>> >>> +0000"
>> >>>         at java.text.DateFormat.parse(Unknown Source)"
>> >>>
>> >>> Does anyone know how to fix this? I would eventually like to do a date
>> >>> query
>> >>> but without the ability to properly parse them I don't know if it's
>> going
>> >>> to
>> >>> work.
>> >>>
>> >>> Thanks,
>> >>> Adam
>> >>>
>> >>
>> >> Adam,
>> >>
>> >> How does your data-config.xml look like for that field?
>> >> Have you looked at rss-data-config.xml file
>> >> under example/example-DIH/solr/rss/conf directory?
>> >>
>> >> Koji
>> >> --
>> >> http://www.rondhuit.com/en/
>> >>
>> >
>>
>>
>>
>> --
>> Lance Norskog
>> goksron@gmail.com
>>
>



-- 
Lance Norskog
goksron@gmail.com

Re: [pubDate] is not converting correctly

Posted by Adam Estrada <es...@gmail.com>.
+1  If I knew enough about how to do this in Java I would but I do not
soooo.....What is the correct way to add or suggest enhancements to Solr
core?

Adam

On Sun, Dec 12, 2010 at 11:38 PM, Lance Norskog <go...@gmail.com> wrote:

> Nice find!  This is Apache 2.0, copyright SUN.
>
> O Great Apache Elders: Is it kosher to add this to the Solr
> distribution? It's not in the JDK and is also com.sun.*
>
> On Sun, Dec 12, 2010 at 5:33 PM, Adam Estrada
> <es...@gmail.com> wrote:
> > Thanks for the feedback! There are quite a few formats that can be used.
> I
> > am experiencing at least 5 of them. Would something like this work? Note
> > that there are 2 different formats separated by a comma.
> >
> > <field column="pubdate" xpath="/rss/channel/item/pubDate"
> > dateTimeFormat="EEE, dd MMM yyyy HH:mm:ss zzz, yyyy-MM-dd'T'HH:mm:ss'Z'"
> />
> >
> > I don't suppose it will because there is already a comma in the first
> > parser. I guess I am reallly looking for an all purpose data time parser
> but
> > even if I have that, would I still be able to query *all* fields in the
> > index?
> >
> > Good article:
> >
> http://www.java2s.com/Open-Source/Java-Document/RSS-RDF/Rome/com/sun/syndication/io/impl/DateParser.java.htm
> >
> > Adam
> >
> > On Sun, Dec 12, 2010 at 7:31 PM, Koji Sekiguchi <ko...@r.email.ne.jp>
> wrote:
> >
> >> (10/12/13 8:49), Adam Estrada wrote:
> >>
> >>> All,
> >>>
> >>> I am having some difficu"lties parsing the pubDate field that is part
> of
> >>> the?
> >>> RSS spec (I believe). I get the warning that "states, "Dec 12, 2010
> >>> 6:45:26
> >>> PM org.apache.solr.handler.dataimport.DateFormatTransformer
> >>>  transformRow
> >>> WARNING: Could not parse a Date field
> >>> java.text.ParseException: Unparseable date: "Thu, 30 Jul 2009 14:41:43
> >>> +0000"
> >>>         at java.text.DateFormat.parse(Unknown Source)"
> >>>
> >>> Does anyone know how to fix this? I would eventually like to do a date
> >>> query
> >>> but without the ability to properly parse them I don't know if it's
> going
> >>> to
> >>> work.
> >>>
> >>> Thanks,
> >>> Adam
> >>>
> >>
> >> Adam,
> >>
> >> How does your data-config.xml look like for that field?
> >> Have you looked at rss-data-config.xml file
> >> under example/example-DIH/solr/rss/conf directory?
> >>
> >> Koji
> >> --
> >> http://www.rondhuit.com/en/
> >>
> >
>
>
>
> --
> Lance Norskog
> goksron@gmail.com
>

Re: [pubDate] is not converting correctly

Posted by Lance Norskog <go...@gmail.com>.
Nice find!  This is Apache 2.0, copyright SUN.

O Great Apache Elders: Is it kosher to add this to the Solr
distribution? It's not in the JDK and is also com.sun.*

On Sun, Dec 12, 2010 at 5:33 PM, Adam Estrada
<es...@gmail.com> wrote:
> Thanks for the feedback! There are quite a few formats that can be used. I
> am experiencing at least 5 of them. Would something like this work? Note
> that there are 2 different formats separated by a comma.
>
> <field column="pubdate" xpath="/rss/channel/item/pubDate"
> dateTimeFormat="EEE, dd MMM yyyy HH:mm:ss zzz, yyyy-MM-dd'T'HH:mm:ss'Z'" />
>
> I don't suppose it will because there is already a comma in the first
> parser. I guess I am reallly looking for an all purpose data time parser but
> even if I have that, would I still be able to query *all* fields in the
> index?
>
> Good article:
> http://www.java2s.com/Open-Source/Java-Document/RSS-RDF/Rome/com/sun/syndication/io/impl/DateParser.java.htm
>
> Adam
>
> On Sun, Dec 12, 2010 at 7:31 PM, Koji Sekiguchi <ko...@r.email.ne.jp> wrote:
>
>> (10/12/13 8:49), Adam Estrada wrote:
>>
>>> All,
>>>
>>> I am having some difficu"lties parsing the pubDate field that is part of
>>> the?
>>> RSS spec (I believe). I get the warning that "states, "Dec 12, 2010
>>> 6:45:26
>>> PM org.apache.solr.handler.dataimport.DateFormatTransformer
>>>  transformRow
>>> WARNING: Could not parse a Date field
>>> java.text.ParseException: Unparseable date: "Thu, 30 Jul 2009 14:41:43
>>> +0000"
>>>         at java.text.DateFormat.parse(Unknown Source)"
>>>
>>> Does anyone know how to fix this? I would eventually like to do a date
>>> query
>>> but without the ability to properly parse them I don't know if it's going
>>> to
>>> work.
>>>
>>> Thanks,
>>> Adam
>>>
>>
>> Adam,
>>
>> How does your data-config.xml look like for that field?
>> Have you looked at rss-data-config.xml file
>> under example/example-DIH/solr/rss/conf directory?
>>
>> Koji
>> --
>> http://www.rondhuit.com/en/
>>
>



-- 
Lance Norskog
goksron@gmail.com

Re: [pubDate] is not converting correctly

Posted by Adam Estrada <es...@gmail.com>.
Thanks for the feedback! There are quite a few formats that can be used. I
am experiencing at least 5 of them. Would something like this work? Note
that there are 2 different formats separated by a comma.

<field column="pubdate" xpath="/rss/channel/item/pubDate"
dateTimeFormat="EEE, dd MMM yyyy HH:mm:ss zzz, yyyy-MM-dd'T'HH:mm:ss'Z'" />

I don't suppose it will because there is already a comma in the first
parser. I guess I am reallly looking for an all purpose data time parser but
even if I have that, would I still be able to query *all* fields in the
index?

Good article:
http://www.java2s.com/Open-Source/Java-Document/RSS-RDF/Rome/com/sun/syndication/io/impl/DateParser.java.htm

Adam

On Sun, Dec 12, 2010 at 7:31 PM, Koji Sekiguchi <ko...@r.email.ne.jp> wrote:

> (10/12/13 8:49), Adam Estrada wrote:
>
>> All,
>>
>> I am having some difficu"lties parsing the pubDate field that is part of
>> the?
>> RSS spec (I believe). I get the warning that "states, "Dec 12, 2010
>> 6:45:26
>> PM org.apache.solr.handler.dataimport.DateFormatTransformer
>>  transformRow
>> WARNING: Could not parse a Date field
>> java.text.ParseException: Unparseable date: "Thu, 30 Jul 2009 14:41:43
>> +0000"
>>         at java.text.DateFormat.parse(Unknown Source)"
>>
>> Does anyone know how to fix this? I would eventually like to do a date
>> query
>> but without the ability to properly parse them I don't know if it's going
>> to
>> work.
>>
>> Thanks,
>> Adam
>>
>
> Adam,
>
> How does your data-config.xml look like for that field?
> Have you looked at rss-data-config.xml file
> under example/example-DIH/solr/rss/conf directory?
>
> Koji
> --
> http://www.rondhuit.com/en/
>

Re: [pubDate] is not converting correctly

Posted by Koji Sekiguchi <ko...@r.email.ne.jp>.
(10/12/13 8:49), Adam Estrada wrote:
> All,
>
> I am having some difficu"lties parsing the pubDate field that is part of the
> RSS spec (I believe). I get the warning that "states, "Dec 12, 2010 6:45:26
> PM org.apache.solr.handler.dataimport.DateFormatTransformer
>   transformRow
> WARNING: Could not parse a Date field
> java.text.ParseException: Unparseable date: "Thu, 30 Jul 2009 14:41:43
> +0000"
>          at java.text.DateFormat.parse(Unknown Source)"
>
> Does anyone know how to fix this? I would eventually like to do a date query
> but without the ability to properly parse them I don't know if it's going to
> work.
>
> Thanks,
> Adam

Adam,

How does your data-config.xml look like for that field?
Have you looked at rss-data-config.xml file
under example/example-DIH/solr/rss/conf directory?

Koji
-- 
http://www.rondhuit.com/en/