You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by "David Smiley @MITRE.org" <DS...@mitre.org> on 2008/10/16 14:44:43 UTC

dataimport, both splitBy and dateTimeFormat

I'm trying out the dataimport capability.  I have a column that is a series
of dates separated by spaces like so:
"1996-00-00 1996-04-00"
And I'm trying to import it like so:
<field column="r_event_date" splitBy=" " dateTimeFormat="yyyy-MM-dd" />

However this fails and the stack trace suggests it is first trying to apply
the dateTimeFormat before splitBy.  I think this is a bug... dataimport
should apply DateFormatTransformer and NumberFormatTransformer last.

~ David Smiley
-- 
View this message in context: http://www.nabble.com/dataimport%2C-both-splitBy-and-dateTimeFormat-tp20013006p20013006.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: dataimport, both splitBy and dateTimeFormat

Posted by Noble Paul നോബിള്‍ नोब्ळ् <no...@gmail.com>.
Thanks David,
I have updated the wiki documentation
http://wiki.apache.org/solr/DataImportHandler#transformer

The default transformers do not have any special privilege it is like
any normal user provided transformer.We just identified some commonly
found usecases and added transformers for that.

 Applying a transformer is not very 'cheap' it has to do extra checks
to know whether to apply or not.

On Fri, Oct 17, 2008 at 12:26 AM, David Smiley @MITRE.org
<DS...@mitre.org> wrote:
>
> The wiki didn't mention I can specify multiple transformers.  BTW, it's
> "transformer" (singular), not "transformers".  I did mean both NFT and DFT
> because I was speaking of the general case, not just mine in particular.  I
> thought that the built-in transformers were always in-effect and so I
> expected NFT,DFT to occur last.  Sorry if I wasn't clear.
>
> Thanks for your help; it worked.
>
> ~ David
>
>
> Shalin Shekhar Mangar wrote:
>>
>> Hi David,
>>
>> I think you meant RegexTransformer instead of NumberFormatTransformer.
>> Anyhow, the order in which the transformers are applied is the same as the
>> order in which you specify them.
>>
>> So make sure your entity has
>> transformers="RegexTransformer,DateFormatTransformer".
>>
>> On Thu, Oct 16, 2008 at 6:14 PM, David Smiley @MITRE.org
>> <DS...@mitre.org>wrote:
>>
>>>
>>> I'm trying out the dataimport capability.  I have a column that is a
>>> series
>>> of dates separated by spaces like so:
>>> "1996-00-00 1996-04-00"
>>> And I'm trying to import it like so:
>>> <field column="r_event_date" splitBy=" " dateTimeFormat="yyyy-MM-dd" />
>>>
>>> However this fails and the stack trace suggests it is first trying to
>>> apply
>>> the dateTimeFormat before splitBy.  I think this is a bug... dataimport
>>> should apply DateFormatTransformer and NumberFormatTransformer last.
>>>
>>> ~ David Smiley
>>> --
>>> View this message in context:
>>> http://www.nabble.com/dataimport%2C-both-splitBy-and-dateTimeFormat-tp20013006p20013006.html
>>> Sent from the Solr - User mailing list archive at Nabble.com.
>>>
>>>
>>
>>
>> --
>> Regards,
>> Shalin Shekhar Mangar.
>>
>>
> --
> View this message in context: http://www.nabble.com/dataimport%2C-both-splitBy-and-dateTimeFormat-tp20013006p20016178.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>



-- 
--Noble Paul

Re: dataimport, both splitBy and dateTimeFormat

Posted by "David Smiley @MITRE.org" <DS...@mitre.org>.
The wiki didn't mention I can specify multiple transformers.  BTW, it's
"transformer" (singular), not "transformers".  I did mean both NFT and DFT
because I was speaking of the general case, not just mine in particular.  I
thought that the built-in transformers were always in-effect and so I
expected NFT,DFT to occur last.  Sorry if I wasn't clear.

Thanks for your help; it worked.

~ David


Shalin Shekhar Mangar wrote:
> 
> Hi David,
> 
> I think you meant RegexTransformer instead of NumberFormatTransformer.
> Anyhow, the order in which the transformers are applied is the same as the
> order in which you specify them.
> 
> So make sure your entity has
> transformers="RegexTransformer,DateFormatTransformer".
> 
> On Thu, Oct 16, 2008 at 6:14 PM, David Smiley @MITRE.org
> <DS...@mitre.org>wrote:
> 
>>
>> I'm trying out the dataimport capability.  I have a column that is a
>> series
>> of dates separated by spaces like so:
>> "1996-00-00 1996-04-00"
>> And I'm trying to import it like so:
>> <field column="r_event_date" splitBy=" " dateTimeFormat="yyyy-MM-dd" />
>>
>> However this fails and the stack trace suggests it is first trying to
>> apply
>> the dateTimeFormat before splitBy.  I think this is a bug... dataimport
>> should apply DateFormatTransformer and NumberFormatTransformer last.
>>
>> ~ David Smiley
>> --
>> View this message in context:
>> http://www.nabble.com/dataimport%2C-both-splitBy-and-dateTimeFormat-tp20013006p20013006.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>>
>>
> 
> 
> -- 
> Regards,
> Shalin Shekhar Mangar.
> 
> 
-- 
View this message in context: http://www.nabble.com/dataimport%2C-both-splitBy-and-dateTimeFormat-tp20013006p20016178.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: dataimport, both splitBy and dateTimeFormat

Posted by Shalin Shekhar Mangar <sh...@gmail.com>.
Hi David,

I think you meant RegexTransformer instead of NumberFormatTransformer.
Anyhow, the order in which the transformers are applied is the same as the
order in which you specify them.

So make sure your entity has
transformers="RegexTransformer,DateFormatTransformer".

On Thu, Oct 16, 2008 at 6:14 PM, David Smiley @MITRE.org
<DS...@mitre.org>wrote:

>
> I'm trying out the dataimport capability.  I have a column that is a series
> of dates separated by spaces like so:
> "1996-00-00 1996-04-00"
> And I'm trying to import it like so:
> <field column="r_event_date" splitBy=" " dateTimeFormat="yyyy-MM-dd" />
>
> However this fails and the stack trace suggests it is first trying to apply
> the dateTimeFormat before splitBy.  I think this is a bug... dataimport
> should apply DateFormatTransformer and NumberFormatTransformer last.
>
> ~ David Smiley
> --
> View this message in context:
> http://www.nabble.com/dataimport%2C-both-splitBy-and-dateTimeFormat-tp20013006p20013006.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>


-- 
Regards,
Shalin Shekhar Mangar.