You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by karthicrnair <ka...@gmail.com> on 2013/03/29 12:42:36 UTC

Need Help in Patching OPENNLP

Hi All, 

am very new to solr and Java technology. I would wonder if some one can
gimme a way out to patch the OpenNLP platform with Solr.

Am simply blocked out at the initial step, applying patch to Solr 4.2. Any
pointer would be highly appreciated.

Thanks,
Karthic 



--
View this message in context: http://lucene.472066.n3.nabble.com/Need-Help-in-Patching-OPENNLP-tp4052362.html
Sent from the Solr - User mailing list archive at Nabble.com.

OPENNLP current patch compiling problem for 4.x branch

Posted by Patrick Mi <pa...@touchpointgroup.com>.
Hi,

I checked out from here
http://svn.apache.org/repos/asf/lucene/dev/tags/lucene_solr_4_3_0 and
downloaded the latest patch LUCENE-2899-current.patch.

Applied the patch ok but when I did 'ant compile' I got the following error:


==
    [javac]
/home/lucene_solr_4_3_0/lucene/analysis/opennlp/src/java/org/apache/lucene/a
nalysis/opennlp/FilterPayloadsFilter.java:43: error
r: cannot find symbol
    [javac]     super(Version.LUCENE_44, input);
    [javac]                  ^
    [javac]   symbol:   variable LUCENE_44
    [javac]   location: class Version
    [javac] 1 error    
==

Compiled it on trunk without problem.

Is this patch supposed to work for 4.X?

Regards,
Patrick 


Re: Need Help in Patching OPENNLP

Posted by Erick Erickson <er...@gmail.com>.
Gora: Thanks for pitching in, I'm on vacation and only sporadically
looking at the lists.

Karthicrnair:
https is the access to the writeable archive, it's been a long enough
since I set things up that I don't remember if you need committer
credentials or not, so try straight http (without the 's') maybe?

Erick

On Tue, Apr 2, 2013 at 1:01 AM, karthicrnair <ka...@gmail.com> wrote:
> Thanks much !!
>
> Explorer -- Internet Explorer :) Sorry for the miscommunication. Yeah let me
> check it once again.
>
> appreciate all the help :)
>
> krn
>
>
>
> --
> View this message in context: http://lucene.472066.n3.nabble.com/Need-Help-in-Patching-OPENNLP-tp4052362p4053094.html
> Sent from the Solr - User mailing list archive at Nabble.com.

Re: Need Help in Patching OPENNLP

Posted by karthicrnair <ka...@gmail.com>.
Thanks much !!

Explorer -- Internet Explorer :) Sorry for the miscommunication. Yeah let me
check it once again.

appreciate all the help :)

krn



--
View this message in context: http://lucene.472066.n3.nabble.com/Need-Help-in-Patching-OPENNLP-tp4052362p4053094.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Need Help in Patching OPENNLP

Posted by Gora Mohanty <go...@mimirtech.com>.
On 2 April 2013 11:17, karthicrnair <ka...@gmail.com> wrote:
> Thanks Gora!!
>
> when I tried with ping command all my request got timed out. am able to
> access the svn through my explorer though.
>
> What could be the issue now? :(

Hard to tell. My guess would be that your network is blocking
some things like ICMP. Not sure what Explorer you are referring
to, but if you can access svn.apache.org, "svn co" should work.

Regards,
Gora

Re: Need Help in Patching OPENNLP

Posted by karthicrnair <ka...@gmail.com>.
Thanks Gora!!

when I tried with ping command all my request got timed out. am able to
access the svn through my explorer though.

What could be the issue now? :(

Thanks,
krn



--
View this message in context: http://lucene.472066.n3.nabble.com/Need-Help-in-Patching-OPENNLP-tp4052362p4053092.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Need Help in Patching OPENNLP

Posted by Gora Mohanty <go...@mimirtech.com>.
On 2 April 2013 11:08, Gora Mohanty <go...@mimirtech.com> wrote:
> On 2 April 2013 11:00, karthicrnair <ka...@gmail.com> wrote:
>> Hi Erick,
>>
>> thank you so much or the help and support.
>>
>> As you have mentioned I have made the svn set up and while trying to connect
>> using check out option am getting this error;
>>
>> C:\bin>svn co https://svn.apache.org/repos/asf/lucene/dev/
>
> Please read http://wiki.apache.org/solr/HowToContribute#Getting_the_source_code
> carefully. You need to add a branch name to the SVN URL.
> You probably want something like
>   svn co http://svn.apache.org/repos/asf/lucene/dev/trunk

Though "svn co https://svn.apache.org/repos/asf/lucene/dev/"
also works just fine.Are you sure that there is no network issue
at your end? Are you able to ping svn.apache.org?

Regards,
Gora

Re: Need Help in Patching OPENNLP

Posted by Gora Mohanty <go...@mimirtech.com>.
On 2 April 2013 11:00, karthicrnair <ka...@gmail.com> wrote:
> Hi Erick,
>
> thank you so much or the help and support.
>
> As you have mentioned I have made the svn set up and while trying to connect
> using check out option am getting this error;
>
> C:\bin>svn co https://svn.apache.org/repos/asf/lucene/dev/

Please read http://wiki.apache.org/solr/HowToContribute#Getting_the_source_code
carefully. You need to add a branch name to the SVN URL.
You probably want something like
  svn co http://svn.apache.org/repos/asf/lucene/dev/trunk

Regards,
Gora

Re: Need Help in Patching OPENNLP

Posted by karthicrnair <ka...@gmail.com>.
Hi Erick, 

thank you so much or the help and support.

As you have mentioned I have made the svn set up and while trying to connect
using check out option am getting this error;

C:\bin>svn co https://svn.apache.org/repos/asf/lucene/dev/
svn: E175002: Unable to connect to a repository at URL
'https://svn.apache.org/r
epos/asf/lucene/dev'
svn: E175002: OPTIONS of 'https://svn.apache.org/repos/asf/lucene/dev':
could no
t connect to server (https://svn.apache.org)

Is this anything to do with the firewall set up? please advice me on the
further steps.

Thanks,
KRN



--
View this message in context: http://lucene.472066.n3.nabble.com/Need-Help-in-Patching-OPENNLP-tp4052362p4053089.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Need Help in Patching OPENNLP

Posted by Erick Erickson <er...@gmail.com>.
Here's the start-up page:
http://wiki.apache.org/solr/HowToContribute

First, just check out the code via svn and build it (see the page
above). That'll tell you if you have all the tools available.
Second, apply the patch to the source. From the root of your source,
'patch -p0 -i <patch name>'
Third, execute "ant example dist" and that should build you source
with the patch in place...

If you get stuck, let us know what problems you are having, specific
errors you're receiving, all that kind of stuff....

Best
Erick

On Fri, Mar 29, 2013 at 7:42 AM, karthicrnair <ka...@gmail.com> wrote:
> Hi All,
>
> am very new to solr and Java technology. I would wonder if some one can
> gimme a way out to patch the OpenNLP platform with Solr.
>
> Am simply blocked out at the initial step, applying patch to Solr 4.2. Any
> pointer would be highly appreciated.
>
> Thanks,
> Karthic
>
>
>
> --
> View this message in context: http://lucene.472066.n3.nabble.com/Need-Help-in-Patching-OPENNLP-tp4052362.html
> Sent from the Solr - User mailing list archive at Nabble.com.

OPENNLP problems

Posted by Patrick Mi <pa...@touchpointgroup.com>.
Hi there,

Checked out branch_4x and applied the latest patch
LUCENE-2899-current.patch however I ran into 2 problems

Followed the wiki page instruction and set up a field with this type aiming
to keep nouns and verbs and do a facet on the field
==
<fieldType name="text_opennlp_nvf" class="solr.TextField"
positionIncrementGap="100">
      <analyzer>
        <tokenizer class="solr.OpenNLPTokenizerFactory"
tokenizerModel="opennlp/en-token.bin"/>
        <filter class="solr.OpenNLPFilterFactory"
posTaggerModel="opennlp/en-pos-maxent.bin"/>
        <filter class="solr.FilterPayloadsFilterFactory"
payloadList="NN,NNS,NNP,NNPS,VB,VBD,VBG,VBN,VBP,VBZ,FW"/>
        <filter class="solr.StripPayloadsFilterFactory"/>
      </analyzer>
    </fieldType>
==

Struggled to get that going until I put the extra parameter
keepPayloads="true" in as below. 
     <filter class="solr.FilterPayloadsFilterFactory" keepPayloads="true"
payloadList="NN,NNS,NNP,NNPS,VB,VBD,VBG,VBN,VBP,VBZ,FW"/>

Question: am I doing the right thing? Is this a mistake on wiki 

Second problem:

Posted the document xml one by one to the solr and the result was what I
expected.

<add>
<doc>
  <field name="id">1</field>
  <field name="text_opennlp_nvf">check in the hotel</field></doc>
</add>

However if I put multiple documents into the same xml file and post it in
one go only the first document gets processed( only 'check' and 'hotel' were
showing in the facet result.) 
 
<add>
<doc>
  <field name="id">1</field>
  <field name="text_opennlp_nvf">check in the hotel</field>
</doc>
<doc>
  <field name="id">2</field>
  <field name="text_opennlp_nvf">removes the payloads</field>
</doc>
<doc>
  <field name="id">3</field>
  <field name="text_opennlp_nvf">retains only nouns and verbs </field>
</doc>
</add>

Same problem when updated the data using csv upload.

Is that a bug or something I did wrong?

Thanks in advance!

Regards,
Patrick



Re: OPENNLP problems

Posted by Lance Norskog <go...@gmail.com>.
I will look at these problems. Thanks for trying it out!

Lance Norskog

On 05/28/2013 10:08 PM, Patrick Mi wrote:
> Hi there,
>
> Checked out branch_4x and applied the latest patch
> LUCENE-2899-current.patch however I ran into 2 problems
>
> Followed the wiki page instruction and set up a field with this type aiming
> to keep nouns and verbs and do a facet on the field
> ==
> <fieldType name="text_opennlp_nvf" class="solr.TextField"
> positionIncrementGap="100">
>        <analyzer>
>          <tokenizer class="solr.OpenNLPTokenizerFactory"
> tokenizerModel="opennlp/en-token.bin"/>
>          <filter class="solr.OpenNLPFilterFactory"
> posTaggerModel="opennlp/en-pos-maxent.bin"/>
>          <filter class="solr.FilterPayloadsFilterFactory"
> payloadList="NN,NNS,NNP,NNPS,VB,VBD,VBG,VBN,VBP,VBZ,FW"/>
>          <filter class="solr.StripPayloadsFilterFactory"/>
>        </analyzer>
>      </fieldType>
> ==
>
> Struggled to get that going until I put the extra parameter
> keepPayloads="true" in as below.
>       <filter class="solr.FilterPayloadsFilterFactory" keepPayloads="true"
> payloadList="NN,NNS,NNP,NNPS,VB,VBD,VBG,VBN,VBP,VBZ,FW"/>
>
> Question: am I doing the right thing? Is this a mistake on wiki
>
> Second problem:
>
> Posted the document xml one by one to the solr and the result was what I
> expected.
>
> <add>
> <doc>
>    <field name="id">1</field>
>    <field name="text_opennlp_nvf">check in the hotel</field></doc>
> </add>
>
> However if I put multiple documents into the same xml file and post it in
> one go only the first document gets processed( only 'check' and 'hotel' were
> showing in the facet result.)
>   
> <add>
> <doc>
>    <field name="id">1</field>
>    <field name="text_opennlp_nvf">check in the hotel</field>
> </doc>
> <doc>
>    <field name="id">2</field>
>    <field name="text_opennlp_nvf">removes the payloads</field>
> </doc>
> <doc>
>    <field name="id">3</field>
>    <field name="text_opennlp_nvf">retains only nouns and verbs </field>
> </doc>
> </add>
>
> Same problem when updated the data using csv upload.
>
> Is that a bug or something I did wrong?
>
> Thanks in advance!
>
> Regards,
> Patrick
>
>


RE: OPENNLP problems

Posted by Patrick Mi <pa...@touchpointgroup.com>.
Hi Lance,

I updated the src from 4.x and applied the latest patch LUCENE-2899-x.patch
uploaded on 6th June but still had the same problem.


Regards,
Patrick

-----Original Message-----
From: Lance Norskog [mailto:goksron@gmail.com] 
Sent: Thursday, 6 June 2013 5:16 p.m.
To: solr-user@lucene.apache.org
Subject: Re: OPENNLP problems

Patrick-
I found the problem with multiple documents. The problem was that the 
API for the life cycle of a Tokenizer changed, and I only noticed part 
of the change. You can now upload multiple documents in one post, and 
the OpenNLPTokenizer will process each document.

You're right, the example on the wiki is wrong. The FilterPayloadsFilter 
default is to remove the given payloads, and needs keepPayloads="true" 
to retain them.

The fixed patch is up as LUCENE-2899-x.patch. Again, thanks for trying it.

Lance

https://issues.apache.org/jira/browse/LUCENE-2899

On 05/28/2013 10:08 PM, Patrick Mi wrote:
> Hi there,
>
> Checked out branch_4x and applied the latest patch
> LUCENE-2899-current.patch however I ran into 2 problems
>
> Followed the wiki page instruction and set up a field with this type
aiming
> to keep nouns and verbs and do a facet on the field
> ==
> <fieldType name="text_opennlp_nvf" class="solr.TextField"
> positionIncrementGap="100">
>        <analyzer>
>          <tokenizer class="solr.OpenNLPTokenizerFactory"
> tokenizerModel="opennlp/en-token.bin"/>
>          <filter class="solr.OpenNLPFilterFactory"
> posTaggerModel="opennlp/en-pos-maxent.bin"/>
>          <filter class="solr.FilterPayloadsFilterFactory"
> payloadList="NN,NNS,NNP,NNPS,VB,VBD,VBG,VBN,VBP,VBZ,FW"/>
>          <filter class="solr.StripPayloadsFilterFactory"/>
>        </analyzer>
>      </fieldType>
> ==
>
> Struggled to get that going until I put the extra parameter
> keepPayloads="true" in as below.
>       <filter class="solr.FilterPayloadsFilterFactory" keepPayloads="true"
> payloadList="NN,NNS,NNP,NNPS,VB,VBD,VBG,VBN,VBP,VBZ,FW"/>
>
> Question: am I doing the right thing? Is this a mistake on wiki
>
> Second problem:
>
> Posted the document xml one by one to the solr and the result was what I
> expected.
>
> <add>
> <doc>
>    <field name="id">1</field>
>    <field name="text_opennlp_nvf">check in the hotel</field></doc>
> </add>
>
> However if I put multiple documents into the same xml file and post it in
> one go only the first document gets processed( only 'check' and 'hotel'
were
> showing in the facet result.)
>   
> <add>
> <doc>
>    <field name="id">1</field>
>    <field name="text_opennlp_nvf">check in the hotel</field>
> </doc>
> <doc>
>    <field name="id">2</field>
>    <field name="text_opennlp_nvf">removes the payloads</field>
> </doc>
> <doc>
>    <field name="id">3</field>
>    <field name="text_opennlp_nvf">retains only nouns and verbs </field>
> </doc>
> </add>
>
> Same problem when updated the data using csv upload.
>
> Is that a bug or something I did wrong?
>
> Thanks in advance!
>
> Regards,
> Patrick
>
>



Re: OPENNLP problems

Posted by Lance Norskog <go...@gmail.com>.
Found the problem. Please see:
https://issues.apache.org/jira/browse/LUCENE-2899?focusedCommentId=13679293&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13679293
On 06/09/2013 04:38 PM, Patrick Mi wrote:
> Hi Lance,
>
> I updated the src from 4.x and applied the latest patch LUCENE-2899-x.patch
> uploaded on 6th June but still had the same problem.
>
>
> Regards,
> Patrick
>
> -----Original Message-----
> From: Lance Norskog [mailto:goksron@gmail.com]
> Sent: Thursday, 6 June 2013 5:16 p.m.
> To: solr-user@lucene.apache.org
> Subject: Re: OPENNLP problems
>
> Patrick-
> I found the problem with multiple documents. The problem was that the
> API for the life cycle of a Tokenizer changed, and I only noticed part
> of the change. You can now upload multiple documents in one post, and
> the OpenNLPTokenizer will process each document.
>
> You're right, the example on the wiki is wrong. The FilterPayloadsFilter
> default is to remove the given payloads, and needs keepPayloads="true"
> to retain them.
>
> The fixed patch is up as LUCENE-2899-x.patch. Again, thanks for trying it.
>
> Lance
>
> https://issues.apache.org/jira/browse/LUCENE-2899
>
> On 05/28/2013 10:08 PM, Patrick Mi wrote:
>> Hi there,
>>
>> Checked out branch_4x and applied the latest patch
>> LUCENE-2899-current.patch however I ran into 2 problems
>>
>> Followed the wiki page instruction and set up a field with this type
> aiming
>> to keep nouns and verbs and do a facet on the field
>> ==
>> <fieldType name="text_opennlp_nvf" class="solr.TextField"
>> positionIncrementGap="100">
>>         <analyzer>
>>           <tokenizer class="solr.OpenNLPTokenizerFactory"
>> tokenizerModel="opennlp/en-token.bin"/>
>>           <filter class="solr.OpenNLPFilterFactory"
>> posTaggerModel="opennlp/en-pos-maxent.bin"/>
>>           <filter class="solr.FilterPayloadsFilterFactory"
>> payloadList="NN,NNS,NNP,NNPS,VB,VBD,VBG,VBN,VBP,VBZ,FW"/>
>>           <filter class="solr.StripPayloadsFilterFactory"/>
>>         </analyzer>
>>       </fieldType>
>> ==
>>
>> Struggled to get that going until I put the extra parameter
>> keepPayloads="true" in as below.
>>        <filter class="solr.FilterPayloadsFilterFactory" keepPayloads="true"
>> payloadList="NN,NNS,NNP,NNPS,VB,VBD,VBG,VBN,VBP,VBZ,FW"/>
>>
>> Question: am I doing the right thing? Is this a mistake on wiki
>>
>> Second problem:
>>
>> Posted the document xml one by one to the solr and the result was what I
>> expected.
>>
>> <add>
>> <doc>
>>     <field name="id">1</field>
>>     <field name="text_opennlp_nvf">check in the hotel</field></doc>
>> </add>
>>
>> However if I put multiple documents into the same xml file and post it in
>> one go only the first document gets processed( only 'check' and 'hotel'
> were
>> showing in the facet result.)
>>    
>> <add>
>> <doc>
>>     <field name="id">1</field>
>>     <field name="text_opennlp_nvf">check in the hotel</field>
>> </doc>
>> <doc>
>>     <field name="id">2</field>
>>     <field name="text_opennlp_nvf">removes the payloads</field>
>> </doc>
>> <doc>
>>     <field name="id">3</field>
>>     <field name="text_opennlp_nvf">retains only nouns and verbs </field>
>> </doc>
>> </add>
>>
>> Same problem when updated the data using csv upload.
>>
>> Is that a bug or something I did wrong?
>>
>> Thanks in advance!
>>
>> Regards,
>> Patrick
>>
>>
>


Re: OPENNLP problems

Posted by Lance Norskog <go...@gmail.com>.
text_opennlp has the right behavior.
text_opennlp_pos does what you describe.
I'll look some more.

On 06/09/2013 04:38 PM, Patrick Mi wrote:
> Hi Lance,
>
> I updated the src from 4.x and applied the latest patch LUCENE-2899-x.patch
> uploaded on 6th June but still had the same problem.
>
>
> Regards,
> Patrick
>
> -----Original Message-----
> From: Lance Norskog [mailto:goksron@gmail.com]
> Sent: Thursday, 6 June 2013 5:16 p.m.
> To: solr-user@lucene.apache.org
> Subject: Re: OPENNLP problems
>
> Patrick-
> I found the problem with multiple documents. The problem was that the
> API for the life cycle of a Tokenizer changed, and I only noticed part
> of the change. You can now upload multiple documents in one post, and
> the OpenNLPTokenizer will process each document.
>
> You're right, the example on the wiki is wrong. The FilterPayloadsFilter
> default is to remove the given payloads, and needs keepPayloads="true"
> to retain them.
>
> The fixed patch is up as LUCENE-2899-x.patch. Again, thanks for trying it.
>
> Lance
>
> https://issues.apache.org/jira/browse/LUCENE-2899
>
> On 05/28/2013 10:08 PM, Patrick Mi wrote:
>> Hi there,
>>
>> Checked out branch_4x and applied the latest patch
>> LUCENE-2899-current.patch however I ran into 2 problems
>>
>> Followed the wiki page instruction and set up a field with this type
> aiming
>> to keep nouns and verbs and do a facet on the field
>> ==
>> <fieldType name="text_opennlp_nvf" class="solr.TextField"
>> positionIncrementGap="100">
>>         <analyzer>
>>           <tokenizer class="solr.OpenNLPTokenizerFactory"
>> tokenizerModel="opennlp/en-token.bin"/>
>>           <filter class="solr.OpenNLPFilterFactory"
>> posTaggerModel="opennlp/en-pos-maxent.bin"/>
>>           <filter class="solr.FilterPayloadsFilterFactory"
>> payloadList="NN,NNS,NNP,NNPS,VB,VBD,VBG,VBN,VBP,VBZ,FW"/>
>>           <filter class="solr.StripPayloadsFilterFactory"/>
>>         </analyzer>
>>       </fieldType>
>> ==
>>
>> Struggled to get that going until I put the extra parameter
>> keepPayloads="true" in as below.
>>        <filter class="solr.FilterPayloadsFilterFactory" keepPayloads="true"
>> payloadList="NN,NNS,NNP,NNPS,VB,VBD,VBG,VBN,VBP,VBZ,FW"/>
>>
>> Question: am I doing the right thing? Is this a mistake on wiki
>>
>> Second problem:
>>
>> Posted the document xml one by one to the solr and the result was what I
>> expected.
>>
>> <add>
>> <doc>
>>     <field name="id">1</field>
>>     <field name="text_opennlp_nvf">check in the hotel</field></doc>
>> </add>
>>
>> However if I put multiple documents into the same xml file and post it in
>> one go only the first document gets processed( only 'check' and 'hotel'
> were
>> showing in the facet result.)
>>    
>> <add>
>> <doc>
>>     <field name="id">1</field>
>>     <field name="text_opennlp_nvf">check in the hotel</field>
>> </doc>
>> <doc>
>>     <field name="id">2</field>
>>     <field name="text_opennlp_nvf">removes the payloads</field>
>> </doc>
>> <doc>
>>     <field name="id">3</field>
>>     <field name="text_opennlp_nvf">retains only nouns and verbs </field>
>> </doc>
>> </add>
>>
>> Same problem when updated the data using csv upload.
>>
>> Is that a bug or something I did wrong?
>>
>> Thanks in advance!
>>
>> Regards,
>> Patrick
>>
>>
>


Re: OPENNLP problems

Posted by Lance Norskog <go...@gmail.com>.
Patrick-
I found the problem with multiple documents. The problem was that the 
API for the life cycle of a Tokenizer changed, and I only noticed part 
of the change. You can now upload multiple documents in one post, and 
the OpenNLPTokenizer will process each document.

You're right, the example on the wiki is wrong. The FilterPayloadsFilter 
default is to remove the given payloads, and needs keepPayloads="true" 
to retain them.

The fixed patch is up as LUCENE-2899-x.patch. Again, thanks for trying it.

Lance

https://issues.apache.org/jira/browse/LUCENE-2899

On 05/28/2013 10:08 PM, Patrick Mi wrote:
> Hi there,
>
> Checked out branch_4x and applied the latest patch
> LUCENE-2899-current.patch however I ran into 2 problems
>
> Followed the wiki page instruction and set up a field with this type aiming
> to keep nouns and verbs and do a facet on the field
> ==
> <fieldType name="text_opennlp_nvf" class="solr.TextField"
> positionIncrementGap="100">
>        <analyzer>
>          <tokenizer class="solr.OpenNLPTokenizerFactory"
> tokenizerModel="opennlp/en-token.bin"/>
>          <filter class="solr.OpenNLPFilterFactory"
> posTaggerModel="opennlp/en-pos-maxent.bin"/>
>          <filter class="solr.FilterPayloadsFilterFactory"
> payloadList="NN,NNS,NNP,NNPS,VB,VBD,VBG,VBN,VBP,VBZ,FW"/>
>          <filter class="solr.StripPayloadsFilterFactory"/>
>        </analyzer>
>      </fieldType>
> ==
>
> Struggled to get that going until I put the extra parameter
> keepPayloads="true" in as below.
>       <filter class="solr.FilterPayloadsFilterFactory" keepPayloads="true"
> payloadList="NN,NNS,NNP,NNPS,VB,VBD,VBG,VBN,VBP,VBZ,FW"/>
>
> Question: am I doing the right thing? Is this a mistake on wiki
>
> Second problem:
>
> Posted the document xml one by one to the solr and the result was what I
> expected.
>
> <add>
> <doc>
>    <field name="id">1</field>
>    <field name="text_opennlp_nvf">check in the hotel</field></doc>
> </add>
>
> However if I put multiple documents into the same xml file and post it in
> one go only the first document gets processed( only 'check' and 'hotel' were
> showing in the facet result.)
>   
> <add>
> <doc>
>    <field name="id">1</field>
>    <field name="text_opennlp_nvf">check in the hotel</field>
> </doc>
> <doc>
>    <field name="id">2</field>
>    <field name="text_opennlp_nvf">removes the payloads</field>
> </doc>
> <doc>
>    <field name="id">3</field>
>    <field name="text_opennlp_nvf">retains only nouns and verbs </field>
> </doc>
> </add>
>
> Same problem when updated the data using csv upload.
>
> Is that a bug or something I did wrong?
>
> Thanks in advance!
>
> Regards,
> Patrick
>
>


RE: OPENNLP current patch compiling problem for 4.x branch

Posted by Patrick Mi <pa...@touchpointgroup.com>.
Thanks Steve, that worked for branch_4x 

-----Original Message-----
From: Steve Rowe [mailto:sarowe@gmail.com] 
Sent: Friday, 24 May 2013 3:19 a.m.
To: solr-user@lucene.apache.org
Subject: Re: OPENNLP current patch compiling problem for 4.x branch

Hi Patrick,

I think you should check out and apply the patch to branch_4x, rather than
the lucene_solr_4_3_0 tag:

http://svn.apache.org/repos/asf/lucene/dev/branches/branch_4x

Steve

On May 23, 2013, at 2:08 AM, Patrick Mi <pa...@touchpointgroup.com>
wrote:

> Hi,
> 
> I checked out from here
> http://svn.apache.org/repos/asf/lucene/dev/tags/lucene_solr_4_3_0 and
> downloaded the latest patch LUCENE-2899-current.patch.
> 
> Applied the patch ok but when I did 'ant compile' I got the following
error:
> 
> 
> ==
>    [javac]
>
/home/lucene_solr_4_3_0/lucene/analysis/opennlp/src/java/org/apache/lucene/a
> nalysis/opennlp/FilterPayloadsFilter.java:43: error
> r: cannot find symbol
>    [javac]     super(Version.LUCENE_44, input);
>    [javac]                  ^
>    [javac]   symbol:   variable LUCENE_44
>    [javac]   location: class Version
>    [javac] 1 error    
> ==
> 
> Compiled it on trunk without problem.
> 
> Is this patch supposed to work for 4.X?
> 
> Regards,
> Patrick 
> 


Re: OPENNLP current patch compiling problem for 4.x branch

Posted by Steve Rowe <sa...@gmail.com>.
Hi Patrick,

I think you should check out and apply the patch to branch_4x, rather than the lucene_solr_4_3_0 tag:

http://svn.apache.org/repos/asf/lucene/dev/branches/branch_4x

Steve

On May 23, 2013, at 2:08 AM, Patrick Mi <pa...@touchpointgroup.com> wrote:

> Hi,
> 
> I checked out from here
> http://svn.apache.org/repos/asf/lucene/dev/tags/lucene_solr_4_3_0 and
> downloaded the latest patch LUCENE-2899-current.patch.
> 
> Applied the patch ok but when I did 'ant compile' I got the following error:
> 
> 
> ==
>    [javac]
> /home/lucene_solr_4_3_0/lucene/analysis/opennlp/src/java/org/apache/lucene/a
> nalysis/opennlp/FilterPayloadsFilter.java:43: error
> r: cannot find symbol
>    [javac]     super(Version.LUCENE_44, input);
>    [javac]                  ^
>    [javac]   symbol:   variable LUCENE_44
>    [javac]   location: class Version
>    [javac] 1 error    
> ==
> 
> Compiled it on trunk without problem.
> 
> Is this patch supposed to work for 4.X?
> 
> Regards,
> Patrick 
> 


Re: OPENNLP current patch compiling problem for 4.x branch

Posted by Erick Erickson <er...@gmail.com>.
by definition, there is no LUCENE_44 constant in a 4.3
distro! Just change it to LUCENE_43 (or whatever you find
in the Version class that suits your needs) or try this on a
4.x checkout.

Best
Erick

On Thu, May 23, 2013 at 2:08 AM, Patrick Mi
<pa...@touchpointgroup.com> wrote:
> Hi,
>
> I checked out from here
> http://svn.apache.org/repos/asf/lucene/dev/tags/lucene_solr_4_3_0 and
> downloaded the latest patch LUCENE-2899-current.patch.
>
> Applied the patch ok but when I did 'ant compile' I got the following error:
>
>
> ==
>     [javac]
> /home/lucene_solr_4_3_0/lucene/analysis/opennlp/src/java/org/apache/lucene/a
> nalysis/opennlp/FilterPayloadsFilter.java:43: error
> r: cannot find symbol
>     [javac]     super(Version.LUCENE_44, input);
>     [javac]                  ^
>     [javac]   symbol:   variable LUCENE_44
>     [javac]   location: class Version
>     [javac] 1 error
> ==
>
> Compiled it on trunk without problem.
>
> Is this patch supposed to work for 4.X?
>
> Regards,
> Patrick
>