You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Alan Aguia <aa...@yahoo.com> on 2008/05/13 15:53:42 UTC

max number of plugins

I have problems with the plugin, tag that is in nutch-site.xml file.
 After a number of plugins inserted the system doesn't take some of them,
 eventhough the system load it. I don't really understand what;s going
 on... is there a limit of plugin that you can use in this tag? For some
 reason after somo number of plugins the system doesnt use the last one.
   
  For example
    <property>
    <name>plugin.includes</name>
  
  <value>protocol-http|urlfilter-regex|parse-(text|html|js|pdf|msword|msexcel|mspowerpoint)|index-basic|query-(basic|site|url|extended)|summary-basic|scoring-opic|analysis-es</value>
    <description>
      Regular expression naming plugin directory names to
      include.  Any plugin not matching this expression is excluded.
      In any case you need at least include the nutch-extensionpoints
 plugin. By
      default Nutch includes crawling just HTML and plain text via
 HTTP,
      and basic indexing and search plugins.
    </description>
  </property>

  If I include one more plugin the system load it but It doesn't use
 it.

    <property>
    <name>plugin.includes</name>
  
  <value>protocol-http|urlfilter-regex|parse-(text|html|js|pdf|msword|msexcel|mspowerpoint)|index-basic|query-(basic|site|url)|summary-basic|scoring-opic|language-identifier|analysis-es|query-extended</value>
    <description>
      Regular expression naming plugin directory names to
      include.  Any plugin not matching this expression is excluded.
      In any case you need at least include the nutch-extensionpoints
 plugin. By
      default Nutch includes crawling just HTML and plain text via
 HTTP,
      and basic indexing and search plugins.
    </description>
  </property>
   
Thanks