You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Brian Ulicny <bu...@alum.mit.edu> on 2006/09/21 15:54:59 UTC

Ontology plugin in 0.8

I'm trying to get the Ontology plugin to work in an 0.8 environment.

I set extension.ontology.extension-name to
org.apache.nutch.ontology.jena.OntologyImpl
This seemed to be the correct thing to do according to nutch-default.xml
which says: 

  <description>Use the specified online ontology extension. If empty,
  the first available extension will be used. The "name" here refers to
  an 'id'
  attribute of the 'implementation' element in the plugin descriptor XML
  file.</description>

The plugin.xml for ontology is:

<plugin
   id="ontology"
   name="Ontology Plug-in"
   version="1.0.0"
   provider-name="nutch.org">


   <runtime>
      <library name="ontology.jar">
         <export name="*"/>
      </library>

      <library name="commons-logging-1.0.3.jar"/>
      <library name="icu4j_2_6_1.jar"/>
      <library name="jena-2.1.jar"/>

   </runtime>

   <requires>
      <import plugin="nutch-extensionpoints"/>
   </requires>

   <!-- attribute "point" is the plugin interface class -->
   <!-- seems kinda redundant to have to define the point here too -->   
   <extension id="org.apache.nutch.ontology.jena"
              name="Ontology Model Loader"
              point="org.apache.nutch.ontology.Ontology">

      <!-- define all the classes that implement the point defined above
      -->
      <implementation id="org.apache.nutch.ontology.jena.OntologyImpl"
                      class="org.apache.nutch.ontology.jena.OntologyImpl">
        <parameter name="pathSuffix" value=""/>
      </implementation>

   </extension>

</plugin>


I then set the extension.ontology.urls property to a valid (external)
OWL url.

However, I get this warning, and no ontologies are loaded:

6 WARN  OntologyFactory - Ontology extension not found:
'org.apache.nutch.ontology.jena.OntologyImpl', trying the default

What am I doing wrong?

B Ulicny
-- 
  Brian Ulicny
  bulicny@alum.mit.edu


Re: Ontology plugin in 0.8

Posted by "csavage@activeathletemedia.com" <cs...@activeathletemedia.com>.
Can anyone give me an opinion on how adding an ontology to their search 
engine helps to improve search results?

Thanks in advance,

Chad

Florian Fricker wrote:
> Yea, after a closer look it seems to be a problem with your xeres 
> library.
>
> To solve this, one needs to update tomcat's xerces library. Here's a 
> reference:
>
> http://jena.sourceforge.net/jena-faq.html#general-
>
> Hope it will help you.
>
> Regards
>
> Florian Fricker wrote:
>

Re: Ontology plugin in 0.8

Posted by Florian Fricker <fl...@wyona.com>.
Yea, after a closer look it seems to be a problem with your xeres library.

To solve this, one needs to update tomcat's xerces library. Here's a 
reference:

http://jena.sourceforge.net/jena-faq.html#general-

Hope it will help you.

Regards

Florian Fricker wrote:
> Hey Brian!
>
> Have you uncomment refine-query-init.jsp and refine-query.jsp in 
> search.jsp?
>
> Can you tell me what version of xeres is in use with your jena and 
> tomcat?
>
> Regards
>
> Brian Ulicny wrote:
>> I have tried it with the sample owl file.  That didn't work, either.
>>
>> On Fri, 22 Sep 2006 17:52:37 +0300, "Sami Siren" <ss...@gmail.com>
>> said:
>>  
>>> try running it first with the submitted .owl 
>>> (src/plugin/ontology/sample/time.owl)
>>> and verify that it works for you, after that proceed to more complex 
>>> ontologies (with what
>>> I have no experience with)
>>>
>>> -- 
>>>  Sami Siren
>>>
>>> Brian Ulicny wrote:
>>>    
>>>> Florian,
>>>> Thanks for replying: my nutch-site.xml under webapps was wrong. 
>>>> Correcting it gets me a little further.  However, whether I point 
>>>> to an
>>>> OWL file locally (with file:) or remotely (with http://), I get the
>>>> following error (under Root Cause):
>>>>
>>>> with
>>>>  <name>plugin.includes</name>
>>>>   
>>>> <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|urlfilter-regex|index-basic|query-(basic|site|url)|summary-basic|iblog|scoring-opic</value> 
>>>>
>>>>  
>>>>
>>>> java.lang.NoSuchFieldError: actualValueType
>>>>     at 
>>>> com.hp.hpl.jena.datatypes.xsd.XSDDatatype.convertValidatedDataValue(XSDDatatype.java:371) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.datatypes.xsd.XSDDatatype.parse(XSDDatatype.java:250)
>>>>     at 
>>>> com.hp.hpl.jena.graph.impl.LiteralLabel.setValue(LiteralLabel.java:166) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.graph.impl.LiteralLabel.<init>(LiteralLabel.java:82)
>>>>     at com.hp.hpl.jena.graph.Node.createLiteral(Node.java:151)
>>>>     at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:147)
>>>>     at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:169)
>>>>     at 
>>>> com.hp.hpl.jena.rdf.arp.JenaReader$JRStatementHandler.statement(JenaReader.java:57) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.rdf.arp.ARPResource.tripleLit(ARPResource.java:190)
>>>>     at 
>>>> com.hp.hpl.jena.rdf.arp.ARPResource.triple(ARPResource.java:172)
>>>>     at 
>>>> com.hp.hpl.jena.rdf.arp.ARPResource.setPredObject(ARPResource.java:146) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.rdf.arp.ARPResource.setPredicateObject(ARPResource.java:127) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.rdf.arp.ParserSupport.createTriple(ParserSupport.java:315) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.rdf.arp.RDFParser.propertyEltIdAndValue(RDFParser.java:692) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.rdf.arp.RDFParser.propertyElt(RDFParser.java:647)
>>>>     at com.hp.hpl.jena.rdf.arp.RDFParser.typedNode(RDFParser.java:984)
>>>>     at 
>>>> com.hp.hpl.jena.rdf.arp.RDFParser.description(RDFParser.java:429)
>>>>     at com.hp.hpl.jena.rdf.arp.RDFParser.obj(RDFParser.java:346)
>>>>     at com.hp.hpl.jena.rdf.arp.RDFParser.rdf(RDFParser.java:326)
>>>>     at 
>>>> com.hp.hpl.jena.rdf.arp.RDFParser.rdfFileSub(RDFParser.java:107)
>>>>     at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFile(RDFParser.java:99)
>>>>     at com.hp.hpl.jena.rdf.arp.ARPFilter.parse(ARPFilter.java:314)
>>>>     at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:221)
>>>>     at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:209)
>>>>     at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:261)
>>>>     at com.hp.hpl.jena.rdf.model.impl.ModelCom.read(ModelCom.java:206)
>>>>     at 
>>>> com.hp.hpl.jena.ontology.OntDocumentManager.read(OntDocumentManager.java:986) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.ontology.OntDocumentManager.findMetadata(OntDocumentManager.java:807) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.ontology.OntDocumentManager.initialiseMetadata(OntDocumentManager.java:781) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:171) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:155) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.ontology.OntDocumentManager.<clinit>(OntDocumentManager.java:108) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.ontology.OntModelSpec.getDocumentManager(OntModelSpec.java:247) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.ontology.impl.OntModelImpl.getDocumentManager(OntModelImpl.java:174) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:145) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:116) 
>>>>
>>>>     at 
>>>> com.hp.hpl.jena.rdf.model.ModelFactory.createOntologyModel(ModelFactory.java:375) 
>>>>
>>>>     at 
>>>> org.apache.nutch.ontology.jena.OntologyImpl.<init>(OntologyImpl.java:81) 
>>>>
>>>>     at 
>>>> sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
>>>>     at 
>>>> sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39) 
>>>>
>>>>     at 
>>>> sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27) 
>>>>
>>>>     at java.lang.reflect.Constructor.newInstance(Constructor.java:494)
>>>>     at java.lang.Class.newInstance0(Class.java:350)
>>>>     at java.lang.Class.newInstance(Class.java:303)
>>>>     at 
>>>> org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java:151) 
>>>>
>>>>     at 
>>>> org.apache.nutch.ontology.OntologyFactory.getOntology(OntologyFactory.java:64) 
>>>>
>>>>     at org.apache.jsp.search_jsp._jspService(search_jsp.java:118)
>>>>     at 
>>>> org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:92)
>>>>     at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
>>>>     at 
>>>> org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:162) 
>>>>
>>>>     at 
>>>> org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:240) 
>>>>
>>>>     at 
>>>> org.apache.jasper.servlet.JspServlet.service(JspServlet.java:187)
>>>>     at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
>>>>     at 
>>>> org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:200) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:146) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:209) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
>>>>     at 
>>>> org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:144) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
>>>>     at 
>>>> org.apache.catalina.core.StandardContext.invoke(StandardContext.java:2358) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:133) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596) 
>>>>
>>>>     at 
>>>> org.apache.catalina.valves.ErrorDispatcherValve.invoke(ErrorDispatcherValve.java:118) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594) 
>>>>
>>>>     at 
>>>> org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:116) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
>>>>     at 
>>>> org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:127) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433) 
>>>>
>>>>     at 
>>>> org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
>>>>     at 
>>>> org.apache.coyote.tomcat4.CoyoteAdapter.service(CoyoteAdapter.java:152) 
>>>>
>>>>     at 
>>>> org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:799) 
>>>>
>>>>     at 
>>>> org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.processConnection(Http11Protocol.java:705) 
>>>>
>>>>     at 
>>>> org.apache.tomcat.util.net.TcpWorkerThread.runIt(PoolTcpEndpoint.java:577) 
>>>>
>>>>     at 
>>>> org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:683) 
>>>>
>>>>     at java.lang.Thread.run(Thread.java:595)
>>>>
>>>> Any thoughts from anyone out there?
>>>>
>>>> Brian
>>>>
>>>> On Fri, 22 Sep 2006 09:43:51 +0200, "Florian Fricker"
>>>> <fl...@wyona.com> said:
>>>>        
>>>>> Hey Brian,
>>>>>
>>>>> Your nutch-site.xml has two parts for ontology.
>>>>>
>>>>> First Part:
>>>>>
>>>>> <property>
>>>>>   <name>extension.ontology.extension-name</name>
>>>>>   <value>org.apache.nutch.ontology.jena.OntologyImpl</value>
>>>>>   <description>Loads the Ontology plugin</description>
>>>>> </property>
>>>>>
>>>>> Second Part:
>>>>>
>>>>> <property>
>>>>>   <name>extension.ontology.urls</name>
>>>>>   <value>file:/path/to/owl/file or http://path/to/owl/file</value>
>>>>>   <description>Shows the owl file</description>
>>>>> </property>
>>>>>
>>>>> Then add the onotology plugin to the nutch plugin section. It 
>>>>> should look like this:
>>>>>
>>>>>
>>>>> <property>
>>>>>   <name>plugin.includes</name>
>>>>>  <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)| 
>>>>>
>>>>>               
>>>>> urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic| 
>>>>>
>>>>>               
>>>>> query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|scoring-opic</value> 
>>>>>
>>>>> </property>
>>>>>
>>>>> That's it...
>>>>>
>>>>> Regards
>>>>>
>>>>>
>>>>> Brian Ulicny wrote:
>>>>>            
>>>>>> I'm trying to get the Ontology plugin to work in an 0.8 environment.
>>>>>>
>>>>>> I set extension.ontology.extension-name to
>>>>>> org.apache.nutch.ontology.jena.OntologyImpl
>>>>>> This seemed to be the correct thing to do according to 
>>>>>> nutch-default.xml
>>>>>> which says:
>>>>>>   <description>Use the specified online ontology extension. If 
>>>>>> empty,
>>>>>>   the first available extension will be used. The "name" here 
>>>>>> refers to
>>>>>>   an 'id'
>>>>>>   attribute of the 'implementation' element in the plugin 
>>>>>> descriptor XML
>>>>>>   file.</description>
>>>>>>
>>>>>> The plugin.xml for ontology is:
>>>>>>
>>>>>> <plugin
>>>>>>    id="ontology"
>>>>>>    name="Ontology Plug-in"
>>>>>>    version="1.0.0"
>>>>>>    provider-name="nutch.org">
>>>>>>
>>>>>>
>>>>>>    <runtime>
>>>>>>       <library name="ontology.jar">
>>>>>>          <export name="*"/>
>>>>>>       </library>
>>>>>>
>>>>>>       <library name="commons-logging-1.0.3.jar"/>
>>>>>>       <library name="icu4j_2_6_1.jar"/>
>>>>>>       <library name="jena-2.1.jar"/>
>>>>>>
>>>>>>    </runtime>
>>>>>>
>>>>>>    <requires>
>>>>>>       <import plugin="nutch-extensionpoints"/>
>>>>>>    </requires>
>>>>>>
>>>>>>    <!-- attribute "point" is the plugin interface class -->
>>>>>>    <!-- seems kinda redundant to have to define the point here 
>>>>>> too -->      <extension id="org.apache.nutch.ontology.jena"
>>>>>>               name="Ontology Model Loader"
>>>>>>               point="org.apache.nutch.ontology.Ontology">
>>>>>>
>>>>>>       <!-- define all the classes that implement the point 
>>>>>> defined above
>>>>>>       -->
>>>>>>       <implementation 
>>>>>> id="org.apache.nutch.ontology.jena.OntologyImpl"
>>>>>>                       
>>>>>> class="org.apache.nutch.ontology.jena.OntologyImpl">
>>>>>>         <parameter name="pathSuffix" value=""/>
>>>>>>       </implementation>
>>>>>>
>>>>>>    </extension>
>>>>>>
>>>>>> </plugin>
>>>>>>
>>>>>>
>>>>>> I then set the extension.ontology.urls property to a valid 
>>>>>> (external)
>>>>>> OWL url.
>>>>>>
>>>>>> However, I get this warning, and no ontologies are loaded:
>>>>>>
>>>>>> 6 WARN  OntologyFactory - Ontology extension not found:
>>>>>> 'org.apache.nutch.ontology.jena.OntologyImpl', trying the default
>>>>>>
>>>>>> What am I doing wrong?
>>>>>>
>>>>>> B Ulicny
>>>>>>                   
>>>>> -- 
>>>>> Florian Fricker                          florian.fricker@wyona.com
>>>>> Wyona Inc.   -   Open Source Content Management   -   Apache Lenya
>>>>> http://www.wyona.com                       http://lenya.apache.org
>>>>>
>>>>>             
>
>


-- 
Florian Fricker                          florian.fricker@wyona.com
Wyona Inc.   -   Open Source Content Management   -   Apache Lenya
http://www.wyona.com                       http://lenya.apache.org


Re: Ontology plugin in 0.8

Posted by "csavage@activeathletemedia.com" <cs...@activeathletemedia.com>.
Hello All,

I'm not sure if this will cause anything but your urlfilter-regex, 
index-basic, query-(basic|site|url) are duplicated in that last post.
Great posts by the way, I'm knee deep in trying to figure out how to 
implement this as well.

Chad

Florian Fricker wrote:

<value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|urlfilter-regex|index-basic|query-(basic|site|url)|summary-basic|iblog|scoring-opic</value> 



Re: Ontology plugin in 0.8

Posted by Florian Fricker <fl...@wyona.com>.
Hey Brian!

Have you uncomment refine-query-init.jsp and refine-query.jsp in search.jsp?

Can you tell me what version of xeres is in use with your jena and tomcat?

Regards

Brian Ulicny wrote:
> I have tried it with the sample owl file.  That didn't work, either.
>
> On Fri, 22 Sep 2006 17:52:37 +0300, "Sami Siren" <ss...@gmail.com>
> said:
>   
>> try running it first with the submitted .owl 
>> (src/plugin/ontology/sample/time.owl)
>> and verify that it works for you, after that proceed to more complex 
>> ontologies (with what
>> I have no experience with)
>>
>> --
>>  Sami Siren
>>
>> Brian Ulicny wrote:
>>     
>>> Florian, 
>>>
>>> Thanks for replying: my nutch-site.xml under webapps was wrong. 
>>> Correcting it gets me a little further.  However, whether I point to an
>>> OWL file locally (with file:) or remotely (with http://), I get the
>>> following error (under Root Cause):
>>>
>>> with 
>>>
>>>  <name>plugin.includes</name>
>>>   <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|urlfilter-regex|index-basic|query-(basic|site|url)|summary-basic|iblog|scoring-opic</value>
>>>  
>>>
>>> java.lang.NoSuchFieldError: actualValueType
>>> 	at com.hp.hpl.jena.datatypes.xsd.XSDDatatype.convertValidatedDataValue(XSDDatatype.java:371)
>>> 	at com.hp.hpl.jena.datatypes.xsd.XSDDatatype.parse(XSDDatatype.java:250)
>>> 	at com.hp.hpl.jena.graph.impl.LiteralLabel.setValue(LiteralLabel.java:166)
>>> 	at com.hp.hpl.jena.graph.impl.LiteralLabel.<init>(LiteralLabel.java:82)
>>> 	at com.hp.hpl.jena.graph.Node.createLiteral(Node.java:151)
>>> 	at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:147)
>>> 	at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:169)
>>> 	at com.hp.hpl.jena.rdf.arp.JenaReader$JRStatementHandler.statement(JenaReader.java:57)
>>> 	at com.hp.hpl.jena.rdf.arp.ARPResource.tripleLit(ARPResource.java:190)
>>> 	at com.hp.hpl.jena.rdf.arp.ARPResource.triple(ARPResource.java:172)
>>> 	at com.hp.hpl.jena.rdf.arp.ARPResource.setPredObject(ARPResource.java:146)
>>> 	at com.hp.hpl.jena.rdf.arp.ARPResource.setPredicateObject(ARPResource.java:127)
>>> 	at com.hp.hpl.jena.rdf.arp.ParserSupport.createTriple(ParserSupport.java:315)
>>> 	at com.hp.hpl.jena.rdf.arp.RDFParser.propertyEltIdAndValue(RDFParser.java:692)
>>> 	at com.hp.hpl.jena.rdf.arp.RDFParser.propertyElt(RDFParser.java:647)
>>> 	at com.hp.hpl.jena.rdf.arp.RDFParser.typedNode(RDFParser.java:984)
>>> 	at com.hp.hpl.jena.rdf.arp.RDFParser.description(RDFParser.java:429)
>>> 	at com.hp.hpl.jena.rdf.arp.RDFParser.obj(RDFParser.java:346)
>>> 	at com.hp.hpl.jena.rdf.arp.RDFParser.rdf(RDFParser.java:326)
>>> 	at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFileSub(RDFParser.java:107)
>>> 	at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFile(RDFParser.java:99)
>>> 	at com.hp.hpl.jena.rdf.arp.ARPFilter.parse(ARPFilter.java:314)
>>> 	at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:221)
>>> 	at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:209)
>>> 	at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:261)
>>> 	at com.hp.hpl.jena.rdf.model.impl.ModelCom.read(ModelCom.java:206)
>>> 	at com.hp.hpl.jena.ontology.OntDocumentManager.read(OntDocumentManager.java:986)
>>> 	at com.hp.hpl.jena.ontology.OntDocumentManager.findMetadata(OntDocumentManager.java:807)
>>> 	at com.hp.hpl.jena.ontology.OntDocumentManager.initialiseMetadata(OntDocumentManager.java:781)
>>> 	at com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:171)
>>> 	at com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:155)
>>> 	at com.hp.hpl.jena.ontology.OntDocumentManager.<clinit>(OntDocumentManager.java:108)
>>> 	at com.hp.hpl.jena.ontology.OntModelSpec.getDocumentManager(OntModelSpec.java:247)
>>> 	at com.hp.hpl.jena.ontology.impl.OntModelImpl.getDocumentManager(OntModelImpl.java:174)
>>> 	at com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:145)
>>> 	at com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:116)
>>> 	at com.hp.hpl.jena.rdf.model.ModelFactory.createOntologyModel(ModelFactory.java:375)
>>> 	at org.apache.nutch.ontology.jena.OntologyImpl.<init>(OntologyImpl.java:81)
>>> 	at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
>>> 	at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
>>> 	at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
>>> 	at java.lang.reflect.Constructor.newInstance(Constructor.java:494)
>>> 	at java.lang.Class.newInstance0(Class.java:350)
>>> 	at java.lang.Class.newInstance(Class.java:303)
>>> 	at org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java:151)
>>> 	at org.apache.nutch.ontology.OntologyFactory.getOntology(OntologyFactory.java:64)
>>> 	at org.apache.jsp.search_jsp._jspService(search_jsp.java:118)
>>> 	at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:92)
>>> 	at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
>>> 	at org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:162)
>>> 	at org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:240)
>>> 	at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:187)
>>> 	at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
>>> 	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:200)
>>> 	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:146)
>>> 	at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:209)
>>> 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
>>> 	at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
>>> 	at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
>>> 	at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:144)
>>> 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
>>> 	at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
>>> 	at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
>>> 	at org.apache.catalina.core.StandardContext.invoke(StandardContext.java:2358)
>>> 	at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:133)
>>> 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
>>> 	at org.apache.catalina.valves.ErrorDispatcherValve.invoke(ErrorDispatcherValve.java:118)
>>> 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
>>> 	at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:116)
>>> 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
>>> 	at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
>>> 	at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
>>> 	at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:127)
>>> 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
>>> 	at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
>>> 	at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
>>> 	at org.apache.coyote.tomcat4.CoyoteAdapter.service(CoyoteAdapter.java:152)
>>> 	at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:799)
>>> 	at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.processConnection(Http11Protocol.java:705)
>>> 	at org.apache.tomcat.util.net.TcpWorkerThread.runIt(PoolTcpEndpoint.java:577)
>>> 	at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:683)
>>> 	at java.lang.Thread.run(Thread.java:595)
>>>
>>> Any thoughts from anyone out there?
>>>
>>> Brian
>>>
>>> On Fri, 22 Sep 2006 09:43:51 +0200, "Florian Fricker"
>>> <fl...@wyona.com> said:
>>>   
>>>       
>>>> Hey Brian,
>>>>
>>>> Your nutch-site.xml has two parts for ontology.
>>>>
>>>> First Part:
>>>>
>>>> <property>
>>>>   <name>extension.ontology.extension-name</name>
>>>>   <value>org.apache.nutch.ontology.jena.OntologyImpl</value>
>>>>   <description>Loads the Ontology plugin</description>
>>>> </property>
>>>>
>>>> Second Part:
>>>>
>>>> <property>
>>>>   <name>extension.ontology.urls</name>
>>>>   <value>file:/path/to/owl/file or http://path/to/owl/file</value>
>>>>   <description>Shows the owl file</description>
>>>> </property>
>>>>
>>>> Then add the onotology plugin to the nutch plugin section. It should 
>>>> look like this:
>>>>
>>>>
>>>> <property>
>>>>   <name>plugin.includes</name>
>>>>  <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|
>>>>               
>>>> urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|
>>>>               
>>>> query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|scoring-opic</value>
>>>> </property>
>>>>
>>>> That's it...
>>>>
>>>> Regards
>>>>
>>>>
>>>> Brian Ulicny wrote:
>>>>     
>>>>         
>>>>> I'm trying to get the Ontology plugin to work in an 0.8 environment.
>>>>>
>>>>> I set extension.ontology.extension-name to
>>>>> org.apache.nutch.ontology.jena.OntologyImpl
>>>>> This seemed to be the correct thing to do according to nutch-default.xml
>>>>> which says: 
>>>>>
>>>>>   <description>Use the specified online ontology extension. If empty,
>>>>>   the first available extension will be used. The "name" here refers to
>>>>>   an 'id'
>>>>>   attribute of the 'implementation' element in the plugin descriptor XML
>>>>>   file.</description>
>>>>>
>>>>> The plugin.xml for ontology is:
>>>>>
>>>>> <plugin
>>>>>    id="ontology"
>>>>>    name="Ontology Plug-in"
>>>>>    version="1.0.0"
>>>>>    provider-name="nutch.org">
>>>>>
>>>>>
>>>>>    <runtime>
>>>>>       <library name="ontology.jar">
>>>>>          <export name="*"/>
>>>>>       </library>
>>>>>
>>>>>       <library name="commons-logging-1.0.3.jar"/>
>>>>>       <library name="icu4j_2_6_1.jar"/>
>>>>>       <library name="jena-2.1.jar"/>
>>>>>
>>>>>    </runtime>
>>>>>
>>>>>    <requires>
>>>>>       <import plugin="nutch-extensionpoints"/>
>>>>>    </requires>
>>>>>
>>>>>    <!-- attribute "point" is the plugin interface class -->
>>>>>    <!-- seems kinda redundant to have to define the point here too -->   
>>>>>    <extension id="org.apache.nutch.ontology.jena"
>>>>>               name="Ontology Model Loader"
>>>>>               point="org.apache.nutch.ontology.Ontology">
>>>>>
>>>>>       <!-- define all the classes that implement the point defined above
>>>>>       -->
>>>>>       <implementation id="org.apache.nutch.ontology.jena.OntologyImpl"
>>>>>                       class="org.apache.nutch.ontology.jena.OntologyImpl">
>>>>>         <parameter name="pathSuffix" value=""/>
>>>>>       </implementation>
>>>>>
>>>>>    </extension>
>>>>>
>>>>> </plugin>
>>>>>
>>>>>
>>>>> I then set the extension.ontology.urls property to a valid (external)
>>>>> OWL url.
>>>>>
>>>>> However, I get this warning, and no ontologies are loaded:
>>>>>
>>>>> 6 WARN  OntologyFactory - Ontology extension not found:
>>>>> 'org.apache.nutch.ontology.jena.OntologyImpl', trying the default
>>>>>
>>>>> What am I doing wrong?
>>>>>
>>>>> B Ulicny
>>>>>   
>>>>>       
>>>>>           
>>>> -- 
>>>> Florian Fricker                          florian.fricker@wyona.com
>>>> Wyona Inc.   -   Open Source Content Management   -   Apache Lenya
>>>> http://www.wyona.com                       http://lenya.apache.org
>>>>
>>>>     
>>>>         


-- 
Florian Fricker                          florian.fricker@wyona.com
Wyona Inc.   -   Open Source Content Management   -   Apache Lenya
http://www.wyona.com                       http://lenya.apache.org


Re: Ontology plugin in 0.8

Posted by Brian Ulicny <bu...@alum.mit.edu>.
I have tried it with the sample owl file.  That didn't work, either.

On Fri, 22 Sep 2006 17:52:37 +0300, "Sami Siren" <ss...@gmail.com>
said:
> try running it first with the submitted .owl 
> (src/plugin/ontology/sample/time.owl)
> and verify that it works for you, after that proceed to more complex 
> ontologies (with what
> I have no experience with)
> 
> --
>  Sami Siren
> 
> Brian Ulicny wrote:
> > Florian, 
> >
> > Thanks for replying: my nutch-site.xml under webapps was wrong. 
> > Correcting it gets me a little further.  However, whether I point to an
> > OWL file locally (with file:) or remotely (with http://), I get the
> > following error (under Root Cause):
> >
> > with 
> >
> >  <name>plugin.includes</name>
> >   <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|urlfilter-regex|index-basic|query-(basic|site|url)|summary-basic|iblog|scoring-opic</value>
> >  
> >
> > java.lang.NoSuchFieldError: actualValueType
> > 	at com.hp.hpl.jena.datatypes.xsd.XSDDatatype.convertValidatedDataValue(XSDDatatype.java:371)
> > 	at com.hp.hpl.jena.datatypes.xsd.XSDDatatype.parse(XSDDatatype.java:250)
> > 	at com.hp.hpl.jena.graph.impl.LiteralLabel.setValue(LiteralLabel.java:166)
> > 	at com.hp.hpl.jena.graph.impl.LiteralLabel.<init>(LiteralLabel.java:82)
> > 	at com.hp.hpl.jena.graph.Node.createLiteral(Node.java:151)
> > 	at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:147)
> > 	at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:169)
> > 	at com.hp.hpl.jena.rdf.arp.JenaReader$JRStatementHandler.statement(JenaReader.java:57)
> > 	at com.hp.hpl.jena.rdf.arp.ARPResource.tripleLit(ARPResource.java:190)
> > 	at com.hp.hpl.jena.rdf.arp.ARPResource.triple(ARPResource.java:172)
> > 	at com.hp.hpl.jena.rdf.arp.ARPResource.setPredObject(ARPResource.java:146)
> > 	at com.hp.hpl.jena.rdf.arp.ARPResource.setPredicateObject(ARPResource.java:127)
> > 	at com.hp.hpl.jena.rdf.arp.ParserSupport.createTriple(ParserSupport.java:315)
> > 	at com.hp.hpl.jena.rdf.arp.RDFParser.propertyEltIdAndValue(RDFParser.java:692)
> > 	at com.hp.hpl.jena.rdf.arp.RDFParser.propertyElt(RDFParser.java:647)
> > 	at com.hp.hpl.jena.rdf.arp.RDFParser.typedNode(RDFParser.java:984)
> > 	at com.hp.hpl.jena.rdf.arp.RDFParser.description(RDFParser.java:429)
> > 	at com.hp.hpl.jena.rdf.arp.RDFParser.obj(RDFParser.java:346)
> > 	at com.hp.hpl.jena.rdf.arp.RDFParser.rdf(RDFParser.java:326)
> > 	at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFileSub(RDFParser.java:107)
> > 	at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFile(RDFParser.java:99)
> > 	at com.hp.hpl.jena.rdf.arp.ARPFilter.parse(ARPFilter.java:314)
> > 	at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:221)
> > 	at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:209)
> > 	at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:261)
> > 	at com.hp.hpl.jena.rdf.model.impl.ModelCom.read(ModelCom.java:206)
> > 	at com.hp.hpl.jena.ontology.OntDocumentManager.read(OntDocumentManager.java:986)
> > 	at com.hp.hpl.jena.ontology.OntDocumentManager.findMetadata(OntDocumentManager.java:807)
> > 	at com.hp.hpl.jena.ontology.OntDocumentManager.initialiseMetadata(OntDocumentManager.java:781)
> > 	at com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:171)
> > 	at com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:155)
> > 	at com.hp.hpl.jena.ontology.OntDocumentManager.<clinit>(OntDocumentManager.java:108)
> > 	at com.hp.hpl.jena.ontology.OntModelSpec.getDocumentManager(OntModelSpec.java:247)
> > 	at com.hp.hpl.jena.ontology.impl.OntModelImpl.getDocumentManager(OntModelImpl.java:174)
> > 	at com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:145)
> > 	at com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:116)
> > 	at com.hp.hpl.jena.rdf.model.ModelFactory.createOntologyModel(ModelFactory.java:375)
> > 	at org.apache.nutch.ontology.jena.OntologyImpl.<init>(OntologyImpl.java:81)
> > 	at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
> > 	at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
> > 	at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
> > 	at java.lang.reflect.Constructor.newInstance(Constructor.java:494)
> > 	at java.lang.Class.newInstance0(Class.java:350)
> > 	at java.lang.Class.newInstance(Class.java:303)
> > 	at org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java:151)
> > 	at org.apache.nutch.ontology.OntologyFactory.getOntology(OntologyFactory.java:64)
> > 	at org.apache.jsp.search_jsp._jspService(search_jsp.java:118)
> > 	at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:92)
> > 	at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
> > 	at org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:162)
> > 	at org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:240)
> > 	at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:187)
> > 	at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
> > 	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:200)
> > 	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:146)
> > 	at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:209)
> > 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
> > 	at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
> > 	at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
> > 	at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:144)
> > 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
> > 	at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
> > 	at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
> > 	at org.apache.catalina.core.StandardContext.invoke(StandardContext.java:2358)
> > 	at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:133)
> > 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
> > 	at org.apache.catalina.valves.ErrorDispatcherValve.invoke(ErrorDispatcherValve.java:118)
> > 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
> > 	at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:116)
> > 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
> > 	at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
> > 	at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
> > 	at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:127)
> > 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
> > 	at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
> > 	at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
> > 	at org.apache.coyote.tomcat4.CoyoteAdapter.service(CoyoteAdapter.java:152)
> > 	at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:799)
> > 	at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.processConnection(Http11Protocol.java:705)
> > 	at org.apache.tomcat.util.net.TcpWorkerThread.runIt(PoolTcpEndpoint.java:577)
> > 	at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:683)
> > 	at java.lang.Thread.run(Thread.java:595)
> >
> > Any thoughts from anyone out there?
> >
> > Brian
> >
> > On Fri, 22 Sep 2006 09:43:51 +0200, "Florian Fricker"
> > <fl...@wyona.com> said:
> >   
> >> Hey Brian,
> >>
> >> Your nutch-site.xml has two parts for ontology.
> >>
> >> First Part:
> >>
> >> <property>
> >>   <name>extension.ontology.extension-name</name>
> >>   <value>org.apache.nutch.ontology.jena.OntologyImpl</value>
> >>   <description>Loads the Ontology plugin</description>
> >> </property>
> >>
> >> Second Part:
> >>
> >> <property>
> >>   <name>extension.ontology.urls</name>
> >>   <value>file:/path/to/owl/file or http://path/to/owl/file</value>
> >>   <description>Shows the owl file</description>
> >> </property>
> >>
> >> Then add the onotology plugin to the nutch plugin section. It should 
> >> look like this:
> >>
> >>
> >> <property>
> >>   <name>plugin.includes</name>
> >>  <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|
> >>               
> >> urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|
> >>               
> >> query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|scoring-opic</value>
> >> </property>
> >>
> >> That's it...
> >>
> >> Regards
> >>
> >>
> >> Brian Ulicny wrote:
> >>     
> >>> I'm trying to get the Ontology plugin to work in an 0.8 environment.
> >>>
> >>> I set extension.ontology.extension-name to
> >>> org.apache.nutch.ontology.jena.OntologyImpl
> >>> This seemed to be the correct thing to do according to nutch-default.xml
> >>> which says: 
> >>>
> >>>   <description>Use the specified online ontology extension. If empty,
> >>>   the first available extension will be used. The "name" here refers to
> >>>   an 'id'
> >>>   attribute of the 'implementation' element in the plugin descriptor XML
> >>>   file.</description>
> >>>
> >>> The plugin.xml for ontology is:
> >>>
> >>> <plugin
> >>>    id="ontology"
> >>>    name="Ontology Plug-in"
> >>>    version="1.0.0"
> >>>    provider-name="nutch.org">
> >>>
> >>>
> >>>    <runtime>
> >>>       <library name="ontology.jar">
> >>>          <export name="*"/>
> >>>       </library>
> >>>
> >>>       <library name="commons-logging-1.0.3.jar"/>
> >>>       <library name="icu4j_2_6_1.jar"/>
> >>>       <library name="jena-2.1.jar"/>
> >>>
> >>>    </runtime>
> >>>
> >>>    <requires>
> >>>       <import plugin="nutch-extensionpoints"/>
> >>>    </requires>
> >>>
> >>>    <!-- attribute "point" is the plugin interface class -->
> >>>    <!-- seems kinda redundant to have to define the point here too -->   
> >>>    <extension id="org.apache.nutch.ontology.jena"
> >>>               name="Ontology Model Loader"
> >>>               point="org.apache.nutch.ontology.Ontology">
> >>>
> >>>       <!-- define all the classes that implement the point defined above
> >>>       -->
> >>>       <implementation id="org.apache.nutch.ontology.jena.OntologyImpl"
> >>>                       class="org.apache.nutch.ontology.jena.OntologyImpl">
> >>>         <parameter name="pathSuffix" value=""/>
> >>>       </implementation>
> >>>
> >>>    </extension>
> >>>
> >>> </plugin>
> >>>
> >>>
> >>> I then set the extension.ontology.urls property to a valid (external)
> >>> OWL url.
> >>>
> >>> However, I get this warning, and no ontologies are loaded:
> >>>
> >>> 6 WARN  OntologyFactory - Ontology extension not found:
> >>> 'org.apache.nutch.ontology.jena.OntologyImpl', trying the default
> >>>
> >>> What am I doing wrong?
> >>>
> >>> B Ulicny
> >>>   
> >>>       
> >> -- 
> >> Florian Fricker                          florian.fricker@wyona.com
> >> Wyona Inc.   -   Open Source Content Management   -   Apache Lenya
> >> http://www.wyona.com                       http://lenya.apache.org
> >>
> >>     
> 
-- 
  Brian Ulicny
  bulicny@alum.mit.edu


Re: Ontology plugin in 0.8

Posted by Sami Siren <ss...@gmail.com>.
try running it first with the submitted .owl 
(src/plugin/ontology/sample/time.owl)
and verify that it works for you, after that proceed to more complex 
ontologies (with what
I have no experience with)

--
 Sami Siren

Brian Ulicny wrote:
> Florian, 
>
> Thanks for replying: my nutch-site.xml under webapps was wrong. 
> Correcting it gets me a little further.  However, whether I point to an
> OWL file locally (with file:) or remotely (with http://), I get the
> following error (under Root Cause):
>
> with 
>
>  <name>plugin.includes</name>
>   <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|urlfilter-regex|index-basic|query-(basic|site|url)|summary-basic|iblog|scoring-opic</value>
>  
>
> java.lang.NoSuchFieldError: actualValueType
> 	at com.hp.hpl.jena.datatypes.xsd.XSDDatatype.convertValidatedDataValue(XSDDatatype.java:371)
> 	at com.hp.hpl.jena.datatypes.xsd.XSDDatatype.parse(XSDDatatype.java:250)
> 	at com.hp.hpl.jena.graph.impl.LiteralLabel.setValue(LiteralLabel.java:166)
> 	at com.hp.hpl.jena.graph.impl.LiteralLabel.<init>(LiteralLabel.java:82)
> 	at com.hp.hpl.jena.graph.Node.createLiteral(Node.java:151)
> 	at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:147)
> 	at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:169)
> 	at com.hp.hpl.jena.rdf.arp.JenaReader$JRStatementHandler.statement(JenaReader.java:57)
> 	at com.hp.hpl.jena.rdf.arp.ARPResource.tripleLit(ARPResource.java:190)
> 	at com.hp.hpl.jena.rdf.arp.ARPResource.triple(ARPResource.java:172)
> 	at com.hp.hpl.jena.rdf.arp.ARPResource.setPredObject(ARPResource.java:146)
> 	at com.hp.hpl.jena.rdf.arp.ARPResource.setPredicateObject(ARPResource.java:127)
> 	at com.hp.hpl.jena.rdf.arp.ParserSupport.createTriple(ParserSupport.java:315)
> 	at com.hp.hpl.jena.rdf.arp.RDFParser.propertyEltIdAndValue(RDFParser.java:692)
> 	at com.hp.hpl.jena.rdf.arp.RDFParser.propertyElt(RDFParser.java:647)
> 	at com.hp.hpl.jena.rdf.arp.RDFParser.typedNode(RDFParser.java:984)
> 	at com.hp.hpl.jena.rdf.arp.RDFParser.description(RDFParser.java:429)
> 	at com.hp.hpl.jena.rdf.arp.RDFParser.obj(RDFParser.java:346)
> 	at com.hp.hpl.jena.rdf.arp.RDFParser.rdf(RDFParser.java:326)
> 	at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFileSub(RDFParser.java:107)
> 	at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFile(RDFParser.java:99)
> 	at com.hp.hpl.jena.rdf.arp.ARPFilter.parse(ARPFilter.java:314)
> 	at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:221)
> 	at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:209)
> 	at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:261)
> 	at com.hp.hpl.jena.rdf.model.impl.ModelCom.read(ModelCom.java:206)
> 	at com.hp.hpl.jena.ontology.OntDocumentManager.read(OntDocumentManager.java:986)
> 	at com.hp.hpl.jena.ontology.OntDocumentManager.findMetadata(OntDocumentManager.java:807)
> 	at com.hp.hpl.jena.ontology.OntDocumentManager.initialiseMetadata(OntDocumentManager.java:781)
> 	at com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:171)
> 	at com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:155)
> 	at com.hp.hpl.jena.ontology.OntDocumentManager.<clinit>(OntDocumentManager.java:108)
> 	at com.hp.hpl.jena.ontology.OntModelSpec.getDocumentManager(OntModelSpec.java:247)
> 	at com.hp.hpl.jena.ontology.impl.OntModelImpl.getDocumentManager(OntModelImpl.java:174)
> 	at com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:145)
> 	at com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:116)
> 	at com.hp.hpl.jena.rdf.model.ModelFactory.createOntologyModel(ModelFactory.java:375)
> 	at org.apache.nutch.ontology.jena.OntologyImpl.<init>(OntologyImpl.java:81)
> 	at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
> 	at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
> 	at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
> 	at java.lang.reflect.Constructor.newInstance(Constructor.java:494)
> 	at java.lang.Class.newInstance0(Class.java:350)
> 	at java.lang.Class.newInstance(Class.java:303)
> 	at org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java:151)
> 	at org.apache.nutch.ontology.OntologyFactory.getOntology(OntologyFactory.java:64)
> 	at org.apache.jsp.search_jsp._jspService(search_jsp.java:118)
> 	at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:92)
> 	at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
> 	at org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:162)
> 	at org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:240)
> 	at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:187)
> 	at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
> 	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:200)
> 	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:146)
> 	at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:209)
> 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
> 	at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
> 	at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
> 	at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:144)
> 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
> 	at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
> 	at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
> 	at org.apache.catalina.core.StandardContext.invoke(StandardContext.java:2358)
> 	at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:133)
> 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
> 	at org.apache.catalina.valves.ErrorDispatcherValve.invoke(ErrorDispatcherValve.java:118)
> 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
> 	at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:116)
> 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
> 	at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
> 	at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
> 	at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:127)
> 	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
> 	at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
> 	at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
> 	at org.apache.coyote.tomcat4.CoyoteAdapter.service(CoyoteAdapter.java:152)
> 	at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:799)
> 	at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.processConnection(Http11Protocol.java:705)
> 	at org.apache.tomcat.util.net.TcpWorkerThread.runIt(PoolTcpEndpoint.java:577)
> 	at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:683)
> 	at java.lang.Thread.run(Thread.java:595)
>
> Any thoughts from anyone out there?
>
> Brian
>
> On Fri, 22 Sep 2006 09:43:51 +0200, "Florian Fricker"
> <fl...@wyona.com> said:
>   
>> Hey Brian,
>>
>> Your nutch-site.xml has two parts for ontology.
>>
>> First Part:
>>
>> <property>
>>   <name>extension.ontology.extension-name</name>
>>   <value>org.apache.nutch.ontology.jena.OntologyImpl</value>
>>   <description>Loads the Ontology plugin</description>
>> </property>
>>
>> Second Part:
>>
>> <property>
>>   <name>extension.ontology.urls</name>
>>   <value>file:/path/to/owl/file or http://path/to/owl/file</value>
>>   <description>Shows the owl file</description>
>> </property>
>>
>> Then add the onotology plugin to the nutch plugin section. It should 
>> look like this:
>>
>>
>> <property>
>>   <name>plugin.includes</name>
>>  <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|
>>               
>> urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|
>>               
>> query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|scoring-opic</value>
>> </property>
>>
>> That's it...
>>
>> Regards
>>
>>
>> Brian Ulicny wrote:
>>     
>>> I'm trying to get the Ontology plugin to work in an 0.8 environment.
>>>
>>> I set extension.ontology.extension-name to
>>> org.apache.nutch.ontology.jena.OntologyImpl
>>> This seemed to be the correct thing to do according to nutch-default.xml
>>> which says: 
>>>
>>>   <description>Use the specified online ontology extension. If empty,
>>>   the first available extension will be used. The "name" here refers to
>>>   an 'id'
>>>   attribute of the 'implementation' element in the plugin descriptor XML
>>>   file.</description>
>>>
>>> The plugin.xml for ontology is:
>>>
>>> <plugin
>>>    id="ontology"
>>>    name="Ontology Plug-in"
>>>    version="1.0.0"
>>>    provider-name="nutch.org">
>>>
>>>
>>>    <runtime>
>>>       <library name="ontology.jar">
>>>          <export name="*"/>
>>>       </library>
>>>
>>>       <library name="commons-logging-1.0.3.jar"/>
>>>       <library name="icu4j_2_6_1.jar"/>
>>>       <library name="jena-2.1.jar"/>
>>>
>>>    </runtime>
>>>
>>>    <requires>
>>>       <import plugin="nutch-extensionpoints"/>
>>>    </requires>
>>>
>>>    <!-- attribute "point" is the plugin interface class -->
>>>    <!-- seems kinda redundant to have to define the point here too -->   
>>>    <extension id="org.apache.nutch.ontology.jena"
>>>               name="Ontology Model Loader"
>>>               point="org.apache.nutch.ontology.Ontology">
>>>
>>>       <!-- define all the classes that implement the point defined above
>>>       -->
>>>       <implementation id="org.apache.nutch.ontology.jena.OntologyImpl"
>>>                       class="org.apache.nutch.ontology.jena.OntologyImpl">
>>>         <parameter name="pathSuffix" value=""/>
>>>       </implementation>
>>>
>>>    </extension>
>>>
>>> </plugin>
>>>
>>>
>>> I then set the extension.ontology.urls property to a valid (external)
>>> OWL url.
>>>
>>> However, I get this warning, and no ontologies are loaded:
>>>
>>> 6 WARN  OntologyFactory - Ontology extension not found:
>>> 'org.apache.nutch.ontology.jena.OntologyImpl', trying the default
>>>
>>> What am I doing wrong?
>>>
>>> B Ulicny
>>>   
>>>       
>> -- 
>> Florian Fricker                          florian.fricker@wyona.com
>> Wyona Inc.   -   Open Source Content Management   -   Apache Lenya
>> http://www.wyona.com                       http://lenya.apache.org
>>
>>     


Re: Ontology plugin in 0.8

Posted by Brian Ulicny <bu...@alum.mit.edu>.
Florian, 

Thanks for replying: my nutch-site.xml under webapps was wrong. 
Correcting it gets me a little further.  However, whether I point to an
OWL file locally (with file:) or remotely (with http://), I get the
following error (under Root Cause):

with 

 <name>plugin.includes</name>
  <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|urlfilter-regex|index-basic|query-(basic|site|url)|summary-basic|iblog|scoring-opic</value>
 

java.lang.NoSuchFieldError: actualValueType
	at com.hp.hpl.jena.datatypes.xsd.XSDDatatype.convertValidatedDataValue(XSDDatatype.java:371)
	at com.hp.hpl.jena.datatypes.xsd.XSDDatatype.parse(XSDDatatype.java:250)
	at com.hp.hpl.jena.graph.impl.LiteralLabel.setValue(LiteralLabel.java:166)
	at com.hp.hpl.jena.graph.impl.LiteralLabel.<init>(LiteralLabel.java:82)
	at com.hp.hpl.jena.graph.Node.createLiteral(Node.java:151)
	at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:147)
	at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:169)
	at com.hp.hpl.jena.rdf.arp.JenaReader$JRStatementHandler.statement(JenaReader.java:57)
	at com.hp.hpl.jena.rdf.arp.ARPResource.tripleLit(ARPResource.java:190)
	at com.hp.hpl.jena.rdf.arp.ARPResource.triple(ARPResource.java:172)
	at com.hp.hpl.jena.rdf.arp.ARPResource.setPredObject(ARPResource.java:146)
	at com.hp.hpl.jena.rdf.arp.ARPResource.setPredicateObject(ARPResource.java:127)
	at com.hp.hpl.jena.rdf.arp.ParserSupport.createTriple(ParserSupport.java:315)
	at com.hp.hpl.jena.rdf.arp.RDFParser.propertyEltIdAndValue(RDFParser.java:692)
	at com.hp.hpl.jena.rdf.arp.RDFParser.propertyElt(RDFParser.java:647)
	at com.hp.hpl.jena.rdf.arp.RDFParser.typedNode(RDFParser.java:984)
	at com.hp.hpl.jena.rdf.arp.RDFParser.description(RDFParser.java:429)
	at com.hp.hpl.jena.rdf.arp.RDFParser.obj(RDFParser.java:346)
	at com.hp.hpl.jena.rdf.arp.RDFParser.rdf(RDFParser.java:326)
	at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFileSub(RDFParser.java:107)
	at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFile(RDFParser.java:99)
	at com.hp.hpl.jena.rdf.arp.ARPFilter.parse(ARPFilter.java:314)
	at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:221)
	at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:209)
	at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:261)
	at com.hp.hpl.jena.rdf.model.impl.ModelCom.read(ModelCom.java:206)
	at com.hp.hpl.jena.ontology.OntDocumentManager.read(OntDocumentManager.java:986)
	at com.hp.hpl.jena.ontology.OntDocumentManager.findMetadata(OntDocumentManager.java:807)
	at com.hp.hpl.jena.ontology.OntDocumentManager.initialiseMetadata(OntDocumentManager.java:781)
	at com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:171)
	at com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:155)
	at com.hp.hpl.jena.ontology.OntDocumentManager.<clinit>(OntDocumentManager.java:108)
	at com.hp.hpl.jena.ontology.OntModelSpec.getDocumentManager(OntModelSpec.java:247)
	at com.hp.hpl.jena.ontology.impl.OntModelImpl.getDocumentManager(OntModelImpl.java:174)
	at com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:145)
	at com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:116)
	at com.hp.hpl.jena.rdf.model.ModelFactory.createOntologyModel(ModelFactory.java:375)
	at org.apache.nutch.ontology.jena.OntologyImpl.<init>(OntologyImpl.java:81)
	at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
	at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
	at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
	at java.lang.reflect.Constructor.newInstance(Constructor.java:494)
	at java.lang.Class.newInstance0(Class.java:350)
	at java.lang.Class.newInstance(Class.java:303)
	at org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java:151)
	at org.apache.nutch.ontology.OntologyFactory.getOntology(OntologyFactory.java:64)
	at org.apache.jsp.search_jsp._jspService(search_jsp.java:118)
	at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:92)
	at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
	at org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:162)
	at org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:240)
	at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:187)
	at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:200)
	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:146)
	at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:209)
	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
	at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
	at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
	at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:144)
	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
	at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
	at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
	at org.apache.catalina.core.StandardContext.invoke(StandardContext.java:2358)
	at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:133)
	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
	at org.apache.catalina.valves.ErrorDispatcherValve.invoke(ErrorDispatcherValve.java:118)
	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
	at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:116)
	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
	at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
	at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
	at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:127)
	at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
	at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
	at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
	at org.apache.coyote.tomcat4.CoyoteAdapter.service(CoyoteAdapter.java:152)
	at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:799)
	at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.processConnection(Http11Protocol.java:705)
	at org.apache.tomcat.util.net.TcpWorkerThread.runIt(PoolTcpEndpoint.java:577)
	at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:683)
	at java.lang.Thread.run(Thread.java:595)

Any thoughts from anyone out there?

Brian

On Fri, 22 Sep 2006 09:43:51 +0200, "Florian Fricker"
<fl...@wyona.com> said:
> Hey Brian,
> 
> Your nutch-site.xml has two parts for ontology.
> 
> First Part:
> 
> <property>
>   <name>extension.ontology.extension-name</name>
>   <value>org.apache.nutch.ontology.jena.OntologyImpl</value>
>   <description>Loads the Ontology plugin</description>
> </property>
> 
> Second Part:
> 
> <property>
>   <name>extension.ontology.urls</name>
>   <value>file:/path/to/owl/file or http://path/to/owl/file</value>
>   <description>Shows the owl file</description>
> </property>
> 
> Then add the onotology plugin to the nutch plugin section. It should 
> look like this:
> 
> 
> <property>
>   <name>plugin.includes</name>
>  <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|
>               
> urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|
>               
> query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|scoring-opic</value>
> </property>
> 
> That's it...
> 
> Regards
> 
> 
> Brian Ulicny wrote:
> > I'm trying to get the Ontology plugin to work in an 0.8 environment.
> >
> > I set extension.ontology.extension-name to
> > org.apache.nutch.ontology.jena.OntologyImpl
> > This seemed to be the correct thing to do according to nutch-default.xml
> > which says: 
> >
> >   <description>Use the specified online ontology extension. If empty,
> >   the first available extension will be used. The "name" here refers to
> >   an 'id'
> >   attribute of the 'implementation' element in the plugin descriptor XML
> >   file.</description>
> >
> > The plugin.xml for ontology is:
> >
> > <plugin
> >    id="ontology"
> >    name="Ontology Plug-in"
> >    version="1.0.0"
> >    provider-name="nutch.org">
> >
> >
> >    <runtime>
> >       <library name="ontology.jar">
> >          <export name="*"/>
> >       </library>
> >
> >       <library name="commons-logging-1.0.3.jar"/>
> >       <library name="icu4j_2_6_1.jar"/>
> >       <library name="jena-2.1.jar"/>
> >
> >    </runtime>
> >
> >    <requires>
> >       <import plugin="nutch-extensionpoints"/>
> >    </requires>
> >
> >    <!-- attribute "point" is the plugin interface class -->
> >    <!-- seems kinda redundant to have to define the point here too -->   
> >    <extension id="org.apache.nutch.ontology.jena"
> >               name="Ontology Model Loader"
> >               point="org.apache.nutch.ontology.Ontology">
> >
> >       <!-- define all the classes that implement the point defined above
> >       -->
> >       <implementation id="org.apache.nutch.ontology.jena.OntologyImpl"
> >                       class="org.apache.nutch.ontology.jena.OntologyImpl">
> >         <parameter name="pathSuffix" value=""/>
> >       </implementation>
> >
> >    </extension>
> >
> > </plugin>
> >
> >
> > I then set the extension.ontology.urls property to a valid (external)
> > OWL url.
> >
> > However, I get this warning, and no ontologies are loaded:
> >
> > 6 WARN  OntologyFactory - Ontology extension not found:
> > 'org.apache.nutch.ontology.jena.OntologyImpl', trying the default
> >
> > What am I doing wrong?
> >
> > B Ulicny
> >   
> 
> 
> -- 
> Florian Fricker                          florian.fricker@wyona.com
> Wyona Inc.   -   Open Source Content Management   -   Apache Lenya
> http://www.wyona.com                       http://lenya.apache.org
> 
-- 
  Brian Ulicny
  bulicny@alum.mit.edu


Re: Ontology plugin in 0.8

Posted by Florian Fricker <fl...@wyona.com>.
Hey Brian,

Your nutch-site.xml has two parts for ontology.

First Part:

<property>
  <name>extension.ontology.extension-name</name>
  <value>org.apache.nutch.ontology.jena.OntologyImpl</value>
  <description>Loads the Ontology plugin</description>
</property>

Second Part:

<property>
  <name>extension.ontology.urls</name>
  <value>file:/path/to/owl/file or http://path/to/owl/file</value>
  <description>Shows the owl file</description>
</property>

Then add the onotology plugin to the nutch plugin section. It should 
look like this:


<property>
  <name>plugin.includes</name>
 <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|
              
urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|
              
query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|scoring-opic</value>
</property>

That's it...

Regards


Brian Ulicny wrote:
> I'm trying to get the Ontology plugin to work in an 0.8 environment.
>
> I set extension.ontology.extension-name to
> org.apache.nutch.ontology.jena.OntologyImpl
> This seemed to be the correct thing to do according to nutch-default.xml
> which says: 
>
>   <description>Use the specified online ontology extension. If empty,
>   the first available extension will be used. The "name" here refers to
>   an 'id'
>   attribute of the 'implementation' element in the plugin descriptor XML
>   file.</description>
>
> The plugin.xml for ontology is:
>
> <plugin
>    id="ontology"
>    name="Ontology Plug-in"
>    version="1.0.0"
>    provider-name="nutch.org">
>
>
>    <runtime>
>       <library name="ontology.jar">
>          <export name="*"/>
>       </library>
>
>       <library name="commons-logging-1.0.3.jar"/>
>       <library name="icu4j_2_6_1.jar"/>
>       <library name="jena-2.1.jar"/>
>
>    </runtime>
>
>    <requires>
>       <import plugin="nutch-extensionpoints"/>
>    </requires>
>
>    <!-- attribute "point" is the plugin interface class -->
>    <!-- seems kinda redundant to have to define the point here too -->   
>    <extension id="org.apache.nutch.ontology.jena"
>               name="Ontology Model Loader"
>               point="org.apache.nutch.ontology.Ontology">
>
>       <!-- define all the classes that implement the point defined above
>       -->
>       <implementation id="org.apache.nutch.ontology.jena.OntologyImpl"
>                       class="org.apache.nutch.ontology.jena.OntologyImpl">
>         <parameter name="pathSuffix" value=""/>
>       </implementation>
>
>    </extension>
>
> </plugin>
>
>
> I then set the extension.ontology.urls property to a valid (external)
> OWL url.
>
> However, I get this warning, and no ontologies are loaded:
>
> 6 WARN  OntologyFactory - Ontology extension not found:
> 'org.apache.nutch.ontology.jena.OntologyImpl', trying the default
>
> What am I doing wrong?
>
> B Ulicny
>   


-- 
Florian Fricker                          florian.fricker@wyona.com
Wyona Inc.   -   Open Source Content Management   -   Apache Lenya
http://www.wyona.com                       http://lenya.apache.org