You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Brian Ulicny <bu...@alum.mit.edu> on 2006/09/21 15:54:59 UTC
Ontology plugin in 0.8
I'm trying to get the Ontology plugin to work in an 0.8 environment.
I set extension.ontology.extension-name to
org.apache.nutch.ontology.jena.OntologyImpl
This seemed to be the correct thing to do according to nutch-default.xml
which says:
<description>Use the specified online ontology extension. If empty,
the first available extension will be used. The "name" here refers to
an 'id'
attribute of the 'implementation' element in the plugin descriptor XML
file.</description>
The plugin.xml for ontology is:
<plugin
id="ontology"
name="Ontology Plug-in"
version="1.0.0"
provider-name="nutch.org">
<runtime>
<library name="ontology.jar">
<export name="*"/>
</library>
<library name="commons-logging-1.0.3.jar"/>
<library name="icu4j_2_6_1.jar"/>
<library name="jena-2.1.jar"/>
</runtime>
<requires>
<import plugin="nutch-extensionpoints"/>
</requires>
<!-- attribute "point" is the plugin interface class -->
<!-- seems kinda redundant to have to define the point here too -->
<extension id="org.apache.nutch.ontology.jena"
name="Ontology Model Loader"
point="org.apache.nutch.ontology.Ontology">
<!-- define all the classes that implement the point defined above
-->
<implementation id="org.apache.nutch.ontology.jena.OntologyImpl"
class="org.apache.nutch.ontology.jena.OntologyImpl">
<parameter name="pathSuffix" value=""/>
</implementation>
</extension>
</plugin>
I then set the extension.ontology.urls property to a valid (external)
OWL url.
However, I get this warning, and no ontologies are loaded:
6 WARN OntologyFactory - Ontology extension not found:
'org.apache.nutch.ontology.jena.OntologyImpl', trying the default
What am I doing wrong?
B Ulicny
--
Brian Ulicny
bulicny@alum.mit.edu
Re: Ontology plugin in 0.8
Posted by "csavage@activeathletemedia.com" <cs...@activeathletemedia.com>.
Can anyone give me an opinion on how adding an ontology to their search
engine helps to improve search results?
Thanks in advance,
Chad
Florian Fricker wrote:
> Yea, after a closer look it seems to be a problem with your xeres
> library.
>
> To solve this, one needs to update tomcat's xerces library. Here's a
> reference:
>
> http://jena.sourceforge.net/jena-faq.html#general-
>
> Hope it will help you.
>
> Regards
>
> Florian Fricker wrote:
>
Re: Ontology plugin in 0.8
Posted by Florian Fricker <fl...@wyona.com>.
Yea, after a closer look it seems to be a problem with your xeres library.
To solve this, one needs to update tomcat's xerces library. Here's a
reference:
http://jena.sourceforge.net/jena-faq.html#general-
Hope it will help you.
Regards
Florian Fricker wrote:
> Hey Brian!
>
> Have you uncomment refine-query-init.jsp and refine-query.jsp in
> search.jsp?
>
> Can you tell me what version of xeres is in use with your jena and
> tomcat?
>
> Regards
>
> Brian Ulicny wrote:
>> I have tried it with the sample owl file. That didn't work, either.
>>
>> On Fri, 22 Sep 2006 17:52:37 +0300, "Sami Siren" <ss...@gmail.com>
>> said:
>>
>>> try running it first with the submitted .owl
>>> (src/plugin/ontology/sample/time.owl)
>>> and verify that it works for you, after that proceed to more complex
>>> ontologies (with what
>>> I have no experience with)
>>>
>>> --
>>> Sami Siren
>>>
>>> Brian Ulicny wrote:
>>>
>>>> Florian,
>>>> Thanks for replying: my nutch-site.xml under webapps was wrong.
>>>> Correcting it gets me a little further. However, whether I point
>>>> to an
>>>> OWL file locally (with file:) or remotely (with http://), I get the
>>>> following error (under Root Cause):
>>>>
>>>> with
>>>> <name>plugin.includes</name>
>>>>
>>>> <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|urlfilter-regex|index-basic|query-(basic|site|url)|summary-basic|iblog|scoring-opic</value>
>>>>
>>>>
>>>>
>>>> java.lang.NoSuchFieldError: actualValueType
>>>> at
>>>> com.hp.hpl.jena.datatypes.xsd.XSDDatatype.convertValidatedDataValue(XSDDatatype.java:371)
>>>>
>>>> at
>>>> com.hp.hpl.jena.datatypes.xsd.XSDDatatype.parse(XSDDatatype.java:250)
>>>> at
>>>> com.hp.hpl.jena.graph.impl.LiteralLabel.setValue(LiteralLabel.java:166)
>>>>
>>>> at
>>>> com.hp.hpl.jena.graph.impl.LiteralLabel.<init>(LiteralLabel.java:82)
>>>> at com.hp.hpl.jena.graph.Node.createLiteral(Node.java:151)
>>>> at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:147)
>>>> at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:169)
>>>> at
>>>> com.hp.hpl.jena.rdf.arp.JenaReader$JRStatementHandler.statement(JenaReader.java:57)
>>>>
>>>> at
>>>> com.hp.hpl.jena.rdf.arp.ARPResource.tripleLit(ARPResource.java:190)
>>>> at
>>>> com.hp.hpl.jena.rdf.arp.ARPResource.triple(ARPResource.java:172)
>>>> at
>>>> com.hp.hpl.jena.rdf.arp.ARPResource.setPredObject(ARPResource.java:146)
>>>>
>>>> at
>>>> com.hp.hpl.jena.rdf.arp.ARPResource.setPredicateObject(ARPResource.java:127)
>>>>
>>>> at
>>>> com.hp.hpl.jena.rdf.arp.ParserSupport.createTriple(ParserSupport.java:315)
>>>>
>>>> at
>>>> com.hp.hpl.jena.rdf.arp.RDFParser.propertyEltIdAndValue(RDFParser.java:692)
>>>>
>>>> at
>>>> com.hp.hpl.jena.rdf.arp.RDFParser.propertyElt(RDFParser.java:647)
>>>> at com.hp.hpl.jena.rdf.arp.RDFParser.typedNode(RDFParser.java:984)
>>>> at
>>>> com.hp.hpl.jena.rdf.arp.RDFParser.description(RDFParser.java:429)
>>>> at com.hp.hpl.jena.rdf.arp.RDFParser.obj(RDFParser.java:346)
>>>> at com.hp.hpl.jena.rdf.arp.RDFParser.rdf(RDFParser.java:326)
>>>> at
>>>> com.hp.hpl.jena.rdf.arp.RDFParser.rdfFileSub(RDFParser.java:107)
>>>> at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFile(RDFParser.java:99)
>>>> at com.hp.hpl.jena.rdf.arp.ARPFilter.parse(ARPFilter.java:314)
>>>> at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:221)
>>>> at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:209)
>>>> at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:261)
>>>> at com.hp.hpl.jena.rdf.model.impl.ModelCom.read(ModelCom.java:206)
>>>> at
>>>> com.hp.hpl.jena.ontology.OntDocumentManager.read(OntDocumentManager.java:986)
>>>>
>>>> at
>>>> com.hp.hpl.jena.ontology.OntDocumentManager.findMetadata(OntDocumentManager.java:807)
>>>>
>>>> at
>>>> com.hp.hpl.jena.ontology.OntDocumentManager.initialiseMetadata(OntDocumentManager.java:781)
>>>>
>>>> at
>>>> com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:171)
>>>>
>>>> at
>>>> com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:155)
>>>>
>>>> at
>>>> com.hp.hpl.jena.ontology.OntDocumentManager.<clinit>(OntDocumentManager.java:108)
>>>>
>>>> at
>>>> com.hp.hpl.jena.ontology.OntModelSpec.getDocumentManager(OntModelSpec.java:247)
>>>>
>>>> at
>>>> com.hp.hpl.jena.ontology.impl.OntModelImpl.getDocumentManager(OntModelImpl.java:174)
>>>>
>>>> at
>>>> com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:145)
>>>>
>>>> at
>>>> com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:116)
>>>>
>>>> at
>>>> com.hp.hpl.jena.rdf.model.ModelFactory.createOntologyModel(ModelFactory.java:375)
>>>>
>>>> at
>>>> org.apache.nutch.ontology.jena.OntologyImpl.<init>(OntologyImpl.java:81)
>>>>
>>>> at
>>>> sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
>>>> at
>>>> sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
>>>>
>>>> at
>>>> sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
>>>>
>>>> at java.lang.reflect.Constructor.newInstance(Constructor.java:494)
>>>> at java.lang.Class.newInstance0(Class.java:350)
>>>> at java.lang.Class.newInstance(Class.java:303)
>>>> at
>>>> org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java:151)
>>>>
>>>> at
>>>> org.apache.nutch.ontology.OntologyFactory.getOntology(OntologyFactory.java:64)
>>>>
>>>> at org.apache.jsp.search_jsp._jspService(search_jsp.java:118)
>>>> at
>>>> org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:92)
>>>> at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
>>>> at
>>>> org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:162)
>>>>
>>>> at
>>>> org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:240)
>>>>
>>>> at
>>>> org.apache.jasper.servlet.JspServlet.service(JspServlet.java:187)
>>>> at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
>>>> at
>>>> org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:200)
>>>>
>>>> at
>>>> org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:146)
>>>>
>>>> at
>>>> org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:209)
>>>>
>>>> at
>>>> org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
>>>>
>>>> at
>>>> org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
>>>>
>>>> at
>>>> org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
>>>> at
>>>> org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:144)
>>>>
>>>> at
>>>> org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
>>>>
>>>> at
>>>> org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
>>>>
>>>> at
>>>> org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
>>>> at
>>>> org.apache.catalina.core.StandardContext.invoke(StandardContext.java:2358)
>>>>
>>>> at
>>>> org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:133)
>>>>
>>>> at
>>>> org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
>>>>
>>>> at
>>>> org.apache.catalina.valves.ErrorDispatcherValve.invoke(ErrorDispatcherValve.java:118)
>>>>
>>>> at
>>>> org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
>>>>
>>>> at
>>>> org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:116)
>>>>
>>>> at
>>>> org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
>>>>
>>>> at
>>>> org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
>>>>
>>>> at
>>>> org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
>>>> at
>>>> org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:127)
>>>>
>>>> at
>>>> org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
>>>>
>>>> at
>>>> org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
>>>>
>>>> at
>>>> org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
>>>> at
>>>> org.apache.coyote.tomcat4.CoyoteAdapter.service(CoyoteAdapter.java:152)
>>>>
>>>> at
>>>> org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:799)
>>>>
>>>> at
>>>> org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.processConnection(Http11Protocol.java:705)
>>>>
>>>> at
>>>> org.apache.tomcat.util.net.TcpWorkerThread.runIt(PoolTcpEndpoint.java:577)
>>>>
>>>> at
>>>> org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:683)
>>>>
>>>> at java.lang.Thread.run(Thread.java:595)
>>>>
>>>> Any thoughts from anyone out there?
>>>>
>>>> Brian
>>>>
>>>> On Fri, 22 Sep 2006 09:43:51 +0200, "Florian Fricker"
>>>> <fl...@wyona.com> said:
>>>>
>>>>> Hey Brian,
>>>>>
>>>>> Your nutch-site.xml has two parts for ontology.
>>>>>
>>>>> First Part:
>>>>>
>>>>> <property>
>>>>> <name>extension.ontology.extension-name</name>
>>>>> <value>org.apache.nutch.ontology.jena.OntologyImpl</value>
>>>>> <description>Loads the Ontology plugin</description>
>>>>> </property>
>>>>>
>>>>> Second Part:
>>>>>
>>>>> <property>
>>>>> <name>extension.ontology.urls</name>
>>>>> <value>file:/path/to/owl/file or http://path/to/owl/file</value>
>>>>> <description>Shows the owl file</description>
>>>>> </property>
>>>>>
>>>>> Then add the onotology plugin to the nutch plugin section. It
>>>>> should look like this:
>>>>>
>>>>>
>>>>> <property>
>>>>> <name>plugin.includes</name>
>>>>> <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|
>>>>>
>>>>>
>>>>> urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|
>>>>>
>>>>>
>>>>> query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|scoring-opic</value>
>>>>>
>>>>> </property>
>>>>>
>>>>> That's it...
>>>>>
>>>>> Regards
>>>>>
>>>>>
>>>>> Brian Ulicny wrote:
>>>>>
>>>>>> I'm trying to get the Ontology plugin to work in an 0.8 environment.
>>>>>>
>>>>>> I set extension.ontology.extension-name to
>>>>>> org.apache.nutch.ontology.jena.OntologyImpl
>>>>>> This seemed to be the correct thing to do according to
>>>>>> nutch-default.xml
>>>>>> which says:
>>>>>> <description>Use the specified online ontology extension. If
>>>>>> empty,
>>>>>> the first available extension will be used. The "name" here
>>>>>> refers to
>>>>>> an 'id'
>>>>>> attribute of the 'implementation' element in the plugin
>>>>>> descriptor XML
>>>>>> file.</description>
>>>>>>
>>>>>> The plugin.xml for ontology is:
>>>>>>
>>>>>> <plugin
>>>>>> id="ontology"
>>>>>> name="Ontology Plug-in"
>>>>>> version="1.0.0"
>>>>>> provider-name="nutch.org">
>>>>>>
>>>>>>
>>>>>> <runtime>
>>>>>> <library name="ontology.jar">
>>>>>> <export name="*"/>
>>>>>> </library>
>>>>>>
>>>>>> <library name="commons-logging-1.0.3.jar"/>
>>>>>> <library name="icu4j_2_6_1.jar"/>
>>>>>> <library name="jena-2.1.jar"/>
>>>>>>
>>>>>> </runtime>
>>>>>>
>>>>>> <requires>
>>>>>> <import plugin="nutch-extensionpoints"/>
>>>>>> </requires>
>>>>>>
>>>>>> <!-- attribute "point" is the plugin interface class -->
>>>>>> <!-- seems kinda redundant to have to define the point here
>>>>>> too --> <extension id="org.apache.nutch.ontology.jena"
>>>>>> name="Ontology Model Loader"
>>>>>> point="org.apache.nutch.ontology.Ontology">
>>>>>>
>>>>>> <!-- define all the classes that implement the point
>>>>>> defined above
>>>>>> -->
>>>>>> <implementation
>>>>>> id="org.apache.nutch.ontology.jena.OntologyImpl"
>>>>>>
>>>>>> class="org.apache.nutch.ontology.jena.OntologyImpl">
>>>>>> <parameter name="pathSuffix" value=""/>
>>>>>> </implementation>
>>>>>>
>>>>>> </extension>
>>>>>>
>>>>>> </plugin>
>>>>>>
>>>>>>
>>>>>> I then set the extension.ontology.urls property to a valid
>>>>>> (external)
>>>>>> OWL url.
>>>>>>
>>>>>> However, I get this warning, and no ontologies are loaded:
>>>>>>
>>>>>> 6 WARN OntologyFactory - Ontology extension not found:
>>>>>> 'org.apache.nutch.ontology.jena.OntologyImpl', trying the default
>>>>>>
>>>>>> What am I doing wrong?
>>>>>>
>>>>>> B Ulicny
>>>>>>
>>>>> --
>>>>> Florian Fricker florian.fricker@wyona.com
>>>>> Wyona Inc. - Open Source Content Management - Apache Lenya
>>>>> http://www.wyona.com http://lenya.apache.org
>>>>>
>>>>>
>
>
--
Florian Fricker florian.fricker@wyona.com
Wyona Inc. - Open Source Content Management - Apache Lenya
http://www.wyona.com http://lenya.apache.org
Re: Ontology plugin in 0.8
Posted by "csavage@activeathletemedia.com" <cs...@activeathletemedia.com>.
Hello All,
I'm not sure if this will cause anything but your urlfilter-regex,
index-basic, query-(basic|site|url) are duplicated in that last post.
Great posts by the way, I'm knee deep in trying to figure out how to
implement this as well.
Chad
Florian Fricker wrote:
<value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|urlfilter-regex|index-basic|query-(basic|site|url)|summary-basic|iblog|scoring-opic</value>
Re: Ontology plugin in 0.8
Posted by Florian Fricker <fl...@wyona.com>.
Hey Brian!
Have you uncomment refine-query-init.jsp and refine-query.jsp in search.jsp?
Can you tell me what version of xeres is in use with your jena and tomcat?
Regards
Brian Ulicny wrote:
> I have tried it with the sample owl file. That didn't work, either.
>
> On Fri, 22 Sep 2006 17:52:37 +0300, "Sami Siren" <ss...@gmail.com>
> said:
>
>> try running it first with the submitted .owl
>> (src/plugin/ontology/sample/time.owl)
>> and verify that it works for you, after that proceed to more complex
>> ontologies (with what
>> I have no experience with)
>>
>> --
>> Sami Siren
>>
>> Brian Ulicny wrote:
>>
>>> Florian,
>>>
>>> Thanks for replying: my nutch-site.xml under webapps was wrong.
>>> Correcting it gets me a little further. However, whether I point to an
>>> OWL file locally (with file:) or remotely (with http://), I get the
>>> following error (under Root Cause):
>>>
>>> with
>>>
>>> <name>plugin.includes</name>
>>> <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|urlfilter-regex|index-basic|query-(basic|site|url)|summary-basic|iblog|scoring-opic</value>
>>>
>>>
>>> java.lang.NoSuchFieldError: actualValueType
>>> at com.hp.hpl.jena.datatypes.xsd.XSDDatatype.convertValidatedDataValue(XSDDatatype.java:371)
>>> at com.hp.hpl.jena.datatypes.xsd.XSDDatatype.parse(XSDDatatype.java:250)
>>> at com.hp.hpl.jena.graph.impl.LiteralLabel.setValue(LiteralLabel.java:166)
>>> at com.hp.hpl.jena.graph.impl.LiteralLabel.<init>(LiteralLabel.java:82)
>>> at com.hp.hpl.jena.graph.Node.createLiteral(Node.java:151)
>>> at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:147)
>>> at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:169)
>>> at com.hp.hpl.jena.rdf.arp.JenaReader$JRStatementHandler.statement(JenaReader.java:57)
>>> at com.hp.hpl.jena.rdf.arp.ARPResource.tripleLit(ARPResource.java:190)
>>> at com.hp.hpl.jena.rdf.arp.ARPResource.triple(ARPResource.java:172)
>>> at com.hp.hpl.jena.rdf.arp.ARPResource.setPredObject(ARPResource.java:146)
>>> at com.hp.hpl.jena.rdf.arp.ARPResource.setPredicateObject(ARPResource.java:127)
>>> at com.hp.hpl.jena.rdf.arp.ParserSupport.createTriple(ParserSupport.java:315)
>>> at com.hp.hpl.jena.rdf.arp.RDFParser.propertyEltIdAndValue(RDFParser.java:692)
>>> at com.hp.hpl.jena.rdf.arp.RDFParser.propertyElt(RDFParser.java:647)
>>> at com.hp.hpl.jena.rdf.arp.RDFParser.typedNode(RDFParser.java:984)
>>> at com.hp.hpl.jena.rdf.arp.RDFParser.description(RDFParser.java:429)
>>> at com.hp.hpl.jena.rdf.arp.RDFParser.obj(RDFParser.java:346)
>>> at com.hp.hpl.jena.rdf.arp.RDFParser.rdf(RDFParser.java:326)
>>> at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFileSub(RDFParser.java:107)
>>> at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFile(RDFParser.java:99)
>>> at com.hp.hpl.jena.rdf.arp.ARPFilter.parse(ARPFilter.java:314)
>>> at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:221)
>>> at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:209)
>>> at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:261)
>>> at com.hp.hpl.jena.rdf.model.impl.ModelCom.read(ModelCom.java:206)
>>> at com.hp.hpl.jena.ontology.OntDocumentManager.read(OntDocumentManager.java:986)
>>> at com.hp.hpl.jena.ontology.OntDocumentManager.findMetadata(OntDocumentManager.java:807)
>>> at com.hp.hpl.jena.ontology.OntDocumentManager.initialiseMetadata(OntDocumentManager.java:781)
>>> at com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:171)
>>> at com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:155)
>>> at com.hp.hpl.jena.ontology.OntDocumentManager.<clinit>(OntDocumentManager.java:108)
>>> at com.hp.hpl.jena.ontology.OntModelSpec.getDocumentManager(OntModelSpec.java:247)
>>> at com.hp.hpl.jena.ontology.impl.OntModelImpl.getDocumentManager(OntModelImpl.java:174)
>>> at com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:145)
>>> at com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:116)
>>> at com.hp.hpl.jena.rdf.model.ModelFactory.createOntologyModel(ModelFactory.java:375)
>>> at org.apache.nutch.ontology.jena.OntologyImpl.<init>(OntologyImpl.java:81)
>>> at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
>>> at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
>>> at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
>>> at java.lang.reflect.Constructor.newInstance(Constructor.java:494)
>>> at java.lang.Class.newInstance0(Class.java:350)
>>> at java.lang.Class.newInstance(Class.java:303)
>>> at org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java:151)
>>> at org.apache.nutch.ontology.OntologyFactory.getOntology(OntologyFactory.java:64)
>>> at org.apache.jsp.search_jsp._jspService(search_jsp.java:118)
>>> at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:92)
>>> at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
>>> at org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:162)
>>> at org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:240)
>>> at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:187)
>>> at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
>>> at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:200)
>>> at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:146)
>>> at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:209)
>>> at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
>>> at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
>>> at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
>>> at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:144)
>>> at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
>>> at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
>>> at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
>>> at org.apache.catalina.core.StandardContext.invoke(StandardContext.java:2358)
>>> at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:133)
>>> at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
>>> at org.apache.catalina.valves.ErrorDispatcherValve.invoke(ErrorDispatcherValve.java:118)
>>> at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
>>> at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:116)
>>> at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
>>> at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
>>> at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
>>> at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:127)
>>> at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
>>> at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
>>> at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
>>> at org.apache.coyote.tomcat4.CoyoteAdapter.service(CoyoteAdapter.java:152)
>>> at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:799)
>>> at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.processConnection(Http11Protocol.java:705)
>>> at org.apache.tomcat.util.net.TcpWorkerThread.runIt(PoolTcpEndpoint.java:577)
>>> at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:683)
>>> at java.lang.Thread.run(Thread.java:595)
>>>
>>> Any thoughts from anyone out there?
>>>
>>> Brian
>>>
>>> On Fri, 22 Sep 2006 09:43:51 +0200, "Florian Fricker"
>>> <fl...@wyona.com> said:
>>>
>>>
>>>> Hey Brian,
>>>>
>>>> Your nutch-site.xml has two parts for ontology.
>>>>
>>>> First Part:
>>>>
>>>> <property>
>>>> <name>extension.ontology.extension-name</name>
>>>> <value>org.apache.nutch.ontology.jena.OntologyImpl</value>
>>>> <description>Loads the Ontology plugin</description>
>>>> </property>
>>>>
>>>> Second Part:
>>>>
>>>> <property>
>>>> <name>extension.ontology.urls</name>
>>>> <value>file:/path/to/owl/file or http://path/to/owl/file</value>
>>>> <description>Shows the owl file</description>
>>>> </property>
>>>>
>>>> Then add the onotology plugin to the nutch plugin section. It should
>>>> look like this:
>>>>
>>>>
>>>> <property>
>>>> <name>plugin.includes</name>
>>>> <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|
>>>>
>>>> urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|
>>>>
>>>> query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|scoring-opic</value>
>>>> </property>
>>>>
>>>> That's it...
>>>>
>>>> Regards
>>>>
>>>>
>>>> Brian Ulicny wrote:
>>>>
>>>>
>>>>> I'm trying to get the Ontology plugin to work in an 0.8 environment.
>>>>>
>>>>> I set extension.ontology.extension-name to
>>>>> org.apache.nutch.ontology.jena.OntologyImpl
>>>>> This seemed to be the correct thing to do according to nutch-default.xml
>>>>> which says:
>>>>>
>>>>> <description>Use the specified online ontology extension. If empty,
>>>>> the first available extension will be used. The "name" here refers to
>>>>> an 'id'
>>>>> attribute of the 'implementation' element in the plugin descriptor XML
>>>>> file.</description>
>>>>>
>>>>> The plugin.xml for ontology is:
>>>>>
>>>>> <plugin
>>>>> id="ontology"
>>>>> name="Ontology Plug-in"
>>>>> version="1.0.0"
>>>>> provider-name="nutch.org">
>>>>>
>>>>>
>>>>> <runtime>
>>>>> <library name="ontology.jar">
>>>>> <export name="*"/>
>>>>> </library>
>>>>>
>>>>> <library name="commons-logging-1.0.3.jar"/>
>>>>> <library name="icu4j_2_6_1.jar"/>
>>>>> <library name="jena-2.1.jar"/>
>>>>>
>>>>> </runtime>
>>>>>
>>>>> <requires>
>>>>> <import plugin="nutch-extensionpoints"/>
>>>>> </requires>
>>>>>
>>>>> <!-- attribute "point" is the plugin interface class -->
>>>>> <!-- seems kinda redundant to have to define the point here too -->
>>>>> <extension id="org.apache.nutch.ontology.jena"
>>>>> name="Ontology Model Loader"
>>>>> point="org.apache.nutch.ontology.Ontology">
>>>>>
>>>>> <!-- define all the classes that implement the point defined above
>>>>> -->
>>>>> <implementation id="org.apache.nutch.ontology.jena.OntologyImpl"
>>>>> class="org.apache.nutch.ontology.jena.OntologyImpl">
>>>>> <parameter name="pathSuffix" value=""/>
>>>>> </implementation>
>>>>>
>>>>> </extension>
>>>>>
>>>>> </plugin>
>>>>>
>>>>>
>>>>> I then set the extension.ontology.urls property to a valid (external)
>>>>> OWL url.
>>>>>
>>>>> However, I get this warning, and no ontologies are loaded:
>>>>>
>>>>> 6 WARN OntologyFactory - Ontology extension not found:
>>>>> 'org.apache.nutch.ontology.jena.OntologyImpl', trying the default
>>>>>
>>>>> What am I doing wrong?
>>>>>
>>>>> B Ulicny
>>>>>
>>>>>
>>>>>
>>>> --
>>>> Florian Fricker florian.fricker@wyona.com
>>>> Wyona Inc. - Open Source Content Management - Apache Lenya
>>>> http://www.wyona.com http://lenya.apache.org
>>>>
>>>>
>>>>
--
Florian Fricker florian.fricker@wyona.com
Wyona Inc. - Open Source Content Management - Apache Lenya
http://www.wyona.com http://lenya.apache.org
Re: Ontology plugin in 0.8
Posted by Brian Ulicny <bu...@alum.mit.edu>.
I have tried it with the sample owl file. That didn't work, either.
On Fri, 22 Sep 2006 17:52:37 +0300, "Sami Siren" <ss...@gmail.com>
said:
> try running it first with the submitted .owl
> (src/plugin/ontology/sample/time.owl)
> and verify that it works for you, after that proceed to more complex
> ontologies (with what
> I have no experience with)
>
> --
> Sami Siren
>
> Brian Ulicny wrote:
> > Florian,
> >
> > Thanks for replying: my nutch-site.xml under webapps was wrong.
> > Correcting it gets me a little further. However, whether I point to an
> > OWL file locally (with file:) or remotely (with http://), I get the
> > following error (under Root Cause):
> >
> > with
> >
> > <name>plugin.includes</name>
> > <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|urlfilter-regex|index-basic|query-(basic|site|url)|summary-basic|iblog|scoring-opic</value>
> >
> >
> > java.lang.NoSuchFieldError: actualValueType
> > at com.hp.hpl.jena.datatypes.xsd.XSDDatatype.convertValidatedDataValue(XSDDatatype.java:371)
> > at com.hp.hpl.jena.datatypes.xsd.XSDDatatype.parse(XSDDatatype.java:250)
> > at com.hp.hpl.jena.graph.impl.LiteralLabel.setValue(LiteralLabel.java:166)
> > at com.hp.hpl.jena.graph.impl.LiteralLabel.<init>(LiteralLabel.java:82)
> > at com.hp.hpl.jena.graph.Node.createLiteral(Node.java:151)
> > at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:147)
> > at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:169)
> > at com.hp.hpl.jena.rdf.arp.JenaReader$JRStatementHandler.statement(JenaReader.java:57)
> > at com.hp.hpl.jena.rdf.arp.ARPResource.tripleLit(ARPResource.java:190)
> > at com.hp.hpl.jena.rdf.arp.ARPResource.triple(ARPResource.java:172)
> > at com.hp.hpl.jena.rdf.arp.ARPResource.setPredObject(ARPResource.java:146)
> > at com.hp.hpl.jena.rdf.arp.ARPResource.setPredicateObject(ARPResource.java:127)
> > at com.hp.hpl.jena.rdf.arp.ParserSupport.createTriple(ParserSupport.java:315)
> > at com.hp.hpl.jena.rdf.arp.RDFParser.propertyEltIdAndValue(RDFParser.java:692)
> > at com.hp.hpl.jena.rdf.arp.RDFParser.propertyElt(RDFParser.java:647)
> > at com.hp.hpl.jena.rdf.arp.RDFParser.typedNode(RDFParser.java:984)
> > at com.hp.hpl.jena.rdf.arp.RDFParser.description(RDFParser.java:429)
> > at com.hp.hpl.jena.rdf.arp.RDFParser.obj(RDFParser.java:346)
> > at com.hp.hpl.jena.rdf.arp.RDFParser.rdf(RDFParser.java:326)
> > at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFileSub(RDFParser.java:107)
> > at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFile(RDFParser.java:99)
> > at com.hp.hpl.jena.rdf.arp.ARPFilter.parse(ARPFilter.java:314)
> > at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:221)
> > at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:209)
> > at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:261)
> > at com.hp.hpl.jena.rdf.model.impl.ModelCom.read(ModelCom.java:206)
> > at com.hp.hpl.jena.ontology.OntDocumentManager.read(OntDocumentManager.java:986)
> > at com.hp.hpl.jena.ontology.OntDocumentManager.findMetadata(OntDocumentManager.java:807)
> > at com.hp.hpl.jena.ontology.OntDocumentManager.initialiseMetadata(OntDocumentManager.java:781)
> > at com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:171)
> > at com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:155)
> > at com.hp.hpl.jena.ontology.OntDocumentManager.<clinit>(OntDocumentManager.java:108)
> > at com.hp.hpl.jena.ontology.OntModelSpec.getDocumentManager(OntModelSpec.java:247)
> > at com.hp.hpl.jena.ontology.impl.OntModelImpl.getDocumentManager(OntModelImpl.java:174)
> > at com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:145)
> > at com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:116)
> > at com.hp.hpl.jena.rdf.model.ModelFactory.createOntologyModel(ModelFactory.java:375)
> > at org.apache.nutch.ontology.jena.OntologyImpl.<init>(OntologyImpl.java:81)
> > at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
> > at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
> > at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
> > at java.lang.reflect.Constructor.newInstance(Constructor.java:494)
> > at java.lang.Class.newInstance0(Class.java:350)
> > at java.lang.Class.newInstance(Class.java:303)
> > at org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java:151)
> > at org.apache.nutch.ontology.OntologyFactory.getOntology(OntologyFactory.java:64)
> > at org.apache.jsp.search_jsp._jspService(search_jsp.java:118)
> > at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:92)
> > at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
> > at org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:162)
> > at org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:240)
> > at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:187)
> > at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
> > at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:200)
> > at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:146)
> > at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:209)
> > at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
> > at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
> > at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
> > at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:144)
> > at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
> > at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
> > at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
> > at org.apache.catalina.core.StandardContext.invoke(StandardContext.java:2358)
> > at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:133)
> > at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
> > at org.apache.catalina.valves.ErrorDispatcherValve.invoke(ErrorDispatcherValve.java:118)
> > at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
> > at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:116)
> > at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
> > at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
> > at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
> > at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:127)
> > at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
> > at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
> > at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
> > at org.apache.coyote.tomcat4.CoyoteAdapter.service(CoyoteAdapter.java:152)
> > at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:799)
> > at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.processConnection(Http11Protocol.java:705)
> > at org.apache.tomcat.util.net.TcpWorkerThread.runIt(PoolTcpEndpoint.java:577)
> > at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:683)
> > at java.lang.Thread.run(Thread.java:595)
> >
> > Any thoughts from anyone out there?
> >
> > Brian
> >
> > On Fri, 22 Sep 2006 09:43:51 +0200, "Florian Fricker"
> > <fl...@wyona.com> said:
> >
> >> Hey Brian,
> >>
> >> Your nutch-site.xml has two parts for ontology.
> >>
> >> First Part:
> >>
> >> <property>
> >> <name>extension.ontology.extension-name</name>
> >> <value>org.apache.nutch.ontology.jena.OntologyImpl</value>
> >> <description>Loads the Ontology plugin</description>
> >> </property>
> >>
> >> Second Part:
> >>
> >> <property>
> >> <name>extension.ontology.urls</name>
> >> <value>file:/path/to/owl/file or http://path/to/owl/file</value>
> >> <description>Shows the owl file</description>
> >> </property>
> >>
> >> Then add the onotology plugin to the nutch plugin section. It should
> >> look like this:
> >>
> >>
> >> <property>
> >> <name>plugin.includes</name>
> >> <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|
> >>
> >> urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|
> >>
> >> query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|scoring-opic</value>
> >> </property>
> >>
> >> That's it...
> >>
> >> Regards
> >>
> >>
> >> Brian Ulicny wrote:
> >>
> >>> I'm trying to get the Ontology plugin to work in an 0.8 environment.
> >>>
> >>> I set extension.ontology.extension-name to
> >>> org.apache.nutch.ontology.jena.OntologyImpl
> >>> This seemed to be the correct thing to do according to nutch-default.xml
> >>> which says:
> >>>
> >>> <description>Use the specified online ontology extension. If empty,
> >>> the first available extension will be used. The "name" here refers to
> >>> an 'id'
> >>> attribute of the 'implementation' element in the plugin descriptor XML
> >>> file.</description>
> >>>
> >>> The plugin.xml for ontology is:
> >>>
> >>> <plugin
> >>> id="ontology"
> >>> name="Ontology Plug-in"
> >>> version="1.0.0"
> >>> provider-name="nutch.org">
> >>>
> >>>
> >>> <runtime>
> >>> <library name="ontology.jar">
> >>> <export name="*"/>
> >>> </library>
> >>>
> >>> <library name="commons-logging-1.0.3.jar"/>
> >>> <library name="icu4j_2_6_1.jar"/>
> >>> <library name="jena-2.1.jar"/>
> >>>
> >>> </runtime>
> >>>
> >>> <requires>
> >>> <import plugin="nutch-extensionpoints"/>
> >>> </requires>
> >>>
> >>> <!-- attribute "point" is the plugin interface class -->
> >>> <!-- seems kinda redundant to have to define the point here too -->
> >>> <extension id="org.apache.nutch.ontology.jena"
> >>> name="Ontology Model Loader"
> >>> point="org.apache.nutch.ontology.Ontology">
> >>>
> >>> <!-- define all the classes that implement the point defined above
> >>> -->
> >>> <implementation id="org.apache.nutch.ontology.jena.OntologyImpl"
> >>> class="org.apache.nutch.ontology.jena.OntologyImpl">
> >>> <parameter name="pathSuffix" value=""/>
> >>> </implementation>
> >>>
> >>> </extension>
> >>>
> >>> </plugin>
> >>>
> >>>
> >>> I then set the extension.ontology.urls property to a valid (external)
> >>> OWL url.
> >>>
> >>> However, I get this warning, and no ontologies are loaded:
> >>>
> >>> 6 WARN OntologyFactory - Ontology extension not found:
> >>> 'org.apache.nutch.ontology.jena.OntologyImpl', trying the default
> >>>
> >>> What am I doing wrong?
> >>>
> >>> B Ulicny
> >>>
> >>>
> >> --
> >> Florian Fricker florian.fricker@wyona.com
> >> Wyona Inc. - Open Source Content Management - Apache Lenya
> >> http://www.wyona.com http://lenya.apache.org
> >>
> >>
>
--
Brian Ulicny
bulicny@alum.mit.edu
Re: Ontology plugin in 0.8
Posted by Sami Siren <ss...@gmail.com>.
try running it first with the submitted .owl
(src/plugin/ontology/sample/time.owl)
and verify that it works for you, after that proceed to more complex
ontologies (with what
I have no experience with)
--
Sami Siren
Brian Ulicny wrote:
> Florian,
>
> Thanks for replying: my nutch-site.xml under webapps was wrong.
> Correcting it gets me a little further. However, whether I point to an
> OWL file locally (with file:) or remotely (with http://), I get the
> following error (under Root Cause):
>
> with
>
> <name>plugin.includes</name>
> <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|urlfilter-regex|index-basic|query-(basic|site|url)|summary-basic|iblog|scoring-opic</value>
>
>
> java.lang.NoSuchFieldError: actualValueType
> at com.hp.hpl.jena.datatypes.xsd.XSDDatatype.convertValidatedDataValue(XSDDatatype.java:371)
> at com.hp.hpl.jena.datatypes.xsd.XSDDatatype.parse(XSDDatatype.java:250)
> at com.hp.hpl.jena.graph.impl.LiteralLabel.setValue(LiteralLabel.java:166)
> at com.hp.hpl.jena.graph.impl.LiteralLabel.<init>(LiteralLabel.java:82)
> at com.hp.hpl.jena.graph.Node.createLiteral(Node.java:151)
> at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:147)
> at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:169)
> at com.hp.hpl.jena.rdf.arp.JenaReader$JRStatementHandler.statement(JenaReader.java:57)
> at com.hp.hpl.jena.rdf.arp.ARPResource.tripleLit(ARPResource.java:190)
> at com.hp.hpl.jena.rdf.arp.ARPResource.triple(ARPResource.java:172)
> at com.hp.hpl.jena.rdf.arp.ARPResource.setPredObject(ARPResource.java:146)
> at com.hp.hpl.jena.rdf.arp.ARPResource.setPredicateObject(ARPResource.java:127)
> at com.hp.hpl.jena.rdf.arp.ParserSupport.createTriple(ParserSupport.java:315)
> at com.hp.hpl.jena.rdf.arp.RDFParser.propertyEltIdAndValue(RDFParser.java:692)
> at com.hp.hpl.jena.rdf.arp.RDFParser.propertyElt(RDFParser.java:647)
> at com.hp.hpl.jena.rdf.arp.RDFParser.typedNode(RDFParser.java:984)
> at com.hp.hpl.jena.rdf.arp.RDFParser.description(RDFParser.java:429)
> at com.hp.hpl.jena.rdf.arp.RDFParser.obj(RDFParser.java:346)
> at com.hp.hpl.jena.rdf.arp.RDFParser.rdf(RDFParser.java:326)
> at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFileSub(RDFParser.java:107)
> at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFile(RDFParser.java:99)
> at com.hp.hpl.jena.rdf.arp.ARPFilter.parse(ARPFilter.java:314)
> at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:221)
> at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:209)
> at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:261)
> at com.hp.hpl.jena.rdf.model.impl.ModelCom.read(ModelCom.java:206)
> at com.hp.hpl.jena.ontology.OntDocumentManager.read(OntDocumentManager.java:986)
> at com.hp.hpl.jena.ontology.OntDocumentManager.findMetadata(OntDocumentManager.java:807)
> at com.hp.hpl.jena.ontology.OntDocumentManager.initialiseMetadata(OntDocumentManager.java:781)
> at com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:171)
> at com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:155)
> at com.hp.hpl.jena.ontology.OntDocumentManager.<clinit>(OntDocumentManager.java:108)
> at com.hp.hpl.jena.ontology.OntModelSpec.getDocumentManager(OntModelSpec.java:247)
> at com.hp.hpl.jena.ontology.impl.OntModelImpl.getDocumentManager(OntModelImpl.java:174)
> at com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:145)
> at com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:116)
> at com.hp.hpl.jena.rdf.model.ModelFactory.createOntologyModel(ModelFactory.java:375)
> at org.apache.nutch.ontology.jena.OntologyImpl.<init>(OntologyImpl.java:81)
> at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
> at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
> at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
> at java.lang.reflect.Constructor.newInstance(Constructor.java:494)
> at java.lang.Class.newInstance0(Class.java:350)
> at java.lang.Class.newInstance(Class.java:303)
> at org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java:151)
> at org.apache.nutch.ontology.OntologyFactory.getOntology(OntologyFactory.java:64)
> at org.apache.jsp.search_jsp._jspService(search_jsp.java:118)
> at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:92)
> at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
> at org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:162)
> at org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:240)
> at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:187)
> at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
> at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:200)
> at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:146)
> at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:209)
> at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
> at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
> at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
> at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:144)
> at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
> at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
> at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
> at org.apache.catalina.core.StandardContext.invoke(StandardContext.java:2358)
> at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:133)
> at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
> at org.apache.catalina.valves.ErrorDispatcherValve.invoke(ErrorDispatcherValve.java:118)
> at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
> at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:116)
> at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
> at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
> at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
> at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:127)
> at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
> at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
> at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
> at org.apache.coyote.tomcat4.CoyoteAdapter.service(CoyoteAdapter.java:152)
> at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:799)
> at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.processConnection(Http11Protocol.java:705)
> at org.apache.tomcat.util.net.TcpWorkerThread.runIt(PoolTcpEndpoint.java:577)
> at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:683)
> at java.lang.Thread.run(Thread.java:595)
>
> Any thoughts from anyone out there?
>
> Brian
>
> On Fri, 22 Sep 2006 09:43:51 +0200, "Florian Fricker"
> <fl...@wyona.com> said:
>
>> Hey Brian,
>>
>> Your nutch-site.xml has two parts for ontology.
>>
>> First Part:
>>
>> <property>
>> <name>extension.ontology.extension-name</name>
>> <value>org.apache.nutch.ontology.jena.OntologyImpl</value>
>> <description>Loads the Ontology plugin</description>
>> </property>
>>
>> Second Part:
>>
>> <property>
>> <name>extension.ontology.urls</name>
>> <value>file:/path/to/owl/file or http://path/to/owl/file</value>
>> <description>Shows the owl file</description>
>> </property>
>>
>> Then add the onotology plugin to the nutch plugin section. It should
>> look like this:
>>
>>
>> <property>
>> <name>plugin.includes</name>
>> <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|
>>
>> urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|
>>
>> query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|scoring-opic</value>
>> </property>
>>
>> That's it...
>>
>> Regards
>>
>>
>> Brian Ulicny wrote:
>>
>>> I'm trying to get the Ontology plugin to work in an 0.8 environment.
>>>
>>> I set extension.ontology.extension-name to
>>> org.apache.nutch.ontology.jena.OntologyImpl
>>> This seemed to be the correct thing to do according to nutch-default.xml
>>> which says:
>>>
>>> <description>Use the specified online ontology extension. If empty,
>>> the first available extension will be used. The "name" here refers to
>>> an 'id'
>>> attribute of the 'implementation' element in the plugin descriptor XML
>>> file.</description>
>>>
>>> The plugin.xml for ontology is:
>>>
>>> <plugin
>>> id="ontology"
>>> name="Ontology Plug-in"
>>> version="1.0.0"
>>> provider-name="nutch.org">
>>>
>>>
>>> <runtime>
>>> <library name="ontology.jar">
>>> <export name="*"/>
>>> </library>
>>>
>>> <library name="commons-logging-1.0.3.jar"/>
>>> <library name="icu4j_2_6_1.jar"/>
>>> <library name="jena-2.1.jar"/>
>>>
>>> </runtime>
>>>
>>> <requires>
>>> <import plugin="nutch-extensionpoints"/>
>>> </requires>
>>>
>>> <!-- attribute "point" is the plugin interface class -->
>>> <!-- seems kinda redundant to have to define the point here too -->
>>> <extension id="org.apache.nutch.ontology.jena"
>>> name="Ontology Model Loader"
>>> point="org.apache.nutch.ontology.Ontology">
>>>
>>> <!-- define all the classes that implement the point defined above
>>> -->
>>> <implementation id="org.apache.nutch.ontology.jena.OntologyImpl"
>>> class="org.apache.nutch.ontology.jena.OntologyImpl">
>>> <parameter name="pathSuffix" value=""/>
>>> </implementation>
>>>
>>> </extension>
>>>
>>> </plugin>
>>>
>>>
>>> I then set the extension.ontology.urls property to a valid (external)
>>> OWL url.
>>>
>>> However, I get this warning, and no ontologies are loaded:
>>>
>>> 6 WARN OntologyFactory - Ontology extension not found:
>>> 'org.apache.nutch.ontology.jena.OntologyImpl', trying the default
>>>
>>> What am I doing wrong?
>>>
>>> B Ulicny
>>>
>>>
>> --
>> Florian Fricker florian.fricker@wyona.com
>> Wyona Inc. - Open Source Content Management - Apache Lenya
>> http://www.wyona.com http://lenya.apache.org
>>
>>
Re: Ontology plugin in 0.8
Posted by Brian Ulicny <bu...@alum.mit.edu>.
Florian,
Thanks for replying: my nutch-site.xml under webapps was wrong.
Correcting it gets me a little further. However, whether I point to an
OWL file locally (with file:) or remotely (with http://), I get the
following error (under Root Cause):
with
<name>plugin.includes</name>
<value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|urlfilter-regex|index-basic|query-(basic|site|url)|summary-basic|iblog|scoring-opic</value>
java.lang.NoSuchFieldError: actualValueType
at com.hp.hpl.jena.datatypes.xsd.XSDDatatype.convertValidatedDataValue(XSDDatatype.java:371)
at com.hp.hpl.jena.datatypes.xsd.XSDDatatype.parse(XSDDatatype.java:250)
at com.hp.hpl.jena.graph.impl.LiteralLabel.setValue(LiteralLabel.java:166)
at com.hp.hpl.jena.graph.impl.LiteralLabel.<init>(LiteralLabel.java:82)
at com.hp.hpl.jena.graph.Node.createLiteral(Node.java:151)
at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:147)
at com.hp.hpl.jena.rdf.arp.JenaReader.convert(JenaReader.java:169)
at com.hp.hpl.jena.rdf.arp.JenaReader$JRStatementHandler.statement(JenaReader.java:57)
at com.hp.hpl.jena.rdf.arp.ARPResource.tripleLit(ARPResource.java:190)
at com.hp.hpl.jena.rdf.arp.ARPResource.triple(ARPResource.java:172)
at com.hp.hpl.jena.rdf.arp.ARPResource.setPredObject(ARPResource.java:146)
at com.hp.hpl.jena.rdf.arp.ARPResource.setPredicateObject(ARPResource.java:127)
at com.hp.hpl.jena.rdf.arp.ParserSupport.createTriple(ParserSupport.java:315)
at com.hp.hpl.jena.rdf.arp.RDFParser.propertyEltIdAndValue(RDFParser.java:692)
at com.hp.hpl.jena.rdf.arp.RDFParser.propertyElt(RDFParser.java:647)
at com.hp.hpl.jena.rdf.arp.RDFParser.typedNode(RDFParser.java:984)
at com.hp.hpl.jena.rdf.arp.RDFParser.description(RDFParser.java:429)
at com.hp.hpl.jena.rdf.arp.RDFParser.obj(RDFParser.java:346)
at com.hp.hpl.jena.rdf.arp.RDFParser.rdf(RDFParser.java:326)
at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFileSub(RDFParser.java:107)
at com.hp.hpl.jena.rdf.arp.RDFParser.rdfFile(RDFParser.java:99)
at com.hp.hpl.jena.rdf.arp.ARPFilter.parse(ARPFilter.java:314)
at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:221)
at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:209)
at com.hp.hpl.jena.rdf.arp.JenaReader.read(JenaReader.java:261)
at com.hp.hpl.jena.rdf.model.impl.ModelCom.read(ModelCom.java:206)
at com.hp.hpl.jena.ontology.OntDocumentManager.read(OntDocumentManager.java:986)
at com.hp.hpl.jena.ontology.OntDocumentManager.findMetadata(OntDocumentManager.java:807)
at com.hp.hpl.jena.ontology.OntDocumentManager.initialiseMetadata(OntDocumentManager.java:781)
at com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:171)
at com.hp.hpl.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:155)
at com.hp.hpl.jena.ontology.OntDocumentManager.<clinit>(OntDocumentManager.java:108)
at com.hp.hpl.jena.ontology.OntModelSpec.getDocumentManager(OntModelSpec.java:247)
at com.hp.hpl.jena.ontology.impl.OntModelImpl.getDocumentManager(OntModelImpl.java:174)
at com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:145)
at com.hp.hpl.jena.ontology.impl.OntModelImpl.<init>(OntModelImpl.java:116)
at com.hp.hpl.jena.rdf.model.ModelFactory.createOntologyModel(ModelFactory.java:375)
at org.apache.nutch.ontology.jena.OntologyImpl.<init>(OntologyImpl.java:81)
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
at java.lang.reflect.Constructor.newInstance(Constructor.java:494)
at java.lang.Class.newInstance0(Class.java:350)
at java.lang.Class.newInstance(Class.java:303)
at org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java:151)
at org.apache.nutch.ontology.OntologyFactory.getOntology(OntologyFactory.java:64)
at org.apache.jsp.search_jsp._jspService(search_jsp.java:118)
at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:92)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
at org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:162)
at org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:240)
at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:187)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:809)
at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:200)
at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:146)
at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:209)
at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:144)
at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
at org.apache.catalina.core.StandardContext.invoke(StandardContext.java:2358)
at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:133)
at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
at org.apache.catalina.valves.ErrorDispatcherValve.invoke(ErrorDispatcherValve.java:118)
at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:116)
at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:594)
at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:127)
at org.apache.catalina.core.StandardPipeline$StandardPipelineValveContext.invokeNext(StandardPipeline.java:596)
at org.apache.catalina.core.StandardPipeline.invoke(StandardPipeline.java:433)
at org.apache.catalina.core.ContainerBase.invoke(ContainerBase.java:948)
at org.apache.coyote.tomcat4.CoyoteAdapter.service(CoyoteAdapter.java:152)
at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:799)
at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.processConnection(Http11Protocol.java:705)
at org.apache.tomcat.util.net.TcpWorkerThread.runIt(PoolTcpEndpoint.java:577)
at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:683)
at java.lang.Thread.run(Thread.java:595)
Any thoughts from anyone out there?
Brian
On Fri, 22 Sep 2006 09:43:51 +0200, "Florian Fricker"
<fl...@wyona.com> said:
> Hey Brian,
>
> Your nutch-site.xml has two parts for ontology.
>
> First Part:
>
> <property>
> <name>extension.ontology.extension-name</name>
> <value>org.apache.nutch.ontology.jena.OntologyImpl</value>
> <description>Loads the Ontology plugin</description>
> </property>
>
> Second Part:
>
> <property>
> <name>extension.ontology.urls</name>
> <value>file:/path/to/owl/file or http://path/to/owl/file</value>
> <description>Shows the owl file</description>
> </property>
>
> Then add the onotology plugin to the nutch plugin section. It should
> look like this:
>
>
> <property>
> <name>plugin.includes</name>
> <value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|
>
> urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|
>
> query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|scoring-opic</value>
> </property>
>
> That's it...
>
> Regards
>
>
> Brian Ulicny wrote:
> > I'm trying to get the Ontology plugin to work in an 0.8 environment.
> >
> > I set extension.ontology.extension-name to
> > org.apache.nutch.ontology.jena.OntologyImpl
> > This seemed to be the correct thing to do according to nutch-default.xml
> > which says:
> >
> > <description>Use the specified online ontology extension. If empty,
> > the first available extension will be used. The "name" here refers to
> > an 'id'
> > attribute of the 'implementation' element in the plugin descriptor XML
> > file.</description>
> >
> > The plugin.xml for ontology is:
> >
> > <plugin
> > id="ontology"
> > name="Ontology Plug-in"
> > version="1.0.0"
> > provider-name="nutch.org">
> >
> >
> > <runtime>
> > <library name="ontology.jar">
> > <export name="*"/>
> > </library>
> >
> > <library name="commons-logging-1.0.3.jar"/>
> > <library name="icu4j_2_6_1.jar"/>
> > <library name="jena-2.1.jar"/>
> >
> > </runtime>
> >
> > <requires>
> > <import plugin="nutch-extensionpoints"/>
> > </requires>
> >
> > <!-- attribute "point" is the plugin interface class -->
> > <!-- seems kinda redundant to have to define the point here too -->
> > <extension id="org.apache.nutch.ontology.jena"
> > name="Ontology Model Loader"
> > point="org.apache.nutch.ontology.Ontology">
> >
> > <!-- define all the classes that implement the point defined above
> > -->
> > <implementation id="org.apache.nutch.ontology.jena.OntologyImpl"
> > class="org.apache.nutch.ontology.jena.OntologyImpl">
> > <parameter name="pathSuffix" value=""/>
> > </implementation>
> >
> > </extension>
> >
> > </plugin>
> >
> >
> > I then set the extension.ontology.urls property to a valid (external)
> > OWL url.
> >
> > However, I get this warning, and no ontologies are loaded:
> >
> > 6 WARN OntologyFactory - Ontology extension not found:
> > 'org.apache.nutch.ontology.jena.OntologyImpl', trying the default
> >
> > What am I doing wrong?
> >
> > B Ulicny
> >
>
>
> --
> Florian Fricker florian.fricker@wyona.com
> Wyona Inc. - Open Source Content Management - Apache Lenya
> http://www.wyona.com http://lenya.apache.org
>
--
Brian Ulicny
bulicny@alum.mit.edu
Re: Ontology plugin in 0.8
Posted by Florian Fricker <fl...@wyona.com>.
Hey Brian,
Your nutch-site.xml has two parts for ontology.
First Part:
<property>
<name>extension.ontology.extension-name</name>
<value>org.apache.nutch.ontology.jena.OntologyImpl</value>
<description>Loads the Ontology plugin</description>
</property>
Second Part:
<property>
<name>extension.ontology.urls</name>
<value>file:/path/to/owl/file or http://path/to/owl/file</value>
<description>Shows the owl file</description>
</property>
Then add the onotology plugin to the nutch plugin section. It should
look like this:
<property>
<name>plugin.includes</name>
<value>nutch-extensionpoints|protocol-file|protocol-http|ontology|languageidentifier|analysis-(fr|en|de)|
urlfilter-regex|parse-(text|pdf|mspowerpoint|msword|msexcel|rtf|rss|html|zip)|index-basic|
query-(basic|site|url)|lib-jakarta-poi|lib-lucene-analyzers|scoring-opic</value>
</property>
That's it...
Regards
Brian Ulicny wrote:
> I'm trying to get the Ontology plugin to work in an 0.8 environment.
>
> I set extension.ontology.extension-name to
> org.apache.nutch.ontology.jena.OntologyImpl
> This seemed to be the correct thing to do according to nutch-default.xml
> which says:
>
> <description>Use the specified online ontology extension. If empty,
> the first available extension will be used. The "name" here refers to
> an 'id'
> attribute of the 'implementation' element in the plugin descriptor XML
> file.</description>
>
> The plugin.xml for ontology is:
>
> <plugin
> id="ontology"
> name="Ontology Plug-in"
> version="1.0.0"
> provider-name="nutch.org">
>
>
> <runtime>
> <library name="ontology.jar">
> <export name="*"/>
> </library>
>
> <library name="commons-logging-1.0.3.jar"/>
> <library name="icu4j_2_6_1.jar"/>
> <library name="jena-2.1.jar"/>
>
> </runtime>
>
> <requires>
> <import plugin="nutch-extensionpoints"/>
> </requires>
>
> <!-- attribute "point" is the plugin interface class -->
> <!-- seems kinda redundant to have to define the point here too -->
> <extension id="org.apache.nutch.ontology.jena"
> name="Ontology Model Loader"
> point="org.apache.nutch.ontology.Ontology">
>
> <!-- define all the classes that implement the point defined above
> -->
> <implementation id="org.apache.nutch.ontology.jena.OntologyImpl"
> class="org.apache.nutch.ontology.jena.OntologyImpl">
> <parameter name="pathSuffix" value=""/>
> </implementation>
>
> </extension>
>
> </plugin>
>
>
> I then set the extension.ontology.urls property to a valid (external)
> OWL url.
>
> However, I get this warning, and no ontologies are loaded:
>
> 6 WARN OntologyFactory - Ontology extension not found:
> 'org.apache.nutch.ontology.jena.OntologyImpl', trying the default
>
> What am I doing wrong?
>
> B Ulicny
>
--
Florian Fricker florian.fricker@wyona.com
Wyona Inc. - Open Source Content Management - Apache Lenya
http://www.wyona.com http://lenya.apache.org