You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@any23.apache.org by armon <zh...@gmail.com> on 2012/06/21 17:15:58 UTC

about the supported input format of any23

Hi,
       I do some data transform currently from xml-format wiki data
(retrieved by wikipedia API) to turtle, but it seems that the any23 can't
work correctly. (I used the command: ./any23 rover url )

       Does any23 actually support the xml data retrieved by wikipedia API
as the input format ?

      If not, is there any extensive extract module do the same work ?

      Or if we can develop a extractor named wiki-xml extractor ?

      Thank you!

      All the best!

armon.

Re: about the supported input format of any23

Posted by armon <zh...@gmail.com>.

ok, 

get it, 

and do you have any suggestion for us the do the conversion from the data (retrieved by wiki API) to RDF format or triple?

thank you very much!

armon.chen

On 2012年6月22日星期五 at 下午5:49, Lewis John Mcgibbney wrote:

> Hi Armon,
> 
> On Fri, Jun 22, 2012 at 10:47 AM, armon <zhimeng9@gmail.com (mailto:zhimeng9@gmail.com)> wrote:
> > I just ask you sincerely whether asny23 0.7 will support the common xml format as input or not.
> 
> No this is certainly not a direction the project is moving in. We have
> a clear aim and this is not part of the that remit.
> 
> Best
> Lewis

Re: about the supported input format of any23

Posted by Lewis John Mcgibbney <le...@gmail.com>.

Hi Armon,

On Fri, Jun 22, 2012 at 10:47 AM, armon <zh...@gmail.com> wrote:
>  I just ask you sincerely whether asny23 0.7 will support the common xml format as input or not.

No this is certainly not a direction the project is moving in. We have
a clear aim and this is not part of the that remit.

Best
Lewis

Re: about the supported input format of any23

Posted by armon <zh...@gmail.com>.

Hi Lewis,

 I DO agree with your opinion, yes, actually any23 do a great work,

 and I was used to think that it can support the common xml structure data while it doesn't.

 So it is ok, maybe I need to develop a new module to meet my requirement.

 And if there is anything make you misunderstand my real mean, I am sorry about that.

 I just ask you sincerely whether asny23 0.7 will support the common xml format as input or not.

 If not, it is ok, I will get other solution.

 Thank you very much!

 All the best! 

armon.chen


On 2012年6月22日星期五 at 下午5:35, Lewis John Mcgibbney wrote:

> Hi Armon,
> 
> I think we need to clarify something here
> 
> Any23 parsers extract structured data... the parsers DO NOT aim to
> extract unstructured text like some kind of 'traditional' parser.
> By structure we are not referring to markup as such but instead relate
> solely to semantic/structural relationships between concepts within
> some given data resource.
> Within the context of this thread, we refer (somewhat ambiguously) to
> resources as one of the following formats
> 
> RDF/XML, Turtle, Notation 3, RDFa with RDFa1.1 prefix mechanism,
> Microformats: Adr, Geo, hCalendar, hCard, hListing, hResume, hReview,
> License, XFN and Species, HTML5 Microdata: (such as Schema.org (http://Schema.org)), CSV:
> Comma Separated Values with separator autodetection.
> 
> Does this make sense?
> 
> The Any23 parser is doing it's job as it should.
> 
> Lewis
> 
> On Fri, Jun 22, 2012 at 10:26 AM, armon <zhimeng9@gmail.com (mailto:zhimeng9@gmail.com)> wrote:
> > Hi Lewis,
> > 
> > I even as the xml data in a file, and then command: ./any23 rover @filepath ，but it still can't work, finally,I create a simply xml data file to test, again nothing retrieved, so I think maybe it is not the url issue, but related with parser engine.
> > 
> > Is the any23 0.7 coming, will it meet my particular request? If so, then I just get the latest 0.7 and test it again.
> > 
> > thanks for your reply.
> > 
> > All the best!
> > 
> > armon.chen
> > 
> > 
> > 
> > On 2012年6月22日星期五 at 下午5:13, Lewis John Mcgibbney wrote:
> > 
> > > So I suppose there are a couple of options here.
> > > 
> > > On Fri, Jun 22, 2012 at 10:02 AM, armon <zhimeng9@gmail.com (mailto:zhimeng9@gmail.com)> wrote:
> > > > 
> > > > but we know that there is some other data in the page that can't be retrieved, such as the xml data (in the attachment of last email).
> > > 
> > > Yes there is a good bit more content but the parsing implementations
> > > within Any23 do not aim to extract content strings... instead the
> > > project (parsing anyway) gains its strength from extracting triples
> > > and such like.
> > > 
> > > You could quickly fire up a Nutch instance to gather content then use
> > > the basic-crawler from Any23 for triples... this is until we implement
> > > an Any23 parsing and indexing filter within Nutch which will provide a
> > > complete solution to your particular request.
> > > 
> > > You could easily implement the above programmatically which would
> > > enable you to fetch page content as well as extract the triples from
> > > it separately.
> 
> 
> 
> -- 
> Lewis

Re: about the supported input format of any23

Posted by Lewis John Mcgibbney <le...@gmail.com>.

Hi Armon,

I think we need to clarify something here

Any23 parsers extract structured data... the parsers DO NOT aim to
extract unstructured text like some kind of 'traditional' parser.
By structure we are not referring to markup as such but instead relate
solely to semantic/structural relationships between concepts within
some given data resource.
Within the context of this thread, we refer (somewhat ambiguously) to
resources as one of the following formats

RDF/XML, Turtle, Notation 3, RDFa with RDFa1.1 prefix mechanism,
Microformats: Adr, Geo, hCalendar, hCard, hListing, hResume, hReview,
License, XFN and Species, HTML5 Microdata: (such as Schema.org), CSV:
Comma Separated Values with separator autodetection.

Does this make sense?

The Any23 parser is doing it's job as it should.

Lewis

On Fri, Jun 22, 2012 at 10:26 AM, armon <zh...@gmail.com> wrote:
> Hi Lewis,
>
> I even as the xml data in a file, and then command: ./any23 rover @filepath ，but it still can't work, finally,I create a simply xml data file to test, again nothing retrieved, so I think maybe it is not the url issue, but related with parser engine.
>
> Is the any23 0.7 coming, will it meet my particular request? If so, then I just get the latest 0.7 and test it again.
>
> thanks for your reply.
>
> All the best!
>
> armon.chen
>
>
>
> On 2012年6月22日星期五 at 下午5:13, Lewis John Mcgibbney wrote:
>
>> So I suppose there are a couple of options here.
>>
>> On Fri, Jun 22, 2012 at 10:02 AM, armon <zhimeng9@gmail.com (mailto:zhimeng9@gmail.com)> wrote:
>> >
>> > but we know that there is some other data in the page that can't be retrieved, such as the xml data (in the attachment of last email).
>>
>> Yes there is a good bit more content but the parsing implementations
>> within Any23 do not aim to extract content strings... instead the
>> project (parsing anyway) gains its strength from extracting triples
>> and such like.
>>
>> You could quickly fire up a Nutch instance to gather content then use
>> the basic-crawler from Any23 for triples... this is until we implement
>> an Any23 parsing and indexing filter within Nutch which will provide a
>> complete solution to your particular request.
>>
>> You could easily implement the above programmatically which would
>> enable you to fetch page content as well as extract the triples from
>> it separately.
>

-- 
Lewis

Re: about the supported input format of any23

Posted by armon <zh...@gmail.com>.

Hi Lewis, 

I even as the xml data in a file, and then command: ./any23 rover @filepath ，but it still can't work, finally,I create a simply xml data file to test, again nothing retrieved, so I think maybe it is not the url issue, but related with parser engine. 

Is the any23 0.7 coming, will it meet my particular request? If so, then I just get the latest 0.7 and test it again.

thanks for your reply.

All the best!

armon.chen

On 2012年6月22日星期五 at 下午5:13, Lewis John Mcgibbney wrote:

> So I suppose there are a couple of options here.
> 
> On Fri, Jun 22, 2012 at 10:02 AM, armon <zhimeng9@gmail.com (mailto:zhimeng9@gmail.com)> wrote:
> > 
> > but we know that there is some other data in the page that can't be retrieved, such as the xml data (in the attachment of last email).
> 
> Yes there is a good bit more content but the parsing implementations
> within Any23 do not aim to extract content strings... instead the
> project (parsing anyway) gains its strength from extracting triples
> and such like.
> 
> You could quickly fire up a Nutch instance to gather content then use
> the basic-crawler from Any23 for triples... this is until we implement
> an Any23 parsing and indexing filter within Nutch which will provide a
> complete solution to your particular request.
> 
> You could easily implement the above programmatically which would
> enable you to fetch page content as well as extract the triples from
> it separately.

Re: about the supported input format of any23

Posted by Lewis John Mcgibbney <le...@gmail.com>.

So I suppose there are a couple of options here.

On Fri, Jun 22, 2012 at 10:02 AM, armon <zh...@gmail.com> wrote:
>
>  but we know that there is some other data in the page that can't be retrieved, such as the xml data (in the attachment of last email).

Yes there is a good bit more content but the parsing implementations
within Any23 do not aim to extract content strings... instead the
project (parsing anyway) gains its strength from extracting triples
and such like.

You could quickly fire up a Nutch instance to gather content then use
the basic-crawler from Any23 for triples... this is until we implement
an Any23 parsing and indexing filter within Nutch which will provide a
complete solution to your particular request.

You could easily implement the above programmatically which would
enable you to fetch page content as well as extract the triples from
it separately.

Re: about the supported input format of any23

Posted by armon <zh...@gmail.com>.

Hi Lewis, 

 thanks for your reply

 the result we get from the url (in last email) is:

@prefix dcterms: <http://purl.org/dc/terms/> .

<http://en.wikipedia.org/w/api.php?action=query> dcterms:title "MediaWiki API Result" .

 but we know that there is some other data in the page that can't be retrieved, such as the xml data (in the attachment of last email).

 Is there any other way for me to have a try if the ./any23 rover "@url" can't work?

Thanks!

All the best!

armon

On 2012年6月22日星期五 at 下午4:51, Lewis John Mcgibbney wrote:

> Hi Armon,
> 
> I was tripping last night and forgot the quotes around your URL
> 
> if you do any23 rover "$URL" you will be returned the relevant triples
> from the page.
> 
> I also quickly used the parserchecker from Nutch to fetch the URL and
> I get it no bother.
> 
> What were you expecting to get from the page? Yesterday when I
> originally navigated to the URL you provided from within my browser it
> was presented to me in the wiki markup, however today it is in some
> XML and contains a tiny fraction of the content it did yesterday...
> 
> On Thu, Jun 21, 2012 at 11:12 PM, armon <zhimeng9@gmail.com (mailto:zhimeng9@gmail.com)> wrote:
> > and use the xml file as the input data, then use the command ./any23 rover filename
> > 
> > armon

Re: about the supported input format of any23

Posted by Lewis John Mcgibbney <le...@gmail.com>.

Hi Armon,

I was tripping last night and forgot the quotes around your URL

if you do any23 rover "$URL" you will be returned the relevant triples
from the page.

I also quickly used the parserchecker from Nutch to fetch the URL and
I get it no bother.

What were you expecting to get from the page? Yesterday when I
originally navigated to the URL you provided from within my browser it
was presented to me in the wiki markup, however today it is in some
XML and contains a tiny fraction of the content it did yesterday...

On Thu, Jun 21, 2012 at 11:12 PM, armon <zh...@gmail.com> wrote:
> and use the xml file as the input data, then use the command ./any23 rover filename
>
> armon
>

Re: about the supported input format of any23

Posted by armon <zh...@gmail.com>.

and use the xml file as the input data, then use the command ./any23 rover filename

armon


On 2012年6月22日星期五 at 上午6:10, armon wrote:

>  yep,so how to solve it, BTW, it still can't work while I save the xml part of the data in http://en.wikipedia.org/w/api.php?action=query&list=search&srwhat=text&srsearch=meaning , the xml file is in the attachment file.
> 
> 
> 
> armon
> 
> 
> On 2012年6月22日星期五 at 上午5:59, Lewis John Mcgibbney wrote:
> 
> > No your doing nothing incorrectly. I get pretty dismal results both
> > with basic-crawler within Any23 please see below
> > 
> > lewismc@lewismc-HP-Mini-110-3100:~/ASF/trunk/runtime/local$ any23
> > rover http://en.wikipedia.org/w/api.php?action=query&list=search&srwhat=text&srsearch=meaning
> > [1] 2956
> > [2] 2957
> > [3] 2958
> > lewismc@lewismc-HP-Mini-110-3100:~/ASF/trunk/runtime/local$
> > ------------------------------------------------------------------------
> > Apache Any23 :: rover
> > ------------------------------------------------------------------------
> > 
> > @prefix dcterms: <http://purl.org/dc/terms/> .
> > 
> > <http://en.wikipedia.org/w/api.php?action=query> dcterms:title
> > "MediaWiki API Result" .
> > 
> > ------------------------------------------------------------------------
> > Apache Any23 SUCCESS
> > Total time: 2s
> > Finished at: Thu Jun 21 22:53:27 BST 2012
> > Final Memory: 24M/483M
> > ------------------------------------------------------------
> > [1] Done any23 rover
> > http://en.wikipedia.org/w/api.php?action=query
> > [2]- Done listDsearch
> > [3]+ Done srwhat=text
> > 
> > The problem is that I don't know how crawler4j deals with some
> > characters such as '?' within URL strings. and whether it treats them
> > as queries or not? By the looks of the log output above, the URL
> > string is being treated incorrectly.
> > 
> > Sitting above all of this is the fact that I don't think the wiki
> > markup syntax is not supported within Any23 parser implementations.
> > 
> > Lewis
> > 
> > 
> > On Thu, Jun 21, 2012 at 10:29 PM, armon <zhimeng9@gmail.com (mailto:zhimeng9@gmail.com)> wrote:
> > > and even when I copy the xml part of data in the url as the input content,
> > > it still can't work well, but when I try a rdf file, it works well, is
> > > there anything I do incorrectly?
> > > 
> > > 
> > > 2012/6/22 armon <zhimeng9@gmail.com (mailto:zhimeng9@gmail.com)>
> > > 
> > > > Hi Lewis, thanks very much for your reply, I am sorry to interrupt you so
> > > > late,
> > > > 
> > > > the url I used was:
> > > > 
> > > > 
> > > > http://en.wikipedia.org/w/api.php?action=query&list=search&srwhat=text&srsearch=meaning
> > > > 
> > > > 
> > > > and then I used command: ./any23 rover url(showed above) to run the
> > > > result.
> > > > 
> > > > thanks.
> > > > 
> > > > armon
> > > > 
> > > > 
> > > > 
> > > > 
> > > > 
> > > > 
> > > > 2012/6/22 Lewis John Mcgibbney <lewis.mcgibbney@gmail.com (mailto:lewis.mcgibbney@gmail.com)>
> > > > 
> > > > > Hi Armon,
> > > > > 
> > > > > On Thu, Jun 21, 2012 at 4:15 PM, armon <zhimeng9@gmail.com (mailto:zhimeng9@gmail.com)> wrote:
> > > > > > Hi,
> > > > > >  I do some data transform currently from xml-format wiki data
> > > > > 
> > > > > Can you give a small example of this xml?
> > > > > 
> > > > > > (retrieved by wikipedia API) to turtle,
> > > > > 
> > > > > Also a small example of your turtle
> > > > > 
> > > > > > but it seems that the any23 can't
> > > > > > work correctly. (I used the command: ./any23 rover url )
> > > > > 
> > > > > What do you get to std out? I am easily able to use any23 parsers on
> > > > > fetching structure from wikipedia pages... but this is not what you
> > > > > are referring to... I need some more information from you please.
> > > > > 
> > > > > > 
> > > > > >  Does any23 actually support the xml data retrieved by wikipedia
> > > > > API
> > > > > > as the input format ?
> > > > > 
> > > > > Please see above
> > > > > 
> > > > > 
> > > > > 
> > > > > 
> > > > > 
> > > > > --
> > > > > Lewis
> > 
> > 
> > 
> > -- 
> > Lewis
> 
> 
> Attachments: 
> - search.xml
>

Re: about the supported input format of any23

Posted by armon <zh...@gmail.com>.

 yep,so how to solve it, BTW, it still can't work while I save the xml part of the data in http://en.wikipedia.org/w/api.php?action=query&list=search&srwhat=text&srsearch=meaning , the xml file is in the attachment file.



armon


On 2012年6月22日星期五 at 上午5:59, Lewis John Mcgibbney wrote:

> No your doing nothing incorrectly. I get pretty dismal results both
> with basic-crawler within Any23 please see below
> 
> lewismc@lewismc-HP-Mini-110-3100:~/ASF/trunk/runtime/local$ any23
> rover http://en.wikipedia.org/w/api.php?action=query&list=search&srwhat=text&srsearch=meaning
> [1] 2956
> [2] 2957
> [3] 2958
> lewismc@lewismc-HP-Mini-110-3100:~/ASF/trunk/runtime/local$
> ------------------------------------------------------------------------
> Apache Any23 :: rover
> ------------------------------------------------------------------------
> 
> @prefix dcterms: <http://purl.org/dc/terms/> .
> 
> <http://en.wikipedia.org/w/api.php?action=query> dcterms:title
> "MediaWiki API Result" .
> 
> ------------------------------------------------------------------------
> Apache Any23 SUCCESS
> Total time: 2s
> Finished at: Thu Jun 21 22:53:27 BST 2012
> Final Memory: 24M/483M
> ------------------------------------------------------------
> [1] Done any23 rover
> http://en.wikipedia.org/w/api.php?action=query
> [2]- Done list=search
> [3]+ Done srwhat=text
> 
> The problem is that I don't know how crawler4j deals with some
> characters such as '?' within URL strings. and whether it treats them
> as queries or not? By the looks of the log output above, the URL
> string is being treated incorrectly.
> 
> Sitting above all of this is the fact that I don't think the wiki
> markup syntax is not supported within Any23 parser implementations.
> 
> Lewis
> 
> 
> On Thu, Jun 21, 2012 at 10:29 PM, armon <zhimeng9@gmail.com (mailto:zhimeng9@gmail.com)> wrote:
> > and even when I copy the xml part of data in the url as the input content,
> > it still can't work well, but when I try a rdf file, it works well, is
> > there anything I do incorrectly?
> > 
> > 
> > 2012/6/22 armon <zhimeng9@gmail.com (mailto:zhimeng9@gmail.com)>
> > 
> > > Hi Lewis, thanks very much for your reply, I am sorry to interrupt you so
> > > late,
> > > 
> > > the url I used was:
> > > 
> > > 
> > > http://en.wikipedia.org/w/api.php?action=query&list=search&srwhat=text&srsearch=meaning
> > > 
> > > 
> > > and then I used command: ./any23 rover url(showed above) to run the
> > > result.
> > > 
> > > thanks.
> > > 
> > > armon
> > > 
> > > 
> > > 
> > > 
> > > 
> > > 
> > > 2012/6/22 Lewis John Mcgibbney <lewis.mcgibbney@gmail.com (mailto:lewis.mcgibbney@gmail.com)>
> > > 
> > > > Hi Armon,
> > > > 
> > > > On Thu, Jun 21, 2012 at 4:15 PM, armon <zhimeng9@gmail.com (mailto:zhimeng9@gmail.com)> wrote:
> > > > > Hi,
> > > > >  I do some data transform currently from xml-format wiki data
> > > > 
> > > > Can you give a small example of this xml?
> > > > 
> > > > > (retrieved by wikipedia API) to turtle,
> > > > 
> > > > Also a small example of your turtle
> > > > 
> > > > > but it seems that the any23 can't
> > > > > work correctly. (I used the command: ./any23 rover url )
> > > > 
> > > > What do you get to std out? I am easily able to use any23 parsers on
> > > > fetching structure from wikipedia pages... but this is not what you
> > > > are referring to... I need some more information from you please.
> > > > 
> > > > > 
> > > > >  Does any23 actually support the xml data retrieved by wikipedia
> > > > API
> > > > > as the input format ?
> > > > 
> > > > Please see above
> > > > 
> > > > 
> > > > 
> > > > 
> > > > 
> > > > --
> > > > Lewis
> 
> 
> 
> -- 
> Lewis

Re: about the supported input format of any23

Posted by Lewis John Mcgibbney <le...@gmail.com>.

No your doing nothing incorrectly. I get pretty dismal results both
with basic-crawler within Any23 please see below

lewismc@lewismc-HP-Mini-110-3100:~/ASF/trunk/runtime/local$ any23
rover http://en.wikipedia.org/w/api.php?action=query&list=search&srwhat=text&srsearch=meaning
[1] 2956
[2] 2957
[3] 2958
lewismc@lewismc-HP-Mini-110-3100:~/ASF/trunk/runtime/local$
------------------------------------------------------------------------
Apache Any23 :: rover
------------------------------------------------------------------------

@prefix dcterms: <http://purl.org/dc/terms/> .

<http://en.wikipedia.org/w/api.php?action=query> dcterms:title
"MediaWiki API Result" .

------------------------------------------------------------------------
Apache Any23 SUCCESS
Total time: 2s
Finished at: Thu Jun 21 22:53:27 BST 2012
Final Memory: 24M/483M
------------------------------------------------------------
[1]   Done                    any23 rover
http://en.wikipedia.org/w/api.php?action=query
[2]-  Done                    list=search
[3]+  Done                    srwhat=text

The problem is that I don't know how crawler4j deals with some
characters such as '?' within URL strings. and whether it treats them
as queries or not? By the looks of the log output above, the URL
string is being treated incorrectly.

Sitting above all of this is the fact that I don't think the wiki
markup syntax is not supported within Any23 parser implementations.

Lewis


On Thu, Jun 21, 2012 at 10:29 PM, armon <zh...@gmail.com> wrote:
> and even when I copy the xml part of data in the url as the input content,
> it still can't work well,  but when I try a rdf file, it works well, is
> there anything I do incorrectly?
>
>
> 2012/6/22 armon <zh...@gmail.com>
>
>> Hi Lewis, thanks very much for your reply, I am sorry to interrupt you so
>> late,
>>
>> the url I used was:
>>
>>
>> http://en.wikipedia.org/w/api.php?action=query&list=search&srwhat=text&srsearch=meaning
>>
>>
>> and then I used command: ./any23 rover url(showed above) to run the
>> result.
>>
>> thanks.
>>
>> armon
>>
>>
>>
>>
>>
>>
>> 2012/6/22 Lewis John Mcgibbney <le...@gmail.com>
>>
>>> Hi Armon,
>>>
>>> On Thu, Jun 21, 2012 at 4:15 PM, armon <zh...@gmail.com> wrote:
>>> > Hi,
>>> >       I do some data transform currently from xml-format wiki data
>>>
>>> Can you give a small example of this xml?
>>>
>>> > (retrieved by wikipedia API) to turtle,
>>>
>>> Also a small example of your turtle
>>>
>>> > but it seems that the any23 can't
>>> > work correctly. (I used the command: ./any23 rover url )
>>>
>>> What do you get to std out? I am easily able to use any23 parsers on
>>> fetching structure from wikipedia pages... but this is not what you
>>> are referring to... I need some more information from you please.
>>>
>>> >
>>> >       Does any23 actually support the xml data retrieved by wikipedia
>>> API
>>> > as the input format ?
>>>
>>> Please see above
>>>
>>>
>>>
>>>
>>>
>>> --
>>> Lewis
>>>
>>
>>



-- 
Lewis

Re: about the supported input format of any23

Posted by armon <zh...@gmail.com>.

and even when I copy the xml part of data in the url as the input content,
it still can't work well,  but when I try a rdf file, it works well, is
there anything I do incorrectly?


2012/6/22 armon <zh...@gmail.com>

> Hi Lewis, thanks very much for your reply, I am sorry to interrupt you so
> late,
>
> the url I used was:
>
>
> http://en.wikipedia.org/w/api.php?action=query&list=search&srwhat=text&srsearch=meaning
>
>
> and then I used command: ./any23 rover url(showed above) to run the
> result.
>
> thanks.
>
> armon
>
>
>
>
>
>
> 2012/6/22 Lewis John Mcgibbney <le...@gmail.com>
>
>> Hi Armon,
>>
>> On Thu, Jun 21, 2012 at 4:15 PM, armon <zh...@gmail.com> wrote:
>> > Hi,
>> >       I do some data transform currently from xml-format wiki data
>>
>> Can you give a small example of this xml?
>>
>> > (retrieved by wikipedia API) to turtle,
>>
>> Also a small example of your turtle
>>
>> > but it seems that the any23 can't
>> > work correctly. (I used the command: ./any23 rover url )
>>
>> What do you get to std out? I am easily able to use any23 parsers on
>> fetching structure from wikipedia pages... but this is not what you
>> are referring to... I need some more information from you please.
>>
>> >
>> >       Does any23 actually support the xml data retrieved by wikipedia
>> API
>> > as the input format ?
>>
>> Please see above
>>
>>
>>
>>
>>
>> --
>> Lewis
>>
>
>

Re: about the supported input format of any23

Posted by armon <zh...@gmail.com>.

Hi Lewis, thanks very much for your reply, I am sorry to interrupt you so
late,

the url I used was:

http://en.wikipedia.org/w/api.php?action=query&list=search&srwhat=text&srsearch=meaning


and then I used command: ./any23 rover url(showed above) to run the result.

thanks.

armon






2012/6/22 Lewis John Mcgibbney <le...@gmail.com>

> Hi Armon,
>
> On Thu, Jun 21, 2012 at 4:15 PM, armon <zh...@gmail.com> wrote:
> > Hi,
> >       I do some data transform currently from xml-format wiki data
>
> Can you give a small example of this xml?
>
> > (retrieved by wikipedia API) to turtle,
>
> Also a small example of your turtle
>
> > but it seems that the any23 can't
> > work correctly. (I used the command: ./any23 rover url )
>
> What do you get to std out? I am easily able to use any23 parsers on
> fetching structure from wikipedia pages... but this is not what you
> are referring to... I need some more information from you please.
>
> >
> >       Does any23 actually support the xml data retrieved by wikipedia API
> > as the input format ?
>
> Please see above
>
>
>
>
>
> --
> Lewis
>

Re: about the supported input format of any23

Posted by Lewis John Mcgibbney <le...@gmail.com>.

Hi Armon,

On Thu, Jun 21, 2012 at 4:15 PM, armon <zh...@gmail.com> wrote:
> Hi,
>       I do some data transform currently from xml-format wiki data

Can you give a small example of this xml?

> (retrieved by wikipedia API) to turtle,

Also a small example of your turtle

> but it seems that the any23 can't
> work correctly. (I used the command: ./any23 rover url )

What do you get to std out? I am easily able to use any23 parsers on
fetching structure from wikipedia pages... but this is not what you
are referring to... I need some more information from you please.

>
>       Does any23 actually support the xml data retrieved by wikipedia API
> as the input format ?

Please see above

-- 
Lewis