You are viewing a plain text version of this content. The canonical link for it is here.
Posted to xindice-dev@xml.apache.org by Dawid Weiss <Da...@cs.put.poznan.pl> on 2002/02/18 14:05:16 UTC

Re[2]: Metadata (was: Planning the future)

KS> What if it was something that was stupid simple like automatically
KS> creating two collections, one for documents and one for metadata? Each

We have adopted such a 'stupid simple' approach. It works well. We'd rather
think of the approach as simple than stupid of course... Any term used, the
separation  of metadata from the contents based on a prefixed subcollection
(prefix   reserved)   has  several  advantages  over  storing  them  inside
documents.

KS> The metadata doc
KS> would have database managed portions and user managed portions,

Amazing  coincidence  in  thinking. It only proves the design is simple and
intuitive...

KS> The meta-data collection name would just have a reserved prefix/suffix and
KS> could be ignored in collection listings.

Exactly :)

KS> Maybe It could be an option so that if you don't want the overhead you

Exactly :)

KS> access the data like any other collection. It wouldn't even require any
KS> changes to the XML:DB APi if you layered on top.

Exactly. Amazing, I could have been an author of your e-mail...

KS> It's kind of hacky, but it's simple and in reality it might work decently 
KS> well.

It does. We have such an API.

KS> I don't know, just a thought.

A  very  good  one.  Thanks,  it  to  some extent proves we chose the right
approach to the problem we faced (i.e. lack of versioning/ metadata).

Dawid


Re: Metadata (was: Planning the future)

Posted by Kimbro Staken <ks...@xmldatabases.org>.
On Monday, February 18, 2002, at 07:16 AM, Gianugo Rabellino wrote:

> Kimbro Staken wrote:
>> On Monday, February 18, 2002, at 06:05 AM, Dawid Weiss wrote:
>>>
>>> KS> What if it was something that was stupid simple like automatically
>>> KS> creating two collections, one for documents and one for metadata?
>
> OK, this is the cleanest hack I can think of, but still it's a hack. :)
>

Never claimed otherwise, "It's kind of hacky". :-)

> It seems to me that we are playing different levels, let's try to make 
> some points clear: you're talking about practical solutions to achieve 
> the result of having metadata, I'm stuck to a generic metadata support, i.
> e. if should metadata be offered to Joe User.  This is *vital*, and we 
> all agree on this, at least for basic "stat-like" metadata. Now:
>

You're right, this is a completely different issue. The problem with the 
XML:DB API is that it's going to be colored by what other databases 
support. I don't know exactly what they support though, so I have to do 
research in order to figure it out. From what I remember most don't have 
very rich support, if any at all.

> * Supporting this means IMHO extending the XML:DB APIs somehow (I'm not 
> in favor of proprietary extensions and the like).
>

You're probably right, I just don't want to tackle that yet. The XML:DB 
API is probably going to change quite a bit before too much longer. I want 
to focus on Xindice right now. Let's figure out what we want to support 
and how we want to expose it through our network API. Then we can look at 
seeing how the XML:DB API shapes up. I'd also prefer to have a little more 
real world experience with this before tackling the real API changes. My 
opinion is that we just add a proprietary service until then

In addition, I'm kind of driving both projects, among others, and right 
now I want to make sure we get Xindice moving nicely first. We've been 
kind of stalled as far as development goes. Once we're rolling along 
nicely, then I'll focus a little more on the XML:DB API project which is 
also kind of stalled. I also need to factor in SixDML which is becoming an 
XML:DB project too.

The major problem with XML:DB is that the XML database industry overall is 
really struggling. We need to get SiXDML going as there is a lot more 
vendor interest in that then the raw API. The good thing is it builds on 
the raw API so it will drive that too. It's just going to take some time 
and I don't want it to affect this project too much. It's more important 
that we get solid support in Xindice first. After all if the XML database 
industry doesn't pick up soon, Xindice may be the only one left and then 
we won't have to worry about anything else. :-(

> * If we are to undertake an effort to change the XML:DB APIs IMHO it's 
> worth to have some investigation and understand what kind of metadata we 
> are to offer. I'm all for having a set of "system level" metadata plus a 
> hook for user ones, but I might be stuck by FS at its best, so please don'
> t hesitate to try my asbestos underwear on this issue. :)
>
> * Given this, it doesn't really matter, at this point of the discussion, 
> whether metadata should be stored inside the DOM, in a parallel document,
>  in a parallel Collection or in my closet: this is just an implementation 
> detail.
>
> I hope I made myself clear now :) Yet the more I think about it the more 
> I'm convinced that this discussion really belongs to xapi-dev...
>

Yes it does.

> Ciao,
>
> -- Gianugo Rabellino
>
>
Kimbro Staken
XML Database Software, Consulting and Writing
http://www.xmldatabases.org/


Re: Metadata (was: Planning the future)

Posted by Gianugo Rabellino <gi...@apache.org>.
Kimbro Staken wrote:
> 
> On Monday, February 18, 2002, at 06:05 AM, Dawid Weiss wrote:
> 
>>
>> KS> What if it was something that was stupid simple like automatically
>> KS> creating two collections, one for documents and one for metadata? 

OK, this is the cleanest hack I can think of, but still it's a hack. :)

It seems to me that we are playing different levels, let's try to make 
some points clear: you're talking about practical solutions to achieve 
the result of having metadata, I'm stuck to a generic metadata support, 
i.e. if should metadata be offered to Joe User.  This is *vital*, and we 
all agree on this, at least for basic "stat-like" metadata. Now:

* Supporting this means IMHO extending the XML:DB APIs somehow (I'm not 
in favor of proprietary extensions and the like).

* If we are to undertake an effort to change the XML:DB APIs IMHO it's 
worth to have some investigation and understand what kind of metadata we 
are to offer. I'm all for having a set of "system level" metadata plus a 
hook for user ones, but I might be stuck by FS at its best, so please 
don't hesitate to try my asbestos underwear on this issue. :)

* Given this, it doesn't really matter, at this point of the discussion, 
whether metadata should be stored inside the DOM, in a parallel 
document, in a parallel Collection or in my closet: this is just an 
implementation detail.

I hope I made myself clear now :) Yet the more I think about it the more 
I'm convinced that this discussion really belongs to xapi-dev...

Ciao,

-- 
Gianugo Rabellino


Re[6]: Metadata (was: Planning the future)

Posted by Dawid Weiss <Da...@cs.put.poznan.pl>.
KS> Are you still planning to contribute this code? It sounds like it would be
KS> a really good starting place.

I  think it would be no problem. We have JUST finished the prototype though
and  JUnit  shows it works ok. My good guess would be to hold on a month or
two  -  we'll  gain  experience  on  the  interface, its pros and cons, and
contribute  a  more  reliable, stable version. Another side of this coin is
that   we   have  it  implemented  in a very structural-like approach, i.e.
there  has  been no class-responsibility analysis etc. The reason for it is
that we intentionally wanted the interface to be so simple.

We  may change it in the future - time will tell.

KS> That's what we're trying to figure out. I don't actually use Xindice for
KS> anything, so I rely on the real users to tell us what we need.

I'm glad to hear it.
Dawid


Re: Re[4]: Metadata (was: Planning the future)

Posted by Kimbro Staken <ks...@xmldatabases.org>.
On Monday, February 18, 2002, at 06:21 AM, Dawid Weiss wrote:

>
> KS> Yeah, sorry I just realized Krzysztof posted a description of that 
> very
> KS> thing right before I sent that mail. Heh, heh, see what I get for 
> being
> KS> too busy writing and not reading. So Kudos to you guys for not only 
> coming
> KS> up with it, but also implementing it.
>
> That's  all right, it was just funny to see you came out with the same 
> idea
> :)  We  actually spent some time investigating various concepts and the 
> one
> we  chose  seemed  to  have least drawbacks: the database remains "clean"
>  -
> xpath,  document  retrieval  etc - these are all transparent, which was 
> out
> goal.
>

Are you still planning to contribute this code? It sounds like it would be 
a really good starting place.

> Nonetheless,  I  would  love  to  see  a  standard  API  for versioning 
> and
> metadata, and it seems many other people seem to crave for it, not only 
> me.
> It  may  be quite easy to walk around the problem, but since the project 
> is
> open  source, it should perhaps be taken into account what the actual 
> users
> need,  not  only  what  is the 'ideal path', which Xindice should follow 
> ;)
> Just my remark, nothing personal.
>

That's what we're trying to figure out. I don't actually use Xindice for 
anything, so I rely on the real users to tell us what we need. Now, I'm 
definitely hearing that metadata is an important issue, even more then I 
already thought. This is what I need to hear, and I'm listening.


> Dawid
>
>
Kimbro Staken
XML Database Software, Consulting and Writing
http://www.xmldatabases.org/


Re[4]: Metadata (was: Planning the future)

Posted by Dawid Weiss <Da...@cs.put.poznan.pl>.
KS> Yeah, sorry I just realized Krzysztof posted a description of that very 
KS> thing right before I sent that mail. Heh, heh, see what I get for being 
KS> too busy writing and not reading. So Kudos to you guys for not only coming 
KS> up with it, but also implementing it.

That's  all right, it was just funny to see you came out with the same idea
:)  We  actually spent some time investigating various concepts and the one
we  chose  seemed  to  have least drawbacks: the database remains "clean" -
xpath,  document  retrieval  etc - these are all transparent, which was out
goal.

Nonetheless,  I  would  love  to  see  a  standard  API  for versioning and
metadata, and it seems many other people seem to crave for it, not only me.
It  may  be quite easy to walk around the problem, but since the project is
open  source, it should perhaps be taken into account what the actual users
need,  not  only  what  is the 'ideal path', which Xindice should follow ;)
Just my remark, nothing personal.

Dawid


Re: Re[2]: Metadata (was: Planning the future)

Posted by Kimbro Staken <ks...@xmldatabases.org>.
On Monday, February 18, 2002, at 06:05 AM, Dawid Weiss wrote:

>
> KS> What if it was something that was stupid simple like automatically
> KS> creating two collections, one for documents and one for metadata? Each
>
> We have adopted such a 'stupid simple' approach. It works well. We'd 
> rather
> think of the approach as simple than stupid of course... Any term used, 
> the
> separation  of metadata from the contents based on a prefixed 
> subcollection
> (prefix   reserved)   has  several  advantages  over  storing  them  
> inside
> documents.
>

Yeah, sorry I just realized Krzysztof posted a description of that very 
thing right before I sent that mail. Heh, heh, see what I get for being 
too busy writing and not reading. So Kudos to you guys for not only coming 
up with it, but also implementing it.

> KS> The metadata doc
> KS> would have database managed portions and user managed portions,
>
> Amazing  coincidence  in  thinking. It only proves the design is simple 
> and
> intuitive...
>
> KS> The meta-data collection name would just have a reserved 
> prefix/suffix and
> KS> could be ignored in collection listings.
>
> Exactly :)
>
> KS> Maybe It could be an option so that if you don't want the overhead you
>
> Exactly :)
>
> KS> access the data like any other collection. It wouldn't even require 
> any
> KS> changes to the XML:DB APi if you layered on top.
>
> Exactly. Amazing, I could have been an author of your e-mail...
>
> KS> It's kind of hacky, but it's simple and in reality it might work 
> decently
> KS> well.
>
> It does. We have such an API.
>
> KS> I don't know, just a thought.
>
> A  very  good  one.  Thanks,  it  to  some extent proves we chose the 
> right
> approach to the problem we faced (i.e. lack of versioning/ metadata).

>
> Dawid
>
>
Kimbro Staken
XML Database Software, Consulting and Writing
http://www.xmldatabases.org/