You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@gora.apache.org by Lewis John Mcgibbney <le...@gmail.com> on 2012/01/26 13:16:05 UTC

[DISCUSS] Strategy moving towards Gora 0.2 release

Hi Guys,

I wanted to wait until we heard back from the board regarding graduation
before passing this one off.

What are the general feelings about preparing for a 0.2 release?

I would like to view the following

1) Currently there are 16 open issues for the 0.2-incubating release [1],
of these there don't appear to be any major bugs, however there are some
which I think should definitely be addressed such as adding Cassandra
tests, getting the new website and accompanying datastore specific
documentation resolved, as well as the licensing issues with the sqlbuilder
dependency (possible rewrite of gora-sql to support JOOQ).
2) It would be absolutely great if we could get the gora-solr store (nearly
there pending review and tests passing), and the gora-accumulo store
(nearly there pending review) integrated as this would give us an
oppertunity to open Gora up to two other communities, hopefully driving on
development between 0.2 release and 0.3 release phase.
3) I would like to rejig the entire site to a Maven build, working on
another incubating project, some of the dev's recently got a new site up
and running [2], it looks great and is incredibly easy to deploy to. I
think we could do with something similar for Gora. As we require a TLP
site, this would be a great time to sort this stuff out.

What does everyone think about this? I know there is a bit gfo workt o be
done, and there is no immediate rush to do this, however I'm really keen to
get Gora out there and to get people buzzing about it. This will also mean
that some addition dev work can be put in to Nutchgora branch, hopefully
more people from the Nutch community will see the usefulness of Gora in
solving their problems.

Thanks

Lewis

[1]
https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=true&jqlQuery=project+%3D+GORA+AND+resolution+%3D+Unresolved+AND+fixVersion+%3D+%220.2-incubating%22+ORDER+BY+priority+DESC&mode=hide
[2] http://incubator.apache.org/any23/
-- 
*Lewis*

Re: [DISCUSS] Strategy moving towards Gora 0.2 release

Posted by Lewis John Mcgibbney <le...@gmail.com>.
Hi Guys,

I'm updating this thread because there has been some progress with Gora in
recent weeks and I firmly believe that we should be aiming for a 0.2
release soon.

Of the 5 issues which are still left for 0.2 release I've got for following
to say about them:

- GORA-76 & 94 Upgrade to Hadoop 1.0.0 & Avro 1.6.2 respectively. I've got
a 80% baked patch which has merged into addressing both issues. It would be
great if someone slightly more clued up on Avro could pick this up and
finish off. There is little I can do before I get an explanation of
replacements for the deprecated API from user@avro.apache.org

- GORA-53 Cassandra tests. Again I have a pretty advanced patch(es) for
this, it uses Hector's embedded server similar to HBase's
HBaseTestingUtility for managing the embedded server. In it's current form
the first test seems to setUp, then the code trips and it keeps settingUp
all of the tests, before tearing them down and progressing to succesfully
build the project put miserably fail all tests. I've not yet written any
Mapreduce tests simply because I couldn't get any further with the patch AS
IS. Some help would be greatly appreciated.

- GORA-74 Removal of Sqlbuilder library. I haven't made much progress with
this at all... I've briefly reviewed JOOQ replacements and I think I have
some idea of what is required, however I've not had sufficient time and
patience to get personal with this issue yet!

- GORA-63 bin/compile-examples.sh does not work. This looks to be as
trivial as they come... however I'm not able to confirm if this is the case
or not and whether the patch Enis attached actually solves the problem. Can
anyone confirm and if so we can get it committed... if not then we can dig
deeper!

That's about it from me for now, I realise that we're all busy with other
things, it would however be excellent to see a Gora 0.2 release out the
door. Would be an excellent milestone to report back to the board about as
well for the upcoming monthly report.

Thanks, have a great weekend.

Lewis

On Mon, Jan 30, 2012 at 7:20 PM, Keith Turner <ke...@deenlo.com> wrote:

> On Sun, Jan 29, 2012 at 6:28 AM, Lewis John Mcgibbney
> <le...@gmail.com> wrote:
> > OK Keith thanks for this.
> >
> > So I suppose we can wait until you guys pass the 1.4.0 release... it is
> no
> > problem.
> >
> >
>
> Ok,  we are hoping to get it out within a week or two.  We would not
> want to hold up your next release though.   Now that Gora is TLP it
> does not have to go through the IPMC for a release.   Accumulo still
> has to, so that could further hold things up.
>



-- 
*Lewis*

Re: [DISCUSS] Strategy moving towards Gora 0.2 release

Posted by Keith Turner <ke...@deenlo.com>.
On Sun, Jan 29, 2012 at 6:28 AM, Lewis John Mcgibbney
<le...@gmail.com> wrote:
> OK Keith thanks for this.
>
> So I suppose we can wait until you guys pass the 1.4.0 release... it is no
> problem.
>
>

Ok,  we are hoping to get it out within a week or two.  We would not
want to hold up your next release though.   Now that Gora is TLP it
does not have to go through the IPMC for a release.   Accumulo still
has to, so that could further hold things up.

Re: [DISCUSS] Strategy moving towards Gora 0.2 release

Posted by Lewis John Mcgibbney <le...@gmail.com>.
OK Keith thanks for this.

So I suppose we can wait until you guys pass the 1.4.0 release... it is no
problem.


On Fri, Jan 27, 2012 at 4:57 PM, Keith Turner <ke...@deenlo.com> wrote:

> On Thu, Jan 26, 2012 at 5:46 PM, Lewis John Mcgibbney
> <le...@gmail.com> wrote:
> > Finally, has anyone had time to review Keith's gora-accumulo store? I'm
> > going to have a crack at it this weekend if I get time.
> >
> > Ta
> > --
> > *Lewis*
>
> FYI
>
> I have been concentrating on getting Accumulo 1.4.0 out the door.  The
> patch I submitted  for a gora-accumulo strore is based on
> 1.4.0-SNAPSHOT.  I tried back porting it to Accumulo 1.3.5 (which is
> released) and ran into some issues.  Instead of spending more time on
> that I decided to put all of my time towards 1.4.0 release.
>



-- 
*Lewis*

Re: [DISCUSS] Strategy moving towards Gora 0.2 release

Posted by Keith Turner <ke...@deenlo.com>.
On Thu, Jan 26, 2012 at 5:46 PM, Lewis John Mcgibbney
<le...@gmail.com> wrote:
> Finally, has anyone had time to review Keith's gora-accumulo store? I'm
> going to have a crack at it this weekend if I get time.
>
> Ta
> --
> *Lewis*

FYI

I have been concentrating on getting Accumulo 1.4.0 out the door.  The
patch I submitted  for a gora-accumulo strore is based on
1.4.0-SNAPSHOT.  I tried back porting it to Accumulo 1.3.5 (which is
released) and ran into some issues.  Instead of spending more time on
that I decided to put all of my time towards 1.4.0 release.

Re: [DISCUSS] Strategy moving towards Gora 0.2 release

Posted by Lewis John Mcgibbney <le...@gmail.com>.
OK.

Hi Henry,

{bq}What additional work need to be done in Nutchgora branch?{bq}

There is quite a bit in the pipeline, and the release discussion hasn't
really taken off. The more mature Gora gets the better chance we have of
getting a real good shot at the Nutchgora release. You can view the open
issues here
https://issues.apache.org/jira/browse/NUTCH/fixforversion/12314893#atl_token=A5KQ-2QAV-T4JA-FDED|cc06a8a6a7cc1f3cdd3255aa108474acd60cc914|lin&selectedTab=com.atlassian.jira.plugin.system.project%3Aversion-issues-panel

Hi Andrzej,

On Thu, Jan 26, 2012 at 9:59 PM, Andrzej Bialecki <ab...@getopt.org> wrote:

>
> I'm interested in pushing forward gora-solr. I believe that with the
> recent developments in Solr (SolrCloud has just been committed to trunk)
> this backend + Nutch Gora could offer a convenient, mostly
> single-technology, combined crawl/store/search platform.
>
Huge +1


> The issues with gora-solr tests were related to unclear semantics of some
> operations. I know there has been some progress on this since that patch
> was created, so I'd like to bring it up to date (using Solr trunk).
>
> I saw this in your patch and merely brought the Maven stuff up to date. I
understodd that you were wanting to use Solr trunk, but the patch will read
3.5 (I think), this was the most stable release I could find on mvn central.

Finally, has anyone had time to review Keith's gora-accumulo store? I'm
going to have a crack at it this weekend if I get time.

Ta
-- 
*Lewis*

Re: [DISCUSS] Strategy moving towards Gora 0.2 release

Posted by Andrzej Bialecki <ab...@getopt.org>.
On 26/01/2012 13:16, Lewis John Mcgibbney wrote:
> Hi Guys,
>
> I wanted to wait until we heard back from the board regarding graduation
> before passing this one off.
>
> What are the general feelings about preparing for a 0.2 release?
>
> I would like to view the following
>
> 1) Currently there are 16 open issues for the 0.2-incubating release [1],
> of these there don't appear to be any major bugs, however there are some
> which I think should definitely be addressed such as adding Cassandra
> tests, getting the new website and accompanying datastore specific
> documentation resolved, as well as the licensing issues with the sqlbuilder
> dependency (possible rewrite of gora-sql to support JOOQ).
> 2) It would be absolutely great if we could get the gora-solr store (nearly
> there pending review and tests passing), and the gora-accumulo store
> (nearly there pending review) integrated as this would give us an
> oppertunity to open Gora up to two other communities, hopefully driving on
> development between 0.2 release and 0.3 release phase.

I'm interested in pushing forward gora-solr. I believe that with the 
recent developments in Solr (SolrCloud has just been committed to trunk) 
this backend + Nutch Gora could offer a convenient, mostly 
single-technology, combined crawl/store/search platform.

The issues with gora-solr tests were related to unclear semantics of 
some operations. I know there has been some progress on this since that 
patch was created, so I'd like to bring it up to date (using Solr trunk).

-- 
Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


Re: [DISCUSS] Strategy moving towards Gora 0.2 release

Posted by Henry Saputra <he...@gmail.com>.
Makes sense to me.

+1

What additional work need to be done in Nutchgora branch?

- Henry

On Thu, Jan 26, 2012 at 4:16 AM, Lewis John Mcgibbney
<le...@gmail.com> wrote:
> Hi Guys,
>
> I wanted to wait until we heard back from the board regarding graduation
> before passing this one off.
>
> What are the general feelings about preparing for a 0.2 release?
>
> I would like to view the following
>
> 1) Currently there are 16 open issues for the 0.2-incubating release [1],
> of these there don't appear to be any major bugs, however there are some
> which I think should definitely be addressed such as adding Cassandra
> tests, getting the new website and accompanying datastore specific
> documentation resolved, as well as the licensing issues with the sqlbuilder
> dependency (possible rewrite of gora-sql to support JOOQ).
> 2) It would be absolutely great if we could get the gora-solr store (nearly
> there pending review and tests passing), and the gora-accumulo store
> (nearly there pending review) integrated as this would give us an
> oppertunity to open Gora up to two other communities, hopefully driving on
> development between 0.2 release and 0.3 release phase.
> 3) I would like to rejig the entire site to a Maven build, working on
> another incubating project, some of the dev's recently got a new site up
> and running [2], it looks great and is incredibly easy to deploy to. I
> think we could do with something similar for Gora. As we require a TLP
> site, this would be a great time to sort this stuff out.
>
> What does everyone think about this? I know there is a bit gfo workt o be
> done, and there is no immediate rush to do this, however I'm really keen to
> get Gora out there and to get people buzzing about it. This will also mean
> that some addition dev work can be put in to Nutchgora branch, hopefully
> more people from the Nutch community will see the usefulness of Gora in
> solving their problems.
>
> Thanks
>
> Lewis
>
> [1]
> https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=true&jqlQuery=project+%3D+GORA+AND+resolution+%3D+Unresolved+AND+fixVersion+%3D+%220.2-incubating%22+ORDER+BY+priority+DESC&mode=hide
> [2] http://incubator.apache.org/any23/
> --
> *Lewis*