You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@stanbol.apache.org by Fabian Christ <ch...@googlemail.com> on 2017/01/16 08:33:02 UTC

The Future of Apache Stanbol

Hi Stanbolers,

the project managed to release the 1.0.0 release in November 2016. I
would like to get some feedback about what you think should happen
with Apache Stanbol in the future.

At the moment I have no oversight about who is using Stanbol or for
whom Stanbol is an essential software component in a larger solution.
If there are those people, I would like to invite them to write down
their view on Stanbol and what should happen.

Maybe the time for Stanbol is over after 5 years of being a top level
Apache project. The ASF has the concept of moving projects to the
attic once there is not enough interest or community for a project
anymore. Another way would be that people start their own GitHub
branch without the overhead of an Apache top level project. Just
thoughts here!

Best
Fabian

-- 
Fabian
http://twitter.com/fctwitt

RE: The Future of Apache Stanbol

Posted by "Lejtovicz, Katalin Eszter" <Ka...@oeaw.ac.at>.
Dear All,

We are also using Stanbol in our research project to do entity linking to LOD resources like GeoNames, GND, etc. 
Stanbol is a critical component for us, as it is a highly configurable tool that allows us to index reference resources and perform enhancement on natural language documents.

Best regards,
Katalin Lejtovicz

-----Original Message-----
From: Yauhen Klimovich [mailto:yauhenklimovich@gmail.com] 
Sent: Monday, January 16, 2017 10:06 AM
To: dev@stanbol.apache.org
Subject: Re: The Future of Apache Stanbol

Hi Fabian,
I guess I could call myself a stanboler.

In our case we use Stanbol in production mode as a critical part of content-enhancement engine.
The main use-case is to get extractions of concepts we provide via our ontology, which is handled by stanbol.

Thank you, all.

— Yauhen

> On 16 Jan 2017, at 11:33, Fabian Christ <ch...@googlemail.com> wrote:
> 
> Hi Stanbolers,
> 
> the project managed to release the 1.0.0 release in November 2016. I 
> would like to get some feedback about what you think should happen 
> with Apache Stanbol in the future.
> 
> At the moment I have no oversight about who is using Stanbol or for 
> whom Stanbol is an essential software component in a larger solution.
> If there are those people, I would like to invite them to write down 
> their view on Stanbol and what should happen.
> 
> Maybe the time for Stanbol is over after 5 years of being a top level 
> Apache project. The ASF has the concept of moving projects to the 
> attic once there is not enough interest or community for a project 
> anymore. Another way would be that people start their own GitHub 
> branch without the overhead of an Apache top level project. Just 
> thoughts here!
> 
> Best
> Fabian
> 
> --
> Fabian
> http://twitter.com/fctwitt


Re: The Future of Apache Stanbol

Posted by Yauhen Klimovich <ya...@gmail.com>.
Hi Fabian,
I guess I could call myself a stanboler.

In our case we use Stanbol in production mode as a critical part of content-enhancement engine.
The main use-case is to get extractions of concepts we provide via our ontology, which is handled by stanbol.

Thank you, all.

— Yauhen

> On 16 Jan 2017, at 11:33, Fabian Christ <ch...@googlemail.com> wrote:
> 
> Hi Stanbolers,
> 
> the project managed to release the 1.0.0 release in November 2016. I
> would like to get some feedback about what you think should happen
> with Apache Stanbol in the future.
> 
> At the moment I have no oversight about who is using Stanbol or for
> whom Stanbol is an essential software component in a larger solution.
> If there are those people, I would like to invite them to write down
> their view on Stanbol and what should happen.
> 
> Maybe the time for Stanbol is over after 5 years of being a top level
> Apache project. The ASF has the concept of moving projects to the
> attic once there is not enough interest or community for a project
> anymore. Another way would be that people start their own GitHub
> branch without the overhead of an Apache top level project. Just
> thoughts here!
> 
> Best
> Fabian
> 
> -- 
> Fabian
> http://twitter.com/fctwitt


Re: The Future of Apache Stanbol

Posted by "A. Soroka" <aj...@virginia.edu>.
We use Stanbol in production (at the library of a major public university) but in a slightly different way than has been described. We use the EntityHub in conjunction with the Open Refine RDF extension:

http://refine.deri.ie/

In fact, I would like to use some of the Enhancer tools, but I face a lot of opposition from librarians who are confused and frightened by change or automation. In any event, some thoughts on where the project might go:

If it ends up moving to the attic, there are some components that seem to me to have places in other Apache projects. For example, some of the enhancer technology might find a home in UIMA (not speaking from expertise on UIMA, just as a casual observer). The EntityHub might work well as a component in Marmotta. It might be worth mapping out some of the potential moves so that we could start making connections with those communities. 

One of the things that has always held me back from getting involved with Stanbol more deeply is the relatively high barrier to development. I am a committer for Apache Jena, and it was not too difficult to delve into the codebase and begin to make real contributions (of course I am still learning). Stanbol's very complex module structure doesn't help in this regard. IMHO, it would also be nice if Stanbol used a more common OSGi provisioning system, like Karaf Features, but I am probably biased in that regard.

Just some idle thoughts. I would like to take part in this conversation. I think there are some great ideas in Stanbol, and whatever happens to the project/codebase, it's important to find good homes for those ideas.

---
A. Soroka
The University of Virginia Library

> On Jan 16, 2017, at 6:01 AM, Rafa Haro <rh...@apache.org> wrote:
> 
> Hi Fabian and Devs,
> 
> We also widely use Stanbol both as part of customers projects and within
> our main product. I admit that we should been contributing further than we
> currently do, but sometimes is difficult to find the time for preparing
> something you have customize according to your concrete needs to a more
> generic contribution.
> 
> Apart from that, it seems that Stanbol nowadays covers a couple of well
> known uses cases that most of the final users adopt where, in my opinion,
> both further contributions and improvements will not likely arise. Still
> under my opinion, one reason is because of the complexity of the code.
> Another reason, taking into account that there is quite poor activity at
> the list regarding this, is that Stanbol Enhancer features seem to be
> enough for final users.
> 
> I think we are not failing in the objective of making Apache Stanbol a
> great tool for developers. We are probably failing, I don't know why, in
> the objective of building and maintaining a community.
> 
> Those are just my thoughts
> 
> On Mon, Jan 16, 2017 at 11:48 AM Rafa Haro <rh...@apache.org> wrote:
> 
>> Hi Arthi,
>> 
>> I started to develop one: https://github.com/rafaharo/pystanbol
>> 
>> It only covers Enhancer for now. Contributions are more than welcome.
>> 
>> Rafa
>> 
>> On Mon, Jan 16, 2017 at 11:41 AM <ar...@wipro.com> wrote:
>> 
>> Hi,
>> Stanbol is a great solution for entity extraction and many NLP problems.
>> I have used it for different pilots and customer implementations.
>> I also plan to use same in future.
>> The community is also great and very helpful.
>> If there is a way for non Java programmer  say a Python programmer to more
>> easily set up and consume the Rest services more folks can use.
>> 
>> Thanks and Regards,
>> Arthi
>> 
>> 
>> 
>> -----Original Message-----
>> From: Bertrand Delacretaz [mailto:bdelacretaz@apache.org]
>> Sent: Monday, January 16, 2017 3:40 PM
>> To: dev@stanbol.apache.org
>> Subject: Re: The Future of Apache Stanbol
>> 
>> ** This mail has been sent from an external source **
>> 
>> Hi,
>> 
>> On Mon, Jan 16, 2017 at 9:33 AM, Fabian Christ <
>> christ.fabian@googlemail.com> wrote:
>>> ...Maybe the time for Stanbol is over after 5 years of being a top
>>> level Apache project. The ASF has the concept of moving projects to
>>> the attic once there is not enough interest or community for a project
>>> anymore....
>> 
>> To be precise, an ASF project has to move to Attic if there are less than
>> 3 active PMC members, which is the minimum required to vote on releases.
>> 
>> Or if the project is unable to respond to security or other critical bug
>> reports, due to lack of available contributors.
>> 
>> It's also fine to move to Attic voluntarily if people think the above
>> criteria might not be met for much longer, of course - I just wanted to
>> clarify the requirements.
>> 
>> -Bertrand
>> The information contained in this electronic message and any attachments
>> to this message are intended for the exclusive use of the addressee(s) and
>> may contain proprietary, confidential or privileged information. If you are
>> not the intended recipient, you should not disseminate, distribute or copy
>> this e-mail. Please notify the sender immediately and destroy all copies of
>> this message and any attachments. WARNING: Computer viruses can be
>> transmitted via email. The recipient should check this email and any
>> attachments for the presence of viruses. The company accepts no liability
>> for any damage caused by any virus transmitted by this email.
>> www.wipro.com
>> 
>> 


Re: The Future of Apache Stanbol

Posted by Rafa Haro <rh...@apache.org>.
Hi Fabian and Devs,

We also widely use Stanbol both as part of customers projects and within
our main product. I admit that we should been contributing further than we
currently do, but sometimes is difficult to find the time for preparing
something you have customize according to your concrete needs to a more
generic contribution.

Apart from that, it seems that Stanbol nowadays covers a couple of well
known uses cases that most of the final users adopt where, in my opinion,
both further contributions and improvements will not likely arise. Still
under my opinion, one reason is because of the complexity of the code.
Another reason, taking into account that there is quite poor activity at
the list regarding this, is that Stanbol Enhancer features seem to be
enough for final users.

I think we are not failing in the objective of making Apache Stanbol a
great tool for developers. We are probably failing, I don't know why, in
the objective of building and maintaining a community.

Those are just my thoughts

On Mon, Jan 16, 2017 at 11:48 AM Rafa Haro <rh...@apache.org> wrote:

> Hi Arthi,
>
> I started to develop one: https://github.com/rafaharo/pystanbol
>
> It only covers Enhancer for now. Contributions are more than welcome.
>
> Rafa
>
> On Mon, Jan 16, 2017 at 11:41 AM <ar...@wipro.com> wrote:
>
> Hi,
>  Stanbol is a great solution for entity extraction and many NLP problems.
> I have used it for different pilots and customer implementations.
> I also plan to use same in future.
> The community is also great and very helpful.
> If there is a way for non Java programmer  say a Python programmer to more
> easily set up and consume the Rest services more folks can use.
>
> Thanks and Regards,
> Arthi
>
>
>
> -----Original Message-----
> From: Bertrand Delacretaz [mailto:bdelacretaz@apache.org]
> Sent: Monday, January 16, 2017 3:40 PM
> To: dev@stanbol.apache.org
> Subject: Re: The Future of Apache Stanbol
>
> ** This mail has been sent from an external source **
>
> Hi,
>
> On Mon, Jan 16, 2017 at 9:33 AM, Fabian Christ <
> christ.fabian@googlemail.com> wrote:
> > ...Maybe the time for Stanbol is over after 5 years of being a top
> > level Apache project. The ASF has the concept of moving projects to
> > the attic once there is not enough interest or community for a project
> > anymore....
>
> To be precise, an ASF project has to move to Attic if there are less than
> 3 active PMC members, which is the minimum required to vote on releases.
>
> Or if the project is unable to respond to security or other critical bug
> reports, due to lack of available contributors.
>
> It's also fine to move to Attic voluntarily if people think the above
> criteria might not be met for much longer, of course - I just wanted to
> clarify the requirements.
>
> -Bertrand
> The information contained in this electronic message and any attachments
> to this message are intended for the exclusive use of the addressee(s) and
> may contain proprietary, confidential or privileged information. If you are
> not the intended recipient, you should not disseminate, distribute or copy
> this e-mail. Please notify the sender immediately and destroy all copies of
> this message and any attachments. WARNING: Computer viruses can be
> transmitted via email. The recipient should check this email and any
> attachments for the presence of viruses. The company accepts no liability
> for any damage caused by any virus transmitted by this email.
> www.wipro.com
>
>

Re: The Future of Apache Stanbol

Posted by Phillip Rhodes <mo...@gmail.com>.
Stanbol is crucially important to us at Fogbeam Labs, and I have
intended for some time to get more involved in the project.  Time has
been the limiting factor, but I will try to find a way to carve out
some time now.  Especially now that we're wrapping up some other work
that was sort of pre-requisite to the stuff we're doing with Stanbol.


Phil

This message optimized for indexing by NSA PRISM


On Mon, Mar 20, 2017 at 4:43 AM, Fabian Christ
<ch...@googlemail.com> wrote:
> Hi Stanbolers,
>
> thanks for all the replies. This looks really promising and is a good
> sign that so much interest still exists in using the Stanbol
> technology.
>
> Now, we have to figure out a way to get everyone on board to start
> contributing and taking responsibility for the project.
>
> Best
> Fabian
>
> 2017-02-27 13:14 GMT+01:00 Andrea Nuzzolese <an...@istc.cnr.it>:
>> Hi all,
>>
>> first, sorry for the late reply, nevertheless I hope this email could be helpful to the discussion.
>> At the Semantic Technology Laboratory of the CNR in Italy we have been using Apache Stanbol since it was born and we are still using it in different research projects dealing with semantic enhancement of content.
>>
>> Additionally, few months ago we founded a start-up (basically a spin-off of the CNR) which re-uses many parts of the Stanbol stack within its products.
>> Hence, as a start-up we would be really interested in the future of Apache Stanbol and we can put some effort by providing some additional developers to contribute to its evolution.
>>
>> —
>> Andrea Giovanni Nuzzolese, Ph.D.
>> Semantic Technology Laboratory (STLab)
>> Institute for Cognitive Science and Technology (ISTC)
>> National Research Council (CNR)
>> Via Nomentana 56, Roma - Italy
>>
>>> On 24 Feb 2017, at 16:08, A. Soroka <aj...@virginia.edu> wrote:
>>>
>>> Yes, it would certainly be best to have more than one opportunity, with such a global group. If you are not available on Monday, Rafa, let's not push for that date. Let's find a date to start with that is good for both you and Rupert.
>>>
>>> Rupert?
>>>
>>> ---
>>> A. Soroka
>>> The University of Virginia Library
>>>
>>>> On Feb 24, 2017, at 10:00 AM, Rafa Haro <rh...@apache.org> wrote:
>>>>
>>>> Hi,
>>>>
>>>> I wouldn't be available next Monday, but maybe we can organize more than
>>>> one :-). I insist, in order to make any sense out of this we need @Rupert
>>>> to join these meetings. He is, by far, the most experienced Stanbol
>>>> developer and the one with the broader knowledge. Please @Rupert, raise
>>>> your voice man :-)
>>>>
>>>> Cheers,
>>>> Rafa
>>>>
>>>> On Fri, Feb 24, 2017 at 3:33 PM Michal Krajňanský <
>>>> michal.krajnansky@gmail.com> wrote:
>>>>
>>>>> Hi Mr. Soroka,
>>>>>
>>>>> Thank you for quick reply and direct call proposal. It will be my pleasure
>>>>> to meeting you virtually.
>>>>>
>>>>> The suggested time Monday 27th 10AM EST works well, so you may consider it
>>>>> settled. I will try to get involved also the managing director of our
>>>>> department located in Brno, Czech Republic, Matej Dusik.
>>>>>
>>>>> I am looking forward to having a fruitful discussion.
>>>>>
>>>>> Best Regards,
>>>>>
>>>>> Michal Krajnansky
>>>>>
>>>>> On Fri, Feb 24, 2017 at 3:22 PM A. Soroka <aj...@virginia.edu> wrote:
>>>>>
>>>>>> We had a few volunteers to begin learning the codebase to take it
>>>>> forward,
>>>>>> and willingness from at least some of the current committers to teach,
>>>>> but
>>>>>> I think we need a bit more organization! :grin:
>>>>>>
>>>>>> I will make a concrete suggestion. Would it be possible to have a video
>>>>>> call (perhaps with Google Hangout) about this on (just throwing out a
>>>>> date
>>>>>> here) this coming week, Monday 27 February, at 10AM EST?
>>>>>>
>>>>>> I would be able to attend. I realize that we are a far-flung group, so
>>>>> I'm
>>>>>> just throwing out that date to get us started. The most important thing,
>>>>> of
>>>>>> course, is to get as many current committers involved as is practical.
>>>>>>
>>>>>> ---
>>>>>> A. Soroka
>>>>>> The University of Virginia Library
>>>>>>
>>>>>>> On Feb 24, 2017, at 8:50 AM, Michal Krajňanský <
>>>>>> michal.krajnansky@gmail.com> wrote:
>>>>>>>
>>>>>>> Dear Stanbol users,
>>>>>>>
>>>>>>> I was wondering, if there were any results of the discussion about the
>>>>>>> Apache Stanbol future.
>>>>>>>
>>>>>>> I work for a R&D team of Konica Minolta Laboratory Europe, and we have
>>>>>> been
>>>>>>> using Stanbol enhancement pipeline in our prototypes concerning
>>>>>> information
>>>>>>> extraction from unstructured data.
>>>>>>>
>>>>>>> We are highly interested in the continuing evolutoin of the Stanbol
>>>>>>> project, and willing to actively support it. Is here anyone who could
>>>>>> tell
>>>>>>> us, what would be a good way to approach the existing Stanbol
>>>>>> stakeholders,
>>>>>>> and support the project by active development and possibly via other
>>>>>> ways?
>>>>>>>
>>>>>>>
>>>>>>> Michal Krajnansky
>>>>>>> Research Specialist Junior, Konica Minolta Laboratory Europe
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> On Mon, Jan 23, 2017 at 6:14 PM A. Soroka <aj...@virginia.edu> wrote:
>>>>>>>
>>>>>>>> Perhaps we can start a page for people to put their name down for
>>>>> this?
>>>>>> I
>>>>>>>> couldn't find an Apache wiki site-- is there somewhere the developers
>>>>>> think
>>>>>>>> appropriate?
>>>>>>>>
>>>>>>>> ---
>>>>>>>> A. Soroka
>>>>>>>> Apache Jena / The University of Virginia Library
>>>>>>>>
>>>>>>>>> On Jan 23, 2017, at 12:08 PM, Aaron Coburn <ac...@amherst.edu>
>>>>>> wrote:
>>>>>>>>>
>>>>>>>>> I would also be very interested.
>>>>>>>>>
>>>>>>>>> We use the entityhub component quite a bit at our institution, and I
>>>>>>>> would be happy to be involved.
>>>>>>>>>
>>>>>>>>> Aaron Coburn
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>> On Jan 22, 2017, at 12:30 PM, Antero Duarte <a....@gmail.com>
>>>>>>>> wrote:
>>>>>>>>>>
>>>>>>>>>> I would definitely be up for that!
>>>>>>>>>>
>>>>>>>>>> On Sat, 21 Jan 2017, 3:33 p.m. A. Soroka, <aj...@virginia.edu>
>>>>> wrote:
>>>>>>>>>>
>>>>>>>>>>> Ditto. If there are other folks who would be interested in
>>>>>>>> participating
>>>>>>>>>>> in something like this, now would be a good time to raise your
>>>>> voice!
>>>>>>>>>>>
>>>>>>>>>>> ---
>>>>>>>>>>> A. Soroka
>>>>>>>>>>> The University of Virginia Library
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>> On Jan 21, 2017, at 10:20 AM, Andrew Valencik <an...@affin.io>
>>>>>>>> wrote:
>>>>>>>>>>>>
>>>>>>>>>>>> I think the "mandatory" documentation contribution is a great idea
>>>>>>>> (and
>>>>>>>>>>>> very reasonable). I'd be happy to partake!
>>>>>>>>>>>> On Wed, Jan 18, 2017 at 11:13 AM Rafa Haro <rh...@apache.org>
>>>>>> wrote:
>>>>>>>>>>>>
>>>>>>>>>>>>> I wouldn't mind to be involved in that but it would be almost
>>>>>>>>>>> "mandatory"
>>>>>>>>>>>>> to contribute some Developer Documentation as outcome of those
>>>>>>>> meetings
>>>>>>>>>>>>> :-). @Rupert we specially need you here :-)
>>>>>>>>>>>>>
>>>>>>>>>>>>> On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu>
>>>>>>>> wrote:
>>>>>>>>>>>>>
>>>>>>>>>>>>>>> I agree that the barrier to contribution is very high. I recall
>>>>>>>> having
>>>>>>>>>>>>>> issues with the documentation initially and the only
>>>>>>>>>>>>>>> available book on Stanbol was not sufficient.
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> If there was renewed interest in bringing on other developers I
>>>>>>>> would
>>>>>>>>>>>>> be
>>>>>>>>>>>>>> interested in investing the time to learn to codebase.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> I second this!
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Perhaps (I know it's very difficult to organize synchronous time
>>>>>>>> for a
>>>>>>>>>>>>>> globally-distributed group but perhaps) we could try to
>>>>> organize a
>>>>>>>> boot
>>>>>>>>>>>>>> camp meeting on-line? In other words, those people who (like
>>>>>> Andrew
>>>>>>>> and
>>>>>>>>>>>>>> myself) would be willing to contribute as part of a larger
>>>>> effort
>>>>>>>> could
>>>>>>>>>>>>> get
>>>>>>>>>>>>>> some virtual time with one or more committers/PMC members to
>>>>> take
>>>>>> an
>>>>>>>>>>>>>> in-depth tour of the system from the developer point of view and
>>>>>>>> hear
>>>>>>>>>>>>> about
>>>>>>>>>>>>>> the outstanding architectural issues, maybe start to figure out
>>>>>>>> points
>>>>>>>>>>> of
>>>>>>>>>>>>>> contribution.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> I realize this would make a lot of demands on the committers
>>>>>>>> involved,
>>>>>>>>>>>>> but
>>>>>>>>>>>>>> it might be a way to inject some fuel into the effort.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Just an idea...
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> ---
>>>>>>>>>>>>>> A. Soroka
>>>>>>>>>>>>>> The University of Virginia Library
>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io>
>>>>>>>> wrote:
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> Hello!
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> We use Stanbol in production to annotate text with entities as
>>>>>>>> part of
>>>>>>>>>>>>>> some
>>>>>>>>>>>>>>> of our data products.
>>>>>>>>>>>>>>> We do this via the REST API.
>>>>>>>>>>>>>>> Originally we were using the content hub to store all the
>>>>>> documents
>>>>>>>>>>> but
>>>>>>>>>>>>>> saw
>>>>>>>>>>>>>>> higher than expected failures.
>>>>>>>>>>>>>>> The entity engines seem a bit more resilient to varying content
>>>>>>>> types
>>>>>>>>>>> /
>>>>>>>>>>>>>>> encoding.
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> I agree that the barrier to contribution is very high.
>>>>>>>>>>>>>>> I recall having issues with the documentation initially and the
>>>>>>>> only
>>>>>>>>>>>>>>> available book on Stanbol was not sufficient.
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> If there was renewed interest in bringing on other developers I
>>>>>>>> would
>>>>>>>>>>>>> be
>>>>>>>>>>>>>>> interested in investing the time to learn to codebase.
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> Thanks!
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
>>>>>>>>>>>>>>> raffaele.palmieri@gmail.com> wrote:
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> Dear community,
>>>>>>>>>>>>>>>> we are using 0.12 version with content hub. I find Stanbol
>>>>> very
>>>>>>>>>>>>>> flexible to
>>>>>>>>>>>>>>>> enhance content, specially those unstructured ones.
>>>>>>>>>>>>>>>> For that regards connections with other projects, we have done
>>>>>>>> some
>>>>>>>>>>>>>>>> thoughts about the use with Apache Marmotta with nosql
>>>>> backends
>>>>>>>> for
>>>>>>>>>>>>> big
>>>>>>>>>>>>>>>> data scenarios. Also with Apache Manifold to implement
>>>>>> enrichment
>>>>>>>> of
>>>>>>>>>>>>>>>> existent document repositories, that's a common request.
>>>>>>>>>>>>>>>> Regards,
>>>>>>>>>>>>>>>> Raffaele.
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> 2017-01-16 22:41 GMT+01:00 Antero Duarte <
>>>>> a.fduarte1@gmail.com
>>>>>>> :
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> Hi there,
>>>>>>>>>>>>>>>>> Stanbol is very useful for me! Greatest and easiest tool for
>>>>> us
>>>>>>>> to
>>>>>>>>>>> do
>>>>>>>>>>>>>> NLP
>>>>>>>>>>>>>>>>> and linked data. Has there been any discussion to move
>>>>> towards
>>>>>> a
>>>>>>>>>>>>> nosql
>>>>>>>>>>>>>>>>> storage solution, or is solr still the best thing for us?
>>>>> Also,
>>>>>>>> what
>>>>>>>>>>>>>>>> about
>>>>>>>>>>>>>>>>> upgrading solr? How much work would be involved in that?
>>>>>> Anyway,
>>>>>>>>>>>>> great
>>>>>>>>>>>>>>>>> tool, really hope this doesn't die!!!
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> Regards,
>>>>>>>>>>>>>>>>> Antero
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
>>>>>>>>>>>>>>>>> bdelacretaz@apache.org>
>>>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <
>>>>> rharo@apache.org>
>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>>>> ...I participated
>>>>>>>>>>>>>>>>>>> also in the development of the Java client, so I could take
>>>>>> the
>>>>>>>>>>>>>>>>>>> responsibility of bringing that one as well...
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> FWIW, I won't be involved in decisions about this as I left
>>>>>> the
>>>>>>>>>>>>>>>>>> Stanbol PMC a while ago - I'm just commenting from a
>>>>> community
>>>>>>>>>>> point
>>>>>>>>>>>>>>>>>> of view, as an experienced Apache member.
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> A while ago Stanbol was "larger" and more focused on its
>>>>> core,
>>>>>>>> but
>>>>>>>>>>>>> as
>>>>>>>>>>>>>>>>>> its community becomes smaller (IIUC) it's probably good to
>>>>>> bring
>>>>>>>>>>>>>>>>>> everyone here, as much as possible, even it it means a
>>>>>> slightly
>>>>>>>>>>> less
>>>>>>>>>>>>>>>>>> focused codebase. This can also help recruiting more active
>>>>>>>>>>>>> committers
>>>>>>>>>>>>>>>>>> and PMC members by involving them directly here.
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> That might make Stanbol more sustainable, as a community of
>>>>>>>> people
>>>>>>>>>>>>> who
>>>>>>>>>>>>>>>>>> need similar functionality.
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> -Bertrand
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> --
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> Andrew Valencik
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> Data Scientist
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>>>>>>>>>>> http://twitter.com/valencik>
>>>>>>>>>>>>>> |
>>>>>>>>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4
>>>>>>
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>>>>>>>>>>> <
>>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>
>>>>>>
>>>>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>
>>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>> --
>>>>>>>>>>>>
>>>>>>>>>>>> Andrew Valencik
>>>>>>>>>>>>
>>>>>>>>>>>> Data Scientist
>>>>>>>>>>>>
>>>>>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>>>>>> http://twitter.com/valencik>
>>>>>>>>>>> |
>>>>>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
>>>>>>>>>>>>
>>>>>>>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>>>>>>>> <
>>>>>>>>>>>
>>>>>>>>
>>>>>>
>>>>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>
>>
>
>
>
> --
> Fabian
> http://twitter.com/fctwitt

Re: The Future of Apache Stanbol

Posted by Fabian Christ <ch...@googlemail.com>.
Hi Stanbolers,

thanks for all the replies. This looks really promising and is a good
sign that so much interest still exists in using the Stanbol
technology.

Now, we have to figure out a way to get everyone on board to start
contributing and taking responsibility for the project.

Best
Fabian

2017-02-27 13:14 GMT+01:00 Andrea Nuzzolese <an...@istc.cnr.it>:
> Hi all,
>
> first, sorry for the late reply, nevertheless I hope this email could be helpful to the discussion.
> At the Semantic Technology Laboratory of the CNR in Italy we have been using Apache Stanbol since it was born and we are still using it in different research projects dealing with semantic enhancement of content.
>
> Additionally, few months ago we founded a start-up (basically a spin-off of the CNR) which re-uses many parts of the Stanbol stack within its products.
> Hence, as a start-up we would be really interested in the future of Apache Stanbol and we can put some effort by providing some additional developers to contribute to its evolution.
>
> —
> Andrea Giovanni Nuzzolese, Ph.D.
> Semantic Technology Laboratory (STLab)
> Institute for Cognitive Science and Technology (ISTC)
> National Research Council (CNR)
> Via Nomentana 56, Roma - Italy
>
>> On 24 Feb 2017, at 16:08, A. Soroka <aj...@virginia.edu> wrote:
>>
>> Yes, it would certainly be best to have more than one opportunity, with such a global group. If you are not available on Monday, Rafa, let's not push for that date. Let's find a date to start with that is good for both you and Rupert.
>>
>> Rupert?
>>
>> ---
>> A. Soroka
>> The University of Virginia Library
>>
>>> On Feb 24, 2017, at 10:00 AM, Rafa Haro <rh...@apache.org> wrote:
>>>
>>> Hi,
>>>
>>> I wouldn't be available next Monday, but maybe we can organize more than
>>> one :-). I insist, in order to make any sense out of this we need @Rupert
>>> to join these meetings. He is, by far, the most experienced Stanbol
>>> developer and the one with the broader knowledge. Please @Rupert, raise
>>> your voice man :-)
>>>
>>> Cheers,
>>> Rafa
>>>
>>> On Fri, Feb 24, 2017 at 3:33 PM Michal Krajňanský <
>>> michal.krajnansky@gmail.com> wrote:
>>>
>>>> Hi Mr. Soroka,
>>>>
>>>> Thank you for quick reply and direct call proposal. It will be my pleasure
>>>> to meeting you virtually.
>>>>
>>>> The suggested time Monday 27th 10AM EST works well, so you may consider it
>>>> settled. I will try to get involved also the managing director of our
>>>> department located in Brno, Czech Republic, Matej Dusik.
>>>>
>>>> I am looking forward to having a fruitful discussion.
>>>>
>>>> Best Regards,
>>>>
>>>> Michal Krajnansky
>>>>
>>>> On Fri, Feb 24, 2017 at 3:22 PM A. Soroka <aj...@virginia.edu> wrote:
>>>>
>>>>> We had a few volunteers to begin learning the codebase to take it
>>>> forward,
>>>>> and willingness from at least some of the current committers to teach,
>>>> but
>>>>> I think we need a bit more organization! :grin:
>>>>>
>>>>> I will make a concrete suggestion. Would it be possible to have a video
>>>>> call (perhaps with Google Hangout) about this on (just throwing out a
>>>> date
>>>>> here) this coming week, Monday 27 February, at 10AM EST?
>>>>>
>>>>> I would be able to attend. I realize that we are a far-flung group, so
>>>> I'm
>>>>> just throwing out that date to get us started. The most important thing,
>>>> of
>>>>> course, is to get as many current committers involved as is practical.
>>>>>
>>>>> ---
>>>>> A. Soroka
>>>>> The University of Virginia Library
>>>>>
>>>>>> On Feb 24, 2017, at 8:50 AM, Michal Krajňanský <
>>>>> michal.krajnansky@gmail.com> wrote:
>>>>>>
>>>>>> Dear Stanbol users,
>>>>>>
>>>>>> I was wondering, if there were any results of the discussion about the
>>>>>> Apache Stanbol future.
>>>>>>
>>>>>> I work for a R&D team of Konica Minolta Laboratory Europe, and we have
>>>>> been
>>>>>> using Stanbol enhancement pipeline in our prototypes concerning
>>>>> information
>>>>>> extraction from unstructured data.
>>>>>>
>>>>>> We are highly interested in the continuing evolutoin of the Stanbol
>>>>>> project, and willing to actively support it. Is here anyone who could
>>>>> tell
>>>>>> us, what would be a good way to approach the existing Stanbol
>>>>> stakeholders,
>>>>>> and support the project by active development and possibly via other
>>>>> ways?
>>>>>>
>>>>>>
>>>>>> Michal Krajnansky
>>>>>> Research Specialist Junior, Konica Minolta Laboratory Europe
>>>>>>
>>>>>>
>>>>>>
>>>>>> On Mon, Jan 23, 2017 at 6:14 PM A. Soroka <aj...@virginia.edu> wrote:
>>>>>>
>>>>>>> Perhaps we can start a page for people to put their name down for
>>>> this?
>>>>> I
>>>>>>> couldn't find an Apache wiki site-- is there somewhere the developers
>>>>> think
>>>>>>> appropriate?
>>>>>>>
>>>>>>> ---
>>>>>>> A. Soroka
>>>>>>> Apache Jena / The University of Virginia Library
>>>>>>>
>>>>>>>> On Jan 23, 2017, at 12:08 PM, Aaron Coburn <ac...@amherst.edu>
>>>>> wrote:
>>>>>>>>
>>>>>>>> I would also be very interested.
>>>>>>>>
>>>>>>>> We use the entityhub component quite a bit at our institution, and I
>>>>>>> would be happy to be involved.
>>>>>>>>
>>>>>>>> Aaron Coburn
>>>>>>>>
>>>>>>>>
>>>>>>>>> On Jan 22, 2017, at 12:30 PM, Antero Duarte <a....@gmail.com>
>>>>>>> wrote:
>>>>>>>>>
>>>>>>>>> I would definitely be up for that!
>>>>>>>>>
>>>>>>>>> On Sat, 21 Jan 2017, 3:33 p.m. A. Soroka, <aj...@virginia.edu>
>>>> wrote:
>>>>>>>>>
>>>>>>>>>> Ditto. If there are other folks who would be interested in
>>>>>>> participating
>>>>>>>>>> in something like this, now would be a good time to raise your
>>>> voice!
>>>>>>>>>>
>>>>>>>>>> ---
>>>>>>>>>> A. Soroka
>>>>>>>>>> The University of Virginia Library
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>> On Jan 21, 2017, at 10:20 AM, Andrew Valencik <an...@affin.io>
>>>>>>> wrote:
>>>>>>>>>>>
>>>>>>>>>>> I think the "mandatory" documentation contribution is a great idea
>>>>>>> (and
>>>>>>>>>>> very reasonable). I'd be happy to partake!
>>>>>>>>>>> On Wed, Jan 18, 2017 at 11:13 AM Rafa Haro <rh...@apache.org>
>>>>> wrote:
>>>>>>>>>>>
>>>>>>>>>>>> I wouldn't mind to be involved in that but it would be almost
>>>>>>>>>> "mandatory"
>>>>>>>>>>>> to contribute some Developer Documentation as outcome of those
>>>>>>> meetings
>>>>>>>>>>>> :-). @Rupert we specially need you here :-)
>>>>>>>>>>>>
>>>>>>>>>>>> On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu>
>>>>>>> wrote:
>>>>>>>>>>>>
>>>>>>>>>>>>>> I agree that the barrier to contribution is very high. I recall
>>>>>>> having
>>>>>>>>>>>>> issues with the documentation initially and the only
>>>>>>>>>>>>>> available book on Stanbol was not sufficient.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> If there was renewed interest in bringing on other developers I
>>>>>>> would
>>>>>>>>>>>> be
>>>>>>>>>>>>> interested in investing the time to learn to codebase.
>>>>>>>>>>>>>
>>>>>>>>>>>>> I second this!
>>>>>>>>>>>>>
>>>>>>>>>>>>> Perhaps (I know it's very difficult to organize synchronous time
>>>>>>> for a
>>>>>>>>>>>>> globally-distributed group but perhaps) we could try to
>>>> organize a
>>>>>>> boot
>>>>>>>>>>>>> camp meeting on-line? In other words, those people who (like
>>>>> Andrew
>>>>>>> and
>>>>>>>>>>>>> myself) would be willing to contribute as part of a larger
>>>> effort
>>>>>>> could
>>>>>>>>>>>> get
>>>>>>>>>>>>> some virtual time with one or more committers/PMC members to
>>>> take
>>>>> an
>>>>>>>>>>>>> in-depth tour of the system from the developer point of view and
>>>>>>> hear
>>>>>>>>>>>> about
>>>>>>>>>>>>> the outstanding architectural issues, maybe start to figure out
>>>>>>> points
>>>>>>>>>> of
>>>>>>>>>>>>> contribution.
>>>>>>>>>>>>>
>>>>>>>>>>>>> I realize this would make a lot of demands on the committers
>>>>>>> involved,
>>>>>>>>>>>> but
>>>>>>>>>>>>> it might be a way to inject some fuel into the effort.
>>>>>>>>>>>>>
>>>>>>>>>>>>> Just an idea...
>>>>>>>>>>>>>
>>>>>>>>>>>>> ---
>>>>>>>>>>>>> A. Soroka
>>>>>>>>>>>>> The University of Virginia Library
>>>>>>>>>>>>>
>>>>>>>>>>>>>> On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io>
>>>>>>> wrote:
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Hello!
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> We use Stanbol in production to annotate text with entities as
>>>>>>> part of
>>>>>>>>>>>>> some
>>>>>>>>>>>>>> of our data products.
>>>>>>>>>>>>>> We do this via the REST API.
>>>>>>>>>>>>>> Originally we were using the content hub to store all the
>>>>> documents
>>>>>>>>>> but
>>>>>>>>>>>>> saw
>>>>>>>>>>>>>> higher than expected failures.
>>>>>>>>>>>>>> The entity engines seem a bit more resilient to varying content
>>>>>>> types
>>>>>>>>>> /
>>>>>>>>>>>>>> encoding.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> I agree that the barrier to contribution is very high.
>>>>>>>>>>>>>> I recall having issues with the documentation initially and the
>>>>>>> only
>>>>>>>>>>>>>> available book on Stanbol was not sufficient.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> If there was renewed interest in bringing on other developers I
>>>>>>> would
>>>>>>>>>>>> be
>>>>>>>>>>>>>> interested in investing the time to learn to codebase.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Thanks!
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
>>>>>>>>>>>>>> raffaele.palmieri@gmail.com> wrote:
>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> Dear community,
>>>>>>>>>>>>>>> we are using 0.12 version with content hub. I find Stanbol
>>>> very
>>>>>>>>>>>>> flexible to
>>>>>>>>>>>>>>> enhance content, specially those unstructured ones.
>>>>>>>>>>>>>>> For that regards connections with other projects, we have done
>>>>>>> some
>>>>>>>>>>>>>>> thoughts about the use with Apache Marmotta with nosql
>>>> backends
>>>>>>> for
>>>>>>>>>>>> big
>>>>>>>>>>>>>>> data scenarios. Also with Apache Manifold to implement
>>>>> enrichment
>>>>>>> of
>>>>>>>>>>>>>>> existent document repositories, that's a common request.
>>>>>>>>>>>>>>> Regards,
>>>>>>>>>>>>>>> Raffaele.
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> 2017-01-16 22:41 GMT+01:00 Antero Duarte <
>>>> a.fduarte1@gmail.com
>>>>>> :
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> Hi there,
>>>>>>>>>>>>>>>> Stanbol is very useful for me! Greatest and easiest tool for
>>>> us
>>>>>>> to
>>>>>>>>>> do
>>>>>>>>>>>>> NLP
>>>>>>>>>>>>>>>> and linked data. Has there been any discussion to move
>>>> towards
>>>>> a
>>>>>>>>>>>> nosql
>>>>>>>>>>>>>>>> storage solution, or is solr still the best thing for us?
>>>> Also,
>>>>>>> what
>>>>>>>>>>>>>>> about
>>>>>>>>>>>>>>>> upgrading solr? How much work would be involved in that?
>>>>> Anyway,
>>>>>>>>>>>> great
>>>>>>>>>>>>>>>> tool, really hope this doesn't die!!!
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> Regards,
>>>>>>>>>>>>>>>> Antero
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
>>>>>>>>>>>>>>>> bdelacretaz@apache.org>
>>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <
>>>> rharo@apache.org>
>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>>> ...I participated
>>>>>>>>>>>>>>>>>> also in the development of the Java client, so I could take
>>>>> the
>>>>>>>>>>>>>>>>>> responsibility of bringing that one as well...
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> FWIW, I won't be involved in decisions about this as I left
>>>>> the
>>>>>>>>>>>>>>>>> Stanbol PMC a while ago - I'm just commenting from a
>>>> community
>>>>>>>>>> point
>>>>>>>>>>>>>>>>> of view, as an experienced Apache member.
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> A while ago Stanbol was "larger" and more focused on its
>>>> core,
>>>>>>> but
>>>>>>>>>>>> as
>>>>>>>>>>>>>>>>> its community becomes smaller (IIUC) it's probably good to
>>>>> bring
>>>>>>>>>>>>>>>>> everyone here, as much as possible, even it it means a
>>>>> slightly
>>>>>>>>>> less
>>>>>>>>>>>>>>>>> focused codebase. This can also help recruiting more active
>>>>>>>>>>>> committers
>>>>>>>>>>>>>>>>> and PMC members by involving them directly here.
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> That might make Stanbol more sustainable, as a community of
>>>>>>> people
>>>>>>>>>>>> who
>>>>>>>>>>>>>>>>> need similar functionality.
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> -Bertrand
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>> --
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Andrew Valencik
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Data Scientist
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>>>>>>>>>> http://twitter.com/valencik>
>>>>>>>>>>>>> |
>>>>>>>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4
>>>>>
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>>>>>>>>>> <
>>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>
>>>>>>>
>>>>>
>>>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>> --
>>>>>>>>>>>
>>>>>>>>>>> Andrew Valencik
>>>>>>>>>>>
>>>>>>>>>>> Data Scientist
>>>>>>>>>>>
>>>>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>>>>> http://twitter.com/valencik>
>>>>>>>>>> |
>>>>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
>>>>>>>>>>>
>>>>>>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>>>>>>> <
>>>>>>>>>>
>>>>>>>
>>>>>
>>>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>
>>>>>
>>>>
>>
>



-- 
Fabian
http://twitter.com/fctwitt

Re: The Future of Apache Stanbol

Posted by Andrea Nuzzolese <an...@istc.cnr.it>.
Hi all,

first, sorry for the late reply, nevertheless I hope this email could be helpful to the discussion.
At the Semantic Technology Laboratory of the CNR in Italy we have been using Apache Stanbol since it was born and we are still using it in different research projects dealing with semantic enhancement of content.

Additionally, few months ago we founded a start-up (basically a spin-off of the CNR) which re-uses many parts of the Stanbol stack within its products.
Hence, as a start-up we would be really interested in the future of Apache Stanbol and we can put some effort by providing some additional developers to contribute to its evolution.

— 
Andrea Giovanni Nuzzolese, Ph.D.
Semantic Technology Laboratory (STLab)
Institute for Cognitive Science and Technology (ISTC)
National Research Council (CNR)
Via Nomentana 56, Roma - Italy

> On 24 Feb 2017, at 16:08, A. Soroka <aj...@virginia.edu> wrote:
> 
> Yes, it would certainly be best to have more than one opportunity, with such a global group. If you are not available on Monday, Rafa, let's not push for that date. Let's find a date to start with that is good for both you and Rupert.
> 
> Rupert?
> 
> ---
> A. Soroka
> The University of Virginia Library
> 
>> On Feb 24, 2017, at 10:00 AM, Rafa Haro <rh...@apache.org> wrote:
>> 
>> Hi,
>> 
>> I wouldn't be available next Monday, but maybe we can organize more than
>> one :-). I insist, in order to make any sense out of this we need @Rupert
>> to join these meetings. He is, by far, the most experienced Stanbol
>> developer and the one with the broader knowledge. Please @Rupert, raise
>> your voice man :-)
>> 
>> Cheers,
>> Rafa
>> 
>> On Fri, Feb 24, 2017 at 3:33 PM Michal Krajňanský <
>> michal.krajnansky@gmail.com> wrote:
>> 
>>> Hi Mr. Soroka,
>>> 
>>> Thank you for quick reply and direct call proposal. It will be my pleasure
>>> to meeting you virtually.
>>> 
>>> The suggested time Monday 27th 10AM EST works well, so you may consider it
>>> settled. I will try to get involved also the managing director of our
>>> department located in Brno, Czech Republic, Matej Dusik.
>>> 
>>> I am looking forward to having a fruitful discussion.
>>> 
>>> Best Regards,
>>> 
>>> Michal Krajnansky
>>> 
>>> On Fri, Feb 24, 2017 at 3:22 PM A. Soroka <aj...@virginia.edu> wrote:
>>> 
>>>> We had a few volunteers to begin learning the codebase to take it
>>> forward,
>>>> and willingness from at least some of the current committers to teach,
>>> but
>>>> I think we need a bit more organization! :grin:
>>>> 
>>>> I will make a concrete suggestion. Would it be possible to have a video
>>>> call (perhaps with Google Hangout) about this on (just throwing out a
>>> date
>>>> here) this coming week, Monday 27 February, at 10AM EST?
>>>> 
>>>> I would be able to attend. I realize that we are a far-flung group, so
>>> I'm
>>>> just throwing out that date to get us started. The most important thing,
>>> of
>>>> course, is to get as many current committers involved as is practical.
>>>> 
>>>> ---
>>>> A. Soroka
>>>> The University of Virginia Library
>>>> 
>>>>> On Feb 24, 2017, at 8:50 AM, Michal Krajňanský <
>>>> michal.krajnansky@gmail.com> wrote:
>>>>> 
>>>>> Dear Stanbol users,
>>>>> 
>>>>> I was wondering, if there were any results of the discussion about the
>>>>> Apache Stanbol future.
>>>>> 
>>>>> I work for a R&D team of Konica Minolta Laboratory Europe, and we have
>>>> been
>>>>> using Stanbol enhancement pipeline in our prototypes concerning
>>>> information
>>>>> extraction from unstructured data.
>>>>> 
>>>>> We are highly interested in the continuing evolutoin of the Stanbol
>>>>> project, and willing to actively support it. Is here anyone who could
>>>> tell
>>>>> us, what would be a good way to approach the existing Stanbol
>>>> stakeholders,
>>>>> and support the project by active development and possibly via other
>>>> ways?
>>>>> 
>>>>> 
>>>>> Michal Krajnansky
>>>>> Research Specialist Junior, Konica Minolta Laboratory Europe
>>>>> 
>>>>> 
>>>>> 
>>>>> On Mon, Jan 23, 2017 at 6:14 PM A. Soroka <aj...@virginia.edu> wrote:
>>>>> 
>>>>>> Perhaps we can start a page for people to put their name down for
>>> this?
>>>> I
>>>>>> couldn't find an Apache wiki site-- is there somewhere the developers
>>>> think
>>>>>> appropriate?
>>>>>> 
>>>>>> ---
>>>>>> A. Soroka
>>>>>> Apache Jena / The University of Virginia Library
>>>>>> 
>>>>>>> On Jan 23, 2017, at 12:08 PM, Aaron Coburn <ac...@amherst.edu>
>>>> wrote:
>>>>>>> 
>>>>>>> I would also be very interested.
>>>>>>> 
>>>>>>> We use the entityhub component quite a bit at our institution, and I
>>>>>> would be happy to be involved.
>>>>>>> 
>>>>>>> Aaron Coburn
>>>>>>> 
>>>>>>> 
>>>>>>>> On Jan 22, 2017, at 12:30 PM, Antero Duarte <a....@gmail.com>
>>>>>> wrote:
>>>>>>>> 
>>>>>>>> I would definitely be up for that!
>>>>>>>> 
>>>>>>>> On Sat, 21 Jan 2017, 3:33 p.m. A. Soroka, <aj...@virginia.edu>
>>> wrote:
>>>>>>>> 
>>>>>>>>> Ditto. If there are other folks who would be interested in
>>>>>> participating
>>>>>>>>> in something like this, now would be a good time to raise your
>>> voice!
>>>>>>>>> 
>>>>>>>>> ---
>>>>>>>>> A. Soroka
>>>>>>>>> The University of Virginia Library
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>>> On Jan 21, 2017, at 10:20 AM, Andrew Valencik <an...@affin.io>
>>>>>> wrote:
>>>>>>>>>> 
>>>>>>>>>> I think the "mandatory" documentation contribution is a great idea
>>>>>> (and
>>>>>>>>>> very reasonable). I'd be happy to partake!
>>>>>>>>>> On Wed, Jan 18, 2017 at 11:13 AM Rafa Haro <rh...@apache.org>
>>>> wrote:
>>>>>>>>>> 
>>>>>>>>>>> I wouldn't mind to be involved in that but it would be almost
>>>>>>>>> "mandatory"
>>>>>>>>>>> to contribute some Developer Documentation as outcome of those
>>>>>> meetings
>>>>>>>>>>> :-). @Rupert we specially need you here :-)
>>>>>>>>>>> 
>>>>>>>>>>> On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu>
>>>>>> wrote:
>>>>>>>>>>> 
>>>>>>>>>>>>> I agree that the barrier to contribution is very high. I recall
>>>>>> having
>>>>>>>>>>>> issues with the documentation initially and the only
>>>>>>>>>>>>> available book on Stanbol was not sufficient.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> If there was renewed interest in bringing on other developers I
>>>>>> would
>>>>>>>>>>> be
>>>>>>>>>>>> interested in investing the time to learn to codebase.
>>>>>>>>>>>> 
>>>>>>>>>>>> I second this!
>>>>>>>>>>>> 
>>>>>>>>>>>> Perhaps (I know it's very difficult to organize synchronous time
>>>>>> for a
>>>>>>>>>>>> globally-distributed group but perhaps) we could try to
>>> organize a
>>>>>> boot
>>>>>>>>>>>> camp meeting on-line? In other words, those people who (like
>>>> Andrew
>>>>>> and
>>>>>>>>>>>> myself) would be willing to contribute as part of a larger
>>> effort
>>>>>> could
>>>>>>>>>>> get
>>>>>>>>>>>> some virtual time with one or more committers/PMC members to
>>> take
>>>> an
>>>>>>>>>>>> in-depth tour of the system from the developer point of view and
>>>>>> hear
>>>>>>>>>>> about
>>>>>>>>>>>> the outstanding architectural issues, maybe start to figure out
>>>>>> points
>>>>>>>>> of
>>>>>>>>>>>> contribution.
>>>>>>>>>>>> 
>>>>>>>>>>>> I realize this would make a lot of demands on the committers
>>>>>> involved,
>>>>>>>>>>> but
>>>>>>>>>>>> it might be a way to inject some fuel into the effort.
>>>>>>>>>>>> 
>>>>>>>>>>>> Just an idea...
>>>>>>>>>>>> 
>>>>>>>>>>>> ---
>>>>>>>>>>>> A. Soroka
>>>>>>>>>>>> The University of Virginia Library
>>>>>>>>>>>> 
>>>>>>>>>>>>> On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io>
>>>>>> wrote:
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Hello!
>>>>>>>>>>>>> 
>>>>>>>>>>>>> We use Stanbol in production to annotate text with entities as
>>>>>> part of
>>>>>>>>>>>> some
>>>>>>>>>>>>> of our data products.
>>>>>>>>>>>>> We do this via the REST API.
>>>>>>>>>>>>> Originally we were using the content hub to store all the
>>>> documents
>>>>>>>>> but
>>>>>>>>>>>> saw
>>>>>>>>>>>>> higher than expected failures.
>>>>>>>>>>>>> The entity engines seem a bit more resilient to varying content
>>>>>> types
>>>>>>>>> /
>>>>>>>>>>>>> encoding.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> I agree that the barrier to contribution is very high.
>>>>>>>>>>>>> I recall having issues with the documentation initially and the
>>>>>> only
>>>>>>>>>>>>> available book on Stanbol was not sufficient.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> If there was renewed interest in bringing on other developers I
>>>>>> would
>>>>>>>>>>> be
>>>>>>>>>>>>> interested in investing the time to learn to codebase.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Thanks!
>>>>>>>>>>>>> 
>>>>>>>>>>>>> On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
>>>>>>>>>>>>> raffaele.palmieri@gmail.com> wrote:
>>>>>>>>>>>>> 
>>>>>>>>>>>>>> Dear community,
>>>>>>>>>>>>>> we are using 0.12 version with content hub. I find Stanbol
>>> very
>>>>>>>>>>>> flexible to
>>>>>>>>>>>>>> enhance content, specially those unstructured ones.
>>>>>>>>>>>>>> For that regards connections with other projects, we have done
>>>>>> some
>>>>>>>>>>>>>> thoughts about the use with Apache Marmotta with nosql
>>> backends
>>>>>> for
>>>>>>>>>>> big
>>>>>>>>>>>>>> data scenarios. Also with Apache Manifold to implement
>>>> enrichment
>>>>>> of
>>>>>>>>>>>>>> existent document repositories, that's a common request.
>>>>>>>>>>>>>> Regards,
>>>>>>>>>>>>>> Raffaele.
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> 2017-01-16 22:41 GMT+01:00 Antero Duarte <
>>> a.fduarte1@gmail.com
>>>>> :
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> Hi there,
>>>>>>>>>>>>>>> Stanbol is very useful for me! Greatest and easiest tool for
>>> us
>>>>>> to
>>>>>>>>> do
>>>>>>>>>>>> NLP
>>>>>>>>>>>>>>> and linked data. Has there been any discussion to move
>>> towards
>>>> a
>>>>>>>>>>> nosql
>>>>>>>>>>>>>>> storage solution, or is solr still the best thing for us?
>>> Also,
>>>>>> what
>>>>>>>>>>>>>> about
>>>>>>>>>>>>>>> upgrading solr? How much work would be involved in that?
>>>> Anyway,
>>>>>>>>>>> great
>>>>>>>>>>>>>>> tool, really hope this doesn't die!!!
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> Regards,
>>>>>>>>>>>>>>> Antero
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
>>>>>>>>>>>>>>> bdelacretaz@apache.org>
>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <
>>> rharo@apache.org>
>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>> ...I participated
>>>>>>>>>>>>>>>>> also in the development of the Java client, so I could take
>>>> the
>>>>>>>>>>>>>>>>> responsibility of bringing that one as well...
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> FWIW, I won't be involved in decisions about this as I left
>>>> the
>>>>>>>>>>>>>>>> Stanbol PMC a while ago - I'm just commenting from a
>>> community
>>>>>>>>> point
>>>>>>>>>>>>>>>> of view, as an experienced Apache member.
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> A while ago Stanbol was "larger" and more focused on its
>>> core,
>>>>>> but
>>>>>>>>>>> as
>>>>>>>>>>>>>>>> its community becomes smaller (IIUC) it's probably good to
>>>> bring
>>>>>>>>>>>>>>>> everyone here, as much as possible, even it it means a
>>>> slightly
>>>>>>>>> less
>>>>>>>>>>>>>>>> focused codebase. This can also help recruiting more active
>>>>>>>>>>> committers
>>>>>>>>>>>>>>>> and PMC members by involving them directly here.
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> That might make Stanbol more sustainable, as a community of
>>>>>> people
>>>>>>>>>>> who
>>>>>>>>>>>>>>>> need similar functionality.
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> -Bertrand
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> 
>>>>>>>>>>>>> --
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Andrew Valencik
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Data Scientist
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>>>>>>>>> http://twitter.com/valencik>
>>>>>>>>>>>> |
>>>>>>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4
>>>> 
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>>>>>>>>> <
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>> 
>>>> 
>>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>> --
>>>>>>>>>> 
>>>>>>>>>> Andrew Valencik
>>>>>>>>>> 
>>>>>>>>>> Data Scientist
>>>>>>>>>> 
>>>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>>>> http://twitter.com/valencik>
>>>>>>>>> |
>>>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
>>>>>>>>>> 
>>>>>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>>>>>> <
>>>>>>>>> 
>>>>>> 
>>>> 
>>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> 
>>>>>>> 
>>>>>> 
>>>>>> 
>>>>>> 
>>>>>> 
>>>>>> 
>>>>>> 
>>>> 
>>>> 
>>> 
> 


Re: The Future of Apache Stanbol

Posted by "A. Soroka" <aj...@virginia.edu>.
This is very encouraging. I think we are just waiting to hear from Rupert and Rafa to find a date that works for both of them.

Perhaps tomorrow or Wednesday?

---
A. Soroka
The University of Virginia Library

> On Feb 27, 2017, at 8:35 AM, Andrea Nuzzolese <an...@gmail.com> wrote:
> 
> Hi all,
> 
> first, sorry for the late reply, nevertheless I hope this email could be helpful to the discussion.
> At the Semantic Technology Laboratory of the CNR in Italy we have been using Apache Stanbol since it was born and we are still using it in different research projects dealing with semantic enhancement of content.
> 
> Additionally, few months ago we founded a start-up (basically a spin-off of the CNR) which re-uses many parts of the Stanbol stack within its products.
> Hence, as a start-up we would be really interested in the future of Apache Stanbol and we can put some effort by providing some additional developers to contribute to its evolution.
> 
> — 
> Andrea Giovanni Nuzzolese, Ph.D.
> Semantic Technology Laboratory (STLab)
> Institute for Cognitive Science and Technology (ISTC)
> National Research Council (CNR)
> Via Nomentana 56, Roma - Italy
> 
>> On 24 Feb 2017, at 16:08, A. Soroka <aj...@virginia.edu> wrote:
>> 
>> Yes, it would certainly be best to have more than one opportunity, with such a global group. If you are not available on Monday, Rafa, let's not push for that date. Let's find a date to start with that is good for both you and Rupert.
>> 
>> Rupert?
>> 
>> ---
>> A. Soroka
>> The University of Virginia Library
>> 
>>> On Feb 24, 2017, at 10:00 AM, Rafa Haro <rh...@apache.org> wrote:
>>> 
>>> Hi,
>>> 
>>> I wouldn't be available next Monday, but maybe we can organize more than
>>> one :-). I insist, in order to make any sense out of this we need @Rupert
>>> to join these meetings. He is, by far, the most experienced Stanbol
>>> developer and the one with the broader knowledge. Please @Rupert, raise
>>> your voice man :-)
>>> 
>>> Cheers,
>>> Rafa
>>> 
>>> On Fri, Feb 24, 2017 at 3:33 PM Michal Krajňanský <
>>> michal.krajnansky@gmail.com> wrote:
>>> 
>>>> Hi Mr. Soroka,
>>>> 
>>>> Thank you for quick reply and direct call proposal. It will be my pleasure
>>>> to meeting you virtually.
>>>> 
>>>> The suggested time Monday 27th 10AM EST works well, so you may consider it
>>>> settled. I will try to get involved also the managing director of our
>>>> department located in Brno, Czech Republic, Matej Dusik.
>>>> 
>>>> I am looking forward to having a fruitful discussion.
>>>> 
>>>> Best Regards,
>>>> 
>>>> Michal Krajnansky
>>>> 
>>>> On Fri, Feb 24, 2017 at 3:22 PM A. Soroka <aj...@virginia.edu> wrote:
>>>> 
>>>>> We had a few volunteers to begin learning the codebase to take it
>>>> forward,
>>>>> and willingness from at least some of the current committers to teach,
>>>> but
>>>>> I think we need a bit more organization! :grin:
>>>>> 
>>>>> I will make a concrete suggestion. Would it be possible to have a video
>>>>> call (perhaps with Google Hangout) about this on (just throwing out a
>>>> date
>>>>> here) this coming week, Monday 27 February, at 10AM EST?
>>>>> 
>>>>> I would be able to attend. I realize that we are a far-flung group, so
>>>> I'm
>>>>> just throwing out that date to get us started. The most important thing,
>>>> of
>>>>> course, is to get as many current committers involved as is practical.
>>>>> 
>>>>> ---
>>>>> A. Soroka
>>>>> The University of Virginia Library
>>>>> 
>>>>>> On Feb 24, 2017, at 8:50 AM, Michal Krajňanský <
>>>>> michal.krajnansky@gmail.com> wrote:
>>>>>> 
>>>>>> Dear Stanbol users,
>>>>>> 
>>>>>> I was wondering, if there were any results of the discussion about the
>>>>>> Apache Stanbol future.
>>>>>> 
>>>>>> I work for a R&D team of Konica Minolta Laboratory Europe, and we have
>>>>> been
>>>>>> using Stanbol enhancement pipeline in our prototypes concerning
>>>>> information
>>>>>> extraction from unstructured data.
>>>>>> 
>>>>>> We are highly interested in the continuing evolutoin of the Stanbol
>>>>>> project, and willing to actively support it. Is here anyone who could
>>>>> tell
>>>>>> us, what would be a good way to approach the existing Stanbol
>>>>> stakeholders,
>>>>>> and support the project by active development and possibly via other
>>>>> ways?
>>>>>> 
>>>>>> 
>>>>>> Michal Krajnansky
>>>>>> Research Specialist Junior, Konica Minolta Laboratory Europe
>>>>>> 
>>>>>> 
>>>>>> 
>>>>>> On Mon, Jan 23, 2017 at 6:14 PM A. Soroka <aj...@virginia.edu> wrote:
>>>>>> 
>>>>>>> Perhaps we can start a page for people to put their name down for
>>>> this?
>>>>> I
>>>>>>> couldn't find an Apache wiki site-- is there somewhere the developers
>>>>> think
>>>>>>> appropriate?
>>>>>>> 
>>>>>>> ---
>>>>>>> A. Soroka
>>>>>>> Apache Jena / The University of Virginia Library
>>>>>>> 
>>>>>>>> On Jan 23, 2017, at 12:08 PM, Aaron Coburn <ac...@amherst.edu>
>>>>> wrote:
>>>>>>>> 
>>>>>>>> I would also be very interested.
>>>>>>>> 
>>>>>>>> We use the entityhub component quite a bit at our institution, and I
>>>>>>> would be happy to be involved.
>>>>>>>> 
>>>>>>>> Aaron Coburn
>>>>>>>> 
>>>>>>>> 
>>>>>>>>> On Jan 22, 2017, at 12:30 PM, Antero Duarte <a....@gmail.com>
>>>>>>> wrote:
>>>>>>>>> 
>>>>>>>>> I would definitely be up for that!
>>>>>>>>> 
>>>>>>>>> On Sat, 21 Jan 2017, 3:33 p.m. A. Soroka, <aj...@virginia.edu>
>>>> wrote:
>>>>>>>>> 
>>>>>>>>>> Ditto. If there are other folks who would be interested in
>>>>>>> participating
>>>>>>>>>> in something like this, now would be a good time to raise your
>>>> voice!
>>>>>>>>>> 
>>>>>>>>>> ---
>>>>>>>>>> A. Soroka
>>>>>>>>>> The University of Virginia Library
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>>> On Jan 21, 2017, at 10:20 AM, Andrew Valencik <an...@affin.io>
>>>>>>> wrote:
>>>>>>>>>>> 
>>>>>>>>>>> I think the "mandatory" documentation contribution is a great idea
>>>>>>> (and
>>>>>>>>>>> very reasonable). I'd be happy to partake!
>>>>>>>>>>> On Wed, Jan 18, 2017 at 11:13 AM Rafa Haro <rh...@apache.org>
>>>>> wrote:
>>>>>>>>>>> 
>>>>>>>>>>>> I wouldn't mind to be involved in that but it would be almost
>>>>>>>>>> "mandatory"
>>>>>>>>>>>> to contribute some Developer Documentation as outcome of those
>>>>>>> meetings
>>>>>>>>>>>> :-). @Rupert we specially need you here :-)
>>>>>>>>>>>> 
>>>>>>>>>>>> On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu>
>>>>>>> wrote:
>>>>>>>>>>>> 
>>>>>>>>>>>>>> I agree that the barrier to contribution is very high. I recall
>>>>>>> having
>>>>>>>>>>>>> issues with the documentation initially and the only
>>>>>>>>>>>>>> available book on Stanbol was not sufficient.
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> If there was renewed interest in bringing on other developers I
>>>>>>> would
>>>>>>>>>>>> be
>>>>>>>>>>>>> interested in investing the time to learn to codebase.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> I second this!
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Perhaps (I know it's very difficult to organize synchronous time
>>>>>>> for a
>>>>>>>>>>>>> globally-distributed group but perhaps) we could try to
>>>> organize a
>>>>>>> boot
>>>>>>>>>>>>> camp meeting on-line? In other words, those people who (like
>>>>> Andrew
>>>>>>> and
>>>>>>>>>>>>> myself) would be willing to contribute as part of a larger
>>>> effort
>>>>>>> could
>>>>>>>>>>>> get
>>>>>>>>>>>>> some virtual time with one or more committers/PMC members to
>>>> take
>>>>> an
>>>>>>>>>>>>> in-depth tour of the system from the developer point of view and
>>>>>>> hear
>>>>>>>>>>>> about
>>>>>>>>>>>>> the outstanding architectural issues, maybe start to figure out
>>>>>>> points
>>>>>>>>>> of
>>>>>>>>>>>>> contribution.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> I realize this would make a lot of demands on the committers
>>>>>>> involved,
>>>>>>>>>>>> but
>>>>>>>>>>>>> it might be a way to inject some fuel into the effort.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Just an idea...
>>>>>>>>>>>>> 
>>>>>>>>>>>>> ---
>>>>>>>>>>>>> A. Soroka
>>>>>>>>>>>>> The University of Virginia Library
>>>>>>>>>>>>> 
>>>>>>>>>>>>>> On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io>
>>>>>>> wrote:
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> Hello!
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> We use Stanbol in production to annotate text with entities as
>>>>>>> part of
>>>>>>>>>>>>> some
>>>>>>>>>>>>>> of our data products.
>>>>>>>>>>>>>> We do this via the REST API.
>>>>>>>>>>>>>> Originally we were using the content hub to store all the
>>>>> documents
>>>>>>>>>> but
>>>>>>>>>>>>> saw
>>>>>>>>>>>>>> higher than expected failures.
>>>>>>>>>>>>>> The entity engines seem a bit more resilient to varying content
>>>>>>> types
>>>>>>>>>> /
>>>>>>>>>>>>>> encoding.
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> I agree that the barrier to contribution is very high.
>>>>>>>>>>>>>> I recall having issues with the documentation initially and the
>>>>>>> only
>>>>>>>>>>>>>> available book on Stanbol was not sufficient.
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> If there was renewed interest in bringing on other developers I
>>>>>>> would
>>>>>>>>>>>> be
>>>>>>>>>>>>>> interested in investing the time to learn to codebase.
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> Thanks!
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
>>>>>>>>>>>>>> raffaele.palmieri@gmail.com> wrote:
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> Dear community,
>>>>>>>>>>>>>>> we are using 0.12 version with content hub. I find Stanbol
>>>> very
>>>>>>>>>>>>> flexible to
>>>>>>>>>>>>>>> enhance content, specially those unstructured ones.
>>>>>>>>>>>>>>> For that regards connections with other projects, we have done
>>>>>>> some
>>>>>>>>>>>>>>> thoughts about the use with Apache Marmotta with nosql
>>>> backends
>>>>>>> for
>>>>>>>>>>>> big
>>>>>>>>>>>>>>> data scenarios. Also with Apache Manifold to implement
>>>>> enrichment
>>>>>>> of
>>>>>>>>>>>>>>> existent document repositories, that's a common request.
>>>>>>>>>>>>>>> Regards,
>>>>>>>>>>>>>>> Raffaele.
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> 2017-01-16 22:41 GMT+01:00 Antero Duarte <
>>>> a.fduarte1@gmail.com
>>>>>> :
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> Hi there,
>>>>>>>>>>>>>>>> Stanbol is very useful for me! Greatest and easiest tool for
>>>> us
>>>>>>> to
>>>>>>>>>> do
>>>>>>>>>>>>> NLP
>>>>>>>>>>>>>>>> and linked data. Has there been any discussion to move
>>>> towards
>>>>> a
>>>>>>>>>>>> nosql
>>>>>>>>>>>>>>>> storage solution, or is solr still the best thing for us?
>>>> Also,
>>>>>>> what
>>>>>>>>>>>>>>> about
>>>>>>>>>>>>>>>> upgrading solr? How much work would be involved in that?
>>>>> Anyway,
>>>>>>>>>>>> great
>>>>>>>>>>>>>>>> tool, really hope this doesn't die!!!
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> Regards,
>>>>>>>>>>>>>>>> Antero
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
>>>>>>>>>>>>>>>> bdelacretaz@apache.org>
>>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <
>>>> rharo@apache.org>
>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>>> ...I participated
>>>>>>>>>>>>>>>>>> also in the development of the Java client, so I could take
>>>>> the
>>>>>>>>>>>>>>>>>> responsibility of bringing that one as well...
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> FWIW, I won't be involved in decisions about this as I left
>>>>> the
>>>>>>>>>>>>>>>>> Stanbol PMC a while ago - I'm just commenting from a
>>>> community
>>>>>>>>>> point
>>>>>>>>>>>>>>>>> of view, as an experienced Apache member.
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> A while ago Stanbol was "larger" and more focused on its
>>>> core,
>>>>>>> but
>>>>>>>>>>>> as
>>>>>>>>>>>>>>>>> its community becomes smaller (IIUC) it's probably good to
>>>>> bring
>>>>>>>>>>>>>>>>> everyone here, as much as possible, even it it means a
>>>>> slightly
>>>>>>>>>> less
>>>>>>>>>>>>>>>>> focused codebase. This can also help recruiting more active
>>>>>>>>>>>> committers
>>>>>>>>>>>>>>>>> and PMC members by involving them directly here.
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> That might make Stanbol more sustainable, as a community of
>>>>>>> people
>>>>>>>>>>>> who
>>>>>>>>>>>>>>>>> need similar functionality.
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> -Bertrand
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> --
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> Andrew Valencik
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> Data Scientist
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>>>>>>>>>> http://twitter.com/valencik>
>>>>>>>>>>>>> |
>>>>>>>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4
>>>>> 
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>>>>>>>>>> <
>>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>> 
>>>>> 
>>>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>>>>>>>>>> 
>>>>>>>>>>>>> 
>>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> --
>>>>>>>>>>> 
>>>>>>>>>>> Andrew Valencik
>>>>>>>>>>> 
>>>>>>>>>>> Data Scientist
>>>>>>>>>>> 
>>>>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>>>>> http://twitter.com/valencik>
>>>>>>>>>> |
>>>>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
>>>>>>>>>>> 
>>>>>>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>>>>>>> <
>>>>>>>>>> 
>>>>>>> 
>>>>> 
>>>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>> 
>>>>>>> 
>>>>>>> 
>>>>>>> 
>>>>>>> 
>>>>>>> 
>>>>>>> 
>>>>> 
>>>>> 
>>>> 
>> 
> 


Re: The Future of Apache Stanbol

Posted by Andrea Nuzzolese <an...@gmail.com>.
Hi all,

first, sorry for the late reply, nevertheless I hope this email could be helpful to the discussion.
At the Semantic Technology Laboratory of the CNR in Italy we have been using Apache Stanbol since it was born and we are still using it in different research projects dealing with semantic enhancement of content.

Additionally, few months ago we founded a start-up (basically a spin-off of the CNR) which re-uses many parts of the Stanbol stack within its products.
Hence, as a start-up we would be really interested in the future of Apache Stanbol and we can put some effort by providing some additional developers to contribute to its evolution.

— 
Andrea Giovanni Nuzzolese, Ph.D.
Semantic Technology Laboratory (STLab)
Institute for Cognitive Science and Technology (ISTC)
National Research Council (CNR)
Via Nomentana 56, Roma - Italy

> On 24 Feb 2017, at 16:08, A. Soroka <aj...@virginia.edu> wrote:
> 
> Yes, it would certainly be best to have more than one opportunity, with such a global group. If you are not available on Monday, Rafa, let's not push for that date. Let's find a date to start with that is good for both you and Rupert.
> 
> Rupert?
> 
> ---
> A. Soroka
> The University of Virginia Library
> 
>> On Feb 24, 2017, at 10:00 AM, Rafa Haro <rh...@apache.org> wrote:
>> 
>> Hi,
>> 
>> I wouldn't be available next Monday, but maybe we can organize more than
>> one :-). I insist, in order to make any sense out of this we need @Rupert
>> to join these meetings. He is, by far, the most experienced Stanbol
>> developer and the one with the broader knowledge. Please @Rupert, raise
>> your voice man :-)
>> 
>> Cheers,
>> Rafa
>> 
>> On Fri, Feb 24, 2017 at 3:33 PM Michal Krajňanský <
>> michal.krajnansky@gmail.com> wrote:
>> 
>>> Hi Mr. Soroka,
>>> 
>>> Thank you for quick reply and direct call proposal. It will be my pleasure
>>> to meeting you virtually.
>>> 
>>> The suggested time Monday 27th 10AM EST works well, so you may consider it
>>> settled. I will try to get involved also the managing director of our
>>> department located in Brno, Czech Republic, Matej Dusik.
>>> 
>>> I am looking forward to having a fruitful discussion.
>>> 
>>> Best Regards,
>>> 
>>> Michal Krajnansky
>>> 
>>> On Fri, Feb 24, 2017 at 3:22 PM A. Soroka <aj...@virginia.edu> wrote:
>>> 
>>>> We had a few volunteers to begin learning the codebase to take it
>>> forward,
>>>> and willingness from at least some of the current committers to teach,
>>> but
>>>> I think we need a bit more organization! :grin:
>>>> 
>>>> I will make a concrete suggestion. Would it be possible to have a video
>>>> call (perhaps with Google Hangout) about this on (just throwing out a
>>> date
>>>> here) this coming week, Monday 27 February, at 10AM EST?
>>>> 
>>>> I would be able to attend. I realize that we are a far-flung group, so
>>> I'm
>>>> just throwing out that date to get us started. The most important thing,
>>> of
>>>> course, is to get as many current committers involved as is practical.
>>>> 
>>>> ---
>>>> A. Soroka
>>>> The University of Virginia Library
>>>> 
>>>>> On Feb 24, 2017, at 8:50 AM, Michal Krajňanský <
>>>> michal.krajnansky@gmail.com> wrote:
>>>>> 
>>>>> Dear Stanbol users,
>>>>> 
>>>>> I was wondering, if there were any results of the discussion about the
>>>>> Apache Stanbol future.
>>>>> 
>>>>> I work for a R&D team of Konica Minolta Laboratory Europe, and we have
>>>> been
>>>>> using Stanbol enhancement pipeline in our prototypes concerning
>>>> information
>>>>> extraction from unstructured data.
>>>>> 
>>>>> We are highly interested in the continuing evolutoin of the Stanbol
>>>>> project, and willing to actively support it. Is here anyone who could
>>>> tell
>>>>> us, what would be a good way to approach the existing Stanbol
>>>> stakeholders,
>>>>> and support the project by active development and possibly via other
>>>> ways?
>>>>> 
>>>>> 
>>>>> Michal Krajnansky
>>>>> Research Specialist Junior, Konica Minolta Laboratory Europe
>>>>> 
>>>>> 
>>>>> 
>>>>> On Mon, Jan 23, 2017 at 6:14 PM A. Soroka <aj...@virginia.edu> wrote:
>>>>> 
>>>>>> Perhaps we can start a page for people to put their name down for
>>> this?
>>>> I
>>>>>> couldn't find an Apache wiki site-- is there somewhere the developers
>>>> think
>>>>>> appropriate?
>>>>>> 
>>>>>> ---
>>>>>> A. Soroka
>>>>>> Apache Jena / The University of Virginia Library
>>>>>> 
>>>>>>> On Jan 23, 2017, at 12:08 PM, Aaron Coburn <ac...@amherst.edu>
>>>> wrote:
>>>>>>> 
>>>>>>> I would also be very interested.
>>>>>>> 
>>>>>>> We use the entityhub component quite a bit at our institution, and I
>>>>>> would be happy to be involved.
>>>>>>> 
>>>>>>> Aaron Coburn
>>>>>>> 
>>>>>>> 
>>>>>>>> On Jan 22, 2017, at 12:30 PM, Antero Duarte <a....@gmail.com>
>>>>>> wrote:
>>>>>>>> 
>>>>>>>> I would definitely be up for that!
>>>>>>>> 
>>>>>>>> On Sat, 21 Jan 2017, 3:33 p.m. A. Soroka, <aj...@virginia.edu>
>>> wrote:
>>>>>>>> 
>>>>>>>>> Ditto. If there are other folks who would be interested in
>>>>>> participating
>>>>>>>>> in something like this, now would be a good time to raise your
>>> voice!
>>>>>>>>> 
>>>>>>>>> ---
>>>>>>>>> A. Soroka
>>>>>>>>> The University of Virginia Library
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>>> On Jan 21, 2017, at 10:20 AM, Andrew Valencik <an...@affin.io>
>>>>>> wrote:
>>>>>>>>>> 
>>>>>>>>>> I think the "mandatory" documentation contribution is a great idea
>>>>>> (and
>>>>>>>>>> very reasonable). I'd be happy to partake!
>>>>>>>>>> On Wed, Jan 18, 2017 at 11:13 AM Rafa Haro <rh...@apache.org>
>>>> wrote:
>>>>>>>>>> 
>>>>>>>>>>> I wouldn't mind to be involved in that but it would be almost
>>>>>>>>> "mandatory"
>>>>>>>>>>> to contribute some Developer Documentation as outcome of those
>>>>>> meetings
>>>>>>>>>>> :-). @Rupert we specially need you here :-)
>>>>>>>>>>> 
>>>>>>>>>>> On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu>
>>>>>> wrote:
>>>>>>>>>>> 
>>>>>>>>>>>>> I agree that the barrier to contribution is very high. I recall
>>>>>> having
>>>>>>>>>>>> issues with the documentation initially and the only
>>>>>>>>>>>>> available book on Stanbol was not sufficient.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> If there was renewed interest in bringing on other developers I
>>>>>> would
>>>>>>>>>>> be
>>>>>>>>>>>> interested in investing the time to learn to codebase.
>>>>>>>>>>>> 
>>>>>>>>>>>> I second this!
>>>>>>>>>>>> 
>>>>>>>>>>>> Perhaps (I know it's very difficult to organize synchronous time
>>>>>> for a
>>>>>>>>>>>> globally-distributed group but perhaps) we could try to
>>> organize a
>>>>>> boot
>>>>>>>>>>>> camp meeting on-line? In other words, those people who (like
>>>> Andrew
>>>>>> and
>>>>>>>>>>>> myself) would be willing to contribute as part of a larger
>>> effort
>>>>>> could
>>>>>>>>>>> get
>>>>>>>>>>>> some virtual time with one or more committers/PMC members to
>>> take
>>>> an
>>>>>>>>>>>> in-depth tour of the system from the developer point of view and
>>>>>> hear
>>>>>>>>>>> about
>>>>>>>>>>>> the outstanding architectural issues, maybe start to figure out
>>>>>> points
>>>>>>>>> of
>>>>>>>>>>>> contribution.
>>>>>>>>>>>> 
>>>>>>>>>>>> I realize this would make a lot of demands on the committers
>>>>>> involved,
>>>>>>>>>>> but
>>>>>>>>>>>> it might be a way to inject some fuel into the effort.
>>>>>>>>>>>> 
>>>>>>>>>>>> Just an idea...
>>>>>>>>>>>> 
>>>>>>>>>>>> ---
>>>>>>>>>>>> A. Soroka
>>>>>>>>>>>> The University of Virginia Library
>>>>>>>>>>>> 
>>>>>>>>>>>>> On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io>
>>>>>> wrote:
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Hello!
>>>>>>>>>>>>> 
>>>>>>>>>>>>> We use Stanbol in production to annotate text with entities as
>>>>>> part of
>>>>>>>>>>>> some
>>>>>>>>>>>>> of our data products.
>>>>>>>>>>>>> We do this via the REST API.
>>>>>>>>>>>>> Originally we were using the content hub to store all the
>>>> documents
>>>>>>>>> but
>>>>>>>>>>>> saw
>>>>>>>>>>>>> higher than expected failures.
>>>>>>>>>>>>> The entity engines seem a bit more resilient to varying content
>>>>>> types
>>>>>>>>> /
>>>>>>>>>>>>> encoding.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> I agree that the barrier to contribution is very high.
>>>>>>>>>>>>> I recall having issues with the documentation initially and the
>>>>>> only
>>>>>>>>>>>>> available book on Stanbol was not sufficient.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> If there was renewed interest in bringing on other developers I
>>>>>> would
>>>>>>>>>>> be
>>>>>>>>>>>>> interested in investing the time to learn to codebase.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Thanks!
>>>>>>>>>>>>> 
>>>>>>>>>>>>> On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
>>>>>>>>>>>>> raffaele.palmieri@gmail.com> wrote:
>>>>>>>>>>>>> 
>>>>>>>>>>>>>> Dear community,
>>>>>>>>>>>>>> we are using 0.12 version with content hub. I find Stanbol
>>> very
>>>>>>>>>>>> flexible to
>>>>>>>>>>>>>> enhance content, specially those unstructured ones.
>>>>>>>>>>>>>> For that regards connections with other projects, we have done
>>>>>> some
>>>>>>>>>>>>>> thoughts about the use with Apache Marmotta with nosql
>>> backends
>>>>>> for
>>>>>>>>>>> big
>>>>>>>>>>>>>> data scenarios. Also with Apache Manifold to implement
>>>> enrichment
>>>>>> of
>>>>>>>>>>>>>> existent document repositories, that's a common request.
>>>>>>>>>>>>>> Regards,
>>>>>>>>>>>>>> Raffaele.
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> 2017-01-16 22:41 GMT+01:00 Antero Duarte <
>>> a.fduarte1@gmail.com
>>>>> :
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> Hi there,
>>>>>>>>>>>>>>> Stanbol is very useful for me! Greatest and easiest tool for
>>> us
>>>>>> to
>>>>>>>>> do
>>>>>>>>>>>> NLP
>>>>>>>>>>>>>>> and linked data. Has there been any discussion to move
>>> towards
>>>> a
>>>>>>>>>>> nosql
>>>>>>>>>>>>>>> storage solution, or is solr still the best thing for us?
>>> Also,
>>>>>> what
>>>>>>>>>>>>>> about
>>>>>>>>>>>>>>> upgrading solr? How much work would be involved in that?
>>>> Anyway,
>>>>>>>>>>> great
>>>>>>>>>>>>>>> tool, really hope this doesn't die!!!
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> Regards,
>>>>>>>>>>>>>>> Antero
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
>>>>>>>>>>>>>>> bdelacretaz@apache.org>
>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <
>>> rharo@apache.org>
>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>> ...I participated
>>>>>>>>>>>>>>>>> also in the development of the Java client, so I could take
>>>> the
>>>>>>>>>>>>>>>>> responsibility of bringing that one as well...
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> FWIW, I won't be involved in decisions about this as I left
>>>> the
>>>>>>>>>>>>>>>> Stanbol PMC a while ago - I'm just commenting from a
>>> community
>>>>>>>>> point
>>>>>>>>>>>>>>>> of view, as an experienced Apache member.
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> A while ago Stanbol was "larger" and more focused on its
>>> core,
>>>>>> but
>>>>>>>>>>> as
>>>>>>>>>>>>>>>> its community becomes smaller (IIUC) it's probably good to
>>>> bring
>>>>>>>>>>>>>>>> everyone here, as much as possible, even it it means a
>>>> slightly
>>>>>>>>> less
>>>>>>>>>>>>>>>> focused codebase. This can also help recruiting more active
>>>>>>>>>>> committers
>>>>>>>>>>>>>>>> and PMC members by involving them directly here.
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> That might make Stanbol more sustainable, as a community of
>>>>>> people
>>>>>>>>>>> who
>>>>>>>>>>>>>>>> need similar functionality.
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> -Bertrand
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> 
>>>>>>>>>>>>> --
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Andrew Valencik
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Data Scientist
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>>>>>>>>> http://twitter.com/valencik>
>>>>>>>>>>>> |
>>>>>>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4
>>>> 
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>>>>>>>>> <
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>> 
>>>> 
>>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>> --
>>>>>>>>>> 
>>>>>>>>>> Andrew Valencik
>>>>>>>>>> 
>>>>>>>>>> Data Scientist
>>>>>>>>>> 
>>>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>>>> http://twitter.com/valencik>
>>>>>>>>> |
>>>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
>>>>>>>>>> 
>>>>>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>>>>>> <
>>>>>>>>> 
>>>>>> 
>>>> 
>>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> 
>>>>>>> 
>>>>>> 
>>>>>> 
>>>>>> 
>>>>>> 
>>>>>> 
>>>>>> 
>>>> 
>>>> 
>>> 
> 


Re: The Future of Apache Stanbol

Posted by "A. Soroka" <aj...@virginia.edu>.
Yes, it would certainly be best to have more than one opportunity, with such a global group. If you are not available on Monday, Rafa, let's not push for that date. Let's find a date to start with that is good for both you and Rupert.

Rupert?

---
A. Soroka
The University of Virginia Library

> On Feb 24, 2017, at 10:00 AM, Rafa Haro <rh...@apache.org> wrote:
> 
> Hi,
> 
> I wouldn't be available next Monday, but maybe we can organize more than
> one :-). I insist, in order to make any sense out of this we need @Rupert
> to join these meetings. He is, by far, the most experienced Stanbol
> developer and the one with the broader knowledge. Please @Rupert, raise
> your voice man :-)
> 
> Cheers,
> Rafa
> 
> On Fri, Feb 24, 2017 at 3:33 PM Michal Krajňanský <
> michal.krajnansky@gmail.com> wrote:
> 
>> Hi Mr. Soroka,
>> 
>> Thank you for quick reply and direct call proposal. It will be my pleasure
>> to meeting you virtually.
>> 
>> The suggested time Monday 27th 10AM EST works well, so you may consider it
>> settled. I will try to get involved also the managing director of our
>> department located in Brno, Czech Republic, Matej Dusik.
>> 
>> I am looking forward to having a fruitful discussion.
>> 
>> Best Regards,
>> 
>> Michal Krajnansky
>> 
>> On Fri, Feb 24, 2017 at 3:22 PM A. Soroka <aj...@virginia.edu> wrote:
>> 
>>> We had a few volunteers to begin learning the codebase to take it
>> forward,
>>> and willingness from at least some of the current committers to teach,
>> but
>>> I think we need a bit more organization! :grin:
>>> 
>>> I will make a concrete suggestion. Would it be possible to have a video
>>> call (perhaps with Google Hangout) about this on (just throwing out a
>> date
>>> here) this coming week, Monday 27 February, at 10AM EST?
>>> 
>>> I would be able to attend. I realize that we are a far-flung group, so
>> I'm
>>> just throwing out that date to get us started. The most important thing,
>> of
>>> course, is to get as many current committers involved as is practical.
>>> 
>>> ---
>>> A. Soroka
>>> The University of Virginia Library
>>> 
>>>> On Feb 24, 2017, at 8:50 AM, Michal Krajňanský <
>>> michal.krajnansky@gmail.com> wrote:
>>>> 
>>>> Dear Stanbol users,
>>>> 
>>>> I was wondering, if there were any results of the discussion about the
>>>> Apache Stanbol future.
>>>> 
>>>> I work for a R&D team of Konica Minolta Laboratory Europe, and we have
>>> been
>>>> using Stanbol enhancement pipeline in our prototypes concerning
>>> information
>>>> extraction from unstructured data.
>>>> 
>>>> We are highly interested in the continuing evolutoin of the Stanbol
>>>> project, and willing to actively support it. Is here anyone who could
>>> tell
>>>> us, what would be a good way to approach the existing Stanbol
>>> stakeholders,
>>>> and support the project by active development and possibly via other
>>> ways?
>>>> 
>>>> 
>>>> Michal Krajnansky
>>>> Research Specialist Junior, Konica Minolta Laboratory Europe
>>>> 
>>>> 
>>>> 
>>>> On Mon, Jan 23, 2017 at 6:14 PM A. Soroka <aj...@virginia.edu> wrote:
>>>> 
>>>>> Perhaps we can start a page for people to put their name down for
>> this?
>>> I
>>>>> couldn't find an Apache wiki site-- is there somewhere the developers
>>> think
>>>>> appropriate?
>>>>> 
>>>>> ---
>>>>> A. Soroka
>>>>> Apache Jena / The University of Virginia Library
>>>>> 
>>>>>> On Jan 23, 2017, at 12:08 PM, Aaron Coburn <ac...@amherst.edu>
>>> wrote:
>>>>>> 
>>>>>> I would also be very interested.
>>>>>> 
>>>>>> We use the entityhub component quite a bit at our institution, and I
>>>>> would be happy to be involved.
>>>>>> 
>>>>>> Aaron Coburn
>>>>>> 
>>>>>> 
>>>>>>> On Jan 22, 2017, at 12:30 PM, Antero Duarte <a....@gmail.com>
>>>>> wrote:
>>>>>>> 
>>>>>>> I would definitely be up for that!
>>>>>>> 
>>>>>>> On Sat, 21 Jan 2017, 3:33 p.m. A. Soroka, <aj...@virginia.edu>
>> wrote:
>>>>>>> 
>>>>>>>> Ditto. If there are other folks who would be interested in
>>>>> participating
>>>>>>>> in something like this, now would be a good time to raise your
>> voice!
>>>>>>>> 
>>>>>>>> ---
>>>>>>>> A. Soroka
>>>>>>>> The University of Virginia Library
>>>>>>>> 
>>>>>>>> 
>>>>>>>>> On Jan 21, 2017, at 10:20 AM, Andrew Valencik <an...@affin.io>
>>>>> wrote:
>>>>>>>>> 
>>>>>>>>> I think the "mandatory" documentation contribution is a great idea
>>>>> (and
>>>>>>>>> very reasonable). I'd be happy to partake!
>>>>>>>>> On Wed, Jan 18, 2017 at 11:13 AM Rafa Haro <rh...@apache.org>
>>> wrote:
>>>>>>>>> 
>>>>>>>>>> I wouldn't mind to be involved in that but it would be almost
>>>>>>>> "mandatory"
>>>>>>>>>> to contribute some Developer Documentation as outcome of those
>>>>> meetings
>>>>>>>>>> :-). @Rupert we specially need you here :-)
>>>>>>>>>> 
>>>>>>>>>> On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu>
>>>>> wrote:
>>>>>>>>>> 
>>>>>>>>>>>> I agree that the barrier to contribution is very high. I recall
>>>>> having
>>>>>>>>>>> issues with the documentation initially and the only
>>>>>>>>>>>> available book on Stanbol was not sufficient.
>>>>>>>>>>>> 
>>>>>>>>>>>> If there was renewed interest in bringing on other developers I
>>>>> would
>>>>>>>>>> be
>>>>>>>>>>> interested in investing the time to learn to codebase.
>>>>>>>>>>> 
>>>>>>>>>>> I second this!
>>>>>>>>>>> 
>>>>>>>>>>> Perhaps (I know it's very difficult to organize synchronous time
>>>>> for a
>>>>>>>>>>> globally-distributed group but perhaps) we could try to
>> organize a
>>>>> boot
>>>>>>>>>>> camp meeting on-line? In other words, those people who (like
>>> Andrew
>>>>> and
>>>>>>>>>>> myself) would be willing to contribute as part of a larger
>> effort
>>>>> could
>>>>>>>>>> get
>>>>>>>>>>> some virtual time with one or more committers/PMC members to
>> take
>>> an
>>>>>>>>>>> in-depth tour of the system from the developer point of view and
>>>>> hear
>>>>>>>>>> about
>>>>>>>>>>> the outstanding architectural issues, maybe start to figure out
>>>>> points
>>>>>>>> of
>>>>>>>>>>> contribution.
>>>>>>>>>>> 
>>>>>>>>>>> I realize this would make a lot of demands on the committers
>>>>> involved,
>>>>>>>>>> but
>>>>>>>>>>> it might be a way to inject some fuel into the effort.
>>>>>>>>>>> 
>>>>>>>>>>> Just an idea...
>>>>>>>>>>> 
>>>>>>>>>>> ---
>>>>>>>>>>> A. Soroka
>>>>>>>>>>> The University of Virginia Library
>>>>>>>>>>> 
>>>>>>>>>>>> On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io>
>>>>> wrote:
>>>>>>>>>>>> 
>>>>>>>>>>>> Hello!
>>>>>>>>>>>> 
>>>>>>>>>>>> We use Stanbol in production to annotate text with entities as
>>>>> part of
>>>>>>>>>>> some
>>>>>>>>>>>> of our data products.
>>>>>>>>>>>> We do this via the REST API.
>>>>>>>>>>>> Originally we were using the content hub to store all the
>>> documents
>>>>>>>> but
>>>>>>>>>>> saw
>>>>>>>>>>>> higher than expected failures.
>>>>>>>>>>>> The entity engines seem a bit more resilient to varying content
>>>>> types
>>>>>>>> /
>>>>>>>>>>>> encoding.
>>>>>>>>>>>> 
>>>>>>>>>>>> I agree that the barrier to contribution is very high.
>>>>>>>>>>>> I recall having issues with the documentation initially and the
>>>>> only
>>>>>>>>>>>> available book on Stanbol was not sufficient.
>>>>>>>>>>>> 
>>>>>>>>>>>> If there was renewed interest in bringing on other developers I
>>>>> would
>>>>>>>>>> be
>>>>>>>>>>>> interested in investing the time to learn to codebase.
>>>>>>>>>>>> 
>>>>>>>>>>>> Thanks!
>>>>>>>>>>>> 
>>>>>>>>>>>> On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
>>>>>>>>>>>> raffaele.palmieri@gmail.com> wrote:
>>>>>>>>>>>> 
>>>>>>>>>>>>> Dear community,
>>>>>>>>>>>>> we are using 0.12 version with content hub. I find Stanbol
>> very
>>>>>>>>>>> flexible to
>>>>>>>>>>>>> enhance content, specially those unstructured ones.
>>>>>>>>>>>>> For that regards connections with other projects, we have done
>>>>> some
>>>>>>>>>>>>> thoughts about the use with Apache Marmotta with nosql
>> backends
>>>>> for
>>>>>>>>>> big
>>>>>>>>>>>>> data scenarios. Also with Apache Manifold to implement
>>> enrichment
>>>>> of
>>>>>>>>>>>>> existent document repositories, that's a common request.
>>>>>>>>>>>>> Regards,
>>>>>>>>>>>>> Raffaele.
>>>>>>>>>>>>> 
>>>>>>>>>>>>> 
>>>>>>>>>>>>> 2017-01-16 22:41 GMT+01:00 Antero Duarte <
>> a.fduarte1@gmail.com
>>>> :
>>>>>>>>>>>>> 
>>>>>>>>>>>>>> Hi there,
>>>>>>>>>>>>>> Stanbol is very useful for me! Greatest and easiest tool for
>> us
>>>>> to
>>>>>>>> do
>>>>>>>>>>> NLP
>>>>>>>>>>>>>> and linked data. Has there been any discussion to move
>> towards
>>> a
>>>>>>>>>> nosql
>>>>>>>>>>>>>> storage solution, or is solr still the best thing for us?
>> Also,
>>>>> what
>>>>>>>>>>>>> about
>>>>>>>>>>>>>> upgrading solr? How much work would be involved in that?
>>> Anyway,
>>>>>>>>>> great
>>>>>>>>>>>>>> tool, really hope this doesn't die!!!
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> Regards,
>>>>>>>>>>>>>> Antero
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
>>>>>>>>>>>>>> bdelacretaz@apache.org>
>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <
>> rharo@apache.org>
>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>> ...I participated
>>>>>>>>>>>>>>>> also in the development of the Java client, so I could take
>>> the
>>>>>>>>>>>>>>>> responsibility of bringing that one as well...
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> FWIW, I won't be involved in decisions about this as I left
>>> the
>>>>>>>>>>>>>>> Stanbol PMC a while ago - I'm just commenting from a
>> community
>>>>>>>> point
>>>>>>>>>>>>>>> of view, as an experienced Apache member.
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> A while ago Stanbol was "larger" and more focused on its
>> core,
>>>>> but
>>>>>>>>>> as
>>>>>>>>>>>>>>> its community becomes smaller (IIUC) it's probably good to
>>> bring
>>>>>>>>>>>>>>> everyone here, as much as possible, even it it means a
>>> slightly
>>>>>>>> less
>>>>>>>>>>>>>>> focused codebase. This can also help recruiting more active
>>>>>>>>>> committers
>>>>>>>>>>>>>>> and PMC members by involving them directly here.
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> That might make Stanbol more sustainable, as a community of
>>>>> people
>>>>>>>>>> who
>>>>>>>>>>>>>>> need similar functionality.
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> -Bertrand
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> 
>>>>>>>>>>>>> 
>>>>>>>>>>>> --
>>>>>>>>>>>> 
>>>>>>>>>>>> Andrew Valencik
>>>>>>>>>>>> 
>>>>>>>>>>>> Data Scientist
>>>>>>>>>>>> 
>>>>>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>>>>>>>> http://twitter.com/valencik>
>>>>>>>>>>> |
>>>>>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4
>>> 
>>>>>>>>>>>> 
>>>>>>>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>>>>>>>> <
>>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>> 
>>>>> 
>>> 
>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>> --
>>>>>>>>> 
>>>>>>>>> Andrew Valencik
>>>>>>>>> 
>>>>>>>>> Data Scientist
>>>>>>>>> 
>>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>>> http://twitter.com/valencik>
>>>>>>>> |
>>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
>>>>>>>>> 
>>>>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>>>>> <
>>>>>>>> 
>>>>> 
>>> 
>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>>>>> 
>>>>>>>> 
>>>>>>>> 
>>>>>>>> 
>>>>>>>> 
>>>>>> 
>>>>> 
>>>>> 
>>>>> 
>>>>> 
>>>>> 
>>>>> 
>>> 
>>> 
>> 


Re: The Future of Apache Stanbol

Posted by Rafa Haro <rh...@apache.org>.
Hi,

I wouldn't be available next Monday, but maybe we can organize more than
one :-). I insist, in order to make any sense out of this we need @Rupert
to join these meetings. He is, by far, the most experienced Stanbol
developer and the one with the broader knowledge. Please @Rupert, raise
your voice man :-)

Cheers,
Rafa

On Fri, Feb 24, 2017 at 3:33 PM Michal Krajňanský <
michal.krajnansky@gmail.com> wrote:

> Hi Mr. Soroka,
>
> Thank you for quick reply and direct call proposal. It will be my pleasure
> to meeting you virtually.
>
> The suggested time Monday 27th 10AM EST works well, so you may consider it
> settled. I will try to get involved also the managing director of our
> department located in Brno, Czech Republic, Matej Dusik.
>
> I am looking forward to having a fruitful discussion.
>
> Best Regards,
>
> Michal Krajnansky
>
> On Fri, Feb 24, 2017 at 3:22 PM A. Soroka <aj...@virginia.edu> wrote:
>
> > We had a few volunteers to begin learning the codebase to take it
> forward,
> > and willingness from at least some of the current committers to teach,
> but
> > I think we need a bit more organization! :grin:
> >
> > I will make a concrete suggestion. Would it be possible to have a video
> > call (perhaps with Google Hangout) about this on (just throwing out a
> date
> > here) this coming week, Monday 27 February, at 10AM EST?
> >
> > I would be able to attend. I realize that we are a far-flung group, so
> I'm
> > just throwing out that date to get us started. The most important thing,
> of
> > course, is to get as many current committers involved as is practical.
> >
> > ---
> > A. Soroka
> > The University of Virginia Library
> >
> > > On Feb 24, 2017, at 8:50 AM, Michal Krajňanský <
> > michal.krajnansky@gmail.com> wrote:
> > >
> > > Dear Stanbol users,
> > >
> > > I was wondering, if there were any results of the discussion about the
> > > Apache Stanbol future.
> > >
> > > I work for a R&D team of Konica Minolta Laboratory Europe, and we have
> > been
> > > using Stanbol enhancement pipeline in our prototypes concerning
> > information
> > > extraction from unstructured data.
> > >
> > > We are highly interested in the continuing evolutoin of the Stanbol
> > > project, and willing to actively support it. Is here anyone who could
> > tell
> > > us, what would be a good way to approach the existing Stanbol
> > stakeholders,
> > > and support the project by active development and possibly via other
> > ways?
> > >
> > >
> > > Michal Krajnansky
> > > Research Specialist Junior, Konica Minolta Laboratory Europe
> > >
> > >
> > >
> > > On Mon, Jan 23, 2017 at 6:14 PM A. Soroka <aj...@virginia.edu> wrote:
> > >
> > >> Perhaps we can start a page for people to put their name down for
> this?
> > I
> > >> couldn't find an Apache wiki site-- is there somewhere the developers
> > think
> > >> appropriate?
> > >>
> > >> ---
> > >> A. Soroka
> > >> Apache Jena / The University of Virginia Library
> > >>
> > >>> On Jan 23, 2017, at 12:08 PM, Aaron Coburn <ac...@amherst.edu>
> > wrote:
> > >>>
> > >>> I would also be very interested.
> > >>>
> > >>> We use the entityhub component quite a bit at our institution, and I
> > >> would be happy to be involved.
> > >>>
> > >>> Aaron Coburn
> > >>>
> > >>>
> > >>>> On Jan 22, 2017, at 12:30 PM, Antero Duarte <a....@gmail.com>
> > >> wrote:
> > >>>>
> > >>>> I would definitely be up for that!
> > >>>>
> > >>>> On Sat, 21 Jan 2017, 3:33 p.m. A. Soroka, <aj...@virginia.edu>
> wrote:
> > >>>>
> > >>>>> Ditto. If there are other folks who would be interested in
> > >> participating
> > >>>>> in something like this, now would be a good time to raise your
> voice!
> > >>>>>
> > >>>>> ---
> > >>>>> A. Soroka
> > >>>>> The University of Virginia Library
> > >>>>>
> > >>>>>
> > >>>>>> On Jan 21, 2017, at 10:20 AM, Andrew Valencik <an...@affin.io>
> > >> wrote:
> > >>>>>>
> > >>>>>> I think the "mandatory" documentation contribution is a great idea
> > >> (and
> > >>>>>> very reasonable). I'd be happy to partake!
> > >>>>>> On Wed, Jan 18, 2017 at 11:13 AM Rafa Haro <rh...@apache.org>
> > wrote:
> > >>>>>>
> > >>>>>>> I wouldn't mind to be involved in that but it would be almost
> > >>>>> "mandatory"
> > >>>>>>> to contribute some Developer Documentation as outcome of those
> > >> meetings
> > >>>>>>> :-). @Rupert we specially need you here :-)
> > >>>>>>>
> > >>>>>>> On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu>
> > >> wrote:
> > >>>>>>>
> > >>>>>>>>> I agree that the barrier to contribution is very high. I recall
> > >> having
> > >>>>>>>> issues with the documentation initially and the only
> > >>>>>>>>> available book on Stanbol was not sufficient.
> > >>>>>>>>>
> > >>>>>>>>> If there was renewed interest in bringing on other developers I
> > >> would
> > >>>>>>> be
> > >>>>>>>> interested in investing the time to learn to codebase.
> > >>>>>>>>
> > >>>>>>>> I second this!
> > >>>>>>>>
> > >>>>>>>> Perhaps (I know it's very difficult to organize synchronous time
> > >> for a
> > >>>>>>>> globally-distributed group but perhaps) we could try to
> organize a
> > >> boot
> > >>>>>>>> camp meeting on-line? In other words, those people who (like
> > Andrew
> > >> and
> > >>>>>>>> myself) would be willing to contribute as part of a larger
> effort
> > >> could
> > >>>>>>> get
> > >>>>>>>> some virtual time with one or more committers/PMC members to
> take
> > an
> > >>>>>>>> in-depth tour of the system from the developer point of view and
> > >> hear
> > >>>>>>> about
> > >>>>>>>> the outstanding architectural issues, maybe start to figure out
> > >> points
> > >>>>> of
> > >>>>>>>> contribution.
> > >>>>>>>>
> > >>>>>>>> I realize this would make a lot of demands on the committers
> > >> involved,
> > >>>>>>> but
> > >>>>>>>> it might be a way to inject some fuel into the effort.
> > >>>>>>>>
> > >>>>>>>> Just an idea...
> > >>>>>>>>
> > >>>>>>>> ---
> > >>>>>>>> A. Soroka
> > >>>>>>>> The University of Virginia Library
> > >>>>>>>>
> > >>>>>>>>> On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io>
> > >> wrote:
> > >>>>>>>>>
> > >>>>>>>>> Hello!
> > >>>>>>>>>
> > >>>>>>>>> We use Stanbol in production to annotate text with entities as
> > >> part of
> > >>>>>>>> some
> > >>>>>>>>> of our data products.
> > >>>>>>>>> We do this via the REST API.
> > >>>>>>>>> Originally we were using the content hub to store all the
> > documents
> > >>>>> but
> > >>>>>>>> saw
> > >>>>>>>>> higher than expected failures.
> > >>>>>>>>> The entity engines seem a bit more resilient to varying content
> > >> types
> > >>>>> /
> > >>>>>>>>> encoding.
> > >>>>>>>>>
> > >>>>>>>>> I agree that the barrier to contribution is very high.
> > >>>>>>>>> I recall having issues with the documentation initially and the
> > >> only
> > >>>>>>>>> available book on Stanbol was not sufficient.
> > >>>>>>>>>
> > >>>>>>>>> If there was renewed interest in bringing on other developers I
> > >> would
> > >>>>>>> be
> > >>>>>>>>> interested in investing the time to learn to codebase.
> > >>>>>>>>>
> > >>>>>>>>> Thanks!
> > >>>>>>>>>
> > >>>>>>>>> On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
> > >>>>>>>>> raffaele.palmieri@gmail.com> wrote:
> > >>>>>>>>>
> > >>>>>>>>>> Dear community,
> > >>>>>>>>>> we are using 0.12 version with content hub. I find Stanbol
> very
> > >>>>>>>> flexible to
> > >>>>>>>>>> enhance content, specially those unstructured ones.
> > >>>>>>>>>> For that regards connections with other projects, we have done
> > >> some
> > >>>>>>>>>> thoughts about the use with Apache Marmotta with nosql
> backends
> > >> for
> > >>>>>>> big
> > >>>>>>>>>> data scenarios. Also with Apache Manifold to implement
> > enrichment
> > >> of
> > >>>>>>>>>> existent document repositories, that's a common request.
> > >>>>>>>>>> Regards,
> > >>>>>>>>>> Raffaele.
> > >>>>>>>>>>
> > >>>>>>>>>>
> > >>>>>>>>>> 2017-01-16 22:41 GMT+01:00 Antero Duarte <
> a.fduarte1@gmail.com
> > >:
> > >>>>>>>>>>
> > >>>>>>>>>>> Hi there,
> > >>>>>>>>>>> Stanbol is very useful for me! Greatest and easiest tool for
> us
> > >> to
> > >>>>> do
> > >>>>>>>> NLP
> > >>>>>>>>>>> and linked data. Has there been any discussion to move
> towards
> > a
> > >>>>>>> nosql
> > >>>>>>>>>>> storage solution, or is solr still the best thing for us?
> Also,
> > >> what
> > >>>>>>>>>> about
> > >>>>>>>>>>> upgrading solr? How much work would be involved in that?
> > Anyway,
> > >>>>>>> great
> > >>>>>>>>>>> tool, really hope this doesn't die!!!
> > >>>>>>>>>>>
> > >>>>>>>>>>> Regards,
> > >>>>>>>>>>> Antero
> > >>>>>>>>>>>
> > >>>>>>>>>>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
> > >>>>>>>>>>> bdelacretaz@apache.org>
> > >>>>>>>>>>> wrote:
> > >>>>>>>>>>>
> > >>>>>>>>>>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <
> rharo@apache.org>
> > >>>>>>> wrote:
> > >>>>>>>>>>>>> ...I participated
> > >>>>>>>>>>>>> also in the development of the Java client, so I could take
> > the
> > >>>>>>>>>>>>> responsibility of bringing that one as well...
> > >>>>>>>>>>>>
> > >>>>>>>>>>>> FWIW, I won't be involved in decisions about this as I left
> > the
> > >>>>>>>>>>>> Stanbol PMC a while ago - I'm just commenting from a
> community
> > >>>>> point
> > >>>>>>>>>>>> of view, as an experienced Apache member.
> > >>>>>>>>>>>>
> > >>>>>>>>>>>> A while ago Stanbol was "larger" and more focused on its
> core,
> > >> but
> > >>>>>>> as
> > >>>>>>>>>>>> its community becomes smaller (IIUC) it's probably good to
> > bring
> > >>>>>>>>>>>> everyone here, as much as possible, even it it means a
> > slightly
> > >>>>> less
> > >>>>>>>>>>>> focused codebase. This can also help recruiting more active
> > >>>>>>> committers
> > >>>>>>>>>>>> and PMC members by involving them directly here.
> > >>>>>>>>>>>>
> > >>>>>>>>>>>> That might make Stanbol more sustainable, as a community of
> > >> people
> > >>>>>>> who
> > >>>>>>>>>>>> need similar functionality.
> > >>>>>>>>>>>>
> > >>>>>>>>>>>> -Bertrand
> > >>>>>>>>>>>>
> > >>>>>>>>>>>
> > >>>>>>>>>>
> > >>>>>>>>> --
> > >>>>>>>>>
> > >>>>>>>>> Andrew Valencik
> > >>>>>>>>>
> > >>>>>>>>> Data Scientist
> > >>>>>>>>>
> > >>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
> > >>>>>>> http://twitter.com/valencik>
> > >>>>>>>> |
> > >>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4
> >
> > >>>>>>>>>
> > >>>>>>>>> Discover your personal brand for free with Discovery by Affinio
> > >>>>>>>>> <
> > >>>>>>>>
> > >>>>>>>
> > >>>>>
> > >>
> >
> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
> > >>>>>>>>>
> > >>>>>>>>
> > >>>>>>>>
> > >>>>>>>
> > >>>>>> --
> > >>>>>>
> > >>>>>> Andrew Valencik
> > >>>>>>
> > >>>>>> Data Scientist
> > >>>>>>
> > >>>>>> Affinio <http://www.affinio.com> | Twitter <
> > >> http://twitter.com/valencik>
> > >>>>> |
> > >>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
> > >>>>>>
> > >>>>>> Discover your personal brand for free with Discovery by Affinio
> > >>>>>> <
> > >>>>>
> > >>
> >
> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
> > >>>>>>
> > >>>>>
> > >>>>>
> > >>>>>
> > >>>>>
> > >>>
> > >>
> > >>
> > >>
> > >>
> > >>
> > >>
> >
> >
>

Re: The Future of Apache Stanbol

Posted by Michal Krajňanský <mi...@gmail.com>.
Hi Mr. Soroka,

Thank you for quick reply and direct call proposal. It will be my pleasure
to meeting you virtually.

The suggested time Monday 27th 10AM EST works well, so you may consider it
settled. I will try to get involved also the managing director of our
department located in Brno, Czech Republic, Matej Dusik.

I am looking forward to having a fruitful discussion.

Best Regards,

Michal Krajnansky

On Fri, Feb 24, 2017 at 3:22 PM A. Soroka <aj...@virginia.edu> wrote:

> We had a few volunteers to begin learning the codebase to take it forward,
> and willingness from at least some of the current committers to teach, but
> I think we need a bit more organization! :grin:
>
> I will make a concrete suggestion. Would it be possible to have a video
> call (perhaps with Google Hangout) about this on (just throwing out a date
> here) this coming week, Monday 27 February, at 10AM EST?
>
> I would be able to attend. I realize that we are a far-flung group, so I'm
> just throwing out that date to get us started. The most important thing, of
> course, is to get as many current committers involved as is practical.
>
> ---
> A. Soroka
> The University of Virginia Library
>
> > On Feb 24, 2017, at 8:50 AM, Michal Krajňanský <
> michal.krajnansky@gmail.com> wrote:
> >
> > Dear Stanbol users,
> >
> > I was wondering, if there were any results of the discussion about the
> > Apache Stanbol future.
> >
> > I work for a R&D team of Konica Minolta Laboratory Europe, and we have
> been
> > using Stanbol enhancement pipeline in our prototypes concerning
> information
> > extraction from unstructured data.
> >
> > We are highly interested in the continuing evolutoin of the Stanbol
> > project, and willing to actively support it. Is here anyone who could
> tell
> > us, what would be a good way to approach the existing Stanbol
> stakeholders,
> > and support the project by active development and possibly via other
> ways?
> >
> >
> > Michal Krajnansky
> > Research Specialist Junior, Konica Minolta Laboratory Europe
> >
> >
> >
> > On Mon, Jan 23, 2017 at 6:14 PM A. Soroka <aj...@virginia.edu> wrote:
> >
> >> Perhaps we can start a page for people to put their name down for this?
> I
> >> couldn't find an Apache wiki site-- is there somewhere the developers
> think
> >> appropriate?
> >>
> >> ---
> >> A. Soroka
> >> Apache Jena / The University of Virginia Library
> >>
> >>> On Jan 23, 2017, at 12:08 PM, Aaron Coburn <ac...@amherst.edu>
> wrote:
> >>>
> >>> I would also be very interested.
> >>>
> >>> We use the entityhub component quite a bit at our institution, and I
> >> would be happy to be involved.
> >>>
> >>> Aaron Coburn
> >>>
> >>>
> >>>> On Jan 22, 2017, at 12:30 PM, Antero Duarte <a....@gmail.com>
> >> wrote:
> >>>>
> >>>> I would definitely be up for that!
> >>>>
> >>>> On Sat, 21 Jan 2017, 3:33 p.m. A. Soroka, <aj...@virginia.edu> wrote:
> >>>>
> >>>>> Ditto. If there are other folks who would be interested in
> >> participating
> >>>>> in something like this, now would be a good time to raise your voice!
> >>>>>
> >>>>> ---
> >>>>> A. Soroka
> >>>>> The University of Virginia Library
> >>>>>
> >>>>>
> >>>>>> On Jan 21, 2017, at 10:20 AM, Andrew Valencik <an...@affin.io>
> >> wrote:
> >>>>>>
> >>>>>> I think the "mandatory" documentation contribution is a great idea
> >> (and
> >>>>>> very reasonable). I'd be happy to partake!
> >>>>>> On Wed, Jan 18, 2017 at 11:13 AM Rafa Haro <rh...@apache.org>
> wrote:
> >>>>>>
> >>>>>>> I wouldn't mind to be involved in that but it would be almost
> >>>>> "mandatory"
> >>>>>>> to contribute some Developer Documentation as outcome of those
> >> meetings
> >>>>>>> :-). @Rupert we specially need you here :-)
> >>>>>>>
> >>>>>>> On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu>
> >> wrote:
> >>>>>>>
> >>>>>>>>> I agree that the barrier to contribution is very high. I recall
> >> having
> >>>>>>>> issues with the documentation initially and the only
> >>>>>>>>> available book on Stanbol was not sufficient.
> >>>>>>>>>
> >>>>>>>>> If there was renewed interest in bringing on other developers I
> >> would
> >>>>>>> be
> >>>>>>>> interested in investing the time to learn to codebase.
> >>>>>>>>
> >>>>>>>> I second this!
> >>>>>>>>
> >>>>>>>> Perhaps (I know it's very difficult to organize synchronous time
> >> for a
> >>>>>>>> globally-distributed group but perhaps) we could try to organize a
> >> boot
> >>>>>>>> camp meeting on-line? In other words, those people who (like
> Andrew
> >> and
> >>>>>>>> myself) would be willing to contribute as part of a larger effort
> >> could
> >>>>>>> get
> >>>>>>>> some virtual time with one or more committers/PMC members to take
> an
> >>>>>>>> in-depth tour of the system from the developer point of view and
> >> hear
> >>>>>>> about
> >>>>>>>> the outstanding architectural issues, maybe start to figure out
> >> points
> >>>>> of
> >>>>>>>> contribution.
> >>>>>>>>
> >>>>>>>> I realize this would make a lot of demands on the committers
> >> involved,
> >>>>>>> but
> >>>>>>>> it might be a way to inject some fuel into the effort.
> >>>>>>>>
> >>>>>>>> Just an idea...
> >>>>>>>>
> >>>>>>>> ---
> >>>>>>>> A. Soroka
> >>>>>>>> The University of Virginia Library
> >>>>>>>>
> >>>>>>>>> On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io>
> >> wrote:
> >>>>>>>>>
> >>>>>>>>> Hello!
> >>>>>>>>>
> >>>>>>>>> We use Stanbol in production to annotate text with entities as
> >> part of
> >>>>>>>> some
> >>>>>>>>> of our data products.
> >>>>>>>>> We do this via the REST API.
> >>>>>>>>> Originally we were using the content hub to store all the
> documents
> >>>>> but
> >>>>>>>> saw
> >>>>>>>>> higher than expected failures.
> >>>>>>>>> The entity engines seem a bit more resilient to varying content
> >> types
> >>>>> /
> >>>>>>>>> encoding.
> >>>>>>>>>
> >>>>>>>>> I agree that the barrier to contribution is very high.
> >>>>>>>>> I recall having issues with the documentation initially and the
> >> only
> >>>>>>>>> available book on Stanbol was not sufficient.
> >>>>>>>>>
> >>>>>>>>> If there was renewed interest in bringing on other developers I
> >> would
> >>>>>>> be
> >>>>>>>>> interested in investing the time to learn to codebase.
> >>>>>>>>>
> >>>>>>>>> Thanks!
> >>>>>>>>>
> >>>>>>>>> On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
> >>>>>>>>> raffaele.palmieri@gmail.com> wrote:
> >>>>>>>>>
> >>>>>>>>>> Dear community,
> >>>>>>>>>> we are using 0.12 version with content hub. I find Stanbol very
> >>>>>>>> flexible to
> >>>>>>>>>> enhance content, specially those unstructured ones.
> >>>>>>>>>> For that regards connections with other projects, we have done
> >> some
> >>>>>>>>>> thoughts about the use with Apache Marmotta with nosql backends
> >> for
> >>>>>>> big
> >>>>>>>>>> data scenarios. Also with Apache Manifold to implement
> enrichment
> >> of
> >>>>>>>>>> existent document repositories, that's a common request.
> >>>>>>>>>> Regards,
> >>>>>>>>>> Raffaele.
> >>>>>>>>>>
> >>>>>>>>>>
> >>>>>>>>>> 2017-01-16 22:41 GMT+01:00 Antero Duarte <a.fduarte1@gmail.com
> >:
> >>>>>>>>>>
> >>>>>>>>>>> Hi there,
> >>>>>>>>>>> Stanbol is very useful for me! Greatest and easiest tool for us
> >> to
> >>>>> do
> >>>>>>>> NLP
> >>>>>>>>>>> and linked data. Has there been any discussion to move towards
> a
> >>>>>>> nosql
> >>>>>>>>>>> storage solution, or is solr still the best thing for us? Also,
> >> what
> >>>>>>>>>> about
> >>>>>>>>>>> upgrading solr? How much work would be involved in that?
> Anyway,
> >>>>>>> great
> >>>>>>>>>>> tool, really hope this doesn't die!!!
> >>>>>>>>>>>
> >>>>>>>>>>> Regards,
> >>>>>>>>>>> Antero
> >>>>>>>>>>>
> >>>>>>>>>>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
> >>>>>>>>>>> bdelacretaz@apache.org>
> >>>>>>>>>>> wrote:
> >>>>>>>>>>>
> >>>>>>>>>>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <rh...@apache.org>
> >>>>>>> wrote:
> >>>>>>>>>>>>> ...I participated
> >>>>>>>>>>>>> also in the development of the Java client, so I could take
> the
> >>>>>>>>>>>>> responsibility of bringing that one as well...
> >>>>>>>>>>>>
> >>>>>>>>>>>> FWIW, I won't be involved in decisions about this as I left
> the
> >>>>>>>>>>>> Stanbol PMC a while ago - I'm just commenting from a community
> >>>>> point
> >>>>>>>>>>>> of view, as an experienced Apache member.
> >>>>>>>>>>>>
> >>>>>>>>>>>> A while ago Stanbol was "larger" and more focused on its core,
> >> but
> >>>>>>> as
> >>>>>>>>>>>> its community becomes smaller (IIUC) it's probably good to
> bring
> >>>>>>>>>>>> everyone here, as much as possible, even it it means a
> slightly
> >>>>> less
> >>>>>>>>>>>> focused codebase. This can also help recruiting more active
> >>>>>>> committers
> >>>>>>>>>>>> and PMC members by involving them directly here.
> >>>>>>>>>>>>
> >>>>>>>>>>>> That might make Stanbol more sustainable, as a community of
> >> people
> >>>>>>> who
> >>>>>>>>>>>> need similar functionality.
> >>>>>>>>>>>>
> >>>>>>>>>>>> -Bertrand
> >>>>>>>>>>>>
> >>>>>>>>>>>
> >>>>>>>>>>
> >>>>>>>>> --
> >>>>>>>>>
> >>>>>>>>> Andrew Valencik
> >>>>>>>>>
> >>>>>>>>> Data Scientist
> >>>>>>>>>
> >>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
> >>>>>>> http://twitter.com/valencik>
> >>>>>>>> |
> >>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
> >>>>>>>>>
> >>>>>>>>> Discover your personal brand for free with Discovery by Affinio
> >>>>>>>>> <
> >>>>>>>>
> >>>>>>>
> >>>>>
> >>
> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
> >>>>>>>>>
> >>>>>>>>
> >>>>>>>>
> >>>>>>>
> >>>>>> --
> >>>>>>
> >>>>>> Andrew Valencik
> >>>>>>
> >>>>>> Data Scientist
> >>>>>>
> >>>>>> Affinio <http://www.affinio.com> | Twitter <
> >> http://twitter.com/valencik>
> >>>>> |
> >>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
> >>>>>>
> >>>>>> Discover your personal brand for free with Discovery by Affinio
> >>>>>> <
> >>>>>
> >>
> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
> >>>>>>
> >>>>>
> >>>>>
> >>>>>
> >>>>>
> >>>
> >>
> >>
> >>
> >>
> >>
> >>
>
>

Re: The Future of Apache Stanbol

Posted by "A. Soroka" <aj...@virginia.edu>.
We had a few volunteers to begin learning the codebase to take it forward, and willingness from at least some of the current committers to teach, but I think we need a bit more organization! :grin:

I will make a concrete suggestion. Would it be possible to have a video call (perhaps with Google Hangout) about this on (just throwing out a date here) this coming week, Monday 27 February, at 10AM EST? 

I would be able to attend. I realize that we are a far-flung group, so I'm just throwing out that date to get us started. The most important thing, of course, is to get as many current committers involved as is practical.

---
A. Soroka
The University of Virginia Library

> On Feb 24, 2017, at 8:50 AM, Michal Krajňanský <mi...@gmail.com> wrote:
> 
> Dear Stanbol users,
> 
> I was wondering, if there were any results of the discussion about the
> Apache Stanbol future.
> 
> I work for a R&D team of Konica Minolta Laboratory Europe, and we have been
> using Stanbol enhancement pipeline in our prototypes concerning information
> extraction from unstructured data.
> 
> We are highly interested in the continuing evolutoin of the Stanbol
> project, and willing to actively support it. Is here anyone who could tell
> us, what would be a good way to approach the existing Stanbol stakeholders,
> and support the project by active development and possibly via other ways?
> 
> 
> Michal Krajnansky
> Research Specialist Junior, Konica Minolta Laboratory Europe
> 
> 
> 
> On Mon, Jan 23, 2017 at 6:14 PM A. Soroka <aj...@virginia.edu> wrote:
> 
>> Perhaps we can start a page for people to put their name down for this? I
>> couldn't find an Apache wiki site-- is there somewhere the developers think
>> appropriate?
>> 
>> ---
>> A. Soroka
>> Apache Jena / The University of Virginia Library
>> 
>>> On Jan 23, 2017, at 12:08 PM, Aaron Coburn <ac...@amherst.edu> wrote:
>>> 
>>> I would also be very interested.
>>> 
>>> We use the entityhub component quite a bit at our institution, and I
>> would be happy to be involved.
>>> 
>>> Aaron Coburn
>>> 
>>> 
>>>> On Jan 22, 2017, at 12:30 PM, Antero Duarte <a....@gmail.com>
>> wrote:
>>>> 
>>>> I would definitely be up for that!
>>>> 
>>>> On Sat, 21 Jan 2017, 3:33 p.m. A. Soroka, <aj...@virginia.edu> wrote:
>>>> 
>>>>> Ditto. If there are other folks who would be interested in
>> participating
>>>>> in something like this, now would be a good time to raise your voice!
>>>>> 
>>>>> ---
>>>>> A. Soroka
>>>>> The University of Virginia Library
>>>>> 
>>>>> 
>>>>>> On Jan 21, 2017, at 10:20 AM, Andrew Valencik <an...@affin.io>
>> wrote:
>>>>>> 
>>>>>> I think the "mandatory" documentation contribution is a great idea
>> (and
>>>>>> very reasonable). I'd be happy to partake!
>>>>>> On Wed, Jan 18, 2017 at 11:13 AM Rafa Haro <rh...@apache.org> wrote:
>>>>>> 
>>>>>>> I wouldn't mind to be involved in that but it would be almost
>>>>> "mandatory"
>>>>>>> to contribute some Developer Documentation as outcome of those
>> meetings
>>>>>>> :-). @Rupert we specially need you here :-)
>>>>>>> 
>>>>>>> On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu>
>> wrote:
>>>>>>> 
>>>>>>>>> I agree that the barrier to contribution is very high. I recall
>> having
>>>>>>>> issues with the documentation initially and the only
>>>>>>>>> available book on Stanbol was not sufficient.
>>>>>>>>> 
>>>>>>>>> If there was renewed interest in bringing on other developers I
>> would
>>>>>>> be
>>>>>>>> interested in investing the time to learn to codebase.
>>>>>>>> 
>>>>>>>> I second this!
>>>>>>>> 
>>>>>>>> Perhaps (I know it's very difficult to organize synchronous time
>> for a
>>>>>>>> globally-distributed group but perhaps) we could try to organize a
>> boot
>>>>>>>> camp meeting on-line? In other words, those people who (like Andrew
>> and
>>>>>>>> myself) would be willing to contribute as part of a larger effort
>> could
>>>>>>> get
>>>>>>>> some virtual time with one or more committers/PMC members to take an
>>>>>>>> in-depth tour of the system from the developer point of view and
>> hear
>>>>>>> about
>>>>>>>> the outstanding architectural issues, maybe start to figure out
>> points
>>>>> of
>>>>>>>> contribution.
>>>>>>>> 
>>>>>>>> I realize this would make a lot of demands on the committers
>> involved,
>>>>>>> but
>>>>>>>> it might be a way to inject some fuel into the effort.
>>>>>>>> 
>>>>>>>> Just an idea...
>>>>>>>> 
>>>>>>>> ---
>>>>>>>> A. Soroka
>>>>>>>> The University of Virginia Library
>>>>>>>> 
>>>>>>>>> On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io>
>> wrote:
>>>>>>>>> 
>>>>>>>>> Hello!
>>>>>>>>> 
>>>>>>>>> We use Stanbol in production to annotate text with entities as
>> part of
>>>>>>>> some
>>>>>>>>> of our data products.
>>>>>>>>> We do this via the REST API.
>>>>>>>>> Originally we were using the content hub to store all the documents
>>>>> but
>>>>>>>> saw
>>>>>>>>> higher than expected failures.
>>>>>>>>> The entity engines seem a bit more resilient to varying content
>> types
>>>>> /
>>>>>>>>> encoding.
>>>>>>>>> 
>>>>>>>>> I agree that the barrier to contribution is very high.
>>>>>>>>> I recall having issues with the documentation initially and the
>> only
>>>>>>>>> available book on Stanbol was not sufficient.
>>>>>>>>> 
>>>>>>>>> If there was renewed interest in bringing on other developers I
>> would
>>>>>>> be
>>>>>>>>> interested in investing the time to learn to codebase.
>>>>>>>>> 
>>>>>>>>> Thanks!
>>>>>>>>> 
>>>>>>>>> On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
>>>>>>>>> raffaele.palmieri@gmail.com> wrote:
>>>>>>>>> 
>>>>>>>>>> Dear community,
>>>>>>>>>> we are using 0.12 version with content hub. I find Stanbol very
>>>>>>>> flexible to
>>>>>>>>>> enhance content, specially those unstructured ones.
>>>>>>>>>> For that regards connections with other projects, we have done
>> some
>>>>>>>>>> thoughts about the use with Apache Marmotta with nosql backends
>> for
>>>>>>> big
>>>>>>>>>> data scenarios. Also with Apache Manifold to implement enrichment
>> of
>>>>>>>>>> existent document repositories, that's a common request.
>>>>>>>>>> Regards,
>>>>>>>>>> Raffaele.
>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>>> 2017-01-16 22:41 GMT+01:00 Antero Duarte <a....@gmail.com>:
>>>>>>>>>> 
>>>>>>>>>>> Hi there,
>>>>>>>>>>> Stanbol is very useful for me! Greatest and easiest tool for us
>> to
>>>>> do
>>>>>>>> NLP
>>>>>>>>>>> and linked data. Has there been any discussion to move towards a
>>>>>>> nosql
>>>>>>>>>>> storage solution, or is solr still the best thing for us? Also,
>> what
>>>>>>>>>> about
>>>>>>>>>>> upgrading solr? How much work would be involved in that? Anyway,
>>>>>>> great
>>>>>>>>>>> tool, really hope this doesn't die!!!
>>>>>>>>>>> 
>>>>>>>>>>> Regards,
>>>>>>>>>>> Antero
>>>>>>>>>>> 
>>>>>>>>>>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
>>>>>>>>>>> bdelacretaz@apache.org>
>>>>>>>>>>> wrote:
>>>>>>>>>>> 
>>>>>>>>>>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <rh...@apache.org>
>>>>>>> wrote:
>>>>>>>>>>>>> ...I participated
>>>>>>>>>>>>> also in the development of the Java client, so I could take the
>>>>>>>>>>>>> responsibility of bringing that one as well...
>>>>>>>>>>>> 
>>>>>>>>>>>> FWIW, I won't be involved in decisions about this as I left the
>>>>>>>>>>>> Stanbol PMC a while ago - I'm just commenting from a community
>>>>> point
>>>>>>>>>>>> of view, as an experienced Apache member.
>>>>>>>>>>>> 
>>>>>>>>>>>> A while ago Stanbol was "larger" and more focused on its core,
>> but
>>>>>>> as
>>>>>>>>>>>> its community becomes smaller (IIUC) it's probably good to bring
>>>>>>>>>>>> everyone here, as much as possible, even it it means a slightly
>>>>> less
>>>>>>>>>>>> focused codebase. This can also help recruiting more active
>>>>>>> committers
>>>>>>>>>>>> and PMC members by involving them directly here.
>>>>>>>>>>>> 
>>>>>>>>>>>> That might make Stanbol more sustainable, as a community of
>> people
>>>>>>> who
>>>>>>>>>>>> need similar functionality.
>>>>>>>>>>>> 
>>>>>>>>>>>> -Bertrand
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>> 
>>>>>>>>> --
>>>>>>>>> 
>>>>>>>>> Andrew Valencik
>>>>>>>>> 
>>>>>>>>> Data Scientist
>>>>>>>>> 
>>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>>>>> http://twitter.com/valencik>
>>>>>>>> |
>>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
>>>>>>>>> 
>>>>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>>>>> <
>>>>>>>> 
>>>>>>> 
>>>>> 
>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>>>>> 
>>>>>>>> 
>>>>>>>> 
>>>>>>> 
>>>>>> --
>>>>>> 
>>>>>> Andrew Valencik
>>>>>> 
>>>>>> Data Scientist
>>>>>> 
>>>>>> Affinio <http://www.affinio.com> | Twitter <
>> http://twitter.com/valencik>
>>>>> |
>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
>>>>>> 
>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>> <
>>>>> 
>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>> 
>>>>> 
>>>>> 
>>>>> 
>>>>> 
>>> 
>> 
>> 
>> 
>> 
>> 
>> 


Re: The Future of Apache Stanbol

Posted by Michal Krajňanský <mi...@gmail.com>.
Dear Stanbol users,

I was wondering, if there were any results of the discussion about the
Apache Stanbol future.

I work for a R&D team of Konica Minolta Laboratory Europe, and we have been
using Stanbol enhancement pipeline in our prototypes concerning information
extraction from unstructured data.

We are highly interested in the continuing evolutoin of the Stanbol
project, and willing to actively support it. Is here anyone who could tell
us, what would be a good way to approach the existing Stanbol stakeholders,
and support the project by active development and possibly via other ways?


Michal Krajnansky
Research Specialist Junior, Konica Minolta Laboratory Europe



On Mon, Jan 23, 2017 at 6:14 PM A. Soroka <aj...@virginia.edu> wrote:

> Perhaps we can start a page for people to put their name down for this? I
> couldn't find an Apache wiki site-- is there somewhere the developers think
> appropriate?
>
> ---
> A. Soroka
> Apache Jena / The University of Virginia Library
>
> > On Jan 23, 2017, at 12:08 PM, Aaron Coburn <ac...@amherst.edu> wrote:
> >
> > I would also be very interested.
> >
> > We use the entityhub component quite a bit at our institution, and I
> would be happy to be involved.
> >
> > Aaron Coburn
> >
> >
> >> On Jan 22, 2017, at 12:30 PM, Antero Duarte <a....@gmail.com>
> wrote:
> >>
> >> I would definitely be up for that!
> >>
> >> On Sat, 21 Jan 2017, 3:33 p.m. A. Soroka, <aj...@virginia.edu> wrote:
> >>
> >>> Ditto. If there are other folks who would be interested in
> participating
> >>> in something like this, now would be a good time to raise your voice!
> >>>
> >>> ---
> >>> A. Soroka
> >>> The University of Virginia Library
> >>>
> >>>
> >>>> On Jan 21, 2017, at 10:20 AM, Andrew Valencik <an...@affin.io>
> wrote:
> >>>>
> >>>> I think the "mandatory" documentation contribution is a great idea
> (and
> >>>> very reasonable). I'd be happy to partake!
> >>>> On Wed, Jan 18, 2017 at 11:13 AM Rafa Haro <rh...@apache.org> wrote:
> >>>>
> >>>>> I wouldn't mind to be involved in that but it would be almost
> >>> "mandatory"
> >>>>> to contribute some Developer Documentation as outcome of those
> meetings
> >>>>> :-). @Rupert we specially need you here :-)
> >>>>>
> >>>>> On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu>
> wrote:
> >>>>>
> >>>>>>> I agree that the barrier to contribution is very high. I recall
> having
> >>>>>> issues with the documentation initially and the only
> >>>>>>> available book on Stanbol was not sufficient.
> >>>>>>>
> >>>>>>> If there was renewed interest in bringing on other developers I
> would
> >>>>> be
> >>>>>> interested in investing the time to learn to codebase.
> >>>>>>
> >>>>>> I second this!
> >>>>>>
> >>>>>> Perhaps (I know it's very difficult to organize synchronous time
> for a
> >>>>>> globally-distributed group but perhaps) we could try to organize a
> boot
> >>>>>> camp meeting on-line? In other words, those people who (like Andrew
> and
> >>>>>> myself) would be willing to contribute as part of a larger effort
> could
> >>>>> get
> >>>>>> some virtual time with one or more committers/PMC members to take an
> >>>>>> in-depth tour of the system from the developer point of view and
> hear
> >>>>> about
> >>>>>> the outstanding architectural issues, maybe start to figure out
> points
> >>> of
> >>>>>> contribution.
> >>>>>>
> >>>>>> I realize this would make a lot of demands on the committers
> involved,
> >>>>> but
> >>>>>> it might be a way to inject some fuel into the effort.
> >>>>>>
> >>>>>> Just an idea...
> >>>>>>
> >>>>>> ---
> >>>>>> A. Soroka
> >>>>>> The University of Virginia Library
> >>>>>>
> >>>>>>> On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io>
> wrote:
> >>>>>>>
> >>>>>>> Hello!
> >>>>>>>
> >>>>>>> We use Stanbol in production to annotate text with entities as
> part of
> >>>>>> some
> >>>>>>> of our data products.
> >>>>>>> We do this via the REST API.
> >>>>>>> Originally we were using the content hub to store all the documents
> >>> but
> >>>>>> saw
> >>>>>>> higher than expected failures.
> >>>>>>> The entity engines seem a bit more resilient to varying content
> types
> >>> /
> >>>>>>> encoding.
> >>>>>>>
> >>>>>>> I agree that the barrier to contribution is very high.
> >>>>>>> I recall having issues with the documentation initially and the
> only
> >>>>>>> available book on Stanbol was not sufficient.
> >>>>>>>
> >>>>>>> If there was renewed interest in bringing on other developers I
> would
> >>>>> be
> >>>>>>> interested in investing the time to learn to codebase.
> >>>>>>>
> >>>>>>> Thanks!
> >>>>>>>
> >>>>>>> On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
> >>>>>>> raffaele.palmieri@gmail.com> wrote:
> >>>>>>>
> >>>>>>>> Dear community,
> >>>>>>>> we are using 0.12 version with content hub. I find Stanbol very
> >>>>>> flexible to
> >>>>>>>> enhance content, specially those unstructured ones.
> >>>>>>>> For that regards connections with other projects, we have done
> some
> >>>>>>>> thoughts about the use with Apache Marmotta with nosql backends
> for
> >>>>> big
> >>>>>>>> data scenarios. Also with Apache Manifold to implement enrichment
> of
> >>>>>>>> existent document repositories, that's a common request.
> >>>>>>>> Regards,
> >>>>>>>> Raffaele.
> >>>>>>>>
> >>>>>>>>
> >>>>>>>> 2017-01-16 22:41 GMT+01:00 Antero Duarte <a....@gmail.com>:
> >>>>>>>>
> >>>>>>>>> Hi there,
> >>>>>>>>> Stanbol is very useful for me! Greatest and easiest tool for us
> to
> >>> do
> >>>>>> NLP
> >>>>>>>>> and linked data. Has there been any discussion to move towards a
> >>>>> nosql
> >>>>>>>>> storage solution, or is solr still the best thing for us? Also,
> what
> >>>>>>>> about
> >>>>>>>>> upgrading solr? How much work would be involved in that? Anyway,
> >>>>> great
> >>>>>>>>> tool, really hope this doesn't die!!!
> >>>>>>>>>
> >>>>>>>>> Regards,
> >>>>>>>>> Antero
> >>>>>>>>>
> >>>>>>>>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
> >>>>>>>>> bdelacretaz@apache.org>
> >>>>>>>>> wrote:
> >>>>>>>>>
> >>>>>>>>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <rh...@apache.org>
> >>>>> wrote:
> >>>>>>>>>>> ...I participated
> >>>>>>>>>>> also in the development of the Java client, so I could take the
> >>>>>>>>>>> responsibility of bringing that one as well...
> >>>>>>>>>>
> >>>>>>>>>> FWIW, I won't be involved in decisions about this as I left the
> >>>>>>>>>> Stanbol PMC a while ago - I'm just commenting from a community
> >>> point
> >>>>>>>>>> of view, as an experienced Apache member.
> >>>>>>>>>>
> >>>>>>>>>> A while ago Stanbol was "larger" and more focused on its core,
> but
> >>>>> as
> >>>>>>>>>> its community becomes smaller (IIUC) it's probably good to bring
> >>>>>>>>>> everyone here, as much as possible, even it it means a slightly
> >>> less
> >>>>>>>>>> focused codebase. This can also help recruiting more active
> >>>>> committers
> >>>>>>>>>> and PMC members by involving them directly here.
> >>>>>>>>>>
> >>>>>>>>>> That might make Stanbol more sustainable, as a community of
> people
> >>>>> who
> >>>>>>>>>> need similar functionality.
> >>>>>>>>>>
> >>>>>>>>>> -Bertrand
> >>>>>>>>>>
> >>>>>>>>>
> >>>>>>>>
> >>>>>>> --
> >>>>>>>
> >>>>>>> Andrew Valencik
> >>>>>>>
> >>>>>>> Data Scientist
> >>>>>>>
> >>>>>>> Affinio <http://www.affinio.com> | Twitter <
> >>>>> http://twitter.com/valencik>
> >>>>>> |
> >>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
> >>>>>>>
> >>>>>>> Discover your personal brand for free with Discovery by Affinio
> >>>>>>> <
> >>>>>>
> >>>>>
> >>>
> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
> >>>>>>>
> >>>>>>
> >>>>>>
> >>>>>
> >>>> --
> >>>>
> >>>> Andrew Valencik
> >>>>
> >>>> Data Scientist
> >>>>
> >>>> Affinio <http://www.affinio.com> | Twitter <
> http://twitter.com/valencik>
> >>> |
> >>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
> >>>>
> >>>> Discover your personal brand for free with Discovery by Affinio
> >>>> <
> >>>
> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
> >>>>
> >>>
> >>>
> >>>
> >>>
> >
>
>
>
>
>
>

Re: The Future of Apache Stanbol

Posted by "A. Soroka" <aj...@virginia.edu>.
Perhaps we can start a page for people to put their name down for this? I couldn't find an Apache wiki site-- is there somewhere the developers think appropriate?

---
A. Soroka
Apache Jena / The University of Virginia Library

> On Jan 23, 2017, at 12:08 PM, Aaron Coburn <ac...@amherst.edu> wrote:
> 
> I would also be very interested.
> 
> We use the entityhub component quite a bit at our institution, and I would be happy to be involved.
> 
> Aaron Coburn
> 
> 
>> On Jan 22, 2017, at 12:30 PM, Antero Duarte <a....@gmail.com> wrote:
>> 
>> I would definitely be up for that!
>> 
>> On Sat, 21 Jan 2017, 3:33 p.m. A. Soroka, <aj...@virginia.edu> wrote:
>> 
>>> Ditto. If there are other folks who would be interested in participating
>>> in something like this, now would be a good time to raise your voice!
>>> 
>>> ---
>>> A. Soroka
>>> The University of Virginia Library
>>> 
>>> 
>>>> On Jan 21, 2017, at 10:20 AM, Andrew Valencik <an...@affin.io> wrote:
>>>> 
>>>> I think the "mandatory" documentation contribution is a great idea (and
>>>> very reasonable). I'd be happy to partake!
>>>> On Wed, Jan 18, 2017 at 11:13 AM Rafa Haro <rh...@apache.org> wrote:
>>>> 
>>>>> I wouldn't mind to be involved in that but it would be almost
>>> "mandatory"
>>>>> to contribute some Developer Documentation as outcome of those meetings
>>>>> :-). @Rupert we specially need you here :-)
>>>>> 
>>>>> On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu> wrote:
>>>>> 
>>>>>>> I agree that the barrier to contribution is very high. I recall having
>>>>>> issues with the documentation initially and the only
>>>>>>> available book on Stanbol was not sufficient.
>>>>>>> 
>>>>>>> If there was renewed interest in bringing on other developers I would
>>>>> be
>>>>>> interested in investing the time to learn to codebase.
>>>>>> 
>>>>>> I second this!
>>>>>> 
>>>>>> Perhaps (I know it's very difficult to organize synchronous time for a
>>>>>> globally-distributed group but perhaps) we could try to organize a boot
>>>>>> camp meeting on-line? In other words, those people who (like Andrew and
>>>>>> myself) would be willing to contribute as part of a larger effort could
>>>>> get
>>>>>> some virtual time with one or more committers/PMC members to take an
>>>>>> in-depth tour of the system from the developer point of view and hear
>>>>> about
>>>>>> the outstanding architectural issues, maybe start to figure out points
>>> of
>>>>>> contribution.
>>>>>> 
>>>>>> I realize this would make a lot of demands on the committers involved,
>>>>> but
>>>>>> it might be a way to inject some fuel into the effort.
>>>>>> 
>>>>>> Just an idea...
>>>>>> 
>>>>>> ---
>>>>>> A. Soroka
>>>>>> The University of Virginia Library
>>>>>> 
>>>>>>> On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io> wrote:
>>>>>>> 
>>>>>>> Hello!
>>>>>>> 
>>>>>>> We use Stanbol in production to annotate text with entities as part of
>>>>>> some
>>>>>>> of our data products.
>>>>>>> We do this via the REST API.
>>>>>>> Originally we were using the content hub to store all the documents
>>> but
>>>>>> saw
>>>>>>> higher than expected failures.
>>>>>>> The entity engines seem a bit more resilient to varying content types
>>> /
>>>>>>> encoding.
>>>>>>> 
>>>>>>> I agree that the barrier to contribution is very high.
>>>>>>> I recall having issues with the documentation initially and the only
>>>>>>> available book on Stanbol was not sufficient.
>>>>>>> 
>>>>>>> If there was renewed interest in bringing on other developers I would
>>>>> be
>>>>>>> interested in investing the time to learn to codebase.
>>>>>>> 
>>>>>>> Thanks!
>>>>>>> 
>>>>>>> On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
>>>>>>> raffaele.palmieri@gmail.com> wrote:
>>>>>>> 
>>>>>>>> Dear community,
>>>>>>>> we are using 0.12 version with content hub. I find Stanbol very
>>>>>> flexible to
>>>>>>>> enhance content, specially those unstructured ones.
>>>>>>>> For that regards connections with other projects, we have done some
>>>>>>>> thoughts about the use with Apache Marmotta with nosql backends for
>>>>> big
>>>>>>>> data scenarios. Also with Apache Manifold to implement enrichment of
>>>>>>>> existent document repositories, that's a common request.
>>>>>>>> Regards,
>>>>>>>> Raffaele.
>>>>>>>> 
>>>>>>>> 
>>>>>>>> 2017-01-16 22:41 GMT+01:00 Antero Duarte <a....@gmail.com>:
>>>>>>>> 
>>>>>>>>> Hi there,
>>>>>>>>> Stanbol is very useful for me! Greatest and easiest tool for us to
>>> do
>>>>>> NLP
>>>>>>>>> and linked data. Has there been any discussion to move towards a
>>>>> nosql
>>>>>>>>> storage solution, or is solr still the best thing for us? Also, what
>>>>>>>> about
>>>>>>>>> upgrading solr? How much work would be involved in that? Anyway,
>>>>> great
>>>>>>>>> tool, really hope this doesn't die!!!
>>>>>>>>> 
>>>>>>>>> Regards,
>>>>>>>>> Antero
>>>>>>>>> 
>>>>>>>>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
>>>>>>>>> bdelacretaz@apache.org>
>>>>>>>>> wrote:
>>>>>>>>> 
>>>>>>>>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <rh...@apache.org>
>>>>> wrote:
>>>>>>>>>>> ...I participated
>>>>>>>>>>> also in the development of the Java client, so I could take the
>>>>>>>>>>> responsibility of bringing that one as well...
>>>>>>>>>> 
>>>>>>>>>> FWIW, I won't be involved in decisions about this as I left the
>>>>>>>>>> Stanbol PMC a while ago - I'm just commenting from a community
>>> point
>>>>>>>>>> of view, as an experienced Apache member.
>>>>>>>>>> 
>>>>>>>>>> A while ago Stanbol was "larger" and more focused on its core, but
>>>>> as
>>>>>>>>>> its community becomes smaller (IIUC) it's probably good to bring
>>>>>>>>>> everyone here, as much as possible, even it it means a slightly
>>> less
>>>>>>>>>> focused codebase. This can also help recruiting more active
>>>>> committers
>>>>>>>>>> and PMC members by involving them directly here.
>>>>>>>>>> 
>>>>>>>>>> That might make Stanbol more sustainable, as a community of people
>>>>> who
>>>>>>>>>> need similar functionality.
>>>>>>>>>> 
>>>>>>>>>> -Bertrand
>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>>>> --
>>>>>>> 
>>>>>>> Andrew Valencik
>>>>>>> 
>>>>>>> Data Scientist
>>>>>>> 
>>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>>> http://twitter.com/valencik>
>>>>>> |
>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
>>>>>>> 
>>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>>> <
>>>>>> 
>>>>> 
>>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>>> 
>>>>>> 
>>>>>> 
>>>>> 
>>>> --
>>>> 
>>>> Andrew Valencik
>>>> 
>>>> Data Scientist
>>>> 
>>>> Affinio <http://www.affinio.com> | Twitter <http://twitter.com/valencik>
>>> |
>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
>>>> 
>>>> Discover your personal brand for free with Discovery by Affinio
>>>> <
>>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>> 
>>> 
>>> 
>>> 
>>> 
> 






Re: The Future of Apache Stanbol

Posted by Aaron Coburn <ac...@amherst.edu>.
I would also be very interested.

We use the entityhub component quite a bit at our institution, and I would be happy to be involved.

Aaron Coburn


> On Jan 22, 2017, at 12:30 PM, Antero Duarte <a....@gmail.com> wrote:
> 
> I would definitely be up for that!
> 
> On Sat, 21 Jan 2017, 3:33 p.m. A. Soroka, <aj...@virginia.edu> wrote:
> 
>> Ditto. If there are other folks who would be interested in participating
>> in something like this, now would be a good time to raise your voice!
>> 
>> ---
>> A. Soroka
>> The University of Virginia Library
>> 
>> 
>>> On Jan 21, 2017, at 10:20 AM, Andrew Valencik <an...@affin.io> wrote:
>>> 
>>> I think the "mandatory" documentation contribution is a great idea (and
>>> very reasonable). I'd be happy to partake!
>>> On Wed, Jan 18, 2017 at 11:13 AM Rafa Haro <rh...@apache.org> wrote:
>>> 
>>>> I wouldn't mind to be involved in that but it would be almost
>> "mandatory"
>>>> to contribute some Developer Documentation as outcome of those meetings
>>>> :-). @Rupert we specially need you here :-)
>>>> 
>>>> On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu> wrote:
>>>> 
>>>>>> I agree that the barrier to contribution is very high. I recall having
>>>>> issues with the documentation initially and the only
>>>>>> available book on Stanbol was not sufficient.
>>>>>> 
>>>>>> If there was renewed interest in bringing on other developers I would
>>>> be
>>>>> interested in investing the time to learn to codebase.
>>>>> 
>>>>> I second this!
>>>>> 
>>>>> Perhaps (I know it's very difficult to organize synchronous time for a
>>>>> globally-distributed group but perhaps) we could try to organize a boot
>>>>> camp meeting on-line? In other words, those people who (like Andrew and
>>>>> myself) would be willing to contribute as part of a larger effort could
>>>> get
>>>>> some virtual time with one or more committers/PMC members to take an
>>>>> in-depth tour of the system from the developer point of view and hear
>>>> about
>>>>> the outstanding architectural issues, maybe start to figure out points
>> of
>>>>> contribution.
>>>>> 
>>>>> I realize this would make a lot of demands on the committers involved,
>>>> but
>>>>> it might be a way to inject some fuel into the effort.
>>>>> 
>>>>> Just an idea...
>>>>> 
>>>>> ---
>>>>> A. Soroka
>>>>> The University of Virginia Library
>>>>> 
>>>>>> On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io> wrote:
>>>>>> 
>>>>>> Hello!
>>>>>> 
>>>>>> We use Stanbol in production to annotate text with entities as part of
>>>>> some
>>>>>> of our data products.
>>>>>> We do this via the REST API.
>>>>>> Originally we were using the content hub to store all the documents
>> but
>>>>> saw
>>>>>> higher than expected failures.
>>>>>> The entity engines seem a bit more resilient to varying content types
>> /
>>>>>> encoding.
>>>>>> 
>>>>>> I agree that the barrier to contribution is very high.
>>>>>> I recall having issues with the documentation initially and the only
>>>>>> available book on Stanbol was not sufficient.
>>>>>> 
>>>>>> If there was renewed interest in bringing on other developers I would
>>>> be
>>>>>> interested in investing the time to learn to codebase.
>>>>>> 
>>>>>> Thanks!
>>>>>> 
>>>>>> On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
>>>>>> raffaele.palmieri@gmail.com> wrote:
>>>>>> 
>>>>>>> Dear community,
>>>>>>> we are using 0.12 version with content hub. I find Stanbol very
>>>>> flexible to
>>>>>>> enhance content, specially those unstructured ones.
>>>>>>> For that regards connections with other projects, we have done some
>>>>>>> thoughts about the use with Apache Marmotta with nosql backends for
>>>> big
>>>>>>> data scenarios. Also with Apache Manifold to implement enrichment of
>>>>>>> existent document repositories, that's a common request.
>>>>>>> Regards,
>>>>>>> Raffaele.
>>>>>>> 
>>>>>>> 
>>>>>>> 2017-01-16 22:41 GMT+01:00 Antero Duarte <a....@gmail.com>:
>>>>>>> 
>>>>>>>> Hi there,
>>>>>>>> Stanbol is very useful for me! Greatest and easiest tool for us to
>> do
>>>>> NLP
>>>>>>>> and linked data. Has there been any discussion to move towards a
>>>> nosql
>>>>>>>> storage solution, or is solr still the best thing for us? Also, what
>>>>>>> about
>>>>>>>> upgrading solr? How much work would be involved in that? Anyway,
>>>> great
>>>>>>>> tool, really hope this doesn't die!!!
>>>>>>>> 
>>>>>>>> Regards,
>>>>>>>> Antero
>>>>>>>> 
>>>>>>>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
>>>>>>>> bdelacretaz@apache.org>
>>>>>>>> wrote:
>>>>>>>> 
>>>>>>>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <rh...@apache.org>
>>>> wrote:
>>>>>>>>>> ...I participated
>>>>>>>>>> also in the development of the Java client, so I could take the
>>>>>>>>>> responsibility of bringing that one as well...
>>>>>>>>> 
>>>>>>>>> FWIW, I won't be involved in decisions about this as I left the
>>>>>>>>> Stanbol PMC a while ago - I'm just commenting from a community
>> point
>>>>>>>>> of view, as an experienced Apache member.
>>>>>>>>> 
>>>>>>>>> A while ago Stanbol was "larger" and more focused on its core, but
>>>> as
>>>>>>>>> its community becomes smaller (IIUC) it's probably good to bring
>>>>>>>>> everyone here, as much as possible, even it it means a slightly
>> less
>>>>>>>>> focused codebase. This can also help recruiting more active
>>>> committers
>>>>>>>>> and PMC members by involving them directly here.
>>>>>>>>> 
>>>>>>>>> That might make Stanbol more sustainable, as a community of people
>>>> who
>>>>>>>>> need similar functionality.
>>>>>>>>> 
>>>>>>>>> -Bertrand
>>>>>>>>> 
>>>>>>>> 
>>>>>>> 
>>>>>> --
>>>>>> 
>>>>>> Andrew Valencik
>>>>>> 
>>>>>> Data Scientist
>>>>>> 
>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>> http://twitter.com/valencik>
>>>>> |
>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
>>>>>> 
>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>> <
>>>>> 
>>>> 
>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>> 
>>>>> 
>>>>> 
>>>> 
>>> --
>>> 
>>> Andrew Valencik
>>> 
>>> Data Scientist
>>> 
>>> Affinio <http://www.affinio.com> | Twitter <http://twitter.com/valencik>
>> |
>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
>>> 
>>> Discover your personal brand for free with Discovery by Affinio
>>> <
>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>> 
>> 
>> 
>> 
>> 


Re: The Future of Apache Stanbol

Posted by Antero Duarte <a....@gmail.com>.
I would definitely be up for that!

On Sat, 21 Jan 2017, 3:33 p.m. A. Soroka, <aj...@virginia.edu> wrote:

> Ditto. If there are other folks who would be interested in participating
> in something like this, now would be a good time to raise your voice!
>
> ---
> A. Soroka
> The University of Virginia Library
>
>
> > On Jan 21, 2017, at 10:20 AM, Andrew Valencik <an...@affin.io> wrote:
> >
> > I think the "mandatory" documentation contribution is a great idea (and
> > very reasonable). I'd be happy to partake!
> > On Wed, Jan 18, 2017 at 11:13 AM Rafa Haro <rh...@apache.org> wrote:
> >
> >> I wouldn't mind to be involved in that but it would be almost
> "mandatory"
> >> to contribute some Developer Documentation as outcome of those meetings
> >> :-). @Rupert we specially need you here :-)
> >>
> >> On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu> wrote:
> >>
> >>>> I agree that the barrier to contribution is very high. I recall having
> >>> issues with the documentation initially and the only
> >>>> available book on Stanbol was not sufficient.
> >>>>
> >>>> If there was renewed interest in bringing on other developers I would
> >> be
> >>> interested in investing the time to learn to codebase.
> >>>
> >>> I second this!
> >>>
> >>> Perhaps (I know it's very difficult to organize synchronous time for a
> >>> globally-distributed group but perhaps) we could try to organize a boot
> >>> camp meeting on-line? In other words, those people who (like Andrew and
> >>> myself) would be willing to contribute as part of a larger effort could
> >> get
> >>> some virtual time with one or more committers/PMC members to take an
> >>> in-depth tour of the system from the developer point of view and hear
> >> about
> >>> the outstanding architectural issues, maybe start to figure out points
> of
> >>> contribution.
> >>>
> >>> I realize this would make a lot of demands on the committers involved,
> >> but
> >>> it might be a way to inject some fuel into the effort.
> >>>
> >>> Just an idea...
> >>>
> >>> ---
> >>> A. Soroka
> >>> The University of Virginia Library
> >>>
> >>>> On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io> wrote:
> >>>>
> >>>> Hello!
> >>>>
> >>>> We use Stanbol in production to annotate text with entities as part of
> >>> some
> >>>> of our data products.
> >>>> We do this via the REST API.
> >>>> Originally we were using the content hub to store all the documents
> but
> >>> saw
> >>>> higher than expected failures.
> >>>> The entity engines seem a bit more resilient to varying content types
> /
> >>>> encoding.
> >>>>
> >>>> I agree that the barrier to contribution is very high.
> >>>> I recall having issues with the documentation initially and the only
> >>>> available book on Stanbol was not sufficient.
> >>>>
> >>>> If there was renewed interest in bringing on other developers I would
> >> be
> >>>> interested in investing the time to learn to codebase.
> >>>>
> >>>> Thanks!
> >>>>
> >>>> On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
> >>>> raffaele.palmieri@gmail.com> wrote:
> >>>>
> >>>>> Dear community,
> >>>>> we are using 0.12 version with content hub. I find Stanbol very
> >>> flexible to
> >>>>> enhance content, specially those unstructured ones.
> >>>>> For that regards connections with other projects, we have done some
> >>>>> thoughts about the use with Apache Marmotta with nosql backends for
> >> big
> >>>>> data scenarios. Also with Apache Manifold to implement enrichment of
> >>>>> existent document repositories, that's a common request.
> >>>>> Regards,
> >>>>> Raffaele.
> >>>>>
> >>>>>
> >>>>> 2017-01-16 22:41 GMT+01:00 Antero Duarte <a....@gmail.com>:
> >>>>>
> >>>>>> Hi there,
> >>>>>> Stanbol is very useful for me! Greatest and easiest tool for us to
> do
> >>> NLP
> >>>>>> and linked data. Has there been any discussion to move towards a
> >> nosql
> >>>>>> storage solution, or is solr still the best thing for us? Also, what
> >>>>> about
> >>>>>> upgrading solr? How much work would be involved in that? Anyway,
> >> great
> >>>>>> tool, really hope this doesn't die!!!
> >>>>>>
> >>>>>> Regards,
> >>>>>> Antero
> >>>>>>
> >>>>>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
> >>>>>> bdelacretaz@apache.org>
> >>>>>> wrote:
> >>>>>>
> >>>>>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <rh...@apache.org>
> >> wrote:
> >>>>>>>> ...I participated
> >>>>>>>> also in the development of the Java client, so I could take the
> >>>>>>>> responsibility of bringing that one as well...
> >>>>>>>
> >>>>>>> FWIW, I won't be involved in decisions about this as I left the
> >>>>>>> Stanbol PMC a while ago - I'm just commenting from a community
> point
> >>>>>>> of view, as an experienced Apache member.
> >>>>>>>
> >>>>>>> A while ago Stanbol was "larger" and more focused on its core, but
> >> as
> >>>>>>> its community becomes smaller (IIUC) it's probably good to bring
> >>>>>>> everyone here, as much as possible, even it it means a slightly
> less
> >>>>>>> focused codebase. This can also help recruiting more active
> >> committers
> >>>>>>> and PMC members by involving them directly here.
> >>>>>>>
> >>>>>>> That might make Stanbol more sustainable, as a community of people
> >> who
> >>>>>>> need similar functionality.
> >>>>>>>
> >>>>>>> -Bertrand
> >>>>>>>
> >>>>>>
> >>>>>
> >>>> --
> >>>>
> >>>> Andrew Valencik
> >>>>
> >>>> Data Scientist
> >>>>
> >>>> Affinio <http://www.affinio.com> | Twitter <
> >> http://twitter.com/valencik>
> >>> |
> >>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
> >>>>
> >>>> Discover your personal brand for free with Discovery by Affinio
> >>>> <
> >>>
> >>
> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
> >>>>
> >>>
> >>>
> >>
> > --
> >
> > Andrew Valencik
> >
> > Data Scientist
> >
> > Affinio <http://www.affinio.com> | Twitter <http://twitter.com/valencik>
> |
> > LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
> >
> > Discover your personal brand for free with Discovery by Affinio
> > <
> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
> >
>
>
>
>

Re: The Future of Apache Stanbol

Posted by "A. Soroka" <aj...@virginia.edu>.
Ditto. If there are other folks who would be interested in participating in something like this, now would be a good time to raise your voice!

---
A. Soroka
The University of Virginia Library


> On Jan 21, 2017, at 10:20 AM, Andrew Valencik <an...@affin.io> wrote:
> 
> I think the "mandatory" documentation contribution is a great idea (and
> very reasonable). I'd be happy to partake!
> On Wed, Jan 18, 2017 at 11:13 AM Rafa Haro <rh...@apache.org> wrote:
> 
>> I wouldn't mind to be involved in that but it would be almost "mandatory"
>> to contribute some Developer Documentation as outcome of those meetings
>> :-). @Rupert we specially need you here :-)
>> 
>> On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu> wrote:
>> 
>>>> I agree that the barrier to contribution is very high. I recall having
>>> issues with the documentation initially and the only
>>>> available book on Stanbol was not sufficient.
>>>> 
>>>> If there was renewed interest in bringing on other developers I would
>> be
>>> interested in investing the time to learn to codebase.
>>> 
>>> I second this!
>>> 
>>> Perhaps (I know it's very difficult to organize synchronous time for a
>>> globally-distributed group but perhaps) we could try to organize a boot
>>> camp meeting on-line? In other words, those people who (like Andrew and
>>> myself) would be willing to contribute as part of a larger effort could
>> get
>>> some virtual time with one or more committers/PMC members to take an
>>> in-depth tour of the system from the developer point of view and hear
>> about
>>> the outstanding architectural issues, maybe start to figure out points of
>>> contribution.
>>> 
>>> I realize this would make a lot of demands on the committers involved,
>> but
>>> it might be a way to inject some fuel into the effort.
>>> 
>>> Just an idea...
>>> 
>>> ---
>>> A. Soroka
>>> The University of Virginia Library
>>> 
>>>> On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io> wrote:
>>>> 
>>>> Hello!
>>>> 
>>>> We use Stanbol in production to annotate text with entities as part of
>>> some
>>>> of our data products.
>>>> We do this via the REST API.
>>>> Originally we were using the content hub to store all the documents but
>>> saw
>>>> higher than expected failures.
>>>> The entity engines seem a bit more resilient to varying content types /
>>>> encoding.
>>>> 
>>>> I agree that the barrier to contribution is very high.
>>>> I recall having issues with the documentation initially and the only
>>>> available book on Stanbol was not sufficient.
>>>> 
>>>> If there was renewed interest in bringing on other developers I would
>> be
>>>> interested in investing the time to learn to codebase.
>>>> 
>>>> Thanks!
>>>> 
>>>> On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
>>>> raffaele.palmieri@gmail.com> wrote:
>>>> 
>>>>> Dear community,
>>>>> we are using 0.12 version with content hub. I find Stanbol very
>>> flexible to
>>>>> enhance content, specially those unstructured ones.
>>>>> For that regards connections with other projects, we have done some
>>>>> thoughts about the use with Apache Marmotta with nosql backends for
>> big
>>>>> data scenarios. Also with Apache Manifold to implement enrichment of
>>>>> existent document repositories, that's a common request.
>>>>> Regards,
>>>>> Raffaele.
>>>>> 
>>>>> 
>>>>> 2017-01-16 22:41 GMT+01:00 Antero Duarte <a....@gmail.com>:
>>>>> 
>>>>>> Hi there,
>>>>>> Stanbol is very useful for me! Greatest and easiest tool for us to do
>>> NLP
>>>>>> and linked data. Has there been any discussion to move towards a
>> nosql
>>>>>> storage solution, or is solr still the best thing for us? Also, what
>>>>> about
>>>>>> upgrading solr? How much work would be involved in that? Anyway,
>> great
>>>>>> tool, really hope this doesn't die!!!
>>>>>> 
>>>>>> Regards,
>>>>>> Antero
>>>>>> 
>>>>>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
>>>>>> bdelacretaz@apache.org>
>>>>>> wrote:
>>>>>> 
>>>>>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <rh...@apache.org>
>> wrote:
>>>>>>>> ...I participated
>>>>>>>> also in the development of the Java client, so I could take the
>>>>>>>> responsibility of bringing that one as well...
>>>>>>> 
>>>>>>> FWIW, I won't be involved in decisions about this as I left the
>>>>>>> Stanbol PMC a while ago - I'm just commenting from a community point
>>>>>>> of view, as an experienced Apache member.
>>>>>>> 
>>>>>>> A while ago Stanbol was "larger" and more focused on its core, but
>> as
>>>>>>> its community becomes smaller (IIUC) it's probably good to bring
>>>>>>> everyone here, as much as possible, even it it means a slightly less
>>>>>>> focused codebase. This can also help recruiting more active
>> committers
>>>>>>> and PMC members by involving them directly here.
>>>>>>> 
>>>>>>> That might make Stanbol more sustainable, as a community of people
>> who
>>>>>>> need similar functionality.
>>>>>>> 
>>>>>>> -Bertrand
>>>>>>> 
>>>>>> 
>>>>> 
>>>> --
>>>> 
>>>> Andrew Valencik
>>>> 
>>>> Data Scientist
>>>> 
>>>> Affinio <http://www.affinio.com> | Twitter <
>> http://twitter.com/valencik>
>>> |
>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
>>>> 
>>>> Discover your personal brand for free with Discovery by Affinio
>>>> <
>>> 
>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>> 
>>> 
>>> 
>> 
> -- 
> 
> Andrew Valencik
> 
> Data Scientist
> 
> Affinio <http://www.affinio.com> | Twitter <http://twitter.com/valencik> |
> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
> 
> Discover your personal brand for free with Discovery by Affinio
> <http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f>




Re: The Future of Apache Stanbol

Posted by Andrew Valencik <an...@affin.io>.
I think the "mandatory" documentation contribution is a great idea (and
very reasonable). I'd be happy to partake!
On Wed, Jan 18, 2017 at 11:13 AM Rafa Haro <rh...@apache.org> wrote:

> I wouldn't mind to be involved in that but it would be almost "mandatory"
> to contribute some Developer Documentation as outcome of those meetings
> :-). @Rupert we specially need you here :-)
>
> On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu> wrote:
>
> > > I agree that the barrier to contribution is very high. I recall having
> > issues with the documentation initially and the only
> > > available book on Stanbol was not sufficient.
> > >
> > > If there was renewed interest in bringing on other developers I would
> be
> > interested in investing the time to learn to codebase.
> >
> > I second this!
> >
> > Perhaps (I know it's very difficult to organize synchronous time for a
> > globally-distributed group but perhaps) we could try to organize a boot
> > camp meeting on-line? In other words, those people who (like Andrew and
> > myself) would be willing to contribute as part of a larger effort could
> get
> > some virtual time with one or more committers/PMC members to take an
> > in-depth tour of the system from the developer point of view and hear
> about
> > the outstanding architectural issues, maybe start to figure out points of
> > contribution.
> >
> > I realize this would make a lot of demands on the committers involved,
> but
> > it might be a way to inject some fuel into the effort.
> >
> > Just an idea...
> >
> > ---
> > A. Soroka
> > The University of Virginia Library
> >
> > > On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io> wrote:
> > >
> > > Hello!
> > >
> > > We use Stanbol in production to annotate text with entities as part of
> > some
> > > of our data products.
> > > We do this via the REST API.
> > > Originally we were using the content hub to store all the documents but
> > saw
> > > higher than expected failures.
> > > The entity engines seem a bit more resilient to varying content types /
> > > encoding.
> > >
> > > I agree that the barrier to contribution is very high.
> > > I recall having issues with the documentation initially and the only
> > > available book on Stanbol was not sufficient.
> > >
> > > If there was renewed interest in bringing on other developers I would
> be
> > > interested in investing the time to learn to codebase.
> > >
> > > Thanks!
> > >
> > > On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
> > > raffaele.palmieri@gmail.com> wrote:
> > >
> > >> Dear community,
> > >> we are using 0.12 version with content hub. I find Stanbol very
> > flexible to
> > >> enhance content, specially those unstructured ones.
> > >> For that regards connections with other projects, we have done some
> > >> thoughts about the use with Apache Marmotta with nosql backends for
> big
> > >> data scenarios. Also with Apache Manifold to implement enrichment of
> > >> existent document repositories, that's a common request.
> > >> Regards,
> > >> Raffaele.
> > >>
> > >>
> > >> 2017-01-16 22:41 GMT+01:00 Antero Duarte <a....@gmail.com>:
> > >>
> > >>> Hi there,
> > >>> Stanbol is very useful for me! Greatest and easiest tool for us to do
> > NLP
> > >>> and linked data. Has there been any discussion to move towards a
> nosql
> > >>> storage solution, or is solr still the best thing for us? Also, what
> > >> about
> > >>> upgrading solr? How much work would be involved in that? Anyway,
> great
> > >>> tool, really hope this doesn't die!!!
> > >>>
> > >>> Regards,
> > >>> Antero
> > >>>
> > >>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
> > >>> bdelacretaz@apache.org>
> > >>> wrote:
> > >>>
> > >>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <rh...@apache.org>
> wrote:
> > >>>>> ...I participated
> > >>>>> also in the development of the Java client, so I could take the
> > >>>>> responsibility of bringing that one as well...
> > >>>>
> > >>>> FWIW, I won't be involved in decisions about this as I left the
> > >>>> Stanbol PMC a while ago - I'm just commenting from a community point
> > >>>> of view, as an experienced Apache member.
> > >>>>
> > >>>> A while ago Stanbol was "larger" and more focused on its core, but
> as
> > >>>> its community becomes smaller (IIUC) it's probably good to bring
> > >>>> everyone here, as much as possible, even it it means a slightly less
> > >>>> focused codebase. This can also help recruiting more active
> committers
> > >>>> and PMC members by involving them directly here.
> > >>>>
> > >>>> That might make Stanbol more sustainable, as a community of people
> who
> > >>>> need similar functionality.
> > >>>>
> > >>>> -Bertrand
> > >>>>
> > >>>
> > >>
> > > --
> > >
> > > Andrew Valencik
> > >
> > > Data Scientist
> > >
> > > Affinio <http://www.affinio.com> | Twitter <
> http://twitter.com/valencik>
> > |
> > > LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
> > >
> > > Discover your personal brand for free with Discovery by Affinio
> > > <
> >
> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
> > >
> >
> >
>
-- 

Andrew Valencik

Data Scientist

Affinio <http://www.affinio.com> | Twitter <http://twitter.com/valencik> |
LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>

Discover your personal brand for free with Discovery by Affinio
<http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f>

Re: The Future of Apache Stanbol

Posted by Rafa Haro <rh...@apache.org>.
I wouldn't mind to be involved in that but it would be almost "mandatory"
to contribute some Developer Documentation as outcome of those meetings
:-). @Rupert we specially need you here :-)

On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu> wrote:

> > I agree that the barrier to contribution is very high. I recall having
> issues with the documentation initially and the only
> > available book on Stanbol was not sufficient.
> >
> > If there was renewed interest in bringing on other developers I would be
> interested in investing the time to learn to codebase.
>
> I second this!
>
> Perhaps (I know it's very difficult to organize synchronous time for a
> globally-distributed group but perhaps) we could try to organize a boot
> camp meeting on-line? In other words, those people who (like Andrew and
> myself) would be willing to contribute as part of a larger effort could get
> some virtual time with one or more committers/PMC members to take an
> in-depth tour of the system from the developer point of view and hear about
> the outstanding architectural issues, maybe start to figure out points of
> contribution.
>
> I realize this would make a lot of demands on the committers involved, but
> it might be a way to inject some fuel into the effort.
>
> Just an idea...
>
> ---
> A. Soroka
> The University of Virginia Library
>
> > On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io> wrote:
> >
> > Hello!
> >
> > We use Stanbol in production to annotate text with entities as part of
> some
> > of our data products.
> > We do this via the REST API.
> > Originally we were using the content hub to store all the documents but
> saw
> > higher than expected failures.
> > The entity engines seem a bit more resilient to varying content types /
> > encoding.
> >
> > I agree that the barrier to contribution is very high.
> > I recall having issues with the documentation initially and the only
> > available book on Stanbol was not sufficient.
> >
> > If there was renewed interest in bringing on other developers I would be
> > interested in investing the time to learn to codebase.
> >
> > Thanks!
> >
> > On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
> > raffaele.palmieri@gmail.com> wrote:
> >
> >> Dear community,
> >> we are using 0.12 version with content hub. I find Stanbol very
> flexible to
> >> enhance content, specially those unstructured ones.
> >> For that regards connections with other projects, we have done some
> >> thoughts about the use with Apache Marmotta with nosql backends for big
> >> data scenarios. Also with Apache Manifold to implement enrichment of
> >> existent document repositories, that's a common request.
> >> Regards,
> >> Raffaele.
> >>
> >>
> >> 2017-01-16 22:41 GMT+01:00 Antero Duarte <a....@gmail.com>:
> >>
> >>> Hi there,
> >>> Stanbol is very useful for me! Greatest and easiest tool for us to do
> NLP
> >>> and linked data. Has there been any discussion to move towards a nosql
> >>> storage solution, or is solr still the best thing for us? Also, what
> >> about
> >>> upgrading solr? How much work would be involved in that? Anyway, great
> >>> tool, really hope this doesn't die!!!
> >>>
> >>> Regards,
> >>> Antero
> >>>
> >>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
> >>> bdelacretaz@apache.org>
> >>> wrote:
> >>>
> >>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <rh...@apache.org> wrote:
> >>>>> ...I participated
> >>>>> also in the development of the Java client, so I could take the
> >>>>> responsibility of bringing that one as well...
> >>>>
> >>>> FWIW, I won't be involved in decisions about this as I left the
> >>>> Stanbol PMC a while ago - I'm just commenting from a community point
> >>>> of view, as an experienced Apache member.
> >>>>
> >>>> A while ago Stanbol was "larger" and more focused on its core, but as
> >>>> its community becomes smaller (IIUC) it's probably good to bring
> >>>> everyone here, as much as possible, even it it means a slightly less
> >>>> focused codebase. This can also help recruiting more active committers
> >>>> and PMC members by involving them directly here.
> >>>>
> >>>> That might make Stanbol more sustainable, as a community of people who
> >>>> need similar functionality.
> >>>>
> >>>> -Bertrand
> >>>>
> >>>
> >>
> > --
> >
> > Andrew Valencik
> >
> > Data Scientist
> >
> > Affinio <http://www.affinio.com> | Twitter <http://twitter.com/valencik>
> |
> > LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
> >
> > Discover your personal brand for free with Discovery by Affinio
> > <
> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
> >
>
>

Re: The Future of Apache Stanbol

Posted by "A. Soroka" <aj...@virginia.edu>.
> I agree that the barrier to contribution is very high. I recall having issues with the documentation initially and the only
> available book on Stanbol was not sufficient.
> 
> If there was renewed interest in bringing on other developers I would be interested in investing the time to learn to codebase.

I second this!

Perhaps (I know it's very difficult to organize synchronous time for a globally-distributed group but perhaps) we could try to organize a boot camp meeting on-line? In other words, those people who (like Andrew and myself) would be willing to contribute as part of a larger effort could get some virtual time with one or more committers/PMC members to take an in-depth tour of the system from the developer point of view and hear about the outstanding architectural issues, maybe start to figure out points of contribution.

I realize this would make a lot of demands on the committers involved, but it might be a way to inject some fuel into the effort.

Just an idea...

---
A. Soroka
The University of Virginia Library

> On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io> wrote:
> 
> Hello!
> 
> We use Stanbol in production to annotate text with entities as part of some
> of our data products.
> We do this via the REST API.
> Originally we were using the content hub to store all the documents but saw
> higher than expected failures.
> The entity engines seem a bit more resilient to varying content types /
> encoding.
> 
> I agree that the barrier to contribution is very high.
> I recall having issues with the documentation initially and the only
> available book on Stanbol was not sufficient.
> 
> If there was renewed interest in bringing on other developers I would be
> interested in investing the time to learn to codebase.
> 
> Thanks!
> 
> On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
> raffaele.palmieri@gmail.com> wrote:
> 
>> Dear community,
>> we are using 0.12 version with content hub. I find Stanbol very flexible to
>> enhance content, specially those unstructured ones.
>> For that regards connections with other projects, we have done some
>> thoughts about the use with Apache Marmotta with nosql backends for big
>> data scenarios. Also with Apache Manifold to implement enrichment of
>> existent document repositories, that's a common request.
>> Regards,
>> Raffaele.
>> 
>> 
>> 2017-01-16 22:41 GMT+01:00 Antero Duarte <a....@gmail.com>:
>> 
>>> Hi there,
>>> Stanbol is very useful for me! Greatest and easiest tool for us to do NLP
>>> and linked data. Has there been any discussion to move towards a nosql
>>> storage solution, or is solr still the best thing for us? Also, what
>> about
>>> upgrading solr? How much work would be involved in that? Anyway, great
>>> tool, really hope this doesn't die!!!
>>> 
>>> Regards,
>>> Antero
>>> 
>>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
>>> bdelacretaz@apache.org>
>>> wrote:
>>> 
>>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <rh...@apache.org> wrote:
>>>>> ...I participated
>>>>> also in the development of the Java client, so I could take the
>>>>> responsibility of bringing that one as well...
>>>> 
>>>> FWIW, I won't be involved in decisions about this as I left the
>>>> Stanbol PMC a while ago - I'm just commenting from a community point
>>>> of view, as an experienced Apache member.
>>>> 
>>>> A while ago Stanbol was "larger" and more focused on its core, but as
>>>> its community becomes smaller (IIUC) it's probably good to bring
>>>> everyone here, as much as possible, even it it means a slightly less
>>>> focused codebase. This can also help recruiting more active committers
>>>> and PMC members by involving them directly here.
>>>> 
>>>> That might make Stanbol more sustainable, as a community of people who
>>>> need similar functionality.
>>>> 
>>>> -Bertrand
>>>> 
>>> 
>> 
> -- 
> 
> Andrew Valencik
> 
> Data Scientist
> 
> Affinio <http://www.affinio.com> | Twitter <http://twitter.com/valencik> |
> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
> 
> Discover your personal brand for free with Discovery by Affinio
> <http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f>


Re: The Future of Apache Stanbol

Posted by Andrew Valencik <an...@affin.io>.
Hello!

We use Stanbol in production to annotate text with entities as part of some
of our data products.
We do this via the REST API.
Originally we were using the content hub to store all the documents but saw
higher than expected failures.
The entity engines seem a bit more resilient to varying content types /
encoding.

I agree that the barrier to contribution is very high.
I recall having issues with the documentation initially and the only
available book on Stanbol was not sufficient.

If there was renewed interest in bringing on other developers I would be
interested in investing the time to learn to codebase.

Thanks!

On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
raffaele.palmieri@gmail.com> wrote:

> Dear community,
> we are using 0.12 version with content hub. I find Stanbol very flexible to
> enhance content, specially those unstructured ones.
> For that regards connections with other projects, we have done some
> thoughts about the use with Apache Marmotta with nosql backends for big
> data scenarios. Also with Apache Manifold to implement enrichment of
> existent document repositories, that's a common request.
> Regards,
> Raffaele.
>
>
> 2017-01-16 22:41 GMT+01:00 Antero Duarte <a....@gmail.com>:
>
> > Hi there,
> > Stanbol is very useful for me! Greatest and easiest tool for us to do NLP
> > and linked data. Has there been any discussion to move towards a nosql
> > storage solution, or is solr still the best thing for us? Also, what
> about
> > upgrading solr? How much work would be involved in that? Anyway, great
> > tool, really hope this doesn't die!!!
> >
> > Regards,
> > Antero
> >
> > On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
> > bdelacretaz@apache.org>
> > wrote:
> >
> > > On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <rh...@apache.org> wrote:
> > > > ...I participated
> > > > also in the development of the Java client, so I could take the
> > > > responsibility of bringing that one as well...
> > >
> > > FWIW, I won't be involved in decisions about this as I left the
> > > Stanbol PMC a while ago - I'm just commenting from a community point
> > > of view, as an experienced Apache member.
> > >
> > > A while ago Stanbol was "larger" and more focused on its core, but as
> > > its community becomes smaller (IIUC) it's probably good to bring
> > > everyone here, as much as possible, even it it means a slightly less
> > > focused codebase. This can also help recruiting more active committers
> > > and PMC members by involving them directly here.
> > >
> > > That might make Stanbol more sustainable, as a community of people who
> > > need similar functionality.
> > >
> > > -Bertrand
> > >
> >
>
-- 

Andrew Valencik

Data Scientist

Affinio <http://www.affinio.com> | Twitter <http://twitter.com/valencik> |
LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>

Discover your personal brand for free with Discovery by Affinio
<http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f>

Re: The Future of Apache Stanbol

Posted by Raffaele Palmieri <ra...@gmail.com>.
Dear community,
we are using 0.12 version with content hub. I find Stanbol very flexible to
enhance content, specially those unstructured ones.
For that regards connections with other projects, we have done some
thoughts about the use with Apache Marmotta with nosql backends for big
data scenarios. Also with Apache Manifold to implement enrichment of
existent document repositories, that's a common request.
Regards,
Raffaele.


2017-01-16 22:41 GMT+01:00 Antero Duarte <a....@gmail.com>:

> Hi there,
> Stanbol is very useful for me! Greatest and easiest tool for us to do NLP
> and linked data. Has there been any discussion to move towards a nosql
> storage solution, or is solr still the best thing for us? Also, what about
> upgrading solr? How much work would be involved in that? Anyway, great
> tool, really hope this doesn't die!!!
>
> Regards,
> Antero
>
> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
> bdelacretaz@apache.org>
> wrote:
>
> > On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <rh...@apache.org> wrote:
> > > ...I participated
> > > also in the development of the Java client, so I could take the
> > > responsibility of bringing that one as well...
> >
> > FWIW, I won't be involved in decisions about this as I left the
> > Stanbol PMC a while ago - I'm just commenting from a community point
> > of view, as an experienced Apache member.
> >
> > A while ago Stanbol was "larger" and more focused on its core, but as
> > its community becomes smaller (IIUC) it's probably good to bring
> > everyone here, as much as possible, even it it means a slightly less
> > focused codebase. This can also help recruiting more active committers
> > and PMC members by involving them directly here.
> >
> > That might make Stanbol more sustainable, as a community of people who
> > need similar functionality.
> >
> > -Bertrand
> >
>

Re: The Future of Apache Stanbol

Posted by Antero Duarte <a....@gmail.com>.
Hi there,
Stanbol is very useful for me! Greatest and easiest tool for us to do NLP
and linked data. Has there been any discussion to move towards a nosql
storage solution, or is solr still the best thing for us? Also, what about
upgrading solr? How much work would be involved in that? Anyway, great
tool, really hope this doesn't die!!!

Regards,
Antero

On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <bd...@apache.org>
wrote:

> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <rh...@apache.org> wrote:
> > ...I participated
> > also in the development of the Java client, so I could take the
> > responsibility of bringing that one as well...
>
> FWIW, I won't be involved in decisions about this as I left the
> Stanbol PMC a while ago - I'm just commenting from a community point
> of view, as an experienced Apache member.
>
> A while ago Stanbol was "larger" and more focused on its core, but as
> its community becomes smaller (IIUC) it's probably good to bring
> everyone here, as much as possible, even it it means a slightly less
> focused codebase. This can also help recruiting more active committers
> and PMC members by involving them directly here.
>
> That might make Stanbol more sustainable, as a community of people who
> need similar functionality.
>
> -Bertrand
>

Re: The Future of Apache Stanbol

Posted by Bertrand Delacretaz <bd...@apache.org>.
On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <rh...@apache.org> wrote:
> ...I participated
> also in the development of the Java client, so I could take the
> responsibility of bringing that one as well...

FWIW, I won't be involved in decisions about this as I left the
Stanbol PMC a while ago - I'm just commenting from a community point
of view, as an experienced Apache member.

A while ago Stanbol was "larger" and more focused on its core, but as
its community becomes smaller (IIUC) it's probably good to bring
everyone here, as much as possible, even it it means a slightly less
focused codebase. This can also help recruiting more active committers
and PMC members by involving them directly here.

That might make Stanbol more sustainable, as a community of people who
need similar functionality.

-Bertrand

Re: The Future of Apache Stanbol

Posted by Rafa Haro <rh...@apache.org>.
HI Bertrand,

There was initiatives in the past to bring Stanbol REST API clients as part
of codebase, but they were rejected in that concrete moment. There is also
a Java one available from long time ago and licensed under Apache License.
I will be more than happy to contribute the python client. I participated
also in the development of the Java client, so I could take the
responsibility of bringing that one as well.

Cheers,
Rafa

On Mon, Jan 16, 2017 at 2:21 PM Bertrand Delacretaz <bd...@apache.org>
wrote:

> Hi,
>
> On Mon, Jan 16, 2017 at 11:48 AM, Rafa Haro <rh...@apache.org> wrote:
> > ...I started to develop one: https://github.com/rafaharo/pystanbol ...
>
> How about bringing this to Apache Stanbol?
>
> As this is a small community, getting it all together here might make
> it more sustainable, as opposed to fragmented efforts.
>
> -Bertrand
>

Re: The Future of Apache Stanbol

Posted by Bertrand Delacretaz <bd...@apache.org>.
Hi,

On Mon, Jan 16, 2017 at 11:48 AM, Rafa Haro <rh...@apache.org> wrote:
> ...I started to develop one: https://github.com/rafaharo/pystanbol ...

How about bringing this to Apache Stanbol?

As this is a small community, getting it all together here might make
it more sustainable, as opposed to fragmented efforts.

-Bertrand

RE: The Future of Apache Stanbol

Posted by ar...@wipro.com.
Thanks Rafa.
I will check same out.

Thanks and Regards,
Arthi


-----Original Message-----
From: Rafa Haro [mailto:rharo@apache.org]
Sent: Monday, January 16, 2017 4:19 PM
To: dev@stanbol.apache.org
Subject: Re: The Future of Apache Stanbol

** This mail has been sent from an external source **

Hi Arthi,

I started to develop one: https://github.com/rafaharo/pystanbol

It only covers Enhancer for now. Contributions are more than welcome.

Rafa

On Mon, Jan 16, 2017 at 11:41 AM <ar...@wipro.com> wrote:

> Hi,
>  Stanbol is a great solution for entity extraction and many NLP problems.
> I have used it for different pilots and customer implementations.
> I also plan to use same in future.
> The community is also great and very helpful.
> If there is a way for non Java programmer  say a Python programmer to
> more easily set up and consume the Rest services more folks can use.
>
> Thanks and Regards,
> Arthi
>
>
>
> -----Original Message-----
> From: Bertrand Delacretaz [mailto:bdelacretaz@apache.org]
> Sent: Monday, January 16, 2017 3:40 PM
> To: dev@stanbol.apache.org
> Subject: Re: The Future of Apache Stanbol
>
> ** This mail has been sent from an external source **
>
> Hi,
>
> On Mon, Jan 16, 2017 at 9:33 AM, Fabian Christ <
> christ.fabian@googlemail.com> wrote:
> > ...Maybe the time for Stanbol is over after 5 years of being a top
> > level Apache project. The ASF has the concept of moving projects to
> > the attic once there is not enough interest or community for a
> > project anymore....
>
> To be precise, an ASF project has to move to Attic if there are less
> than
> 3 active PMC members, which is the minimum required to vote on releases.
>
> Or if the project is unable to respond to security or other critical
> bug reports, due to lack of available contributors.
>
> It's also fine to move to Attic voluntarily if people think the above
> criteria might not be met for much longer, of course - I just wanted
> to clarify the requirements.
>
> -Bertrand
> The information contained in this electronic message and any
> attachments to this message are intended for the exclusive use of the
> addressee(s) and may contain proprietary, confidential or privileged
> information. If you are not the intended recipient, you should not
> disseminate, distribute or copy this e-mail. Please notify the sender
> immediately and destroy all copies of this message and any
> attachments. WARNING: Computer viruses can be transmitted via email.
> The recipient should check this email and any attachments for the
> presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email.
> www.wipro.com
>
The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you should not disseminate, distribute or copy this e-mail. Please notify the sender immediately and destroy all copies of this message and any attachments. WARNING: Computer viruses can be transmitted via email. The recipient should check this email and any attachments for the presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email. www.wipro.com

Re: The Future of Apache Stanbol

Posted by Rafa Haro <rh...@apache.org>.
Hi Arthi,

I started to develop one: https://github.com/rafaharo/pystanbol

It only covers Enhancer for now. Contributions are more than welcome.

Rafa

On Mon, Jan 16, 2017 at 11:41 AM <ar...@wipro.com> wrote:

> Hi,
>  Stanbol is a great solution for entity extraction and many NLP problems.
> I have used it for different pilots and customer implementations.
> I also plan to use same in future.
> The community is also great and very helpful.
> If there is a way for non Java programmer  say a Python programmer to more
> easily set up and consume the Rest services more folks can use.
>
> Thanks and Regards,
> Arthi
>
>
>
> -----Original Message-----
> From: Bertrand Delacretaz [mailto:bdelacretaz@apache.org]
> Sent: Monday, January 16, 2017 3:40 PM
> To: dev@stanbol.apache.org
> Subject: Re: The Future of Apache Stanbol
>
> ** This mail has been sent from an external source **
>
> Hi,
>
> On Mon, Jan 16, 2017 at 9:33 AM, Fabian Christ <
> christ.fabian@googlemail.com> wrote:
> > ...Maybe the time for Stanbol is over after 5 years of being a top
> > level Apache project. The ASF has the concept of moving projects to
> > the attic once there is not enough interest or community for a project
> > anymore....
>
> To be precise, an ASF project has to move to Attic if there are less than
> 3 active PMC members, which is the minimum required to vote on releases.
>
> Or if the project is unable to respond to security or other critical bug
> reports, due to lack of available contributors.
>
> It's also fine to move to Attic voluntarily if people think the above
> criteria might not be met for much longer, of course - I just wanted to
> clarify the requirements.
>
> -Bertrand
> The information contained in this electronic message and any attachments
> to this message are intended for the exclusive use of the addressee(s) and
> may contain proprietary, confidential or privileged information. If you are
> not the intended recipient, you should not disseminate, distribute or copy
> this e-mail. Please notify the sender immediately and destroy all copies of
> this message and any attachments. WARNING: Computer viruses can be
> transmitted via email. The recipient should check this email and any
> attachments for the presence of viruses. The company accepts no liability
> for any damage caused by any virus transmitted by this email.
> www.wipro.com
>

RE: The Future of Apache Stanbol

Posted by ar...@wipro.com.
Hi,
 Stanbol is a great solution for entity extraction and many NLP problems.  I have used it for different pilots and customer implementations.
I also plan to use same in future.
The community is also great and very helpful.
If there is a way for non Java programmer  say a Python programmer to more easily set up and consume the Rest services more folks can use.

Thanks and Regards,
Arthi



-----Original Message-----
From: Bertrand Delacretaz [mailto:bdelacretaz@apache.org]
Sent: Monday, January 16, 2017 3:40 PM
To: dev@stanbol.apache.org
Subject: Re: The Future of Apache Stanbol

** This mail has been sent from an external source **

Hi,

On Mon, Jan 16, 2017 at 9:33 AM, Fabian Christ <ch...@googlemail.com> wrote:
> ...Maybe the time for Stanbol is over after 5 years of being a top
> level Apache project. The ASF has the concept of moving projects to
> the attic once there is not enough interest or community for a project
> anymore....

To be precise, an ASF project has to move to Attic if there are less than 3 active PMC members, which is the minimum required to vote on releases.

Or if the project is unable to respond to security or other critical bug reports, due to lack of available contributors.

It's also fine to move to Attic voluntarily if people think the above criteria might not be met for much longer, of course - I just wanted to clarify the requirements.

-Bertrand
The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you should not disseminate, distribute or copy this e-mail. Please notify the sender immediately and destroy all copies of this message and any attachments. WARNING: Computer viruses can be transmitted via email. The recipient should check this email and any attachments for the presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email. www.wipro.com

Re: The Future of Apache Stanbol

Posted by Bertrand Delacretaz <bd...@apache.org>.
Hi,

On Mon, Jan 16, 2017 at 9:33 AM, Fabian Christ
<ch...@googlemail.com> wrote:
> ...Maybe the time for Stanbol is over after 5 years of being a top level
> Apache project. The ASF has the concept of moving projects to the
> attic once there is not enough interest or community for a project
> anymore....

To be precise, an ASF project has to move to Attic if there are less
than 3 active PMC members, which is the minimum required to vote on
releases.

Or if the project is unable to respond to security or other critical
bug reports, due to lack of available contributors.

It's also fine to move to Attic voluntarily if people think the above
criteria might not be met for much longer, of course - I just wanted
to clarify the requirements.

-Bertrand