You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@stanbol.apache.org by Phillip Rhodes <mo...@gmail.com> on 2017/11/08 18:21:20 UTC

Re: The Future of Apache Stanbol

Stanbol is crucially important to us at Fogbeam Labs, and I have
intended for some time to get more involved in the project.  Time has
been the limiting factor, but I will try to find a way to carve out
some time now.  Especially now that we're wrapping up some other work
that was sort of pre-requisite to the stuff we're doing with Stanbol.


Phil

This message optimized for indexing by NSA PRISM


On Mon, Mar 20, 2017 at 4:43 AM, Fabian Christ
<ch...@googlemail.com> wrote:
> Hi Stanbolers,
>
> thanks for all the replies. This looks really promising and is a good
> sign that so much interest still exists in using the Stanbol
> technology.
>
> Now, we have to figure out a way to get everyone on board to start
> contributing and taking responsibility for the project.
>
> Best
> Fabian
>
> 2017-02-27 13:14 GMT+01:00 Andrea Nuzzolese <an...@istc.cnr.it>:
>> Hi all,
>>
>> first, sorry for the late reply, nevertheless I hope this email could be helpful to the discussion.
>> At the Semantic Technology Laboratory of the CNR in Italy we have been using Apache Stanbol since it was born and we are still using it in different research projects dealing with semantic enhancement of content.
>>
>> Additionally, few months ago we founded a start-up (basically a spin-off of the CNR) which re-uses many parts of the Stanbol stack within its products.
>> Hence, as a start-up we would be really interested in the future of Apache Stanbol and we can put some effort by providing some additional developers to contribute to its evolution.
>>
>> —
>> Andrea Giovanni Nuzzolese, Ph.D.
>> Semantic Technology Laboratory (STLab)
>> Institute for Cognitive Science and Technology (ISTC)
>> National Research Council (CNR)
>> Via Nomentana 56, Roma - Italy
>>
>>> On 24 Feb 2017, at 16:08, A. Soroka <aj...@virginia.edu> wrote:
>>>
>>> Yes, it would certainly be best to have more than one opportunity, with such a global group. If you are not available on Monday, Rafa, let's not push for that date. Let's find a date to start with that is good for both you and Rupert.
>>>
>>> Rupert?
>>>
>>> ---
>>> A. Soroka
>>> The University of Virginia Library
>>>
>>>> On Feb 24, 2017, at 10:00 AM, Rafa Haro <rh...@apache.org> wrote:
>>>>
>>>> Hi,
>>>>
>>>> I wouldn't be available next Monday, but maybe we can organize more than
>>>> one :-). I insist, in order to make any sense out of this we need @Rupert
>>>> to join these meetings. He is, by far, the most experienced Stanbol
>>>> developer and the one with the broader knowledge. Please @Rupert, raise
>>>> your voice man :-)
>>>>
>>>> Cheers,
>>>> Rafa
>>>>
>>>> On Fri, Feb 24, 2017 at 3:33 PM Michal Krajňanský <
>>>> michal.krajnansky@gmail.com> wrote:
>>>>
>>>>> Hi Mr. Soroka,
>>>>>
>>>>> Thank you for quick reply and direct call proposal. It will be my pleasure
>>>>> to meeting you virtually.
>>>>>
>>>>> The suggested time Monday 27th 10AM EST works well, so you may consider it
>>>>> settled. I will try to get involved also the managing director of our
>>>>> department located in Brno, Czech Republic, Matej Dusik.
>>>>>
>>>>> I am looking forward to having a fruitful discussion.
>>>>>
>>>>> Best Regards,
>>>>>
>>>>> Michal Krajnansky
>>>>>
>>>>> On Fri, Feb 24, 2017 at 3:22 PM A. Soroka <aj...@virginia.edu> wrote:
>>>>>
>>>>>> We had a few volunteers to begin learning the codebase to take it
>>>>> forward,
>>>>>> and willingness from at least some of the current committers to teach,
>>>>> but
>>>>>> I think we need a bit more organization! :grin:
>>>>>>
>>>>>> I will make a concrete suggestion. Would it be possible to have a video
>>>>>> call (perhaps with Google Hangout) about this on (just throwing out a
>>>>> date
>>>>>> here) this coming week, Monday 27 February, at 10AM EST?
>>>>>>
>>>>>> I would be able to attend. I realize that we are a far-flung group, so
>>>>> I'm
>>>>>> just throwing out that date to get us started. The most important thing,
>>>>> of
>>>>>> course, is to get as many current committers involved as is practical.
>>>>>>
>>>>>> ---
>>>>>> A. Soroka
>>>>>> The University of Virginia Library
>>>>>>
>>>>>>> On Feb 24, 2017, at 8:50 AM, Michal Krajňanský <
>>>>>> michal.krajnansky@gmail.com> wrote:
>>>>>>>
>>>>>>> Dear Stanbol users,
>>>>>>>
>>>>>>> I was wondering, if there were any results of the discussion about the
>>>>>>> Apache Stanbol future.
>>>>>>>
>>>>>>> I work for a R&D team of Konica Minolta Laboratory Europe, and we have
>>>>>> been
>>>>>>> using Stanbol enhancement pipeline in our prototypes concerning
>>>>>> information
>>>>>>> extraction from unstructured data.
>>>>>>>
>>>>>>> We are highly interested in the continuing evolutoin of the Stanbol
>>>>>>> project, and willing to actively support it. Is here anyone who could
>>>>>> tell
>>>>>>> us, what would be a good way to approach the existing Stanbol
>>>>>> stakeholders,
>>>>>>> and support the project by active development and possibly via other
>>>>>> ways?
>>>>>>>
>>>>>>>
>>>>>>> Michal Krajnansky
>>>>>>> Research Specialist Junior, Konica Minolta Laboratory Europe
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> On Mon, Jan 23, 2017 at 6:14 PM A. Soroka <aj...@virginia.edu> wrote:
>>>>>>>
>>>>>>>> Perhaps we can start a page for people to put their name down for
>>>>> this?
>>>>>> I
>>>>>>>> couldn't find an Apache wiki site-- is there somewhere the developers
>>>>>> think
>>>>>>>> appropriate?
>>>>>>>>
>>>>>>>> ---
>>>>>>>> A. Soroka
>>>>>>>> Apache Jena / The University of Virginia Library
>>>>>>>>
>>>>>>>>> On Jan 23, 2017, at 12:08 PM, Aaron Coburn <ac...@amherst.edu>
>>>>>> wrote:
>>>>>>>>>
>>>>>>>>> I would also be very interested.
>>>>>>>>>
>>>>>>>>> We use the entityhub component quite a bit at our institution, and I
>>>>>>>> would be happy to be involved.
>>>>>>>>>
>>>>>>>>> Aaron Coburn
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>> On Jan 22, 2017, at 12:30 PM, Antero Duarte <a....@gmail.com>
>>>>>>>> wrote:
>>>>>>>>>>
>>>>>>>>>> I would definitely be up for that!
>>>>>>>>>>
>>>>>>>>>> On Sat, 21 Jan 2017, 3:33 p.m. A. Soroka, <aj...@virginia.edu>
>>>>> wrote:
>>>>>>>>>>
>>>>>>>>>>> Ditto. If there are other folks who would be interested in
>>>>>>>> participating
>>>>>>>>>>> in something like this, now would be a good time to raise your
>>>>> voice!
>>>>>>>>>>>
>>>>>>>>>>> ---
>>>>>>>>>>> A. Soroka
>>>>>>>>>>> The University of Virginia Library
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>> On Jan 21, 2017, at 10:20 AM, Andrew Valencik <an...@affin.io>
>>>>>>>> wrote:
>>>>>>>>>>>>
>>>>>>>>>>>> I think the "mandatory" documentation contribution is a great idea
>>>>>>>> (and
>>>>>>>>>>>> very reasonable). I'd be happy to partake!
>>>>>>>>>>>> On Wed, Jan 18, 2017 at 11:13 AM Rafa Haro <rh...@apache.org>
>>>>>> wrote:
>>>>>>>>>>>>
>>>>>>>>>>>>> I wouldn't mind to be involved in that but it would be almost
>>>>>>>>>>> "mandatory"
>>>>>>>>>>>>> to contribute some Developer Documentation as outcome of those
>>>>>>>> meetings
>>>>>>>>>>>>> :-). @Rupert we specially need you here :-)
>>>>>>>>>>>>>
>>>>>>>>>>>>> On Wed, Jan 18, 2017 at 4:09 PM A. Soroka <aj...@virginia.edu>
>>>>>>>> wrote:
>>>>>>>>>>>>>
>>>>>>>>>>>>>>> I agree that the barrier to contribution is very high. I recall
>>>>>>>> having
>>>>>>>>>>>>>> issues with the documentation initially and the only
>>>>>>>>>>>>>>> available book on Stanbol was not sufficient.
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> If there was renewed interest in bringing on other developers I
>>>>>>>> would
>>>>>>>>>>>>> be
>>>>>>>>>>>>>> interested in investing the time to learn to codebase.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> I second this!
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Perhaps (I know it's very difficult to organize synchronous time
>>>>>>>> for a
>>>>>>>>>>>>>> globally-distributed group but perhaps) we could try to
>>>>> organize a
>>>>>>>> boot
>>>>>>>>>>>>>> camp meeting on-line? In other words, those people who (like
>>>>>> Andrew
>>>>>>>> and
>>>>>>>>>>>>>> myself) would be willing to contribute as part of a larger
>>>>> effort
>>>>>>>> could
>>>>>>>>>>>>> get
>>>>>>>>>>>>>> some virtual time with one or more committers/PMC members to
>>>>> take
>>>>>> an
>>>>>>>>>>>>>> in-depth tour of the system from the developer point of view and
>>>>>>>> hear
>>>>>>>>>>>>> about
>>>>>>>>>>>>>> the outstanding architectural issues, maybe start to figure out
>>>>>>>> points
>>>>>>>>>>> of
>>>>>>>>>>>>>> contribution.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> I realize this would make a lot of demands on the committers
>>>>>>>> involved,
>>>>>>>>>>>>> but
>>>>>>>>>>>>>> it might be a way to inject some fuel into the effort.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Just an idea...
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> ---
>>>>>>>>>>>>>> A. Soroka
>>>>>>>>>>>>>> The University of Virginia Library
>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> On Jan 18, 2017, at 9:54 AM, Andrew Valencik <an...@affin.io>
>>>>>>>> wrote:
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> Hello!
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> We use Stanbol in production to annotate text with entities as
>>>>>>>> part of
>>>>>>>>>>>>>> some
>>>>>>>>>>>>>>> of our data products.
>>>>>>>>>>>>>>> We do this via the REST API.
>>>>>>>>>>>>>>> Originally we were using the content hub to store all the
>>>>>> documents
>>>>>>>>>>> but
>>>>>>>>>>>>>> saw
>>>>>>>>>>>>>>> higher than expected failures.
>>>>>>>>>>>>>>> The entity engines seem a bit more resilient to varying content
>>>>>>>> types
>>>>>>>>>>> /
>>>>>>>>>>>>>>> encoding.
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> I agree that the barrier to contribution is very high.
>>>>>>>>>>>>>>> I recall having issues with the documentation initially and the
>>>>>>>> only
>>>>>>>>>>>>>>> available book on Stanbol was not sufficient.
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> If there was renewed interest in bringing on other developers I
>>>>>>>> would
>>>>>>>>>>>>> be
>>>>>>>>>>>>>>> interested in investing the time to learn to codebase.
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> Thanks!
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> On Wed, Jan 18, 2017 at 5:53 AM Raffaele Palmieri <
>>>>>>>>>>>>>>> raffaele.palmieri@gmail.com> wrote:
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> Dear community,
>>>>>>>>>>>>>>>> we are using 0.12 version with content hub. I find Stanbol
>>>>> very
>>>>>>>>>>>>>> flexible to
>>>>>>>>>>>>>>>> enhance content, specially those unstructured ones.
>>>>>>>>>>>>>>>> For that regards connections with other projects, we have done
>>>>>>>> some
>>>>>>>>>>>>>>>> thoughts about the use with Apache Marmotta with nosql
>>>>> backends
>>>>>>>> for
>>>>>>>>>>>>> big
>>>>>>>>>>>>>>>> data scenarios. Also with Apache Manifold to implement
>>>>>> enrichment
>>>>>>>> of
>>>>>>>>>>>>>>>> existent document repositories, that's a common request.
>>>>>>>>>>>>>>>> Regards,
>>>>>>>>>>>>>>>> Raffaele.
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> 2017-01-16 22:41 GMT+01:00 Antero Duarte <
>>>>> a.fduarte1@gmail.com
>>>>>>> :
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> Hi there,
>>>>>>>>>>>>>>>>> Stanbol is very useful for me! Greatest and easiest tool for
>>>>> us
>>>>>>>> to
>>>>>>>>>>> do
>>>>>>>>>>>>>> NLP
>>>>>>>>>>>>>>>>> and linked data. Has there been any discussion to move
>>>>> towards
>>>>>> a
>>>>>>>>>>>>> nosql
>>>>>>>>>>>>>>>>> storage solution, or is solr still the best thing for us?
>>>>> Also,
>>>>>>>> what
>>>>>>>>>>>>>>>> about
>>>>>>>>>>>>>>>>> upgrading solr? How much work would be involved in that?
>>>>>> Anyway,
>>>>>>>>>>>>> great
>>>>>>>>>>>>>>>>> tool, really hope this doesn't die!!!
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> Regards,
>>>>>>>>>>>>>>>>> Antero
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>> On Mon, 16 Jan 2017, 4:43 p.m. Bertrand Delacretaz, <
>>>>>>>>>>>>>>>>> bdelacretaz@apache.org>
>>>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> On Mon, Jan 16, 2017 at 2:28 PM, Rafa Haro <
>>>>> rharo@apache.org>
>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>>>>> ...I participated
>>>>>>>>>>>>>>>>>>> also in the development of the Java client, so I could take
>>>>>> the
>>>>>>>>>>>>>>>>>>> responsibility of bringing that one as well...
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> FWIW, I won't be involved in decisions about this as I left
>>>>>> the
>>>>>>>>>>>>>>>>>> Stanbol PMC a while ago - I'm just commenting from a
>>>>> community
>>>>>>>>>>> point
>>>>>>>>>>>>>>>>>> of view, as an experienced Apache member.
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> A while ago Stanbol was "larger" and more focused on its
>>>>> core,
>>>>>>>> but
>>>>>>>>>>>>> as
>>>>>>>>>>>>>>>>>> its community becomes smaller (IIUC) it's probably good to
>>>>>> bring
>>>>>>>>>>>>>>>>>> everyone here, as much as possible, even it it means a
>>>>>> slightly
>>>>>>>>>>> less
>>>>>>>>>>>>>>>>>> focused codebase. This can also help recruiting more active
>>>>>>>>>>>>> committers
>>>>>>>>>>>>>>>>>> and PMC members by involving them directly here.
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> That might make Stanbol more sustainable, as a community of
>>>>>>>> people
>>>>>>>>>>>>> who
>>>>>>>>>>>>>>>>>> need similar functionality.
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>> -Bertrand
>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> --
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> Andrew Valencik
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> Data Scientist
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>>>>>>>>>>> http://twitter.com/valencik>
>>>>>>>>>>>>>> |
>>>>>>>>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4
>>>>>>
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>>>>>>>>>>> <
>>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>
>>>>>>
>>>>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>
>>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>> --
>>>>>>>>>>>>
>>>>>>>>>>>> Andrew Valencik
>>>>>>>>>>>>
>>>>>>>>>>>> Data Scientist
>>>>>>>>>>>>
>>>>>>>>>>>> Affinio <http://www.affinio.com> | Twitter <
>>>>>>>> http://twitter.com/valencik>
>>>>>>>>>>> |
>>>>>>>>>>>> LinkedIn <https://www.linkedin.com/in/andrew-valencik-472b2aa4>
>>>>>>>>>>>>
>>>>>>>>>>>> Discover your personal brand for free with Discovery by Affinio
>>>>>>>>>>>> <
>>>>>>>>>>>
>>>>>>>>
>>>>>>
>>>>> http://t.sidekickopen36.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XZs2zGp1-W3Ljs1C5vfVNRVQJSG056dPwzdmbQgC02?t=http%3A%2F%2Fdiscovery.affinio.com%2F&si=5220160139427840&pi=8070b80d-4591-486c-891e-f9743f8a6c3f
>>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>
>>
>
>
>
> --
> Fabian
> http://twitter.com/fctwitt