You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@ctakes.apache.org by Benjamin hansen <be...@gmail.com> on 2021/07/29 07:24:35 UTC

ctakes activity gauge

Hi all,


I hope its okay i send this mail here - i was not sure where else to pose
my question.


We are considering to use cTakes in our applications - however I got
concerned when i saw the lack of activity in the github repository which is
why I want to ask -



Is cTakes still being actively developed and maintained or have developers
gone to develop other systems instead?


Is there any kind of up 2 date roadmap for the activity development of
cTakes?


What is the cTakes userbase like these days? Is it growing?, dwindling?
stable? non-existent?




Thanks in advance.

Re: ctakes activity gauge

Posted by gandhi rajan <ga...@gmail.com>.
You are right Peter. The latest changes are getting into Apache SVN
repository and this repo is very much active according to me.

On Thu, Jul 29, 2021 at 9:51 PM Peter Abramowitsch <pa...@gmail.com>
wrote:

> Hi Ben,
>
> I can only speak for myself, but I am using cTakes extensively at two major
> California Universities in multiple projects. The kind of customizations I
> am doing are mostly specific to the facility and to the project and
> therefore wouldn't be for inclusion in the source repository.    We have
> just finished using it to extract concepts from 102 million notes
>
> Unless I am wrong, updates are going into the Apache SVN repository and
> Github has acted as a backup repo.  It's true that there isn't a well
> organized update bugfix & release team and schedule.  Others can speak to
> that too, but I would suspect that part of this is due to the fact that the
> core is very stable and most modifications and enhancements are, like mine,
> local to a project.    However, there has been talk but not much definitive
> action on two initiatives - to upgrade to the current version of UIMA and
> to include the Ruta engine as one of the pluggable components.
>
> The user base is fairly substantial for a project this specialized.  I
> suggest you have a look at the presentations at the cTakes thread of the
> 2020 ApacheCon conference.
>
> You'll have to search through this list:
> https://www.youtube.com/playlist?list=PLU2OcwpQkYCy_awEe5xwlxGTk5UieA37m
>
> By all means, come and join us!
>
> Regards, Peter
>
>
>
> On Thu, Jul 29, 2021 at 12:24 AM Benjamin hansen <
> benjaminkakkesen@gmail.com>
> wrote:
>
> > Hi all,
> >
> >
> > I hope its okay i send this mail here - i was not sure where else to pose
> > my question.
> >
> >
> > We are considering to use cTakes in our applications - however I got
> > concerned when i saw the lack of activity in the github repository which
> is
> > why I want to ask -
> >
> >
> >
> > Is cTakes still being actively developed and maintained or have
> developers
> > gone to develop other systems instead?
> >
> >
> > Is there any kind of up 2 date roadmap for the activity development of
> > cTakes?
> >
> >
> > What is the cTakes userbase like these days? Is it growing?, dwindling?
> > stable? non-existent?
> >
> >
> >
> >
> > Thanks in advance.
> >
>


-- 
Regards,
Gandhi

"The best way to find urself is to lose urself in the service of others !!!"

Re: ctakes activity gauge

Posted by Peter Abramowitsch <pa...@gmail.com>.
Hi Ben
If you watch my presentation from ApacheCon you'll see how we went about
mass extraction of notes.   This video contains two presentations and mine
starts about halfway through:   https://www.youtube.com/watch?v=F5WCCPWz7Z0

But in the same conference thread, there were two other groups working on
similar projects, but using different approaches.
here's one of them  https://www.youtube.com/watch?v=kZw42pGzyHs

Peter

On Thu, Jul 29, 2021 at 11:25 AM Benjamin hansen <be...@gmail.com>
wrote:

> Thank you for these insights Peter.
> Your project sounds very interesting. Are you using the uima pipeline on a
> cluster to process that many notes? And how long does it take?
>
> I have been considering to use uimafit+ctakes together with apache spark
> for distributed computations. I saw a video from Philip Ogren from 2014
> describing this - but unfortunately he does not give any details on how he
> did this.
> Would you by any chance know where i can find more information about how to
> achieve this?
>
> Best regards
>
> On Thu, Jul 29, 2021 at 6:21 PM Peter Abramowitsch <
> pabramowitsch@gmail.com>
> wrote:
>
> > Hi Ben,
> >
> > I can only speak for myself, but I am using cTakes extensively at two
> major
> > California Universities in multiple projects. The kind of customizations
> I
> > am doing are mostly specific to the facility and to the project and
> > therefore wouldn't be for inclusion in the source repository.    We have
> > just finished using it to extract concepts from 102 million notes
> >
> > Unless I am wrong, updates are going into the Apache SVN repository and
> > Github has acted as a backup repo.  It's true that there isn't a well
> > organized update bugfix & release team and schedule.  Others can speak to
> > that too, but I would suspect that part of this is due to the fact that
> the
> > core is very stable and most modifications and enhancements are, like
> mine,
> > local to a project.    However, there has been talk but not much
> definitive
> > action on two initiatives - to upgrade to the current version of UIMA and
> > to include the Ruta engine as one of the pluggable components.
> >
> > The user base is fairly substantial for a project this specialized.  I
> > suggest you have a look at the presentations at the cTakes thread of the
> > 2020 ApacheCon conference.
> >
> > You'll have to search through this list:
> > https://www.youtube.com/playlist?list=PLU2OcwpQkYCy_awEe5xwlxGTk5UieA37m
> >
> > By all means, come and join us!
> >
> > Regards, Peter
> >
> >
> >
> > On Thu, Jul 29, 2021 at 12:24 AM Benjamin hansen <
> > benjaminkakkesen@gmail.com>
> > wrote:
> >
> > > Hi all,
> > >
> > >
> > > I hope its okay i send this mail here - i was not sure where else to
> pose
> > > my question.
> > >
> > >
> > > We are considering to use cTakes in our applications - however I got
> > > concerned when i saw the lack of activity in the github repository
> which
> > is
> > > why I want to ask -
> > >
> > >
> > >
> > > Is cTakes still being actively developed and maintained or have
> > developers
> > > gone to develop other systems instead?
> > >
> > >
> > > Is there any kind of up 2 date roadmap for the activity development of
> > > cTakes?
> > >
> > >
> > > What is the cTakes userbase like these days? Is it growing?, dwindling?
> > > stable? non-existent?
> > >
> > >
> > >
> > >
> > > Thanks in advance.
> > >
> >
>

Re: ctakes activity gauge

Posted by Benjamin hansen <be...@gmail.com>.
Thank you for these insights Peter.
Your project sounds very interesting. Are you using the uima pipeline on a
cluster to process that many notes? And how long does it take?

I have been considering to use uimafit+ctakes together with apache spark
for distributed computations. I saw a video from Philip Ogren from 2014
describing this - but unfortunately he does not give any details on how he
did this.
Would you by any chance know where i can find more information about how to
achieve this?

Best regards

On Thu, Jul 29, 2021 at 6:21 PM Peter Abramowitsch <pa...@gmail.com>
wrote:

> Hi Ben,
>
> I can only speak for myself, but I am using cTakes extensively at two major
> California Universities in multiple projects. The kind of customizations I
> am doing are mostly specific to the facility and to the project and
> therefore wouldn't be for inclusion in the source repository.    We have
> just finished using it to extract concepts from 102 million notes
>
> Unless I am wrong, updates are going into the Apache SVN repository and
> Github has acted as a backup repo.  It's true that there isn't a well
> organized update bugfix & release team and schedule.  Others can speak to
> that too, but I would suspect that part of this is due to the fact that the
> core is very stable and most modifications and enhancements are, like mine,
> local to a project.    However, there has been talk but not much definitive
> action on two initiatives - to upgrade to the current version of UIMA and
> to include the Ruta engine as one of the pluggable components.
>
> The user base is fairly substantial for a project this specialized.  I
> suggest you have a look at the presentations at the cTakes thread of the
> 2020 ApacheCon conference.
>
> You'll have to search through this list:
> https://www.youtube.com/playlist?list=PLU2OcwpQkYCy_awEe5xwlxGTk5UieA37m
>
> By all means, come and join us!
>
> Regards, Peter
>
>
>
> On Thu, Jul 29, 2021 at 12:24 AM Benjamin hansen <
> benjaminkakkesen@gmail.com>
> wrote:
>
> > Hi all,
> >
> >
> > I hope its okay i send this mail here - i was not sure where else to pose
> > my question.
> >
> >
> > We are considering to use cTakes in our applications - however I got
> > concerned when i saw the lack of activity in the github repository which
> is
> > why I want to ask -
> >
> >
> >
> > Is cTakes still being actively developed and maintained or have
> developers
> > gone to develop other systems instead?
> >
> >
> > Is there any kind of up 2 date roadmap for the activity development of
> > cTakes?
> >
> >
> > What is the cTakes userbase like these days? Is it growing?, dwindling?
> > stable? non-existent?
> >
> >
> >
> >
> > Thanks in advance.
> >
>

Re: ctakes activity gauge

Posted by Peter Abramowitsch <pa...@gmail.com>.
Hi Ben,

I can only speak for myself, but I am using cTakes extensively at two major
California Universities in multiple projects. The kind of customizations I
am doing are mostly specific to the facility and to the project and
therefore wouldn't be for inclusion in the source repository.    We have
just finished using it to extract concepts from 102 million notes

Unless I am wrong, updates are going into the Apache SVN repository and
Github has acted as a backup repo.  It's true that there isn't a well
organized update bugfix & release team and schedule.  Others can speak to
that too, but I would suspect that part of this is due to the fact that the
core is very stable and most modifications and enhancements are, like mine,
local to a project.    However, there has been talk but not much definitive
action on two initiatives - to upgrade to the current version of UIMA and
to include the Ruta engine as one of the pluggable components.

The user base is fairly substantial for a project this specialized.  I
suggest you have a look at the presentations at the cTakes thread of the
2020 ApacheCon conference.

You'll have to search through this list:
https://www.youtube.com/playlist?list=PLU2OcwpQkYCy_awEe5xwlxGTk5UieA37m

By all means, come and join us!

Regards, Peter



On Thu, Jul 29, 2021 at 12:24 AM Benjamin hansen <be...@gmail.com>
wrote:

> Hi all,
>
>
> I hope its okay i send this mail here - i was not sure where else to pose
> my question.
>
>
> We are considering to use cTakes in our applications - however I got
> concerned when i saw the lack of activity in the github repository which is
> why I want to ask -
>
>
>
> Is cTakes still being actively developed and maintained or have developers
> gone to develop other systems instead?
>
>
> Is there any kind of up 2 date roadmap for the activity development of
> cTakes?
>
>
> What is the cTakes userbase like these days? Is it growing?, dwindling?
> stable? non-existent?
>
>
>
>
> Thanks in advance.
>