You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@uima.apache.org by Richard Eckart de Castilho <ec...@ukp.informatik.tu-darmstadt.de> on 2012/10/23 16:39:34 UTC

DKPro Core 1.4.0 released via Maven Central

We are pleased to announce the release of DKPro Core 1.4.0 - 
a collection of software components for natural language processing
(NLP) based on the Apache UIMA framework.

   http://code.google.com/p/dkpro-core-asl

This is the first release on Maven Central. DKPro Core can now be
used easier than ever without any need for a special Maven configuration!

There are new modules in DKPro Core:

* OpenNLP parser, POS tagger and tokenizers (ASL)
* MaltParser dependency parser (ASL)
* MeCab part-of-speech tagger for Japanese (ASL)
* Berkeley parser (GPL)
* GATE lemmatizer (GPL)
* Binary CAS (de)serialization (ASL)
* Generic JDBC reader (ASL)

Further highlights in this release are:

* New infrastructure and parameters for loading models and configuring
  type mapping.
* New versioning scheme for models and standardized model artifact names.
* Changed various parameters in all components to follow a common
  naming scheme.
* Added support to print tagset information when a model is loaded in
  most components
* Added various new and updated mappings for POS tags (Chinese, English,
  Estonian, French, German, ...).
* Added modules to support unit tests and to measure performance
* Fixed problems with paths containing spaces and paths on Windows systems
* Added support for POS mapping in various readers
* ... and many more updates and improvements in various modules!

For a more complete overview see:

http://code.google.com/p/dkpro-core-asl/wiki/ReleaseNotes_1_4_0
http://code.google.com/p/dkpro-core-gpl/wiki/ReleaseNotes_1_4_0

DKPro Core consists of a number of pre-processing components for NLP
tasks, often wrapping existing libraries or tools for easy use in an
UIMA pipeline.

* tokenization/segmentation
* compound splitting 
* stemming 
* part-of-speech tagging 
* lemmatization
* parsing 
* named entity recognition
* coreference resolution
* language identification 
* spelling correction
* grammar checking
* support for various data types

A basic UIMA type system is provided with which all of the components
work out-of-the-box. Some components can be configured for use with
other type systems.

DKPro Core builds heavily on uimaFIT, making use of features such as
injection of configuration parameters and automatic type detection.
Because using DKPro in Java code with uimaFIT is so easy, we do not
provide traditional UIMA XML descriptors for our analysis engines,
readers and consumers - only for the type systems.
We offer two sets of components with DKPro Core:

* DKPro Core ASL provides components under the Apache Software License 2.0
  http://code.google.com/p/dkpro-core-asl
* DKPro Core GPL provides components under the GNU Public License 3.0
  http://code.google.com/p/dkpro-core-gpl

DKPro Core is meant to be used with Apache Maven. The main components
are hosted on Maven Central while distributable models are downloaded
automatically from our public Maven repository. 

This project was initiated by the Ubiquitous Knowledge Processing Lab
(UKP) at the Technische Universität Darmstadt, Germany under the
auspices of Prof. Dr. Iryna Gurevych. All former and current member
of the UKP Lab have contributed in code, as testers or in spirit to
this project. It constitutes an essential cornerstone for our research
environment at the UKP Lab.

DKPro Core requires Java 1.6, UIMA 2.4.0 and uimaFIT 1.4.0 (amongst
other component-specific dependencies).

An introduction to DKPro Core can be found at

http://code.google.com/p/dkpro-core-asl/wiki/MyFirstDKProProject

Please direct any questions or suggestions to

dkpro-core-user@googlegroups.com

Best,

Richard

--
------------------------------------------------------------------- 
Richard Eckart de Castilho
Technical Lead
Ubiquitous Knowledge Processing Lab (UKP-TUD) 
FB 20 Computer Science Department      
Technische Universität Darmstadt 
Hochschulstr. 10, D-64289 Darmstadt, Germany 
phone [+49] (0)6151 16-7477, fax -5455, room S2/02/B117
eckart@ukp.informatik.tu-darmstadt.de 
www.ukp.tu-darmstadt.de 
Web Research at TU Darmstadt (WeRC) www.werc.tu-darmstadt.de
-------------------------------------------------------------------







Re: DKPro Core 1.4.0 released via Maven Central

Posted by Marshall Schor <ms...@schor.com>.
On 10/25/2012 11:29 AM, Richard Eckart de Castilho wrote:
> Hello Marshal,
>
> is it ok to provide text that is a bit more longish than what is currently on that page, more resembling the contents/size of the release announcement?

Hmmm, I think that this page would be scanned by people looking for things of
interest, and therefore should have shorter descriptions, with links to details.

I could be a little bit longer, though, I think.

-Marshall
>
> -- Richard
>
> Am 24.10.2012 um 21:40 schrieb Marshall Schor <ms...@schor.com>
>
>> Hi Richard,
>>
>> Would you like to make a patch to update the UIMA Website entry for DKPro, found
>> here:
>>
>> http://uima.apache.org/external-resources.html
>>
>> Thanks.  -Marshall
>


Re: DKPro Core 1.4.0 released via Maven Central

Posted by Richard Eckart de Castilho <ec...@ukp.informatik.tu-darmstadt.de>.
Hello Marshal,

is it ok to provide text that is a bit more longish than what is currently on that page, more resembling the contents/size of the release announcement?

-- Richard

Am 24.10.2012 um 21:40 schrieb Marshall Schor <ms...@schor.com>

> Hi Richard,
> 
> Would you like to make a patch to update the UIMA Website entry for DKPro, found
> here:
> 
> http://uima.apache.org/external-resources.html
> 
> Thanks.  -Marshall


-- 
------------------------------------------------------------------- 
Richard Eckart de Castilho
Technical Lead
Ubiquitous Knowledge Processing Lab (UKP-TUD) 
FB 20 Computer Science Department      
Technische Universität Darmstadt 
Hochschulstr. 10, D-64289 Darmstadt, Germany 
phone [+49] (0)6151 16-7477, fax -5455, room S2/02/B117
eckart@ukp.informatik.tu-darmstadt.de 
www.ukp.tu-darmstadt.de 
Web Research at TU Darmstadt (WeRC) www.werc.tu-darmstadt.de
------------------------------------------------------------------- 







Re: DKPro Core 1.4.0 released via Maven Central

Posted by Marshall Schor <ms...@schor.com>.
Hi Richard,

Would you like to make a patch to update the UIMA Website entry for DKPro, found
here:

http://uima.apache.org/external-resources.html

Thanks.  -Marshall

On 10/23/2012 10:39 AM, Richard Eckart de Castilho wrote:
> We are pleased to announce the release of DKPro Core 1.4.0 - 
> a collection of software components for natural language processing
> (NLP) based on the Apache UIMA framework.
>
>    http://code.google.com/p/dkpro-core-asl
>
> This is the first release on Maven Central. DKPro Core can now be
> used easier than ever without any need for a special Maven configuration!
>
> There are new modules in DKPro Core:
>
> * OpenNLP parser, POS tagger and tokenizers (ASL)
> * MaltParser dependency parser (ASL)
> * MeCab part-of-speech tagger for Japanese (ASL)
> * Berkeley parser (GPL)
> * GATE lemmatizer (GPL)
> * Binary CAS (de)serialization (ASL)
> * Generic JDBC reader (ASL)
>
> Further highlights in this release are:
>
> * New infrastructure and parameters for loading models and configuring
>   type mapping.
> * New versioning scheme for models and standardized model artifact names.
> * Changed various parameters in all components to follow a common
>   naming scheme.
> * Added support to print tagset information when a model is loaded in
>   most components
> * Added various new and updated mappings for POS tags (Chinese, English,
>   Estonian, French, German, ...).
> * Added modules to support unit tests and to measure performance
> * Fixed problems with paths containing spaces and paths on Windows systems
> * Added support for POS mapping in various readers
> * ... and many more updates and improvements in various modules!
>
> For a more complete overview see:
>
> http://code.google.com/p/dkpro-core-asl/wiki/ReleaseNotes_1_4_0
> http://code.google.com/p/dkpro-core-gpl/wiki/ReleaseNotes_1_4_0
>
> DKPro Core consists of a number of pre-processing components for NLP
> tasks, often wrapping existing libraries or tools for easy use in an
> UIMA pipeline.
>
> * tokenization/segmentation
> * compound splitting 
> * stemming 
> * part-of-speech tagging 
> * lemmatization
> * parsing 
> * named entity recognition
> * coreference resolution
> * language identification 
> * spelling correction
> * grammar checking
> * support for various data types
>
> A basic UIMA type system is provided with which all of the components
> work out-of-the-box. Some components can be configured for use with
> other type systems.
>
> DKPro Core builds heavily on uimaFIT, making use of features such as
> injection of configuration parameters and automatic type detection.
> Because using DKPro in Java code with uimaFIT is so easy, we do not
> provide traditional UIMA XML descriptors for our analysis engines,
> readers and consumers - only for the type systems.
> We offer two sets of components with DKPro Core:
>
> * DKPro Core ASL provides components under the Apache Software License 2.0
>   http://code.google.com/p/dkpro-core-asl
> * DKPro Core GPL provides components under the GNU Public License 3.0
>   http://code.google.com/p/dkpro-core-gpl
>
> DKPro Core is meant to be used with Apache Maven. The main components
> are hosted on Maven Central while distributable models are downloaded
> automatically from our public Maven repository. 
>
> This project was initiated by the Ubiquitous Knowledge Processing Lab
> (UKP) at the Technische Universität Darmstadt, Germany under the
> auspices of Prof. Dr. Iryna Gurevych. All former and current member
> of the UKP Lab have contributed in code, as testers or in spirit to
> this project. It constitutes an essential cornerstone for our research
> environment at the UKP Lab.
>
> DKPro Core requires Java 1.6, UIMA 2.4.0 and uimaFIT 1.4.0 (amongst
> other component-specific dependencies).
>
> An introduction to DKPro Core can be found at
>
> http://code.google.com/p/dkpro-core-asl/wiki/MyFirstDKProProject
>
> Please direct any questions or suggestions to
>
> dkpro-core-user@googlegroups.com
>
> Best,
>
> Richard
>
> --
> ------------------------------------------------------------------- 
> Richard Eckart de Castilho
> Technical Lead
> Ubiquitous Knowledge Processing Lab (UKP-TUD) 
> FB 20 Computer Science Department      
> Technische Universität Darmstadt 
> Hochschulstr. 10, D-64289 Darmstadt, Germany 
> phone [+49] (0)6151 16-7477, fax -5455, room S2/02/B117
> eckart@ukp.informatik.tu-darmstadt.de 
> www.ukp.tu-darmstadt.de 
> Web Research at TU Darmstadt (WeRC) www.werc.tu-darmstadt.de
> -------------------------------------------------------------------
>
>
>
>
>
>
>