You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@stanbol.apache.org by harish suvarna <hs...@gmail.com> on 2013/03/05 02:52:27 UTC
Stanbol myLauncher
I followed the directions to create a new stanbol launcher to have only
enhancer/entityhub components. In addition to the 7 components, listed, I
retained chinese engines and opennlp.
That gave me a luncher of size 169MB. The full launcher was around 189 MB.
So removing ontologymgr, rules, reasoner, contenthub saved me only 20MB?
Ami doing it right?
Ref: http://stanbol.apache.org/docs/trunk/production-mode/your-launcher.html
--
Thanks
Harish
Re: Stanbol myLauncher
Posted by Rupert Westenthaler <ru...@gmail.com>.
Hi
For excluding (1) and (2) just remove
* org.apache.stanbol.launchers.bundlelists.defaultdata
* org.apache.stanbol.launchers.bundlelists.opennlp
Related to (3) and (4) you will need to either
* adjust the according bundlelists (stanbolcommons and enhancer).
* create your own bundle lists.
* exclude those bundlelsit and add the required modules directly to
the list.xml of your launcher
* use the "bundleExclusions" directive of the Sling Launchpad plugin
to exclude bundles you do not need (see [1])
For (3) you will just need to exclude "tika-bundle" and
"org.apache.stanbol.enhancer.engines.tika" - both in the enhancer
list. For (4) you need to replace
<bundle>
<groupId>org.apache.clerezza.ext</groupId>
<artifactId>com.hp.hpl.jena.tdb</artifactId>
<version>0.3-incubating</version>
</bundle>
with
<bundle> <!-- the Apache Jena Bundle provided by Clerezza -->
<groupId>org.apache.clerezza.ext</groupId>
<artifactId>com.hp.hpl.jena</artifactId>
<version>0.6-incubating</version>
</bundle>
in the stanbolcommons bundlelist.
best
Rupert
[1] http://sling.apache.org/site/maven-launchpad-plugin.html
On Wed, Mar 13, 2013 at 12:45 AM, harish suvarna <hs...@gmail.com> wrote:
> Thanks Rupert.
> How to remove 1? I am not using the default dbpedia index anyway. I use the
> bigger index.
> 2. Opennlp models. I guees take out the opennlp related stuff in pom.xml of
> myLauncher.
> 3. Apache tika bundle. Should I remove it from list.xml
> of enhancer/src/main/bundles/list.xml.
> 4. whare to remove it?
>
> -harish
>
> On Mon, Mar 4, 2013 at 10:22 PM, Rupert Westenthaler <
> rupert.westenthaler@gmail.com> wrote:
>
>> Hi
>>
>> The biggest modules of the launcher are
>>
>> 1. the dbpedia default data index (~46MByte)
>> 2. the OpenNLP models (~45MByte)
>> 3. the Apache Tika Bundle (~22MByte)
>> 4. the Jena TDB bundle for Clerezza (~14MByte)
>>
>> 1 and 2 can be easily excluded as they are only needed by the default
>> configuration. 3 is only needed by the Tika Engine. So users that only
>> need to process plain text file can exclude the Tika bundle. The
>> (small) tike-core module is also used by some other components (e.g.
>> the langid engine). 4 is also not needed by if you only use the
>> Enhancer and the Entityhub. However as 4 also exports the Jena API you
>> will need to include the Clerezza Jena module instead.
>>
>> If you really want to build a highly customized launcher
>> configuration, than you will need to create your own bundlelist
>> (list.xml) file. Note that there is some ongoing work to make creating
>> customized Launcher configuration more efficient (see STANBOL-915 [1]
>> and [2]). However the completion of this work depends on a feature
>> request to the Sling launchpad plugin.
>>
>> best
>> Rupert
>>
>>
>> [1] https://issues.apache.org/jira/browse/STANBOL-915
>> [2] http://svn.apache.org/repos/asf/stanbol/branches/new_bundlelists/
>>
>> On Tue, Mar 5, 2013 at 2:52 AM, harish suvarna <hs...@gmail.com> wrote:
>> > I followed the directions to create a new stanbol launcher to have only
>> > enhancer/entityhub components. In addition to the 7 components, listed, I
>> > retained chinese engines and opennlp.
>> > That gave me a luncher of size 169MB. The full launcher was around 189
>> MB.
>> > So removing ontologymgr, rules, reasoner, contenthub saved me only 20MB?
>> > Ami doing it right?
>> >
>> > Ref:
>> http://stanbol.apache.org/docs/trunk/production-mode/your-launcher.html
>> >
>> > --
>> > Thanks
>> > Harish
>>
>>
>>
>> --
>> | Rupert Westenthaler rupert.westenthaler@gmail.com
>> | Bodenlehenstraße 11 ++43-699-11108907
>> | A-5500 Bischofshofen
>>
>
>
>
> --
> Thanks
> Harish
--
| Rupert Westenthaler rupert.westenthaler@gmail.com
| Bodenlehenstraße 11 ++43-699-11108907
| A-5500 Bischofshofen
Re: Stanbol myLauncher
Posted by harish suvarna <hs...@gmail.com>.
Thanks Rupert.
How to remove 1? I am not using the default dbpedia index anyway. I use the
bigger index.
2. Opennlp models. I guees take out the opennlp related stuff in pom.xml of
myLauncher.
3. Apache tika bundle. Should I remove it from list.xml
of enhancer/src/main/bundles/list.xml.
4. whare to remove it?
-harish
On Mon, Mar 4, 2013 at 10:22 PM, Rupert Westenthaler <
rupert.westenthaler@gmail.com> wrote:
> Hi
>
> The biggest modules of the launcher are
>
> 1. the dbpedia default data index (~46MByte)
> 2. the OpenNLP models (~45MByte)
> 3. the Apache Tika Bundle (~22MByte)
> 4. the Jena TDB bundle for Clerezza (~14MByte)
>
> 1 and 2 can be easily excluded as they are only needed by the default
> configuration. 3 is only needed by the Tika Engine. So users that only
> need to process plain text file can exclude the Tika bundle. The
> (small) tike-core module is also used by some other components (e.g.
> the langid engine). 4 is also not needed by if you only use the
> Enhancer and the Entityhub. However as 4 also exports the Jena API you
> will need to include the Clerezza Jena module instead.
>
> If you really want to build a highly customized launcher
> configuration, than you will need to create your own bundlelist
> (list.xml) file. Note that there is some ongoing work to make creating
> customized Launcher configuration more efficient (see STANBOL-915 [1]
> and [2]). However the completion of this work depends on a feature
> request to the Sling launchpad plugin.
>
> best
> Rupert
>
>
> [1] https://issues.apache.org/jira/browse/STANBOL-915
> [2] http://svn.apache.org/repos/asf/stanbol/branches/new_bundlelists/
>
> On Tue, Mar 5, 2013 at 2:52 AM, harish suvarna <hs...@gmail.com> wrote:
> > I followed the directions to create a new stanbol launcher to have only
> > enhancer/entityhub components. In addition to the 7 components, listed, I
> > retained chinese engines and opennlp.
> > That gave me a luncher of size 169MB. The full launcher was around 189
> MB.
> > So removing ontologymgr, rules, reasoner, contenthub saved me only 20MB?
> > Ami doing it right?
> >
> > Ref:
> http://stanbol.apache.org/docs/trunk/production-mode/your-launcher.html
> >
> > --
> > Thanks
> > Harish
>
>
>
> --
> | Rupert Westenthaler rupert.westenthaler@gmail.com
> | Bodenlehenstraße 11 ++43-699-11108907
> | A-5500 Bischofshofen
>
--
Thanks
Harish
Re: Stanbol myLauncher
Posted by Rupert Westenthaler <ru...@gmail.com>.
Hi
The biggest modules of the launcher are
1. the dbpedia default data index (~46MByte)
2. the OpenNLP models (~45MByte)
3. the Apache Tika Bundle (~22MByte)
4. the Jena TDB bundle for Clerezza (~14MByte)
1 and 2 can be easily excluded as they are only needed by the default
configuration. 3 is only needed by the Tika Engine. So users that only
need to process plain text file can exclude the Tika bundle. The
(small) tike-core module is also used by some other components (e.g.
the langid engine). 4 is also not needed by if you only use the
Enhancer and the Entityhub. However as 4 also exports the Jena API you
will need to include the Clerezza Jena module instead.
If you really want to build a highly customized launcher
configuration, than you will need to create your own bundlelist
(list.xml) file. Note that there is some ongoing work to make creating
customized Launcher configuration more efficient (see STANBOL-915 [1]
and [2]). However the completion of this work depends on a feature
request to the Sling launchpad plugin.
best
Rupert
[1] https://issues.apache.org/jira/browse/STANBOL-915
[2] http://svn.apache.org/repos/asf/stanbol/branches/new_bundlelists/
On Tue, Mar 5, 2013 at 2:52 AM, harish suvarna <hs...@gmail.com> wrote:
> I followed the directions to create a new stanbol launcher to have only
> enhancer/entityhub components. In addition to the 7 components, listed, I
> retained chinese engines and opennlp.
> That gave me a luncher of size 169MB. The full launcher was around 189 MB.
> So removing ontologymgr, rules, reasoner, contenthub saved me only 20MB?
> Ami doing it right?
>
> Ref: http://stanbol.apache.org/docs/trunk/production-mode/your-launcher.html
>
> --
> Thanks
> Harish
--
| Rupert Westenthaler rupert.westenthaler@gmail.com
| Bodenlehenstraße 11 ++43-699-11108907
| A-5500 Bischofshofen