You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by og...@apache.org on 2011/06/30 22:06:37 UTC

svn commit: r1141693 - in /incubator/stanbol/trunk/defaultdata: README.md download_models.sh src/main/resources/org/apache/stanbol/defaultdata/site/dbpedia/index/

Author: ogrisel
Date: Thu Jun 30 20:06:36 2011
New Revision: 1141693

URL: http://svn.apache.org/viewvc?rev=1141693&view=rev
Log:
STANBOL-92: download a prebuilt solr index of Wikipedia to be included in the defaultdata artifact

Modified:
    incubator/stanbol/trunk/defaultdata/README.md
    incubator/stanbol/trunk/defaultdata/download_models.sh
    incubator/stanbol/trunk/defaultdata/src/main/resources/org/apache/stanbol/defaultdata/site/dbpedia/index/   (props changed)

Modified: incubator/stanbol/trunk/defaultdata/README.md
URL: http://svn.apache.org/viewvc/incubator/stanbol/trunk/defaultdata/README.md?rev=1141693&r1=1141692&r2=1141693&view=diff
==============================================================================
--- incubator/stanbol/trunk/defaultdata/README.md (original)
+++ incubator/stanbol/trunk/defaultdata/README.md Thu Jun 30 20:06:36 2011
@@ -8,12 +8,14 @@ To avoid loading subversion repository w
 to be build and deployed manually to retrieve precomputed models from other
 sites.
 
-## Downloading the OpenNLP statistical model files
 
-Use the `download_models.sh` script.
+## Downloading the OpenNLP statistical model files and pre-built Solr Index
 
-## Building Entity Hub indices
+Under Unix, use the `download_models.sh` script and then run `mvn install`.
+
+Under windows, read the script content and do the same operations manually :)
 
-TODO
 
+## Building Entity Hub indices
 
+See the online documentation: (TODO: put the URL here when no longer staging)

Modified: incubator/stanbol/trunk/defaultdata/download_models.sh
URL: http://svn.apache.org/viewvc/incubator/stanbol/trunk/defaultdata/download_models.sh?rev=1141693&r1=1141692&r2=1141693&view=diff
==============================================================================
--- incubator/stanbol/trunk/defaultdata/download_models.sh (original)
+++ incubator/stanbol/trunk/defaultdata/download_models.sh Thu Jun 30 20:06:36 2011
@@ -2,12 +2,19 @@
 
 OPENNLP_DATA=src/main/resources/org/apache/stanbol/defaultdata/opennlp
 MODELS_URL="http://opennlp.sourceforge.net/models-1.5"
-
+DBPEDIDA_SOLR_DATA=src/main/resources/org/apache/stanbol/defaultdata/site/dbpedia/index
+DBPEDIA_SOLR_URL="http://dl.dropbox.com/u/5743203/IKS/dbpedia/3.6/dbpedia_43k.solrindex.zip"
 
 rm -rf $OPENNLP_DATA/*.bin
 
 (cd $OPENNLP_DATA && wget $MODELS_URL/en-sent.bin)
+(cd $OPENNLP_DATA && wget $MODELS_URL/en-pos-perceptron.bin)
+(cd $OPENNLP_DATA && wget $MODELS_URL/en-chunker.bin)
 (cd $OPENNLP_DATA && wget $MODELS_URL/en-ner-person.bin)
 (cd $OPENNLP_DATA && wget $MODELS_URL/en-ner-location.bin)
 (cd $OPENNLP_DATA && wget $MODELS_URL/en-ner-organization.bin)
 
+
+rm -rf $DBPEDIDA_SOLR_DATA/*.zip
+
+(cd $DBPEDIDA_SOLR_DATA && wget $DBPEDIA_SOLR_URL)

Propchange: incubator/stanbol/trunk/defaultdata/src/main/resources/org/apache/stanbol/defaultdata/site/dbpedia/index/
------------------------------------------------------------------------------
--- svn:ignore (added)
+++ svn:ignore Thu Jun 30 20:06:36 2011
@@ -0,0 +1 @@
+*.zip