You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by Apache Wiki <wi...@apache.org> on 2015/02/02 23:16:48 UTC

[Tika Wiki] Update of "VirtualMachine" by TylerPalsulich

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.

The "VirtualMachine" page has been changed by TylerPalsulich:
https://wiki.apache.org/tika/VirtualMachine

New page:
= Notes on setting up the Tika Virtual Machine (hosted by Rackspace) =
=== See TIKA-1331 ===

==== install software ====

1. `yum install nano perl java-1.7.0-openjdk-devel unzip git svn vim`

2. `curl -s get.gvmtool.net | bash`

3. `source "/root/.gvm/bin/gvm-init.sh"`

4. `gvm install groovy`

5. `mv groovy to /usr/share/groovy`

6. change symbolic link

7. added /usr/share/groovy/current/bin to personal path via .bashrc

==== config/admin stuff ====

`nano /etc/ssh/sshd_config`
  * disallow root to log in
  * added self to AllowUsers

==== need to add documentation for opening port 80 ====

==== permission management ====

1. `adduser <self> sudo`

2. `groupadd <admingroup>`

3. `useradd -g <admingroup> <self>`

==== mkdirs ====

/public/corpora/govdocs1
...


====prep govdocs1====

1. move zipfilelist to govdocs1/archive

2. wget -i zipfilelist.txt

3. (go get some coffee)

4. cd govdocs1/scripts

5. groovy unzip.groovy 0

6. (go get some more coffee)

7. groovy rmBugged.groovy