You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Shawn Heisey (JIRA)" <ji...@apache.org> on 2016/08/27 16:41:21 UTC

[jira] [Commented] (SOLR-6806) Reduce the size of the main Solr binary download

    [ https://issues.apache.org/jira/browse/SOLR-6806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15441878#comment-15441878 ] 

Shawn Heisey commented on SOLR-6806:
------------------------------------

A potential baby step is to do something different with docs.  I see two good options:

 * Remove the docs from the main download entirely, create a separate downloadable archive for them.  For 6.2, this archive would be about 13MB, making the main download nearly 10 percent lighter.
 * Put the compressed docs archive in the main archive.  This wouldn't reduce the size of the download, but it *WOULD* make the archive extract a LOT faster.  See screenshots that I attached earlier.

I think it's a good idea to do the second option to the licenses directory.  This directory doesn't have as many small files in it as the docs directory, but it still has a significant impact on main archive extraction speed.  These files need to be in the main archive, but IMHO it's perfectly acceptable to have them contained in an inner archive.


> Reduce the size of the main Solr binary download
> ------------------------------------------------
>
>                 Key: SOLR-6806
>                 URL: https://issues.apache.org/jira/browse/SOLR-6806
>             Project: Solr
>          Issue Type: Task
>          Components: Build
>    Affects Versions: 5.0
>            Reporter: Shawn Heisey
>         Attachments: solr-zip-docs-extracted.png, solr-zip-extract-graph.png
>
>
> There has been a lot of recent discussion about how large the Solr download is, and how to reduce its size.  The last release (4.10.2) weighs in at 143MB for the tar and 149MB for the zip.
> Most users do not need the full download.  They may never need contrib features, or they may only need one or two, with DIH being the most likely choice.  They could likely get by with a download that's less than 40 MB.
> Our primary competition has a 29MB zip download for the release that's current right now, and not too long ago, that was about 20MB.  I didn't look very deep, but any additional features that might be available for download were not immediately apparent on their website.  I'm sure they exist, but I would guess that most users never need those features, so most users never even see them.
> Solr, by contrast, has everything included ... a "kitchen sink" approach. Once you get past the long download time and fire up the example, you're presented with configs that include features you're likely to never use.
> Although this offers maximum flexibility, I think it also serves to cause confusion in a new user.
> A much better option would be to create a core download that includes only a minimum set of features, probably just the war, the example servlet container, and an example config that only uses the functionality present in the war.  We can create additional downloads that offer additional functionality and configs ... DIH would be a very small addon that would likely be downloaded frequently.
> SOLR-5103 describes a plugin infrastructure which would make it very easy to offer a small core download and then let the user download additional functionality using scripts or the UI.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org