You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@poi.apache.org by edwgiz <gi...@git.apache.org> on 2016/01/25 16:25:56 UTC

[GitHub] poi pull request: Extends parsing for old-format Excel files

GitHub user edwgiz opened a pull request:

    https://github.com/apache/poi/pull/28

    Extends parsing for old-format Excel files

    Apache POI  has 'OldExcelExtractor' class to extract textual data from old Excel formats.
    
    It's helpful, but insufficiently when 
    1. required a parsing not from all columns, or data of different columns must be differently processed for example they can have a specific boost in a full-text indexing.
    3. numeric data is desirable by reason of performance
    
    When Apache POI is already included in project let's don't add other libraries for parsing old Excel files !

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/edwgiz/poi trunk

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/poi/pull/28.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #28
    
----
commit 1cfccbc3ade05ac60741f086f2552f461c36c039
Author: Eduard Gizatullin <ed...@gmail.com>
Date:   2016-01-25T11:06:51Z

    Extends parsing for old-format Excel files

commit 9878b31ef27833286d200f2f4651fa5d16c48e4c
Author: Eduard Gizatullin <ed...@gmail.com>
Date:   2016-01-25T11:09:02Z

    Extends parsing for old-format Excel files

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


Re: [GitHub] poi pull request: Extends parsing for old-format Excel files

Posted by Nick Burch <ap...@gagravarr.org>.
On Mon, 25 Jan 2016, edwgiz wrote:
> GitHub user edwgiz opened a pull request:
>
>    https://github.com/apache/poi/pull/28
>
>    Extends parsing for old-format Excel files

Has anyone else had a chance to look at this? Seems very interesting to 
me, and has unit tests, which is good!

Could someone who's quite familiar with the current HSSF event stuff take 
a quick look, and confirm this old-format stuff is good + follows a 
similar enough model?

Nick

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org