You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@poi.apache.org by bu...@apache.org on 2011/02/10 17:55:09 UTC

DO NOT REPLY [Bug 50750] New: Support MS OneNote file format

https://issues.apache.org/bugzilla/show_bug.cgi?id=50750

           Summary: Support MS OneNote file format
           Product: POI
           Version: unspecified
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: enhancement
          Priority: P2
         Component: POI Overall
        AssignedTo: dev@poi.apache.org
        ReportedBy: jan.asf@cominvent.com


Support extracting text content from .one files as per this file format spec
http://msdn.microsoft.com/en-us/library/dd924743(v=office.12).aspx

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 50750] Support MS OneNote file format

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=50750

janhoy <ja...@cominvent.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEEDINFO                    |NEW

--- Comment #2 from janhoy <ja...@cominvent.com> 2011-02-14 13:15:11 EST ---
Here are some sample OneNote files in a zip file:

https://docs.google.com/leaf?id=0B5l8CG0AFbx2ZWRiYjRiY2QtYzAzOC00ODgxLWIwZGEtNGRlOTdlYzRmNDQ5&hl=no

Zip contains:
sample-onenote-2007.one
sample-onenote-2010.one
sample-onenote-package.onepkg
sample-onenote.pdf
sample-onenote.txt

The files are the default sample document in OneNote2010. The document is one
section, 2 pages. Created with OneNote2010. The 2007 file is exported from
OneNote2010. The .onepkg file has the same contents as the other files, but
saved as a package. The txt doc is created by selecting all text on the page
and then COPY, so you get an idea of what is graphics and what is text. The PDF
gives a visual impression of the original workbook.

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 50750] Support MS OneNote file format

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=50750

--- Comment #3 from Nick Burch <ni...@alfresco.com> 2011-02-14 14:24:10 EST ---
Thanks for these

I can't promise I'll be able to work on this very soon, but I should be able to
add in Tika support just as soon as I've done the POI bit...

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 50750] Support MS OneNote file format

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=50750

Nick Burch <ni...@alfresco.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |NEEDINFO

--- Comment #1 from Nick Burch <ni...@alfresco.com> 2011-02-10 13:12:59 EST ---
Any chance you could create a few sample documents and upload them?

Ideally we'd want say 2 or 3 files. For each one, we'd also want a text file
with the textual contents of the file (so we can make sure we get most of the
contents), and possibly also a screenshot of the file when it's open in onenote
(so we can get a feel for how the text might come out)

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org