You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Allison, Timothy B." <ta...@mitre.org> on 2017/12/05 20:45:45 UTC

RE: Tika 1.17?

I kicked off the regression tests towards the end of last week.  I'm getting permanent churns while executing some of the sql on this size data.  I think we've maxed out H2 for our dataset...or I'm doing something inelegant/ill advised w H2.

I've trimmed out the reports that were causing infinite(ish) hangs, and I'm now getting most of the reports that we care about.

I should have the reports ready by this evening/tomorrow.

-----Original Message-----
From: Allison, Timothy B. [mailto:tallison@mitre.org] 
Sent: Wednesday, November 29, 2017 1:08 PM
To: dev@tika.apache.org
Subject: RE: Tika 1.17?

+1

-----Original Message-----
From: Chris Mattmann [mailto:mattmann@apache.org] 
Sent: Wednesday, November 29, 2017 12:57 PM
To: dev@tika.apache.org
Subject: Re: Tika 1.17?

Thanks so much for fixing this. It worked during MEMEX and then I think has since fallen out of date and perhaps I committed Zarana’s code wrong or something. Will be great to get this working!



On 11/29/17, 9:54 AM, "David Meikle" <lo...@gmail.com> wrote:

    I am thinking TIKA-2385. I've got a resized image that I can commit tonight
    that should close this one off.
    
    Cheers,
    Dave
    
    
    On 29 Nov 2017 14:42, "Allison, Timothy B." <ta...@mitre.org> wrote:
    
    Many thanks to Bob for help on TIKA-2502!
    
    Anything else we want to put into 1.17 before I run the regression tests?
    
    -----Original Message-----
    From: Allison, Timothy B. [mailto:tallison@mitre.org]
    Sent: Monday, November 13, 2017 1:42 PM
    To: dev@tika.apache.org
    Subject: RE: Tika 1.17?
    
    Y.  You're right.  Thank you!
    
     I think I've been avoiding that because there were some regressions in
    metadata-extractor last I looked at this.  Let's hope those are gone in
    2.10.1.
    
    -----Original Message-----
    From: Tyler Bui-Palsulich [mailto:tpalsulich@apache.org]
    Sent: Sunday, November 12, 2017 2:54 PM
    To: dev@tika.apache.org
    Subject: RE: Tika 1.17?
    
    TIKA-2486 might be worth blocking on since there is a CVE.
    
    Tyler
    
    On Nov 6, 2017 5:26 AM, "Allison, Timothy B." <ta...@mitre.org> wrote:
    
    > Y.  I'm happy enough  to wait a few more days.  I wasn't able to kick
    > off the regression tests last week.  Should I wait for the new parsers
    > to run the regression tests?
    >
    > -----Original Message-----
    > From: David Meikle [mailto:loompa@gmail.com]
    > Sent: Friday, November 3, 2017 7:42 PM
    > To: dev@tika.apache.org
    > Subject: Re: Tika 1.17?
    >
    > Sounds good. I have a couple of new parsers I would like to slot in
    > but not had a chance the last few months. Will go for it over the
    > weekend, if that works for you Tim.
    >
    > Cheers,
    > Dave
    >
    >
    >
    > On 3 November 2017 at 15:19, Mattmann, Chris A (3010) <
    > chris.a.mattmann@jpl.nasa.gov> wrote:
    >
    > > Let’s make it so (
    > >
    > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
    > ++++++++++++++
    > > Chris Mattmann, Ph.D.
    > > Principal Data Scientist, Engineering Administrative Office (3010)
    > > Manager, NSF & Open Source Projects Formulation and Development
    > > Offices
    > > (8212)
    > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
    > > Office: 180-503E, Mailstop: 180-503
    > > Email: chris.a.mattmann@nasa.gov
    > > WWW:  http://sunset.usc.edu/~mattmann/
    > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
    > ++++++++++++++
    > > Director, Information Retrieval and Data Science Group (IRDS)
    > > Adjunct Associate Professor, Computer Science Department University
    > > of Southern California, Los Angeles, CA 90089 USA
    > > WWW: http://irds.usc.edu/
    > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
    > ++++++++++++++
    > >
    > >
    > >
    > > On 11/3/17, 7:35 AM, "Allison, Timothy B." <ta...@mitre.org> wrote:
    > >
    > >     All,
    > >
    > >     PDFBox 2.0.8 is now integrated.  I want to fix TIKA-2490 before
    > > we release 1.17.  Are there other issues that are blockers or you'd
    > > like to fix before 1.17 (TIKA-2471, maybe?)?
    > >
    > >     I plan to run initial large scale regression tests shortly for
    > > rfc822 and mbox because of TIKA-2478.  I'll run the full regression
    > > tests before cutting the RC, but I want to focus on those for now.
    Other requests?
    > >
    > >     Cheers,
    > >
    > >                 Tim
    > >
    > >
    > >
    >