You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@tika.apache.org by Jim Garrison <jh...@jhmg.net> on 2020/07/13 00:53:06 UTC

Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()]

On 7/11/2020 12:42 PM, Jim Garrison wrote:
> On 7/11/2020 1:43 AM, Tilman Hausherr wrote:
>> [2nd attempt]
>>
>> I think that one is this:
>> https://issues.apache.org/jira/browse/TIKA-3112
>>
>> Tilman
>>
>> Am 11.07.2020 um 00:32 schrieb Jim Garrison:
>>> Tika App started with
>>>
>>>      java -jar tika-app-1.24.1.jar -g
>>>
>>> Fails the same way no matter what I try to parse.
>>>
>>> Checking here before I submit an issue...
>>>
>>> Stack Trace:
>>>
>>> Apache Tika was unable to parse the document
>>> at D:\Users\jim\Data\Scans\Receipts\20200706-WinCo.pdf.
>>>
> [snip]
> 
> I don't see a TIKA-3112 branch in git, and also see that the commit for
> the fix appears only in branch_1x.  Is that branch currently buildable
> and at least reasonably stable?
> 
> Thanks

Tried cloning branch_1x and building with maven on Windows 10, tests
failed with:

-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.fork.ForkParserIntegrationTest
-------------------------------------------------------------------------------
Tests run: 7, Failures: 0, Errors: 1, Skipped: 1, Time elapsed: 5.101 s
<<< FAILURE! - in org.apache.tika.parser.fork.ForkParserIntegrationTest
org.apache.tika.parser.fork.ForkParserIntegrationTest.testAttachingADebuggerOnTheForkedParserShouldWork
 Time elapsed: 0.182 s  <<< ERROR!
java.io.IOException: EOF while waiting for start beacon
	at
org.apache.tika.parser.fork.ForkParserIntegrationTest.testAttachingADebuggerOnTheForkedParserShouldWork(ForkParserIntegrationTest.java:245)

Suggestions?  I couldn't find anything in the issue tracker that matches
this failure.


-- 
Jim Garrison jhg@acm.org

Re: Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()]

Posted by Tilman Hausherr <TH...@t-online.de>.
And now the whole 1x build successfully for me (W10).

Tilman


Re: Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()]

Posted by Tilman Hausherr <TH...@t-online.de>.
Somebody please replace that one with this in 
ImageMetadataExtractor.java (I can't, I'm not yet up to date on that 
master / main change), but now the parser builds with 1x:


     public void parseHeif(File file) throws IOException, TikaException {
         InputStream is = null;
         try {
             is = new FileInputStream(file);
             com.drew.metadata.Metadata heifMetadata = 
HeifMetadataReader.readMetadata(is);
             handle(heifMetadata);
         } catch (IOException e) {
             throw e;
         } catch (MetadataException e) {
             throw new TikaException("Can't process Heif data", e);
         }
         finally {
             IOUtils.closeQuietly(is);
         }
     }


Re: Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()]

Posted by Tilman Hausherr <TH...@t-online.de>.
the FileInputStream is never closed here:


     public void parseHeif(File file) throws IOException, TikaException {

         try {
             com.drew.metadata.Metadata heifMetadata = new 
com.drew.metadata.Metadata();
             heifMetadata = HeifMetadataReader.readMetadata(new 
FileInputStream(file));
             handle(heifMetadata);
         } catch (IOException e) {
             throw e;
         } catch (MetadataException e) {
             throw new TikaException("Can't process Heif data", e);
         }

     }




Re: Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()]

Posted by Tilman Hausherr <TH...@t-online.de>.
I tried to build 1x, then forgot about it and went to sleep, it stopped 
with this:


Running org.apache.tika.parser.image.HeifParserTest
Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.12 s 
<<< FAILURE! - in org.apache.tika.parser.image.HeifParserTest
org.apache.tika.parser.image.HeifParserTest.testSimple  Time elapsed: 
0.118 s  <<< ERROR!
org.apache.tika.exception.TikaException: Failed to close temporary resources
     at 
org.apache.tika.parser.image.HeifParserTest.testSimple(HeifParserTest.java:48)
Caused by: java.nio.file.FileSystemException:
C:\Users\XXXXX\AppData\Local\Temp\apache-tika-1560891050378389098.tmp: 
Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen 
Prozess verwendet wird.

     at 
org.apache.tika.parser.image.HeifParserTest.testSimple(HeifParserTest.java:48)


This happens also when running that test as single.


Re: Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()]

Posted by Tim Allison <ta...@apache.org>.
Y, branch_1x and main should have the fix.  I'm not able to replicate this
failure on Mac or Linux on branch_1x.  :(

When this test has failed in the past, it was caused by a new
parser/component not being serializable.  From the "EOF while waiting for
start beacon", this is not the case for you.

Is there any chance that Windows is preventing the child process from
starting?


> Tried cloning branch_1x and building with maven on Windows 10, tests
> failed with:
>
>
> -------------------------------------------------------------------------------
> Test set: org.apache.tika.parser.fork.ForkParserIntegrationTest
>
> -------------------------------------------------------------------------------
> Tests run: 7, Failures: 0, Errors: 1, Skipped: 1, Time elapsed: 5.101 s
> <<< FAILURE! - in org.apache.tika.parser.fork.ForkParserIntegrationTest
>
> org.apache.tika.parser.fork.ForkParserIntegrationTest.testAttachingADebuggerOnTheForkedParserShouldWork
>  Time elapsed: 0.182 s  <<< ERROR!
> java.io.IOException: EOF while waiting for start beacon
>         at
>
> org.apache.tika.parser.fork.ForkParserIntegrationTest.testAttachingADebuggerOnTheForkedParserShouldWork(ForkParserIntegrationTest.java:245)
>
> Suggestions?  I couldn't find anything in the issue tracker that matches
> this failure.
>
>
> --
> Jim Garrison jhg@acm.org
>