You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@tika.apache.org by Jim Garrison <jh...@jhmg.net> on 2020/07/13 00:53:06 UTC
Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in
AbstractPDF2XHTML.extractXMPXFA()]
On 7/11/2020 12:42 PM, Jim Garrison wrote:
> On 7/11/2020 1:43 AM, Tilman Hausherr wrote:
>> [2nd attempt]
>>
>> I think that one is this:
>> https://issues.apache.org/jira/browse/TIKA-3112
>>
>> Tilman
>>
>> Am 11.07.2020 um 00:32 schrieb Jim Garrison:
>>> Tika App started with
>>>
>>> java -jar tika-app-1.24.1.jar -g
>>>
>>> Fails the same way no matter what I try to parse.
>>>
>>> Checking here before I submit an issue...
>>>
>>> Stack Trace:
>>>
>>> Apache Tika was unable to parse the document
>>> at D:\Users\jim\Data\Scans\Receipts\20200706-WinCo.pdf.
>>>
> [snip]
>
> I don't see a TIKA-3112 branch in git, and also see that the commit for
> the fix appears only in branch_1x. Is that branch currently buildable
> and at least reasonably stable?
>
> Thanks
Tried cloning branch_1x and building with maven on Windows 10, tests
failed with:
-------------------------------------------------------------------------------
Test set: org.apache.tika.parser.fork.ForkParserIntegrationTest
-------------------------------------------------------------------------------
Tests run: 7, Failures: 0, Errors: 1, Skipped: 1, Time elapsed: 5.101 s
<<< FAILURE! - in org.apache.tika.parser.fork.ForkParserIntegrationTest
org.apache.tika.parser.fork.ForkParserIntegrationTest.testAttachingADebuggerOnTheForkedParserShouldWork
Time elapsed: 0.182 s <<< ERROR!
java.io.IOException: EOF while waiting for start beacon
at
org.apache.tika.parser.fork.ForkParserIntegrationTest.testAttachingADebuggerOnTheForkedParserShouldWork(ForkParserIntegrationTest.java:245)
Suggestions? I couldn't find anything in the issue tracker that matches
this failure.
--
Jim Garrison jhg@acm.org
Re: Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in
AbstractPDF2XHTML.extractXMPXFA()]
Posted by Tilman Hausherr <TH...@t-online.de>.
And now the whole 1x build successfully for me (W10).
Tilman
Re: Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in
AbstractPDF2XHTML.extractXMPXFA()]
Posted by Tilman Hausherr <TH...@t-online.de>.
Somebody please replace that one with this in
ImageMetadataExtractor.java (I can't, I'm not yet up to date on that
master / main change), but now the parser builds with 1x:
public void parseHeif(File file) throws IOException, TikaException {
InputStream is = null;
try {
is = new FileInputStream(file);
com.drew.metadata.Metadata heifMetadata =
HeifMetadataReader.readMetadata(is);
handle(heifMetadata);
} catch (IOException e) {
throw e;
} catch (MetadataException e) {
throw new TikaException("Can't process Heif data", e);
}
finally {
IOUtils.closeQuietly(is);
}
}
Re: Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in
AbstractPDF2XHTML.extractXMPXFA()]
Posted by Tilman Hausherr <TH...@t-online.de>.
the FileInputStream is never closed here:
public void parseHeif(File file) throws IOException, TikaException {
try {
com.drew.metadata.Metadata heifMetadata = new
com.drew.metadata.Metadata();
heifMetadata = HeifMetadataReader.readMetadata(new
FileInputStream(file));
handle(heifMetadata);
} catch (IOException e) {
throw e;
} catch (MetadataException e) {
throw new TikaException("Can't process Heif data", e);
}
}
Re: Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in
AbstractPDF2XHTML.extractXMPXFA()]
Posted by Tilman Hausherr <TH...@t-online.de>.
I tried to build 1x, then forgot about it and went to sleep, it stopped
with this:
Running org.apache.tika.parser.image.HeifParserTest
Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.12 s
<<< FAILURE! - in org.apache.tika.parser.image.HeifParserTest
org.apache.tika.parser.image.HeifParserTest.testSimple Time elapsed:
0.118 s <<< ERROR!
org.apache.tika.exception.TikaException: Failed to close temporary resources
at
org.apache.tika.parser.image.HeifParserTest.testSimple(HeifParserTest.java:48)
Caused by: java.nio.file.FileSystemException:
C:\Users\XXXXX\AppData\Local\Temp\apache-tika-1560891050378389098.tmp:
Der Prozess kann nicht auf die Datei zugreifen, da sie von einem anderen
Prozess verwendet wird.
at
org.apache.tika.parser.image.HeifParserTest.testSimple(HeifParserTest.java:48)
This happens also when running that test as single.
Re: Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()]
Posted by Tim Allison <ta...@apache.org>.
Y, branch_1x and main should have the fix. I'm not able to replicate this
failure on Mac or Linux on branch_1x. :(
When this test has failed in the past, it was caused by a new
parser/component not being serializable. From the "EOF while waiting for
start beacon", this is not the case for you.
Is there any chance that Windows is preventing the child process from
starting?
> Tried cloning branch_1x and building with maven on Windows 10, tests
> failed with:
>
>
> -------------------------------------------------------------------------------
> Test set: org.apache.tika.parser.fork.ForkParserIntegrationTest
>
> -------------------------------------------------------------------------------
> Tests run: 7, Failures: 0, Errors: 1, Skipped: 1, Time elapsed: 5.101 s
> <<< FAILURE! - in org.apache.tika.parser.fork.ForkParserIntegrationTest
>
> org.apache.tika.parser.fork.ForkParserIntegrationTest.testAttachingADebuggerOnTheForkedParserShouldWork
> Time elapsed: 0.182 s <<< ERROR!
> java.io.IOException: EOF while waiting for start beacon
> at
>
> org.apache.tika.parser.fork.ForkParserIntegrationTest.testAttachingADebuggerOnTheForkedParserShouldWork(ForkParserIntegrationTest.java:245)
>
> Suggestions? I couldn't find anything in the issue tracker that matches
> this failure.
>
>
> --
> Jim Garrison jhg@acm.org
>