You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Gregory Lepore (Jira)" <ji...@apache.org> on 2023/04/21 11:55:00 UTC

[jira] [Commented] (TIKA-4022) Tika not parsing AVI files

    [ https://issues.apache.org/jira/browse/TIKA-4022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17714963#comment-17714963 ] 

Gregory Lepore commented on TIKA-4022:
--------------------------------------

See attachment for difference between Tika 2.7 and 1.28 on parsing the same AVI file.

> Tika not parsing AVI files
> --------------------------
>
>                 Key: TIKA-4022
>                 URL: https://issues.apache.org/jira/browse/TIKA-4022
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 2.7.0
>            Reporter: Gregory Lepore
>            Priority: Major
>         Attachments: tika-comparison-avi.jpeg
>
>
> I have tried a variety of .AVI files and none have been correctly parsed. This is all I get:
>  
> Content-Length: 190976
> Content-Type: video/x-msvideo
> X-TIKA:Parsed-By: org.apache.tika.parser.EmptyParser
> X-TIKA:Parsed-By-Full-Set: org.apache.tika.parser.EmptyParser
> X-TIKA:digest:MD5: 057a6067822dabd02d5613bb45296b22
> X-TIKA:digest:SHA256: 8156b8870311a2e1afb1b07417724df0816d6dbf2fa48072896e8db32796b77e
> resourceName: ACADABOUT.AVI
>  
> I have ffmpeg installed:
> ffmpeg -version 
> ffmpeg version 4.4.3-0ubuntu1~20.04.sav2 Copyright (c) 2000-2022 the FFmpeg developers 
> built with gcc 9 (Ubuntu 9.4.0-1ubuntu1~20.04.1) 
> configuration: --prefix=/usr --extra-version='0ubuntu1~20.04.sav2' --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 
> --enable-gpl --disable-stripping --enable-amf --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-lib
> cdio --enable-libcodec2 --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame 
> --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librubberband --enable-libshine --enable-libsnappy
> --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libw
> ebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2
> --enable-pocketsphinx --enable-librsvg --enable-libdav1d --enable-librist --enable-libvmaf --enable-libzimg --enable-crystalhd --enable-libmfx --enable-libsvtav1 --enable-lib
> dc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-librav1e --enable-shared 
> libavutil      56. 70.100 / 56. 70.100 
> libavcodec     58.134.100 / 58.134.100 
> libavformat    58. 76.100 / 58. 76.100 
> libavdevice    58. 13.100 / 58. 13.100 
> libavfilter     7.110.100 /  7.110.100 
> libswscale      5.  9.100 /  5.  9.100 
> libswresample   3.  9.100 /  3.  9.100 
> libpostproc    55.  9.100 / 55.  9.100
> ffmpeg -i  ACADABOUT.AVI 
> [snip]
> Input #0, avi, from 'ACADABOUT.AVI': 
>  Duration: 00:00:02.03, start: 0.000000, bitrate: 751 kb/s 
>  Stream #0:0: Video: indeo3 (IV32 / 0x32335649), yuv410p, 576x104, 749 kb/s, 30 fps, 30 tbr, 30 tbn, 30 tbc 
>    Metadata: 
>      title           : Video Track
> Any suggestions for further troubleshooting?
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)