You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by GitBox <gi...@apache.org> on 2021/01/25 16:08:50 UTC

[GitHub] [nutch] sebastian-nagel commented on pull request #564: NUTCH-2845 Complete rules of urlfilter-suffix

sebastian-nagel commented on pull request #564:
URL: https://github.com/apache/nutch/pull/564#issuecomment-766924760


   Below the mapping of file suffixes to file formats or MIME types:
   ```
   .3g2	3GPP2 Multimedia
   .3gp	video/3gpp
   .afm	application/x-font-adobe-metric
   .aif	audio/x-aiff
   .aiff	audio/x-aiff
   .apk	application/vnd.android.package-archive
   .avif	video/quicktime
   .bz2	application/x-bzip2
   .crx	application/x-chrome-package
   .dll	Microsoft dynamic-link library
   .dmg	Apple Disk Image
   .egg	Egg Python package
   .f4v	video/quicktime
   .flac	audio/x-flac
   .gblorb	audio/x-aiff
   .gifv	image/gif
   .icns	Apple Icon Image
   .ipk	Debian software installation package
   .jp2	image/jp2
   .jpf	image/jpx
   .jpx	image/jpx
   .js	application/javascript
   .kdc	image/x-raw-kodak
   .lzma	application/x-lzma
   .m3u8	audio/x-mpegurl
   .m4a	video/mp4
   .m4r	iPhone Ringtone
   .m4v	video/x-m4v
   .mar	Mozilla Archive
   .mk3d	Matroska 3D Video
   .mka	Matroska Audio
   .mks	Matroska Elementary Stream
   .mkv	video/x-matroska
   .movie	video/x-sgi-movie
   .mp4	video/mp4
   .mp4a	audio/mp4
   .mpga	audio/mpeg
   .oga	audio/vorbis
   .opus	audio/opus
   .otf	application/x-font-otf
   .pfb	application/x-font-type1
   .qt	Apple QuickTime Movie
   .rar	application/x-rar-compressed
   .svg	image/svg+xml
   .svgz	image/svg+xml
   .ttf	application/x-font-ttf
   .udeb	application/x-debian-package
   .webm	video/webm
   .webp	image/webp
   .whl	Wheel Python package
   .woff	application/font-woff
   .woff2	application/font-woff2
   .xm	audio/xm
   .xz	application/x-xz
   ```


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org