You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Lewis John McGibbney (Jira)" <ji...@apache.org> on 2021/05/03 23:33:00 UTC

[jira] [Comment Edited] (TIKA-94) Speech-to-text transcription

    [ https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17338669#comment-17338669 ] 

Lewis John McGibbney edited comment on TIKA-94 at 5/3/21, 11:32 PM:
--------------------------------------------------------------------

4923 days after this ticket was originally created, I'm glad to say that it's been merged into main!
The implementation can be improved. For example, we may deprecate the tika.transcribe API and simply `hide` the transcribe functionality behind tika.parse. At the end of the day we we just want to interpret a file and get text back so why introduce a new API to do that?
That however remains as follow-on work. :)

Excellent work and kudos to the following contributors (alphabetical order)

Ashank Behara
Minh Phan
Nic Prate
Rohan Surana
Sahan Kumarasinghe

This feature was developed through the HackIllinois Program - https://www.hackillinois.org/ which was very well organized and well executed. 


was (Author: lewismc):
4923 days after this ticket was originally I'm glad to say that it's been merged into main!
The implementation can be improved. For example, we may deprecate the tika.transcribe API and simply `hide` the transcribe functionality behind tika.parse. At the end of the day we we just want to interpret a file and get text back so why introduce a new API to do that?
That however remains as follow-on work. :)

Excellent work and kudos to the following contributors (alphabetical order)

Ashank Behara
Minh Phan
Nic Prate
Rohan Surana
Sahan Kumarasinghe

This feature was developed through the HackIllinois Program - https://www.hackillinois.org/ which was very well organized and well executed. 

> Speech-to-text transcription
> ----------------------------
>
>                 Key: TIKA-94
>                 URL: https://issues.apache.org/jira/browse/TIKA-94
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>            Assignee: Lewis John McGibbney
>            Priority: Minor
>              Labels: new-parser
>             Fix For: 1.27
>
>
> Like OCR for image files (TIKA-93), we could try using speech recognition to extract text content (where available) from audio (and video!) files.
> The CMU Sphinx engine (http://cmusphinx.sourceforge.net/) looks promising and comes with a friendly license.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)