You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Amit Baghel (JIRA)" <ji...@apache.org> on 2018/08/19 12:43:00 UTC

[jira] [Created] (SPARK-25157) Streaming of image files from directory

Amit Baghel created SPARK-25157:
-----------------------------------

             Summary: Streaming of image files from directory
                 Key: SPARK-25157
                 URL: https://issues.apache.org/jira/browse/SPARK-25157
             Project: Spark
          Issue Type: New Feature
          Components: ML, Structured Streaming
    Affects Versions: 2.3.1
            Reporter: Amit Baghel


We are doing video analytics for video streams using Spark. At present there is no direct way to stream video frames or image files to Spark and process using Structured Streaming and Dataset. We are using Kafka to stream images and then doing processing at spark. We need a method in Spark to stream images from directory. Currently *{{DataStreamReader}}* doesn't support Images. With the introduction of *org.apache.spark.ml.image.ImageSchema* class, we think streaming capabilities can be added for images. It is fine if it won't support some of the structured streaming features as it is a binary file. Schema used in ImageSchema class for image can be used in Dataset. This feature could be similar to *mmlspark* *streamImages* method. ([https://github.com/Azure/mmlspark/blob/4413771a8830e4760f550084da60ea0616bf80b9/src/io/image/src/main/python/ImageReader.py] )



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org