You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Amit Baghel (JIRA)" <ji...@apache.org> on 2018/08/19 12:43:00 UTC
[jira] [Created] (SPARK-25157) Streaming of image files from
directory
Amit Baghel created SPARK-25157:
-----------------------------------
Summary: Streaming of image files from directory
Key: SPARK-25157
URL: https://issues.apache.org/jira/browse/SPARK-25157
Project: Spark
Issue Type: New Feature
Components: ML, Structured Streaming
Affects Versions: 2.3.1
Reporter: Amit Baghel
We are doing video analytics for video streams using Spark. At present there is no direct way to stream video frames or image files to Spark and process using Structured Streaming and Dataset. We are using Kafka to stream images and then doing processing at spark. We need a method in Spark to stream images from directory. Currently *{{DataStreamReader}}* doesn't support Images. With the introduction of *org.apache.spark.ml.image.ImageSchema* class, we think streaming capabilities can be added for images. It is fine if it won't support some of the structured streaming features as it is a binary file. Schema used in ImageSchema class for image can be used in Dataset. This feature could be similar to *mmlspark* *streamImages* method. ([https://github.com/Azure/mmlspark/blob/4413771a8830e4760f550084da60ea0616bf80b9/src/io/image/src/main/python/ImageReader.py] )
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org