You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sedona.apache.org by "Magdalena Wójciak (Jira)" <ji...@apache.org> on 2021/11/15 08:11:00 UTC

[jira] [Commented] (SEDONA-18) error reading shapefile

    [ https://issues.apache.org/jira/browse/SEDONA-18?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17443624#comment-17443624 ] 

Magdalena Wójciak commented on SEDONA-18:
-----------------------------------------

Is anyone working on that? 

> error reading shapefile
> -----------------------
>
>                 Key: SEDONA-18
>                 URL: https://issues.apache.org/jira/browse/SEDONA-18
>             Project: Apache Sedona
>          Issue Type: Bug
>    Affects Versions: 1.0.0
>            Reporter: Brian Lockwood
>            Priority: Minor
>
> Get the following error when calling:
> {code:java}
> ShapefileReader.readToGeometryRDD(spark.sparkContext, path)
> {code}
>  
> {code:java}
> java.io.IOException: Can't find .shp file.
>  at org.apache.sedona.core.formatMapper.shapefileParser.shapes.CombineShapeReader.initialize(CombineShapeReader.java:107)
>  at org.apache.spark.rdd.NewHadoopRDD$$anon$1.liftedTree1$1(NewHadoopRDD.scala:216)
>  at org.apache.spark.rdd.NewHadoopRDD$$anon$1.<init>(NewHadoopRDD.scala:213)
>  at org.apache.spark.rdd.NewHadoopRDD.compute(NewHadoopRDD.scala:168)
>  at org.apache.spark.rdd.NewHadoopRDD.compute(NewHadoopRDD.scala:71)
>  at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:349)
>  at org.apache.spark.rdd.RDD.iterator(RDD.scala:313)
>  at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:52)
>  at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:349)
>  at org.apache.spark.rdd.RDD.iterator(RDD.scala:313)
>  at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:90)
>  at org.apache.spark.scheduler.Task.run(Task.scala:127)
>  at org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$3(Executor.scala:446)
>  at org.apache.spark.util.Utils$.tryWithSafeFinally(Utils.scala:1377)
>  at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:449)
>  at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>  at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>  at java.lang.Thread.run(Thread.java:748){code}
> The path does contain a .shp file, but its also contains a few xml files that also contain .shp, if I rename these files then I can load the shapefile.
> example shapefile [tl_2020_us_zcta510.zip|https://www2.census.gov/geo/tiger/TIGER2020/ZCTA5/tl_2020_us_zcta510.zip] if i rename these files to not contain .shp or delete them then everything works as expected
> {code:java}
> tl_2020_us_zcta510.shp.ea.iso.xml
> tl_2020_us_zcta510.shp.iso.xml{code}
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)