You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sean R. Owen (Jira)" <ji...@apache.org> on 2021/04/23 18:15:00 UTC

[jira] [Resolved] (SPARK-35193) Scala/Java compatibility issue Re: how to use externalResource in java transformer from Scala Transformer?

     [ https://issues.apache.org/jira/browse/SPARK-35193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sean R. Owen resolved SPARK-35193.
----------------------------------
    Resolution: Invalid

I think this should be a question to the user@ list - I don't see reason to believe it's a Spark issue.

There are several things that could be wrong, like, ExternalResourceParam not extending Param or not having the right name, etc.

> Scala/Java compatibility issue Re: how to use externalResource in java transformer from Scala Transformer?
> ----------------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-35193
>                 URL: https://issues.apache.org/jira/browse/SPARK-35193
>             Project: Spark
>          Issue Type: Bug
>          Components: Java API, ML
>    Affects Versions: 3.1.1
>            Reporter: Arthur
>            Priority: Major
>
> I am trying to make a custom transformer use an externalResource, as it requires a large table to do the transformation. I'm not super familiar with scala syntax, but from snippets found on the internet I think I've made a proper java implementation. I am running into the following error:
> Exception in thread "main" java.lang.IllegalArgumentException: requirement failed: Param HardMatchDetector_d95b8f699114__externalResource does not belong to HardMatchDetector_d95b8f699114.
>  at scala.Predef$.require(Predef.scala:281)
>  at org.apache.spark.ml.param.Params.shouldOwn(params.scala:851)
>  at org.apache.spark.ml.param.Params.set(params.scala:727)
>  at org.apache.spark.ml.param.Params.set$(params.scala:726)
>  at org.apache.spark.ml.PipelineStage.set(Pipeline.scala:41)
>  at org.apache.spark.ml.param.Params.set(params.scala:713)
>  at org.apache.spark.ml.param.Params.set$(params.scala:712)
>  at org.apache.spark.ml.PipelineStage.set(Pipeline.scala:41)
>  at HardMatchDetector.setResource(HardMatchDetector.java:45)
>  
> Code as follows:
> {code:java}
> public class HardMatchDetector extends Transformer implements DefaultParamsWritable, DefaultParamsReadable, Serializable {
> public String inputColumn = "value";
>  public String outputColumn = "hardMatches";
>  private ExternalResourceParam resourceParam = new ExternalResourceParam(this, "externalResource", "external resource, parquet file with 2 columns, one names and one wordcount");;
>  private String uid;
> public HardMatchDetector setResource(final ExternalResource value)
> { return (HardMatchDetector)this.set(this.resourceParam, value); }
> public HardMatchDetector setResource(final String path)
> { return this.setResource(new ExternalResource(path, ReadAs.TEXT(), new HashMap())); }
> @Override
>  public String uid()
> { return getUid(); }
> private String getUid() {
>  if (uid == null)
> { uid = Identifiable$.MODULE$.randomUID("HardMatchDetector"); }
> return uid;
>  }
> @Override
>  public Dataset<Row> transform(final Dataset<?> dataset)
> { return dataset; }
> @Override
>  public StructType transformSchema(StructType schema)
> { return schema.add(DataTypes.createStructField(outputColumn, DataTypes.StringType, true)); }
> @Override
>  public Transformer copy(ParamMap extra)
> { return new HardMatchDetector(); }
> }
> public class HardMatcherTest extends AbstractSparkTest
> { @Test 
> public void test() 
> { 
> var hardMatcher = new HardMatchDetector().setResource(pathName); }
> }
> {code}
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org