You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/07/13 18:57:44 UTC

[GitHub] [arrow] jvanstraten commented on a diff in pull request #13537: ARROW-16988: [C++] Introduce Substrait ToProto/FromProto conversion options

jvanstraten commented on code in PR #13537:
URL: https://github.com/apache/arrow/pull/13537#discussion_r920406800


##########
cpp/src/arrow/engine/substrait/options.h:
##########
@@ -0,0 +1,55 @@
+// Licensed to the Apache Software Foundation (ASF) under one
+// or more contributor license agreements.  See the NOTICE file
+// distributed with this work for additional information
+// regarding copyright ownership.  The ASF licenses this file
+// to you under the Apache License, Version 2.0 (the
+// "License"); you may not use this file except in compliance
+// with the License.  You may obtain a copy of the License at
+//
+//   http://www.apache.org/licenses/LICENSE-2.0
+//
+// Unless required by applicable law or agreed to in writing,
+// software distributed under the License is distributed on an
+// "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+// KIND, either express or implied.  See the License for the
+// specific language governing permissions and limitations
+// under the License.
+
+// This API is EXPERIMENTAL.
+
+#pragma once
+
+namespace arrow {
+namespace engine {
+
+/// How strictly to adhere to the input structure when converting between Substrait and
+/// Acero representations of a plan. This allows the user to trade conversion accuracy
+/// for performance and lenience.
+enum class ConversionStrictness {
+  /// Prevent information loss by rejecting incoming plans that use features or contain
+  /// metadata that cannot be exactly represented in the output format in a way that
+  /// will round-trip. Relations/nodes must map one-to-one.
+  PEDANTIC,

Review Comment:
   > "Pedantic" conveys an idea of spurious or arbitrary constraints. Is this the case here?
   
   I'd say so, actually. "Pedantic" includes constraints such as rejecting an Acero plan for conversion to Substrait if the internal column names are not exactly what you'd get after round-tripping through Substrait (which cannot represent inner column names), even though the plan can be conveyed in Substrait perfectly well.
   
   > Otherwise "strict" would sound better IMHO.
   
   Strict implies to me to strictly conform to all the rules of the output format, i.e. don't make anything that's out of spec. But none of the options do that, so it feels kind of meaningless.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org