You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/10/16 21:50:00 UTC

[jira] [Commented] (DRILL-7763) Add Limit Pushdown to File Based Storage Plugins

    [ https://issues.apache.org/jira/browse/DRILL-7763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17215670#comment-17215670 ] 

ASF GitHub Bot commented on DRILL-7763:
---------------------------------------

cgivre commented on a change in pull request #2092:
URL: https://github.com/apache/drill/pull/2092#discussion_r506735879



##########
File path: contrib/format-spss/src/test/java/org/apache/drill/exec/store/spss/TestSpssReader.java
##########
@@ -108,7 +108,7 @@ public void testStarQuery() throws Exception {
 
   @Test
   public void testExplicitQuery() throws Exception {
-    String sql = "SELECT ID, Urban, Urban_value FROM dfs.`spss/testdata.sav` WHERE d16=4";
+    String sql = "SELECT ID, Urban, Urban_value FROM dfs.`spss/testdata.sav` WHERE d16=4 LIMIT 5";

Review comment:
       Fixed.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> Add Limit Pushdown to File Based Storage Plugins
> ------------------------------------------------
>
>                 Key: DRILL-7763
>                 URL: https://issues.apache.org/jira/browse/DRILL-7763
>             Project: Apache Drill
>          Issue Type: Improvement
>    Affects Versions: 1.17.0
>            Reporter: Charles Givre
>            Assignee: Charles Givre
>            Priority: Major
>             Fix For: 1.19.0
>
>
> As currently implemented, when querying a file, Drill will read the entire file even if a limit is specified in the query.  This PR does a few things:
>  # Refactors the EasyGroupScan, EasySubScan, and EasyFormatConfig to allow the option of pushing down limits.
>  # Applies this to all the EVF based format plugins which are: LogRegex, PCAP, SPSS, Esri, Excel and Text (CSV). 
> Due to JSON's fluid schema, it would be unwise to adopt the limit pushdown as it could result in very inconsistent schemata.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)