You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by GitBox <gi...@apache.org> on 2022/07/12 15:57:23 UTC

[GitHub] [drill] CarusoGuillaume opened a new issue, #2595: Unexpected behavior with Cassandra connector

CarusoGuillaume opened a new issue, #2595:
URL: https://github.com/apache/drill/issues/2595

   **Describe the bug**
   We are using Apache Drill to add ANSI SQL capabilities to cassandra, but when using the '>', '<' or 'IN' operators when filtering data, the query plan switch from a CassandraFilter to a regular Filter, meaning all the cassandra table data is scanned, fetched, then filtered, which is not the expected behavior, as the Apache Calcite plugin supports those operators.
   This results in very slow queries, and high resources consumptions.
   
   **Screenshots**
   
   *(boitier_id, libelle, unite and periode are keys)*
   
   *Expected behavior (using a CassandraFilter)*
   ![image](https://user-images.githubusercontent.com/62694436/178532574-c276851a-cc23-46fa-8ba9-6f04cebf2658.png)
   ![image](https://user-images.githubusercontent.com/62694436/178532335-e20d4486-d4d5-4337-8cec-6ed6c178051e.png)
   
   *Unexpected behavior (When using lt, gt operator)*
   ![image](https://user-images.githubusercontent.com/62694436/178535991-30dc3731-00d8-4a9d-8765-88d10dc2df4a.png)
   ![image](https://user-images.githubusercontent.com/62694436/178532850-7cfef8b1-7699-4532-af3e-602f0bc4eb37.png)
   
   *Possible solution to get a correct behavior, but not completely, as the whole dataset for (76, '3dProd_C1','W') is loaded insted of just the portion which we would like to use*
   ![image](https://user-images.githubusercontent.com/62694436/178533991-bd9c3288-f905-406d-bc1e-101a12ecd9a2.png)
   ![image](https://user-images.githubusercontent.com/62694436/178534184-265eca54-4d7e-401b-80fb-b4d7d8cdd575.png)
   
   
   **Expected behavior**
   The Query should use a CassandraFilter in order to fetch data efficiently, even when using '>', '<' operators, and not use a normal filter, which requires to fetch all the data from the queried table.
   
   
   Should our question not belong here, feel free to remove it, but please point us to where we could ask it.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscribe@drill.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [drill] CarusoGuillaume commented on issue #2595: Unexpected behavior with Cassandra connector

Posted by GitBox <gi...@apache.org>.
CarusoGuillaume commented on issue #2595:
URL: https://github.com/apache/drill/issues/2595#issuecomment-1216235037

   @cgivre Is the update available for docker ? The `latest` tag pulls the 1.20.2 version. And in this version, it still performs a full table scan when using a cassandra operator that should result in a cassandra filter.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscribe@drill.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [drill] CarusoGuillaume commented on issue #2595: Unexpected behavior with Cassandra connector

Posted by GitBox <gi...@apache.org>.
CarusoGuillaume commented on issue #2595:
URL: https://github.com/apache/drill/issues/2595#issuecomment-1225825361

   Hi ! The issue is still here in Apache Drill 2.0. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscribe@drill.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [drill] jnturton commented on issue #2595: Unexpected behavior with Cassandra connector

Posted by GitBox <gi...@apache.org>.
jnturton commented on issue #2595:
URL: https://github.com/apache/drill/issues/2595#issuecomment-1216292882

   @CarusoGuillaume he'll be referring to the snaphot builds from the master branch. Look for Docker Hub tags starting with "master"


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscribe@drill.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [drill] cgivre commented on issue #2595: Unexpected behavior with Cassandra connector

Posted by GitBox <gi...@apache.org>.
cgivre commented on issue #2595:
URL: https://github.com/apache/drill/issues/2595#issuecomment-1214559054

   @CarusoGuillaume Thanks for reporting this.  One thing... we recently merged a PR which updated the Drill query planner and likely includes updates to the Cassandra adapter.  I'd be curious if trying this query with the current master of Drill 2.0 has any improvement.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscribe@drill.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org