You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2021/03/31 08:48:25 UTC

[GitHub] [iceberg] openinx commented on issue #2397: When querying data by using the Flink DataStream API , the result is not correct.Is it a bug or is my usage wrong?

openinx commented on issue #2397:
URL: https://github.com/apache/iceberg/issues/2397#issuecomment-810894095


   @wuwangben  In this lines: 
   
   ```java
   FlinkSource.forRowData().env(bsEnv).filters(Collections.singletonList(filter)).project(flinkSchema)
   ```
   
   The `filters` are used to push down filters to data files, then we don't have read all the data files from iceberg table.  But those filters won't be applied in row-level,  that means  we may still read more rows from the filtered data files. In you case,  I think we will still need to filter the unexpected rows in `DataStream` by:
   
   ```java
   dataStream.filter(FilterFunction func);
   ```
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org