You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pinot.apache.org by Pinot Slack Email Digest <sn...@apache.org> on 2021/06/20 02:00:19 UTC

Apache Pinot Daily Email Digest (2021-06-19)

### _#general_

  
 **@gqian3:** Currently we are deploying Pinot for customers facing online
query. And we also have a use case to store 2 years data could be hundreds of
millions records every day , and to build a offline report generator to query
the offline data, do aggregation on different dimensions and convert to a csv
report. Is Pinot able to handle this kind of use case? Would the offline
report query affect online customer query latency? How is the cost efficiency
to host a Pinot cluster to handle this kind of use case?  
**@mayanks:** At high level - what you have described is possible using Pinot.
Would need more concrete information to suggest how to make it work  
**@gqian3:** Great to know, basically as offline report generator, we are
building a backend service to query Pinot to do aggregation on time and other
dimensions over max 2 years of data. User can be notified when report is
ready. Should we separate this Pinot cluster serving offline report from the
Pinot cluster serving online web queries?  
**@g.kishore:** Try with same cluster for now.. you can try star tree index to
minimize the impact  
**@atri.sharma:** @atri.sharma has joined the channel  
 **@atri.sharma:** Hello!!  
**@xiangfu0:** hi!  
**@mayanks:** Welcome to the Apache Pinot Community!  

###  _#random_

  
 **@atri.sharma:** @atri.sharma has joined the channel  

###  _#troubleshooting_

  
 **@atri.sharma:** @atri.sharma has joined the channel  
\--------------------------------------------------------------------- To
unsubscribe, e-mail: dev-unsubscribe@pinot.apache.org For additional commands,
e-mail: dev-help@pinot.apache.org