You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@pinot.apache.org by Pinot Slack Email Digest <sn...@apache.org> on 2021/05/29 02:00:18 UTC

Apache Pinot Daily Email Digest (2021-05-28)

### _#general_

**@keweishang:** Hi team, we’re considering Pinot for our customer-facing
analytics (mainly dashboards). We also want to use the same data store (Pinot
table) to export denormalized report (CSV file of 10k - 100k rows filtered
from the table, with ~20 selected columns) to our clients. Is Pinot also a
good fit in the report generating use case. For example, our backend service
could query the Pinot table and generate the CSV report from the query result.
Thanks!
**@mayanks:** Yes 10-100k rows should be fine for report downloading kind of
use cases
**@keweishang:** Thank you @mayanks for your quick reply. Hopefully the Pinot
PoC will work out nicely for us :+1:
**@mayanks:** Sure, let us know if you have questions

### _#troubleshooting_

**@shaileshjha061:** Hi @mayanks @dlavoie Our API Team is not able to connect
to Pinot Broker. Can you help me with this? I am attaching the broker log.
```SEVERE: An I/O error has occurred while writing a response message entity
to the container output stream.
org.glassfish.jersey.server.internal.process.MappableException:
java.io.IOException: Connection closed``` CC: @mohamed.sultan @nadeemsadim
**@shaileshjha061:** @dlavoie @mayanks Can you please help me troubleshoot
this issue?
**@mohamed.sultan:** we are receiving 404 Error when we try to use broker ip
**@mohamed.sultan:** not sure why?
**@mayanks:** Does query via query console work?
**@mohamed.sultan:** yes. it is working
**@mayanks:** And can your query broker using curl
**@mayanks:** If query console works, that means broker up and running. that
also implies you can query broker using curl. I’d suggest looking at your
client side code. if you are getting 404, that means it is not hitting the
right url?
**@mohamed.sultan:** ```time="2021-05-28T04:59:26Z" level=error msg="Caught
exception to execute SQL query select count(*) from vae_alert, Error: caught
http exception when querying Pinot: 404 Not Found\n"```
**@dlavoie:** Which endpoint are you calling on the broker end?
**@mohamed.sultan:** load balancer ip provided by GCP
**@dlavoie:** Yes, but what URL path?
**@mohamed.sultan:** @mags.carlin Can you post the URL path that you append in
webservice code?
**@mohamed.sultan:** CC: @nadeemsadim
**@mohamed.sultan:** @sandeep ?
**@mags.carlin:** @dlavoie We are using Pinot client , to pass the host and
port as broker
**@mags.carlin:** It will return as the connection with which we are executing
the query directly
**@mags.carlin:** This is how we used for older instance of Pinot and it's
working fine.
**@dlavoie:** Controller and broker have different endpoint for query
**@mags.carlin:** We are using broker ip ,right Sultan?
**@dlavoie:** Broker are queried using `/query/sql`.
**@mohamed.sultan:** yes @mags.carlin
**@nadeemsadim:** Not directly querying the pinot broker using curl but using
golang pinot connector ie
**@mohamed.sultan:** seems like /query/sql this is not used
**@nadeemsadim:** doing it programmatically means querying programmatically by
importing pinot-client library
**@nadeemsadim:** one old pinot cluster.. its working but the latest one its
not working
**@mohamed.sultan:** Exactly!
**@nadeemsadim:** here.. I dont think we need to append /query/sql because
here we just need to specify the host and port
**@dlavoie:** Have you tested the query with curl first?
**@nadeemsadim:** it is working with other broker ip of other old cluster but
not working with broker ip of new cluster
**@dlavoie:** Test with curl first please, let’s eliminate the go client from
the equation
**@mayanks:** Yes, this ^^
**@mayanks:**
**@mags.carlin:** Curl, needs to be used with /query and broker ip?what is the
endpoint
**@mayanks:** Check the link ^^
**@mags.carlin:** Okay
**@nadeemsadim:** we tried with both the pinot cluster broker endpoints and
the curl worked with both ! @dlavoie @mayanks
**@nadeemsadim:** seems issue with only golang connector for one of the pinot
clusters
**@nadeemsadim:**
**@patidar.rahul8392:** Hello Everyone,I am trying to use hdfs as deepatorage
for Pinot I followed all the steps which mentioned in pinot documents and
everything worked fine.Then I have created pinot table and on pinot UI I am
able to see the tables and segments. Now I am expecting that same segment
should be available at the location also which we have mentioned as
controller.data.dir (default.fs/hdfs dir) But here the directory is empty.
Kindly suggest if I need to do anything else.
**@patidar.rahul8392:** I followed this link.
**@patidar.rahul8392:** @fx19880617 @sleepythread
**@patidar.rahul8392:** This is the config files details. I checked
classpath_prefix value also it's pointing the correct jar locations. Kindly
help.
**@fx19880617:** Have you pushed the segment to offline table or it’s a
realtime table?
**@patidar.rahul8392:** It's a realtime table
**@fx19880617:** ok, so you can query the data now
**@patidar.rahul8392:** Yes I can query
**@fx19880617:** are there any segment in `ONLINE` status or they are all in
`CONSUMING` status
**@patidar.rahul8392:** All in consuming
**@fx19880617:** ah
**@fx19880617:** then you need to wait for a while
**@fx19880617:** those segments are in memory at consuming status
**@fx19880617:** once reach the threshold, pinot will persist them and upload
**@fx19880617:** then those segments will be in `ONLINE` status
**@patidar.rahul8392:** Ok is there any time limit which we can set because
since last had in our all are in consuming stats
**@fx19880617:** in table config, you can set flush threshold
**@fx19880617:**
**@patidar.rahul8392:**
**@fx19880617:** You can modify those: ```
"realtime.segment.flush.threshold.size": "0",
"realtime.segment.flush.threshold.time": "24h",
"realtime.segment.flush.desired.size": "50M",```
**@patidar.rahul8392:** This is the segment matadata I can see
**@fx19880617:** make `realtime.segment.flush.threshold.time` to `5m` then it
will persist every 5mins
**@patidar.rahul8392:** Ohh ok got it..thanks a lot @fx19880617.let me update
this.
**@fx19880617:** I think your data volume is not very high, so the consuming
segment need to wait a lot of time
**@fx19880617:** also we don’t recommend to have many segments in a table
**@fx19880617:** so suggest to change back to the default config once you
finished the test
**@patidar.rahul8392:** Oh ok I have only 4 segments
**@patidar.rahul8392:** Ok
**@fx19880617:** I think you mean 4 kafka partitions?
**@patidar.rahul8392:** Yes
**@fx19880617:** right, 5mins a segment means pinot will generate 12 * 24 * 4
= 1152 segments per day
**@fx19880617:** suggest to keep per table segment number < 20k
**@patidar.rahul8392:** Ohh ok so better in realtime will keep high threshold
value.
**@patidar.rahul8392:** Ok @fx19880617
**@fx19880617:** right, try to keep segment size to be hundreds mb-ish
**@patidar.rahul8392:** Okay
**@patidar.rahul8392:** @fx19880617 Thanks a lot deepatorage worked now.
**@patidar.rahul8392:** @fx19880617 what is the process to load these hdfs
segment files again in pinot .in case I want to recover the lost segments in
Pinot.? Do I need to follow the same steps as batch load?
**@nadeemsadim:** suppose I created a realtime table with some kafka with ssl
enabled configs .. now my kafka(Strimzi kafka) crashed or its ssl configs
changed along with broker IP .. so can I point the same pinot table to the new
kafka broker Ip and updated ssl configs without losing the old data .. Is
backup & restore necessary while doing this or can directly update the table
creation connfigs @fx19880617 @mayanks cc: @mohamed.sultan @shaileshjha061
@mohamedkashifuddin @hussain @pugal.selvan
**@fx19880617:** if Kafka topic partition offsets are persisted then you are
fine
**@nadeemsadim:** u mean I dont need to delete the table . .just to update the
table configs where we give broekr url and ssl configs
**@nadeemsadim:** and it will automatically start fetching from new kafka with
same topic name and append to the exisitng data loaded into segments from old
kafka
**@mayanks:** As Xiang mentioned, if the offsets are consistent in old vs new
Kafka then you are good. For example, let's say Pinot consumed until offset xx
in old kafka. Does the new Kafka have the same offset for those messages? If
so, yes. If not, then you need to redo the table
**@nadeemsadim:** ok but as Xiang meantioned earlier that pinot stores offsets
in zookeeper which comes with pinot helm installation .. so even if the kafka
gets replaced .. the pinot setup zookeeper will be still there with the
offsets . .so then we dont need to delete the table since nothing changed in
the zookeeper of pinot
**@nadeemsadim:** for the same topic
**@mayanks:** Pinot does do that. What I am saying is if old and new Kafka are
not consistent (eg message m1 and offset o1 in old kafka is not the same as
message m2 in offset o1 in new Kafka), then Pinot won't know about it. It will
consume from the offset it saved in ZK, but that offset may not be correct for
new Kafka
**@nadeemsadim:** ok got it .. u mean the ordering of events in both kafka
shuld exactly comply on a particular topic .. but since old kafka is lost..
and the new kafka will get only the latest data ..since old data is already
consumed in pinot and the retention period of kafka was of only7 days.. so the
same ordering of events cant be present on the new kafka .. it will have the
latest data starting from the time it is created which is after old kafka
crashed/its ssl config /broker url changed
**@mayanks:** I am also talking about offset. That number is not contiguous
like 1, 2, 3, 4,... It is only monotonically increasing. So one can have 1, 3,
5... another one can have 2, 8, 9....
**@nadeemsadim:** these few highlighted configs in the table c reation script
got changed
**@nadeemsadim:** yes mayank .. offsets can be anything .. its not
monotonically increasing
**@mayanks:** Ok, let me ask you this. Are you doing this in production, or
just playing around?
**@nadeemsadim:** got ur point .. so in some way .. kafka is a single point of
failure .. if kafka fails .. we have to take backup and restore only in new
realtime table .. since i dont think its possible to create a new akfka
instance having the same offsets of the same messages
**@mayanks:** If you have setup kafka with enough fault tolerance, then it
should not fail
**@nadeemsadim:** we are about to get into prod .. thus need to ensure that
whatever we are planning is rock solid ..
**@nadeemsadim:** right mayank .. 100% agreed ..
**@mayanks:** :+1:
**@shaileshjha061:** @dlavoie @mayanks Can you please help me troubleshoot
this issue?

### _#aggregators_

**@harrynet222:** @harrynet222 has joined the channel

### _#community_

**@harrynet222:** @harrynet222 has joined the channel

### _#getting-started_

**@harrynet222:** @harrynet222 has joined the channel

### _#minion-improvements_

**@laxman:** • All in production now. Conversion is at good pace. • After
several rounds of tuning, 15 days of REALTIME data (total 90days) got
converted to OFFLINE in last 24 hours. • By mid/end of next week all data
should be migrated. :crossed_fingers: Early next I will try my Purger task
implementation. Thanks @jackie.jxt @fx19880617 @npawar @g.kishore for all your
help and support.
**@laxman:** I have few observations noted for this task and task framework
in general. • Lack of monitoring and alerting. This task and task framework
doesn’t expose any status/failure/progress metrics. Have to manually monitor
this. In case if any reason, conversion gets stuck, there is no way user will
know directly. Task status also says SUCCEEDED even when 1 out 5 tables
failed. • Conversion task implementation heavily memory intensive. Disk+memory
based implementation will be helpful. For most of our tables, I had to
configure 1 hour as the bucket period due to this limitation. • Data gap
larger than the bucket completely stalls the task. • Default null value
vector information is lost in the conversion. Its a major BLOCKER for tables
enabled with `nullValueHandling`
**@npawar:** whats the size of the data per day in realtime table?
**@laxman:** Did you mean deflated disk size or number of records? • We have 8
tables. Conversion task enabled for 7 tables. • They have different incoming
rate • Total data size 15TB all together including 2 replicas for 90 days. (90
days * 2 replicas = 2.7 TB occupied 5TB total capacity in each pinot server)
**@laxman:**
**@laxman:** logEventView and spanEventView are having 7 days of retention all
other tables are 90 days of retention
**@npawar:** i’m trying to understand why window had to be made just 1 hour.
For a specific table, do you know how much is the size of segments for 1 day
on the deep store? i’m trying to calculate how much data the task is having to
process at once, if the bucket is configured to 12h.
**@npawar:** and how many minions? and what configuration ?
**@laxman:** 4 minions pods, 6GB memory for each pod (max heap=3GB, direct
memory=2GB)
**@laxman:** For service call view, we had to turn it to 1 hr. Otherwise
conversion tasks are timing out and very slow. Stalling conversion of other
tables as well.
**@laxman:** ```469M
/var/pinot/server/data/index/serviceCallView_OFFLINE/serviceCallView_1614726000000_1614729599998_0
489M
/var/pinot/server/data/index/serviceCallView_OFFLINE/serviceCallView_1614736800000_1614740399998_0
450M
/var/pinot/server/data/index/serviceCallView_OFFLINE/serviceCallView_1614740400000_1614743999999_0
353M
/var/pinot/server/data/index/serviceCallView_OFFLINE/serviceCallView_1614751200000_1614754799993_0
358M
/var/pinot/server/data/index/serviceCallView_OFFLINE/serviceCallView_1614762000000_1614765599999_0```
**@laxman:** Latest segment samples for service call view :point_up_2:
**@laxman:** I made sure segments for all tables are in the recommended range
(150MB to 500MB range)
**@laxman:** ``` backendEntityView:
pinotRealtime.taskConfigs.RealtimeToOfflineSegmentsTask.bucketTimePeriod =
"1h" domainEventView:
pinotRealtime.taskConfigs.RealtimeToOfflineSegmentsTask.bucketTimePeriod =
"12h" domainEventSpanView:
pinotRealtime.taskConfigs.RealtimeToOfflineSegmentsTask.bucketTimePeriod =
"3h" rawServiceView:
pinotRealtime.taskConfigs.RealtimeToOfflineSegmentsTask.bucketTimePeriod =
"3h" rawTraceView:
pinotRealtime.taskConfigs.RealtimeToOfflineSegmentsTask.bucketTimePeriod =
"3h" serviceCallView:
pinotRealtime.taskConfigs.RealtimeToOfflineSegmentsTask.bucketTimePeriod =
"1h" logEventView:
pinotRealtime.taskConfigs.RealtimeToOfflineSegmentsTask.bucketTimePeriod =
"1h"```
**@laxman:** ```518M
/var/pinot/server/data/index/backendEntityView_OFFLINE/backendEntityView_1614816000000_1614819599960_0
356M
/var/pinot/server/data/index/backendEntityView_OFFLINE/backendEntityView_1614826800000_1614830399951_0
524M
/var/pinot/server/data/index/backendEntityView_OFFLINE/backendEntityView_1614837600000_1614841199992_0
404M
/var/pinot/server/data/index/backendEntityView_OFFLINE/backendEntityView_1614855600000_1614859199995_0
367M
/var/pinot/server/data/index/backendEntityView_OFFLINE/backendEntityView_1614866400000_1614869999989_0```
**@laxman:** Sorry for spamming :grimacing:
**@npawar:** oh so per hour itself, some tables are creating 500M segments in
offline side
**@laxman:** Yes.
**@npawar:** and you have several such minion tasks running together, upto 7
**@npawar:** we need a better system to help determind minion provisioning and
capacity :sweat_smile:
**@npawar:** and be able to control parallelism across tables
**@laxman:** Yes. After all this tuning, one cycle of all tables take nearly
20 minutes. So, I configured controller scheduler frequency to 20 minutes
**@laxman:** Conversion progress so far in last 24 hours ``` ➜ pinot-converter
./progress.sh [21/05/29| 2:51AM] Thu Mar 4 20:30:00 IST 2021 backendEntityView
Mon Apr 5 05:30:00 IST 2021 domainEventView Tue Mar 9 14:30:00 IST 2021
domainEventSpanView Mon May 24 14:30:00 IST 2021 logEventView Tue Mar 16
02:30:00 IST 2021 rawServiceView Mon Mar 15 14:30:00 IST 2021 rawTraceView Wed
Mar 3 16:30:00 IST 2021 serviceCallView```
**@npawar:** thanks for sharing all these things. will add it to our notes
**@jackie.jxt:** Great feedbacks! We will track these issues and fix them
shortly
**@mayanks:** @mayanks has joined the channel

### _#pinot-docsrus_

**@mohamed.sultan:** @mohamed.sultan has joined the channel
**@mohamed.sultan:** Hi team
**@mayanks:** @mohamed.sultan Check the channel description for links on how
to add docs
**@shaileshjha061:** @shaileshjha061 has joined the channel
\--------------------------------------------------------------------- To
unsubscribe, e-mail: dev-unsubscribe@pinot.apache.org For additional commands,
e-mail: dev-help@pinot.apache.org