You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@ignite.apache.org by "Abhishek Gupta (BLOOMBERG/ 731 LEX)" <ag...@bloomberg.net> on 2019/10/21 20:17:45 UTC

Intermittent "Partition states validation has failed for group" issues

In my otherwise stably running grid (on 2.7.5) I sometimes see intermittent  GridDhtPartitionsExchangeFuture  warning. This warning the occurs periodically and then goes away after some time. I couldn't find any documentation or other threads about this warning and its implications. 
* What is the trigger for this warning? 
* What are the implications?
* Is there any recommendation around fixing this issue?


2019-10-21 16:09:44.378 [WARN ] [sys-#26240] GridDhtPartitionsExchangeFuture - Partition states validation has failed for group: mainCache. Partitions cache sizes are inconsistent for Part 0: [id-dgcasp-ob-398-csp-drp-ny-1=43417 id-dgcasp-ob-080-csp-drp-ny-1=43416 ] Part 1: [id-dgcasp-ob-080-csp-drp-ny-1=43720 id-dgcasp-ob-471-csp-drp-ny-1=43724 ] Part 2: [id-dgcasp-ob-762-csp-drp-ny-1=43388 id-dgcasp-ob-471-csp-drp-ny-1=43376 ] Part 3: [id-dgcasp-ob-775-csp-drp-ny-1=43488 id-dgcasp-ob-403-csp-drp-ny-1=43484 ] Part 4: [id-dgcasp-ob-080-csp-drp-ny-1=43338 id-dgcasp-ob-471-csp-drp-ny-1=43339 ] Part 5: [id-dgcasp-ob-398-csp-drp-ny-1=43105 id-dgcasp-ob-471-csp-drp-ny-1=43106 ] Part 7: [id-dgcasp-ob-775-csp-drp-ny-1=43151 id-dgcasp-ob-762-csp-drp-ny-1=43157 ] Part 8: [id-dgcasp-ob-398-csp-drp-ny-1=42975 id-dgcasp-ob-471-csp-drp-ny-1=42976 ] Part 10: [id-dgcasp-ob-775-csp-drp-ny-1=43033 id-dgcasp-ob-471-csp-drp-ny-1=43036 ] Part 11: [id-dgcasp-ob-762-csp-drp-ny-1=43303 id-dgcasp-ob-471-csp-drp-ny-1=43299 ] Part 12: [id-dgcasp-ob-398-csp-drp-ny-1=43262 id-dgcasp-ob-471-csp-drp-ny-1=43265 ] Part 13: [id-dgcasp-ob-762-csp-drp-ny-1=43123 id-dgcasp-ob-471-csp-drp-ny-1=43120 ] Part 15: [id-dgcasp-ob-775-csp-drp-ny-1=43412 id-dgcasp-ob-398-csp-drp-ny-1=43413 ] Part 16: [id-dgcasp-ob-471-csp-drp-ny-1=43934 id-dgcasp-ob-403-csp-drp-ny-1=43933 ] Part 20: [id-dgcasp-ob-080-csp-drp-ny-1=43146 id-dgcasp-ob-471-csp-drp-ny-1=43148 ] Part 21: [id-dgcasp-ob-762-csp-drp-ny-1=43196 id-dgcasp-ob-080-csp-drp-ny-1=43197 ] Part 22: [id-dgcasp-ob-398-csp-drp-ny-1=43233 id-dgcasp-ob-762-csp-drp-ny-1=43234 ] Part 23: [id-dgcasp-ob-398-csp-drp-ny-1=43127 id-dgcasp-ob-471-csp-drp-ny-1=43128 ] Part 24: [id-dgcasp-ob-775-csp-drp-ny-1=43144 id-dgcasp-ob-398-csp-drp-ny-1=43142 ]  ... TRUNCATED


Thanks,
Abhishek


Re: Intermittent "Partition states validation has failed for group" issues

Posted by Ilya Kasnacheev <il...@gmail.com>.
Hello!

I think this means that backup/primary contents are inconsistent.

The implications is that in case of node failure there will be data
inconsistency (or maybe it's already there).

The recommendation is to a) check logs for any oddities/exceptions, and b)
maybe remove problematic partitions' files from persistence and/or restart
problematic nodes.

Regards,
-- 
Ilya Kasnacheev


пн, 21 окт. 2019 г. в 23:17, Abhishek Gupta (BLOOMBERG/ 731 LEX) <
agupta726@bloomberg.net>:

> In my otherwise stably running grid (on 2.7.5) I sometimes see
> intermittent GridDhtPartitionsExchangeFuture warning. This warning the
> occurs periodically and then goes away after some time. I couldn't find any
> documentation or other threads about this warning and its implications.
> * What is the trigger for this warning?
> * What are the implications?
> * Is there any recommendation around fixing this issue?
>
>
>
>
> 2019-10-21 16:09:44.378 [WARN ] [sys-#26240]
> GridDhtPartitionsExchangeFuture - Partition states validation has failed
> for group: mainCache. Partitions cache sizes are inconsistent for Part 0:
> [id-dgcasp-ob-398-csp-drp-ny-1=43417 id-dgcasp-ob-080-csp-drp-ny-1=43416 ]
> Part 1: [id-dgcasp-ob-080-csp-drp-ny-1=43720
> id-dgcasp-ob-471-csp-drp-ny-1=43724 ] Part 2:
> [id-dgcasp-ob-762-csp-drp-ny-1=43388 id-dgcasp-ob-471-csp-drp-ny-1=43376 ]
> Part 3: [id-dgcasp-ob-775-csp-drp-ny-1=43488
> id-dgcasp-ob-403-csp-drp-ny-1=43484 ] Part 4:
> [id-dgcasp-ob-080-csp-drp-ny-1=43338 id-dgcasp-ob-471-csp-drp-ny-1=43339 ]
> Part 5: [id-dgcasp-ob-398-csp-drp-ny-1=43105
> id-dgcasp-ob-471-csp-drp-ny-1=43106 ] Part 7:
> [id-dgcasp-ob-775-csp-drp-ny-1=43151 id-dgcasp-ob-762-csp-drp-ny-1=43157 ]
> Part 8: [id-dgcasp-ob-398-csp-drp-ny-1=42975
> id-dgcasp-ob-471-csp-drp-ny-1=42976 ] Part 10:
> [id-dgcasp-ob-775-csp-drp-ny-1=43033 id-dgcasp-ob-471-csp-drp-ny-1=43036 ]
> Part 11: [id-dgcasp-ob-762-csp-drp-ny-1=43303
> id-dgcasp-ob-471-csp-drp-ny-1=43299 ] Part 12:
> [id-dgcasp-ob-398-csp-drp-ny-1=43262 id-dgcasp-ob-471-csp-drp-ny-1=43265 ]
> Part 13: [id-dgcasp-ob-762-csp-drp-ny-1=43123
> id-dgcasp-ob-471-csp-drp-ny-1=43120 ] Part 15:
> [id-dgcasp-ob-775-csp-drp-ny-1=43412 id-dgcasp-ob-398-csp-drp-ny-1=43413 ]
> Part 16: [id-dgcasp-ob-471-csp-drp-ny-1=43934
> id-dgcasp-ob-403-csp-drp-ny-1=43933 ] Part 20:
> [id-dgcasp-ob-080-csp-drp-ny-1=43146 id-dgcasp-ob-471-csp-drp-ny-1=43148 ]
> Part 21: [id-dgcasp-ob-762-csp-drp-ny-1=43196
> id-dgcasp-ob-080-csp-drp-ny-1=43197 ] Part 22:
> [id-dgcasp-ob-398-csp-drp-ny-1=43233 id-dgcasp-ob-762-csp-drp-ny-1=43234 ]
> Part 23: [id-dgcasp-ob-398-csp-drp-ny-1=43127
> id-dgcasp-ob-471-csp-drp-ny-1=43128 ] Part 24:
> [id-dgcasp-ob-775-csp-drp-ny-1=43144 id-dgcasp-ob-398-csp-drp-ny-1=43142 ]
> ... TRUNCATED
>
>
> Thanks,
> Abhishek
>
>