You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@ignite.apache.org by eugene miretsky <eu...@gmail.com> on 2018/08/17 15:15:31 UTC

Confusion/inaccurate visor stats

Hello,

I am running a single Ignite node on a r4.8xlarge EC2 node. I am using the
default settings with 132G allocated for the default memory region. So far
I have uploaded 1 large table 60M rows using Spark

The output of node and cache commands is pasted bellow.

A few questions
1) In Data region metrics, why is everything 0?
2) How come Non-heap memory usage is minimal?
3) How can I tell how much memory the table is consuming?
4) Total busy time is 15s, the upload took longer than that.

Cheers,
Eugene


*Visor output*

Time of the snapshot: 2018-08-17 15:06:37

+-------------------------------------------------------------------------------------------+

| ID                          | b19229cd-c6c2-4cfe-ba82-3fb0ac0946b9
                |

| ID8                         | B19229CD
                |

| Node Type                   | Server
                |

| Order                       | 1
                |

| Address (0)                 | 172.17.0.1
                |

| Address (1)                 | 172.21.85.213
                |

| Address (2)                 | 127.0.0.1
                |

| Address (3)                 | 0:0:0:0:0:0:0:1%lo
                |

| OS info                     | Linux amd64 4.4.0-1062-aws
                |

| OS user                     | root
                |

| Deployment mode             | SHARED
                |

| Language runtime            | Java Platform API Specification ver. 1.8
                |

| Ignite version              | 2.5.0
                |

| Ignite instance name        | <default>
                |

| JRE information             | HotSpot 64-Bit Tiered Compilers
                |

| JVM start time              | 2018-08-17 13:52:05
                |

| Node start time             | 2018-08-17 13:52:07
                |

| Up time                     | 01:02:17.619
                |

| CPUs                        | 32
                |

| Last metric update          | 2018-08-17 14:54:23
                |

| Non-loopback IPs            | 172.17.0.1, 172.21.85.213,
fe80:0:0:0:59:71ff:fe32:36e%ens3 |

| Enabled MACs                | 0242652B992A, 02597132036E
                |

| Maximum active jobs         | 1
                |

| Current active jobs         | 0
                |

| Average active jobs         | 0.00
                |

| Maximum waiting jobs        | 0
                |

| Current waiting jobs        | 0
                |

| Average waiting jobs        | 0.00
                |

| Maximum rejected jobs       | 0
                |

| Current rejected jobs       | 0
                |

| Average rejected jobs       | 0.00
                |

| Maximum cancelled jobs      | 0
                |

| Current cancelled jobs      | 0
                |

| Average cancelled jobs      | 0.00
                |

| Total rejected jobs         | 0
                |

| Total executed jobs         | 9
                |

| Total cancelled jobs        | 0
                |

| Maximum job wait time       | 0ms
                |

| Current job wait time       | 0ms
                |

| Average job wait time       | 0.00ms
                |

| Maximum job execute time    | 0ms
                |

| Current job execute time    | 0ms
                |

| Average job execute time    | 0.00ms
                |

| Total busy time             | 15724ms
                |

| Busy time %                 | 0.42%
                |

| Current CPU load %          | 0.03%
                |

| Average CPU load %          | 5.03%
                |

| Heap memory initialized     | 32gb
                |

| Heap memory used            | 57gb
                |

| Heap memory committed       | 113gb
                |

| Heap memory maximum         | 178gb
                |

| Non-heap memory initialized | 2mb
                |

| Non-heap memory used        | 75mb
                |

| Non-heap memory committed   | 76mb
                |

| Non-heap memory maximum     | 496mb
                |

| Current thread count        | 230
                |

| Maximum thread count        | 233
                |

| Total started thread count  | 391
                |

| Current daemon thread count | 13
                |

+-------------------------------------------------------------------------------------------+


Data region metrics:

+====================================================================================================================+

|      Name      | Page size |       Pages        |  Memory   |      Rates
    | Checkpoint buffer | Large entries |

+====================================================================================================================+

| Default_Region | 0         | Total:  0          | Total:  0 | Allocation:
0.00 | Pages: 0          | 0.00%         |

|                |           | Dirty:  0          | In RAM: 0 |
Eviction:   0.00
| Size:  0          |               |

|                |           | Memory: 0          |           |
Replace:    0.00
|                   |               |

|                |           | Fill factor: 0.00% |           |
      |                   |               |

+----------------+-----------+--------------------+-----------+------------------+-------------------+---------------+

| sysMemPlc      | 0         | Total:  0          | Total:  0 | Allocation:
0.00 | Pages: 0          | 0.00%         |

|                |           | Dirty:  0          | In RAM: 0 |
Eviction:   0.00
| Size:  0          |               |

|                |           | Memory: 0          |           |
Replace:    0.00
|                   |               |

|                |           | Fill factor: 0.00% |           |
      |                   |               |

+--------------------------------------------------------------------------------------------------------------------+


visor> cache

Time of the snapshot: 2018-08-17 15:07:24

+=======================================================================================================================================+

|         Name(@)         |    Mode     | Nodes |       Entries (Heap /
Off-heap)       |   Hits    |  Misses   |   Reads   |  Writes   |

+=======================================================================================================================================+

| cache1(@c0)             | PARTITIONED | 1     | min: 0 (0 / 0)
            | min: 0    | min: 0    | min: 0    | min: 0    |

|                         |             |       | avg: 0.00 (0.00 / 0.00)
            | avg: 0.00 | avg: 0.00 | avg: 0.00 | avg: 0.00 |

|                         |             |       | max: 0 (0 / 0)
            | max: 0    | max: 0    | max: 0    | max: 0    |

+-------------------------+-------------+-------+---------------------------------------+-----------+-----------+-----------+-----------+

| cache2(@c1)             | PARTITIONED | 1     | min: 0 (0 / 0)
            | min: 0    | min: 0    | min: 0    | min: 0    |

|                         |             |       | avg: 0.00 (0.00 / 0.00)
            | avg: 0.00 | avg: 0.00 | avg: 0.00 | avg: 0.00 |

|                         |             |       | max: 0 (0 / 0)
            | max: 0    | max: 0    | max: 0    | max: 0    |

+-------------------------+-------------+-------+---------------------------------------+-----------+-----------+-----------+-----------+

| SQL_PUBLIC_CITY(@c3)    | PARTITIONED | 1     | min: 0 (0 / 0)
            | min: 0    | min: 0    | min: 0    | min: 0    |

|                         |             |       | avg: 0.00 (0.00 / 0.00)
            | avg: 0.00 | avg: 0.00 | avg: 0.00 | avg: 0.00 |

|                         |             |       | max: 0 (0 / 0)
            | max: 0    | max: 0    | max: 0    | max: 0    |

+-------------------------+-------------+-------+---------------------------------------+-----------+-----------+-----------+-----------+

| SQL_PUBLIC_COUNTRY(@c4) | PARTITIONED | 1     | min: 0 (0 / 0)
            | min: 0    | min: 0    | min: 0    | min: 0    |

|                         |             |       | avg: 0.00 (0.00 / 0.00)
            | avg: 0.00 | avg: 0.00 | avg: 0.00 | avg: 0.00 |

|                         |             |       | max: 0 (0 / 0)
            | max: 0    | max: 0    | max: 0    | max: 0    |

+-------------------------+-------------+-------+---------------------------------------+-----------+-----------+-----------+-----------+

| SQL_PUBLIC_GAL3EC1(@c2) | PARTITIONED | 1     | min: 63991599 (0 /
63991599)          | min: 0    | min: 0    | min: 0    | min: 0    |

|                         |             |       | avg: 63991599.00 (0.00 /
63991599.00) | avg: 0.00 | avg: 0.00 | avg: 0.00 | avg: 0.00 |

|                         |             |       | max: 63991599 (0 /
63991599)          | max: 0    | max: 0    | max: 0    | max: 0    |

+---------------------------------------------------------------------------------------------------------------------------------------+

Re: Confusion/inaccurate visor stats

Posted by Alex Plehanov <pl...@gmail.com>.
Hello, Eugene!

> 2) How come Non-heap memory usage is minimal?
"Non-heap memory" here it's JVM managed memory regions other then heap used
for internal JVM purposes (JIT compiler, method area, etc.), it's not a
memory used by Ignite to store data (information about this memory can be
obtained by data region metrics).

> 3) How can I tell how much memory the table is consuming?
AFAIK there is no such functionality in visor now. There is JMX metric
CacheGroupMetricsMXBean#getTotalAllocatedSize, which can help you, but
unfortunately there is the bug in current implementation of this metric
with persistent store enabled (ticket [1], already fixed, fix will be
available in Ignite 2.7) and there still no implementation of this metric
with persistence disabled (ticket [2]).

[1]: https://issues.apache.org/jira/browse/IGNITE-8515
[2]: https://issues.apache.org/jira/browse/IGNITE-8517


2018-08-17 18:37 GMT+03:00 Alexey Kuznetsov <ak...@apache.org>:

> Hi!
>
> > 1) In Data region metrics, why is everything 0?
> Did you enable metrics?
>
> See:
>   DataRegionConfiguration dataRegionCfg = new DataRegionConfiguration();
>   dataRegionCfg.setMetricsEnabled(true);
>
> > 4) Total busy time is 15s, the upload took longer than that.
> This is actually time spend in compute engine see:
> org.apache.ignite.cluster.ClusterMetrics
>     /**
>      * Gets total time this node spent executing jobs.
>      * @return Total time this node spent executing jobs.
>      */
>     public long getTotalBusyTime();
>
> I hope 2) & 3) will answer some one who knows about it.
>
> --
> Alexey Kuznetsov
>
>

Re: Confusion/inaccurate visor stats

Posted by Alexey Kuznetsov <ak...@apache.org>.
Hi!

> 1) In Data region metrics, why is everything 0?
Did you enable metrics?

See:
  DataRegionConfiguration dataRegionCfg = new DataRegionConfiguration();
  dataRegionCfg.setMetricsEnabled(true);

> 4) Total busy time is 15s, the upload took longer than that.
This is actually time spend in compute engine see:
org.apache.ignite.cluster.ClusterMetrics
    /**
     * Gets total time this node spent executing jobs.
     * @return Total time this node spent executing jobs.
     */
    public long getTotalBusyTime();

I hope 2) & 3) will answer some one who knows about it.

-- 
Alexey Kuznetsov