Infinispan Monitoring: Difference between revisions

From NovaOrdis Knowledge Base
Jump to navigation Jump to search
Line 24: Line 24:
The number of hits, misses and stores at the cache container level (JDG 7 JMX [[Infinispan_Cache_Container_Metrics#hits|hits]], [[Infinispan_Cache_Container_Metrics#misses|misses]], [[Infinispan_Cache_Container_Metrics#stores|stores]]). This is an aggregation of the corresponding values for the underlying caches. It is preferable to expose these metrics as a rate per second. Some monitoring solutions allow the rate to be calculated automatically (see [[DataDog_and_JMX#Counter|Data Dog counters]]).
The number of hits, misses and stores at the cache container level (JDG 7 JMX [[Infinispan_Cache_Container_Metrics#hits|hits]], [[Infinispan_Cache_Container_Metrics#misses|misses]], [[Infinispan_Cache_Container_Metrics#stores|stores]]). This is an aggregation of the corresponding values for the underlying caches. It is preferable to expose these metrics as a rate per second. Some monitoring solutions allow the rate to be calculated automatically (see [[DataDog_and_JMX#Counter|Data Dog counters]]).


The total number of entries at cache container level. This is an aggregation of the corresponding values for the underlying caches.
The total number of entries at cache container level. This is an aggregation of the corresponding values for the underlying caches. For more details on the cache number of entries see [[Infinsipan Cache Number of Entries]].


The average read, write and remove time for the caches managed by this container (JDG 7 JMX [[Infinispan_Cache_Container_Metrics#averageReadTime|averageReadTime]], [[Infinispan_Cache_Container_Metrics#averageWriteTime|averageWriteTime]], [[Infinispan_Cache_Container_Metrics#averageRemoveTime|averageRemoveTime]]).
The average read, write and remove time for the caches managed by this container (JDG 7 JMX [[Infinispan_Cache_Container_Metrics#averageReadTime|averageReadTime]], [[Infinispan_Cache_Container_Metrics#averageWriteTime|averageWriteTime]], [[Infinispan_Cache_Container_Metrics#averageRemoveTime|averageRemoveTime]]).

Revision as of 18:09, 20 October 2016

Internal

Relevance

  • JDG 7

Overview

This article describes a possible set up for Infinispan production monitoring.

Metric Collection

HotRod Endpoint

Bytes read count and bytes written count, which can be used to infer the read and write rate, in bytes/second (JDG 6 JMX totalBytesRead, totalBytesWritten, JDG 7 JMX bytesRead, bytesWritten. It is preferable to expose these metrics as a rate per second. Some monitoring solutions allow the rate to be calculated automatically (see Data Dog counters).

The number of worker threads (JDG 6 JMX numberWorkerThreads: JDG 7 JMX workerThreads).

The Cache Container

The number of hits, misses and stores at the cache container level (JDG 7 JMX hits, misses, stores). This is an aggregation of the corresponding values for the underlying caches. It is preferable to expose these metrics as a rate per second. Some monitoring solutions allow the rate to be calculated automatically (see Data Dog counters).

The total number of entries at cache container level. This is an aggregation of the corresponding values for the underlying caches. For more details on the cache number of entries see Infinsipan Cache Number of Entries.

The average read, write and remove time for the caches managed by this container (JDG 7 JMX averageReadTime, averageWriteTime, averageRemoveTime).

Individual Caches

TODO

      conf:
        - include:
            domain: jboss.as
            bean:
               - jboss.as:subsystem=datagrid-jgroups,channel=cluster,protocol=UNICAST3
               - jboss.as:subsystem=datagrid-infinispan-endpoint,hotrod-connector=hotrod-connector
               - jboss.as:subsystem=datagrid-infinispan,cache-container=clustered,distributed-cache=gameStateCache
               - jboss.as:subsystem=datagrid-infinispan,cache-container=clustered,distributed-cache=conditionVariableCache
               - jboss.as:subsystem=datagrid-infinispan,cache-container=clustered,distributed-cache=eventLogCache
            attribute:
               averageReplicationTime:
                 alias: jmx.averageReplicationTime
                 metric_type: counter
               replicationCount:
                 alias: jmx.replicationCount
                 metric_type: counter
               replicationFailures:
                 alias: jmx.replicationFailures
                 metric_type: counter
               successRatio:
                 alias: jmx.successRatio
                 metric_type: counter
        - include:
            domain: java.lang
            bean:
               - java.lang:type=OperatingSystem
               - java.lang:type=Threading
            attribute:
               OpenFileDescriptorCount:
                 alias: jmx.OpenFileDescriptorCount
                 metric_type: counter
               MaxFileDescriptorCount:
                 alias: jmx.MaxFileDescriptorCount
                 metric_type: counter
               TotalStartedThreadCount:
                 alias: jmx.TotalStartedThreadCount
                 metric_type: counter
               PeakThreadCount:
                 alias: jmx.PeakThreadCount
                 metric_type: counter