Infinispan Monitoring
External
- JBoss Operations Network Metrics https://access.redhat.com/documentation/en-US/Red_Hat_JBoss_Data_Grid/6.6/html-single/Administration_and_Configuration_Guide/index.html#JBoss_Operations_Network_Plugin_Metrics
Internal
Relevance
- JDG 6
- JDG 7
Overview
This article describes choices of metrics for Infinispan production monitoring.
By default, Infinispan statistics are disabled, because enabling them may have a negative impact on the performance of the subsystem. Thus, statistics should be enabled only when required. For more details on how to enable statistics, see "Enabling Cache Statistics".
Metrics
Everything Below Must Be Reviewed
TO REVIEW
clusterwide-hits
A long representing the clustered number of cache attribute hits. May return null if the cache is not started. Also see hits.
hits
A long representing the number of cache attribute hits for a specific node. May return null if the cache is not started. Also see clusterwide-hits.
misses
A long representing the number of cache attribute misses for a specific node. May return null if the cache is not started. Also see clusterwide-misses.
stores
A long representing the number of put operations for this specific node. May return null if the cache is not started. Also see clusterwide-stores.
average-write-time
A long representing the average time (in ms) a cache write operation requires to complete, for this specific node. May return null if the cache is not started.
The metric is also available as the "averageWriteTime" JMX Attribute of the MBean
jboss.infinispan:type=Cache,name="<cache-name>(dist_sync)",manager="clustered",component=Statistics
average-replication-time
A long representing the average time spent in the transport layer to duplicate data around the cluster, in milliseconds. Maintained by the RpcManager. Applies to both replicated and distributed caches.
The metric is also available as the "averageReplicationTime" JMX Attribute of the MBean
jboss.infinispan:type=Cache,name="<cache-name>(dist_sync)",manager="clustered",component=RpcManager
replication-failures
A long representing the number of data duplication failures. May return null if the cache is not started. Maintained by the RpcManager. Applies to both replicated and distributed caches.
The metric is also available as the "replicationFailures" JMX Attribute of the MBean
jboss.infinispan:type=Cache,name="<cache-name>(dist_sync)",manager="clustered",component=RpcManager
success-ratio
A percentage (double) representing successful duplications as a ratio of total duplications (successes/successes+failures). Maintained by the RpcManager. Applies to both replicated and distributed caches.
The metric is also available as the "successRatio" JMX Attribute of the MBean
jboss.infinispan:type=Cache,name="<cache-name>(dist_sync)",manager="clustered",component=RpcManager
capacity-factor
A read-write double that controls the proportion of entries that will reside on the local node, compared to the other nodes in the cluster. Value must be positive. This element is only used in 'distributed' cache instances. By default is undefined, which corresponds to a logical value of 1.0.
clusterwide-hit-ratio
A read-only double representing the cluster-wide hit/miss ratio for the cache (hits/hits+misses). May return null if the cache is not started. Also see read-write-ratio.
hit-ratio
A double representing the hit/miss ratio for the this node (hits/hits+misses), where the number of successful attempts is divided by the total number of attempts. Expressed in percentage. May return null if the cache is not started. Also see clusterwide-read-write-ratio.
clusterwide-read-write-ratio
A read-only double representing the cluster-wide read/write ratio of the cache ((hits+misses)/stores). May return null if the cache is not started.
read-write-ratio
A double representing the read/write ratio of the cache ((hits+misses)/stores) for this specific node. May return null if the cache is not started.
clusterwide-remove-misses
A read-only long representing the cluster-wide number of cache attribute remove misses. May return null if the cache is not started.
clusterwide-remove-hits
A read-only long representing the cluster-wide number of cache attribute remove hits. May return null if the cache is not started. Also see remove-hits.
remove-hits
A long representing the number of cache attribute remove hits for the particular node in question. Also see clusterwide-remove-hits.
remove-misses
A long representing the number of cache removal misses, where the removal was attempted but the key was not found. May return null if the cache is not started.
activations
A string representing the number of cache node activation events (bringing a node into memory from a cache store). May return null if the cache is not started.
For more details about cache activation see Cache Store Activation.
passivations
A string representing the number of cache node passivation events (writing an entry from memory into a cache store). May return null if the cache is not started.
For more details about cache passivation see Cache Store Passivation.
invalidations
A string representing the number of cache invalidations. May return null if the cache is not started.
For more details about cache invalidation see Invalidation Mode.
cache-loader-stores
The number (as long) of cache loader store operations, for this specific node. May return null if the cache is not started.
cache-loader-loads
The number (as long) of cache loader load operations, for this specific node. May return null if the cache is not started.
cache-loader-misses
The long representing the cache loader miss operation, for this specific node. May return null if the cache is not started.
prepares
The long representing the number of transaction prepares, since the last reset. May return null if the cache is not started.
commits
The long representing the number of transaction commits, since the last reset. May return null if the cache is not started.
rollbacks
The long representing the number of transaction rollbacks, since the last reset. May return null if the cache is not started.
number-of-locks-available
An integer representing the number of exclusive locks available to this cache. Maintained by LockManager.