[Kafka] Kafka commonly used JMX monitoring indicators collation

Insert picture description here

1 Overview

Reprinted: https://cloud.tencent.com/developer/article/1554002

1. System related indicators

1. System information collection

java.lang:type=OperatingSystem

{
    
    "freePhysicalMemorySize":"806023168","maxFileDescriptorCount":"4096","openFileDescriptorCount":"283","processCpuLoad":"0.0017562901839817224","systemCpuLoad":"0.014336627412954635","systemLoadAverage":"0.37"}

2. Thread information collection

java.lang:type=Threading

{
    
    "peakThreadCount":"88","threadCount":"74"}

3. Get mmaped and direct space

Get used, capacity, count through BufferPoolMXBean

2. GC related indicators

1.Young GC

java.lang:type=GarbageCollector,name=G1 Young Generation

{
    
    "collectionCount":"534","collectionTime":"8258"}

2.Old GC

java.lang:type=GarbageCollector,name=G1 Old Generation

{
    
    "collectionCount":"0","collectionTime":"0"}

Three, JVM related indicators

Obtain JVM related information HeapMemoryUsage and NonHeapMemoryUsage through MemoryMXBean; obtain other JVM memory space indicators through MemoryPoolMXBean, such as Metaspace, Codespace, etc.

Four, Topic related indicators

1. Topic message inbound rate (Byte)

kafka.server:type=BrokerTopicMetrics,name=BytesInPerSec,topic=" + topic

{
    
    "count":"0","fifteenMinuteRate":"0.0","fiveMinuteRate":"0.0","meanRate":"0.0","oneMinuteRate":"0.0"}

2. Topic message outbound rate (Byte)

kafka.server:type=BrokerTopicMetrics,name=BytesOutPerSec,topic=" + topic

{
    
    "count":"0","fifteenMinuteRate":"0.0","fiveMinuteRate":"0.0","meanRate":"0.0","oneMinuteRate":"0.0"}

3. Topic request rejected rate

kafka.server:type=BrokerTopicMetrics,name=BytesRejectedPerSec,topic=" + topic

{
    
    "count":"0","fifteenMinuteRate":"0.0","fiveMinuteRate":"0.0","meanRate":"0.0","oneMinuteRate":"0.0"}

4. Topic failed to pull request rate

kafka.server:type=BrokerTopicMetrics,name=FailedFetchRequestsPerSec,topic=" + topic;

{
    
    "count":"0","fifteenMinuteRate":"0.0","fiveMinuteRate":"0.0","meanRate":"0.0","oneMinuteRate":"0.0"}

5. Topic sending request failure rate

kafka.server:type=BrokerTopicMetrics,name=FailedProduceRequestsPerSec,topic=" + topic

{
    
    "count":"0","fifteenMinuteRate":"0.0","fiveMinuteRate":"0.0","meanRate":"0.0","oneMinuteRate":"0.0"}

6. Topic message inbound rate (message)

kafka.server:type=BrokerTopicMetrics,name=MessagesInPerSec,topic=" + topic
{
    
    "count":"0","fifteenMinuteRate":"0.0","fiveMinuteRate":"0.0","meanRate":"0.0","oneMinuteRate":"0.0"}

Five, Broker related indicators

1.Log flush rate and time

kafka.log:type=LogFlushStats,name=LogFlushRateAndTimeMs

{
    
    "50thPercentile":"1.074103","75thPercentile":"1.669793","95thPercentile":"6.846556","98thPercentile":"6.846556","999thPercentile":"6.846556","99thPercentile":"6.846556","count":"19","max":"6.846556","mean":"1.628646052631579","min":"0.512879","stdDev":"1.6007003364105892"}

2. The number of replicas failed synchronization

kafka.server:type=ReplicaManager,name=UnderReplicatedPartitions

{
    
    "value":"0"}

3. Message inbound rate (number of messages)

kafka.server:type=BrokerTopicMetrics,name=MessagesInPerSec

{
    
    "count":"86845","fifteenMinuteRate":"0.6456600497006455","fiveMinuteRate":"0.6444164288097876","meanRate":"0.5314899330400695","oneMinuteRate":"0.6494649408329609"}

4. Message inbound rate (Byte)

kafka.server:type=BrokerTopicMetrics,name=BytesInPerSec

{
    
    "count":"57302357","fifteenMinuteRate":"379.11342092748146","fiveMinuteRate":"371.8482236385939","meanRate":"351.37122686037435","oneMinuteRate":"351.8348952308101"}

5. Message outbound rate (Byte)

kafka.server:type=BrokerTopicMetrics,name=BytesOutPerSec

{
    
    "count":"246","fifteenMinuteRate":"4.508738367219028E-34","fiveMinuteRate":"1.4721921790135324E-98","meanRate":"0.0015031168286836175","oneMinuteRate":"2.964393875E-314"}

6. Request rejected rate

kafka.server:type=BrokerTopicMetrics,name=BytesRejectedPerSec

{
    
    "count":"0","fifteenMinuteRate":"0.0","fiveMinuteRate":"0.0","meanRate":"0.0","oneMinuteRate":"0.0"}

7. Failure pull request rate

kafka.server:type=BrokerTopicMetrics,name=FailedFetchRequestsPerSec

{
    
    "count":"0","fifteenMinuteRate":"0.0","fiveMinuteRate":"0.0","meanRate":"0.0","oneMinuteRate":"0.0"}

8. Failure rate of sending requests

kafka.server:type=BrokerTopicMetrics,name=FailedProduceRequestsPerSec

{
    
    "count":"0","fifteenMinuteRate":"0.0","fiveMinuteRate":"0.0","meanRate":"0.0","oneMinuteRate":"0.0"}

9. Number of Leader Copies

kafka.server:type=ReplicaManager,name=LeaderCount

{
    
    "value":"92"}

10. Number of Partitions

kafka.server:type=ReplicaManager,name=PartitionCount

{
    
    "value":"135"}

11. Number of offline Partitions

kafka.controller:type=KafkaController,name=OfflinePartitionsCount

{
    
    "value":"0"}

12. Broker network processing thread idle rate

kafka.server:type=KafkaRequestHandlerPool,name=RequestHandlerAvgIdlePercent

{
    
    "count":"164506926671008","fifteenMinuteRate":"0.9999327359820058","fiveMinuteRate":"1.0000290054537715","meanRate":"0.9998854371393514","oneMinuteRate":"1.0007836499581673"}

13.Leader election ratio

kafka.controller:type=ControllerStats,name=LeaderElectionRateAndTimeMs

{
    
    "count":"7","fifteenMinuteRate":"5.134993718576819E-82","fiveMinuteRate":"6.882658450509451E-240","meanRate":"4.2525243043608314E-5","oneMinuteRate":"2.964393875E-314"}

14. Unclean Leader election ratio

kafka.controller:type=ControllerStats,name=UncleanLeaderElectionsPerSec

{
    
    "count":"0","fifteenMinuteRate":"0.0","fiveMinuteRate":"0.0","meanRate":"0.0","oneMinuteRate":"0.0"}

15. Controller survival number

kafka.controller:type=KafkaController,name=ActiveControllerCount

{
    
    "value":"1"}

16.Request rate

kafka.network:type=RequestMetrics,name=RequestsPerSec,request=Produce

{
    
    "count":"83233","fifteenMinuteRate":"0.6303485369828705","fiveMinuteRate":"0.6357199085092445","meanRate":"0.5046486472186744","oneMinuteRate":"0.6563203475530601"}

17.Consumer pull rate

kafka.network:type=RequestMetrics,name=RequestsPerSec,request=FetchConsumer

{
    
    "count":"125796","fifteenMinuteRate":"1.14193044007404E-33","fiveMinuteRate":"7.699516480260211E-100","meanRate":"0.7623419964866819","oneMinuteRate":"2.964393875E-314"}

18.Follower pull rate

kafka.network:type=RequestMetrics,name=RequestsPerSec,request=FetchFollower

{
    
    "count":"375108","fifteenMinuteRate":"2.302746562040189","fiveMinuteRate":"2.292459728166488","meanRate":"2.2721808581484693","oneMinuteRate":"2.2814260196672973"}

19.Request total time

kafka.network:type=RequestMetrics,name=TotalTimeMs,request=Produce

{
    
    "50thPercentile":"1.0","75thPercentile":"1.0","95thPercentile":"2.0","98thPercentile":"2.0","999thPercentile":"28.0","99thPercentile":"4.0","count":"83384","max":"48.0","mean":"1.2344934279957787","min":"0.0","stdDev":"1.1783192073287214"}

20.Consumer fetch total time

kafka.network:type=RequestMetrics,name=TotalTimeMs,request=FetchConsumer

{
    
    "50thPercentile":"500.0","75thPercentile":"501.0","95thPercentile":"501.0","98thPercentile":"501.0","999thPercentile":"501.971","99thPercentile":"501.0","count":"125796","max":"535.0","mean":"499.83123469744663","min":"0.0","stdDev":"17.138716708632025"}

21.Follower fetch total time

kafka.network:type=RequestMetrics,name=TotalTimeMs,request=FetchFollower

{
    
    "50thPercentile":"500.0","75thPercentile":"500.0","95thPercentile":"501.0","98thPercentile":"501.0","999thPercentile":"507.826","99thPercentile":"501.0","count":"375564","max":"532.0","mean":"437.79763502359117","min":"0.0","stdDev":"148.25999023472986"}
22.Time the follower fetch request waits in the request queue
kafka.network:type=RequestMetrics,name=RequestQueueTimeMs,request=FetchFollower

{
    
    "50thPercentile":"0.0","75thPercentile":"0.0","95thPercentile":"0.0","98thPercentile":"0.0","999thPercentile":"0.0","99thPercentile":"0.0","count":"376206","max":"28.0","mean":"0.0010260336092459982","min":"0.0","stdDev":"0.1282889653905258"}
23.Time the Consumer fetch request waits in the request queue
kafka.network:type=RequestMetrics,name=RequestQueueTimeMs,request=FetchConsumer

{
    
    "50thPercentile":"0.0","75thPercentile":"0.0","95thPercentile":"0.0","98thPercentile":"0.0","999thPercentile":"0.0","99thPercentile":"0.0","count":"125796","max":"24.0","mean":"0.0018124582657636174","min":"0.0","stdDev":"0.18122860552537737"}
24.Time the Produce fetch request waits in the request queue
kafka.network:type=RequestMetrics,name=RequestQueueTimeMs,request=Produce

{
    
    "50thPercentile":"0.0","75thPercentile":"0.0","95thPercentile":"0.0","98thPercentile":"0.0","999thPercentile":"0.0","99thPercentile":"0.0","count":"83704","max":"12.0","mean":"2.6283092803211315E-4","min":"0.0","stdDev":"0.042892540270754634"}

25. Broker I/O work processing thread idle rate

kafka.network:type=SocketServer,name=NetworkProcessorAvgIdlePercent

{
    
    "value":"1.0015540075894207"}

26.ISR rate of change

kafka.server:type=ReplicaManager,name=IsrShrinksPerSec

{
    
    "count":"0","fifteenMinuteRate":"0.0","fiveMinuteRate":"0.0","meanRate":"0.0","oneMinuteRate":"0.0"}

Share this article from the public micro-channel number - Watermelon old beams (gh_01130ae30a83), Author: Liang Yong

The original source and reprinted information are described in detail in the text. If there is any infringement, please contact [email protected] to delete.

Original publication time: 2019-12-08

This article participates in Tencent Cloud's self-media sharing plan. You are welcome to join and share with you who are reading.

Guess you like

Origin blog.csdn.net/qq_21383435/article/details/109103515