JITStack unified monitoring platform and management matters

Events management (Event Management), formerly known as event management, is one of the main processes ITIL operations management systems. The so-called Event (events) are important means for the configuration item or IT service status changes. For example, IT systems server.
JITStack unified monitoring platform and management matters
Start off from state to state, a state service application from Up to down state changes, and so on. The term Event is also used to refer to any IT service, configuration item or monitoring tool to create a notification. Events typically require IT Operations personnel to take actions, and often leads to incidents logged. In ITIL V4 in the management of affairs has been updated to monitor and manage the situation.

Efficient IT service operations depend on timely and understanding of the state of the infrastructure, operating systems, applications, and other IT systems, and any deviation found work in a more normal and expected. In order to take measures as soon as possible deviation correction system, this feature needs to be achieved through the excellent monitoring system.

People tend to confuse monitor and manage the situation, although the two are closely related, but still essentially different. Monitoring usually in a highly automated manner, and can be actively or passively collect state is monitored item. Management will focus on the situation and management to monitor and record changes by the state organization is defined as the state of the situation. He emphasized for meaningful change in operations management and state management to determine the importance of the situation, and to identify and correct operation start to manage them.
JITStack unified monitoring platform and management matters

For monitoring the situation management is necessary, but not all surveillance will result in the detection of events, not all events have the same meaning or require the same response. The situation can be graded, generally can be divided into information (Information), warning (Warning), exceptions (Exception). Information does not need to take action in identifying, but can provide data to support measures to improve the service during the analysis afterwards in. Alerts are often reach a certain trigger conditions, the team was able to take measures to deal with the negative impact of business before the actual occurrence. The exception indicates that the situation has actually occurred in violation of predefined norms, abnormal situation must take measures.

It may produce large amounts of data through automated monitoring tools or monitoring target practice, but if there is no clear policy and strategy on how to limit, filter, and use this data, then it will be worthless.
JITStack think Inquiry Technology

JITStack集合主流开源监控平台并结合在监控领域的实施经验,为客户组织打造纵向层次化、横向大规模可扩展的灵活、成熟、可扩展的可视化统一监控解决方案。方案以Zabbix、Prometheus、ELK为开源监控平台,Grafana技术框架为开源可视化平台,结合Ansible开源自动化技术,打造纵向可以监控从硬件基础设施、系统、应用状态、业务数据,虚拟化环境、容器,日志等全方位信息系统以及对监控数据的分析、展示;横向可以实现从监控小规模几台到几十台的中小规模的集中式高可用部署,到监控几千台设备的分布式监控系统部署。

客户组织利用JITStack监控系统平台实行监控和事态管理流程中的重要活动:

定义监控项:确定哪些配置项,设备、系统、服务及其组件并确定监控策略。

实施和维护监控:利用设备、系统自身的监控功能或者使用专用的监控工具可实现监控,不同的系统产生的大量监控数据,各种事件分布在不同的系统中,如主机、网络设备的本身往往都有不同的监控系统,其监控信息、事态告警都分布在各自的监控系统中,通过JITStack统一监控系统将各种监数据汇集到统一监控系统、有利于简化事态管理复杂性,提高运维效率。

修正降噪:由于系统之间的耦合,同一个故障可能会导致各个不同的层级关联系统产生一系列相关的事态信息、告警和例外,使运营团队淹没于大量告警之中,增加了排查处理问题的难度。JITStack通过修正降噪方案,将相同原因的事态告警合并,只显示有限数量的事态通知,帮助运营团队专注于处理有意义的告警通知,提高效率。

建立维护阈值:确定哪些状态变化将被视为事态、并选择标准对事态进行分级。JITStack监控系统默认支持6级安全级别定义,满足更精细、灵活的响应操作管理。

JITStack monitoring system supports hierarchical levels of multi-channel notification, in conjunction with the client organization the actual establishment and maintenance of how to deal with the policy of each stage of the situation and appropriate management, enforcing a threshold defined in the JITStack monitoring platform, standards and policies required processes, combined with automation tools to automate the management of operation and maintenance.
JITStack unified monitoring platform and management matters
Monitoring and management of business affairs and operations management value to use JITStack monitoring platform:

It is important that the monitoring system combines state of affairs events management process provides a mechanism for early detection of faults in the actual service disruptions before they occur, faults can be detected and assigned to the relevant team to take action. When integrated with other processes for service management, such as fault management, time management problems, situation management can use the monitoring information events underlying data as input, the display state change, anomalies, the relevant personnel or teams to respond as soon as possible, improve response efficiency , allowing business to benefit from enhancing the overall efficiency of the peacekeeping operation. Monitoring and management of events laid the foundation for automation, operation and maintenance of automation can improve operational efficiency, and expensive human resources freed into more innovative value of the work.

Guess you like

Origin blog.51cto.com/14258464/2435056