JAVA development and maintenance (software integrated observable platform functions)

Software observability is a type of software measurement. It aims to monitor and describe the software's digital experience, business operations, network performance, application performance, infrastructure, and IT processes. Enable developers and operation and maintenance personnel to better optimize and maintain the software.

1. Digital experience:

user session

Understand the user's usage path, and track down problems such as slow request, slow loading, slow interaction, and crash errors that affect the user experience during use.

Crash, Caton, ANR, Abnormal

View crashes, freezes, ANRs, exceptions and other problems that affect the experience in multiple dimensions, gain insights into the characteristics of the problems, and locate and solve problems through information such as code stacks and operating environments.

network request

Multi-dimensional analysis of request performance of CDN, third-party service, server interface, etc., to locate problems of slow requests and wrong requests.

start analysis

Divide the start-up time into multiple stages: system preparation, application preparation, view value, and interaction preparation, and locate and solve the slow start problem.

Web page load

Multi-dimensional analysis of the number of slow page loads, the number of affected users, etc., analysis, connection establishment, first package, white screen, first screen, interactive, full loading and other multi-stage positioning reasons for slow positioning, to ensure fast loading of key business elements.

CDN coverage

Analyze and evaluate the coverage quality of CDN from the dimensions of regions, access points, hosts, operators, resources, etc., to improve user experience.

User Operations

Analyze user behaviors such as activity, retention loss, conversion funnel, user path, etc., and data-driven experience improvement and operational growth

Data retrieval and visualization

Quickly retrieve data such as sessions, requests, crashes, and pages through users, devices, and business IDs to quickly locate specific problems. Visualize multidimensional data to quickly discover quality and experience issues.

2. Business operation:

Take the brokerage business as an example

Core Business Evaluation

Launch a brand-new health evaluation system based on the core business of securities companies to help you establish a scientific business SLO.

Market business evaluation

Provide a brand-new market business evaluation system, and scientifically evaluate the market business of securities companies in the form of "market quadrants"; for market business, launch a new market service measurement index: "push rate", objectively and accurately evaluate the stability of securities companies' market services.

site analysis

Automatically capture the customer's site during the transaction process, and analyze the relationship between the user and the site, and help the customer analyze whether the site coverage strategy is reasonable

site monitoring

Based on the site that is automatically obtained from the market and transaction business, it automatically connects to the Bonree Net platform, and automatically dials to test the availability and delay of the site. The customer does not need to intervene in the whole process, and it is ready to use out of the box

Trading session details

Provide comprehensive and rich on-site data, involving terminal attributes, site attributes, action details, event details, screen recording and packet capture data

Business big screen

The observable platform for the core terminal business of securities companies must be able to monitor the large screen, overview the market and trading business, and intuitively perceive the operation of the terminal business

3. Network performance:

Monitor network performance.

Network Monitoring

Real-time monitoring of the enterprise network environment, and obtain the latest network operation status, to realize the operation status display, fault discovery, fault warning, fault location, etc. of network/security equipment and lines in the network environment

topology monitoring

Automatically generate network topology, intelligently discover network loopbacks and multi-links, filter, select, drag and drop to generate the required topology, and display static information and dynamic indicators of related devices and network connections on the network topology map, such as If there is an exception, it can also be directly displayed on the topology map and give an alarm notification

configuration monitoring

Supports monitoring of network device configuration changes, and notifies relevant operation and maintenance personnel if there is any change; at the same time, it supports security configuration baseline verification management, and conducts compliance security checks on the target system to find out non-compliant items and choose to implement security measures to control Security Risk

4. Application performance:

Distributed link retrieval and analysis

Through the code-level call tracking technology, the service topology is automatically drawn, and the call chain is retrieved through traceID and business data to quickly locate slow calls, slow methods, slow SQL, wrong calls, and method exceptions.

Automatic Topology Discovery

Supports monitoring and analysis at various levels such as system, application, service, interface, and instance. Automatic topology discovery, the influence and dependence of upstream and downstream are clear at a glance

Intelligent alarm

Through intelligent time series data anomaly detection and prediction, problems can be found, and alarm policies and alarm notification channels can be flexibly defined.

Host and instance monitoring

Resource utilization analysis of hosts and containers, and JVM memory analysis.

Flexible configuration

Flexible configuration such as service and interface identification, custom hotspot method, parameter collection, health degree, black and white list, etc.

5. Infrastructure:

host monitoring

Supports the management of multiple types of host devices, through professional host device performance monitoring, real-time monitoring of the host's CPU, memory, disk capacity, disk IO, process, number of TCP connections, network port traffic, temperature, power and other performance monitoring indicators

IDC (computer room monitoring)

IDC machine room monitoring mainly focuses on the comprehensive monitoring of the power environment of the machine room (power distribution, site security, site environment, etc.), cabinet capacity, PDU usage, etc., as well as equipment asset management of the machine room, to realize the integration of IT infrastructure and the power environment of the machine room monitoring and management, so that the IDC computer room can be operated with high quality and refined operation and maintenance

Guess you like

Origin blog.csdn.net/dongjing991/article/details/130519395