The operation and maintenance observation capability of log service helps new retail containerized deployment and upgrade

Alibaba Cloud's SLS provides Yuanfuda with high-quality, high-capacity log storage services and efficient log query and analysis services. The one-stop service of SLS is easy to operate, intuitive and clear, and can quickly trace the source of Yuanfuda product online problems at any time, which greatly reduces the input cost of server storage resources and human resources. At the same time, it also has the function of analyzing logs from multi-dimensional statistics, which is also very helpful for business improvement.

Hebei Yuanfuda Trading Group was established in 2017. It has developed from a single traditional retail enterprise to a new retail group integrating production, logistics, planting, catering, e-commerce + physical retail. There are currently five major business segments (chain supermarkets, online shopping malls, processing and production, scientific planting, and cultural catering).

Yuanfuda's main business focuses on order transactions, and has high requirements for system stability and reliability. It needs comprehensive, efficient and flexible operation and maintenance support without dead ends. At the operation and maintenance level, the log system of each level and each functional module is used to monitor the system operation, locate problems, and optimize performance. In the early days, the system used the open source ELK system to build the entire operation and maintenance system, and collected logs into the ELK system to provide unified retrieval.

Container deployment maintenance service architecture puts forward higher requirements for system operation and maintenance

Yuanfuda's overall IT system is built based on the microservice architecture of containers. Because containers can be opened and closed flexibly, with the development of business, the pressure of operation and maintenance has also increased exponentially. The original operation and maintenance structure can no longer meet the impact of high-frequency switching containers, so Yuanfuda needs a reliable operation and maintenance system to ensure business stability.

Driven by the rapid advancement of cloud-native technology, resources on the cloud are becoming more and more complex, and the architecture is more diverse. From the initial server, database, to the combined use of load balancing, WAF, CDN, OSS and other products, a unified monitoring is urgently needed The platform provides insight into the operation of resources on the cloud.

Since Yuanfuda belongs to the retail industry, the volume of online and offline orders will increase significantly with factors such as festivals, promotional activities, and popular products. Therefore, the number of visits and orders of C-end users will fluctuate greatly within a certain period of time. Therefore, Yuanfuda Fuda needs an observable platform to predict business trends in real time, adjust the size of cloud resources in a timely manner, and optimize cloud costs.

One-stop observable O&M solution to ease the increasing pressure on the system

The operation and maintenance cost of traditional operation and maintenance IT systems is too high and the pressure is too high. After adopting the micro-service architecture of containers, its lightweight characteristics make operation and maintenance more flexible and efficient. Compared with traditional operation and maintenance, this method can achieve faster deployment and delivery. However, with the growth of Yuanfuda's business, the number of containers has gradually increased, and with the high frequency of container switching, the traditional way of using the open source ELK system to build the entire operation and maintenance system has been unable to meet the growing business needs.
insert image description here

Based on the above situation, Alibaba Cloud provides it with a one-stop cloud-native observable operation and maintenance solution. The solution is based on the SLS cloud-native observable platform, supported by big data sources, compatible with open source standards, and can adapt AI algorithms to multiple scenarios for large-scale data processing and analysis. It is a solution launched by Alibaba Cloud for enterprise-level big data operation and maintenance scenarios, helping enterprises to easily realize anomaly detection, root cause analysis, second-level response, and real-time prediction in daily operation and maintenance work.

To address the performance bottlenecks faced by Yuanfuda, Alibaba Cloud's cloud-native observable O&M solution provides O&M-free, high-performance log data storage, query, and modeling services. It can support real-time query and analysis of PB-level data, and provides more than 10 query operators, more than 10 machine learning functions, and more than 100 SQL functions. Help Yuanfuda easily realize log data query in high-concurrency and high-traffic scenarios such as festival activities and e-commerce promotions. Even if there is a sudden problem, you can quickly query the problem log, quickly locate the problem point, and greatly improve the user's product experience.

Moreover, since most of Yuanfuda's systems have been migrated to the cloud, different types of cloud products often need to be combined to meet different business needs, which will further increase the difficulty of cost management and resource management. Alibaba Cloud's cloud-native observable operation and maintenance solution, relying on the one-stop observation capability of the log service SLS, can seamlessly connect to various cloud products, realize one-stop visual management of resources and usage costs, customize charts, and real-time data Capabilities such as monitoring and data exception alarming help Yuanfuda to supervise and alarm abnormal business data, detect abnormal changes in time, and achieve the fastest system troubleshooting and optimal cloud product resource usage planning.
insert image description here

The operation and maintenance observation capability of log service helps new retail containerized deployment and upgrade

Guess you like

Origin blog.csdn.net/bjchenxu/article/details/130320272