ClickHouse technology selection for big data analysis

Recently, the company's strategy needs more data support, and it is currently conceiving to build a user data analysis platform. Due to the limited manpower of the team, there are no technicians in the Hdfs ecosystem. Therefore, it is implemented in stages. In the first stage, data collection, cleaning, and storage are realized, and user behavior data is stored. In the second stage, corresponding query functions are performed according to the analysis model.

After consulting some information, ClickHouse is suitable for large data volume and high-performance queries. Queries are very similar to SQL statements, so we do technical research. Here are some high-quality documents for your reference:

1. Quick Start

Involves the relevant basic content of ClickHouse: usage scenarios, table engine, usage specifications, real-time synchronization, etc.

ClickHouse Cloud Native User Manual

ClickHouse & StarRocks experience sharing

ClickHouse Getting Started, Tuning, and Combat One-stop Full Cheats

ClickHouse usage practices and specifications

Analysis of ClickHouse basics, practice, and tuning from all perspectives

2. Enterprise application and practice

Real-time data warehouse construction of tens of billions of advertising platforms based on ClickHouse

Bilibili's Mass User Behavior Analysis Application Practice Based on ClickHouse

Netease Experience Regulation: ClickHouse Development and Use Specifications

Support frequent updates and ad hoc queries: ClickHouse's application in iQiyi video production

The practice of ClickHouse in Bilibili user behavior analysis

Jiguang clickhouse's 100-billion-level data analysis practice road

God knows what I went through from MongoDB to ClickHouse

Application of Clickhouse on Big Data Analysis Platform - Retention Analysis

Demystifying the technical practice of ByteDance to solve the complex query problem of ClickHouse

Jingdong ClickHouse High Availability Practice

Talking about Tick Market Data Cleaning——Application of Clickhouse Analysis Function

Best practice of WeChat ClickHouse real-time data warehouse

ClickHouse combat | retention, path, funnel, session

Practical application of ClickHouse in self-service behavior analysis scenarios

User Behavior Analysis System Based on ClickHouse

The application practice of ClickHouse in the billion-level user portrait platform

Practice of building a user behavior analysis system in the game industry based on the cloud database ClickHouse

Building a crowd circle selection system based on the cloud database ClickHouse

Big data architecture for user behavior analysis based on ClickHouse

Practice of Multidimensional User Behavior Analysis Based on ClickHouse

Exploration and practice of Clickhouse in self-service analysis scenarios

3. Step on the pit

As the largest ClickHouse user in China, what pitfalls has ByteDance stepped on?

A record of ClickHouse cluster failure caused by DDL

4. Optimization

"Upsert" of ClickHouse Enhancement Plan

Detailed introduction to ClickHouse query optimization

Performance increased by 5 times! "Resource Isolation" of ClickHouse Enhancement Plan

ClickHouse JOIN Optimization of Volcano Engine in Behavior Analysis Scenario

Guess you like

Origin blog.csdn.net/oschina_41731918/article/details/128911682