Kubernetes + Prometheus + Grafana 集群监控

地址: http://www.mknight.cn/post/631/

简介: Welcome to Prometheus! Prometheus is a monitoring platform that collects metrics from monitored targets by scraping metrics HTTP endpoints on these targets.

Prometheus

下载

wget https://github.com/prometheus/prometheus/releases/download/v1.5.2/prometheus-1.5.2.linux-amd64.tar.gz
tar -zxvf prometheus-1.5.2.linux-amd64.tar.gz -C /opt/prometheus --strip-components=1
cd /opt/prometheus
# 备份一下配置文件
mv prometheus.yml prometheus.yml-bak

修改配置文件,替换为自己的IP。

static_configs:
- targets: ['10.10.30.102:9090']

启动

/opt/prometheus/prometheus --config.file=prometheus.yml

检查9090端口是否监听，则正常。访问9090端口

Grafana

下载

wget https://s3-us-west-2.amazonaws.com/grafana-releases/release/grafana-4.0.1-1480694114.x86_64.rpm
yum localinstall grafana-4.0.1-1480694114.x86_64.rpm
service grafana-server start

配置数据源

保存后测试一下，如果不通过说明刚才的9090端口有问题。

增加dashboards模板

下载dashboards
(https://github.com/percona/grafana-dashboards)
git clone https://github.com/percona/grafana-dashboards.git
cp -r grafana-dashboards/dashboards /var/lib/grafana/
编辑 Grafana config
vi /etc/grafana/grafana.ini
[dashboards.json]
enabled = true
path = /var/lib/grafana/dashboards
systemctl restart grafana-server

Kubernetes监控

导入json模板

下载地址json模板。
然后Import

配置prometheus

最终配置文件如下所示：

# my global config
global:
scrape_interval: 15s # Set the scrape interval to every 15 seconds. Default is every 1 minute.
evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute.
# scrape_timeout is set to the global default (10s).
# Attach these labels to any time series or alerts when communicating with
# external systems (federation, remote storage, Alertmanager).
external_labels:
monitor: 'codelab-monitor'
# Load rules once and periodically evaluate them according to the global 'evaluation_interval'.
rule_files:
# - "first.rules"
# - "second.rules"
# A scrape configuration containing exactly one endpoint to scrape:
# Here it's Prometheus itself.
scrape_configs:
# The job name is added as a label `job=<job_name>` to any timeseries scraped from this config.
- job_name: 'prometheus'
# metrics_path defaults to '/metrics'
# scheme defaults to 'http'.
static_configs:
- targets: ['10.10.30.102:9090']
#以下部分为配置kubernetes,注意修改地址
- job_name: 'kubernetes-nodes-cadvisor'
kubernetes_sd_configs:
- api_server: 'http://10.10.30.102:8080'
role: node
relabel_configs:
- action: labelmap
regex: __meta_kubernetes_node_label_(.+)
- source_labels: [__meta_kubernetes_role]
action: replace
target_label: kubernetes_role
- source_labels: [__address__]
regex: '(.*):10250'
replacement: '${1}:10255'
target_label: __address__
- job_name: 'kubernetes_node'
kubernetes_sd_configs:
- role: node
api_server: 'http://10.10.30.102:8080'
relabel_configs:
- source_labels: [__address__]
regex: '(.*):10250'
replacement: '${1}:9100'
target_label: __address__

重启服务，选择kubernetes的Dashboard

Kubernetes + Prometheus + Grafana 集群监控

Prometheus

下载

启动

Grafana

下载

配置数据源

增加dashboards模板

Kubernetes监控

导入json模板

配置prometheus

猜你喜欢