0%

prometheus监控实践

发表于 2019-07-20 更新于 2021-01-04 分类于 prometheus 阅读次数：

Power your metrics and alerting with a leading open-source monitoring solution.

prometheus server端配置范例

# my global config
global:
  scrape_interval:     60s # Set the scrape interval to every 15 seconds. Default is every 1 minute.
  evaluation_interval: 60s # Evaluate rules every 15 seconds. The default is every 1 minute.
  # scrape_timeout is set to the global default (10s).

# Alertmanager configuration
alerting:
  alertmanagers:
  - static_configs:
    - targets:
      # - alertmanager:9093

# Load rules once and periodically evaluate them according to the global 'evaluation_interval'.
rule_files:
  # - "first_rules.yml"
  # - "second_rules.yml"
  # - compute_metrics.rules

# A scrape configuration containing exactly one endpoint to scrape:
# Here it's Prometheus itself.
# ================================================================
scrape_configs:
  # The job name is added as a label `job=<job_name>` to any timeseries scraped from this config.
  - job_name: 'prometheus'

    # metrics_path defaults to '/metrics'
    # scheme defaults to 'http'.

    # static_configs:

    file_sd_configs:
      - files: ['/opt/prometheus/conf/node.d/*.yml']

  - job_name: 'metricslogfile'
    metrics_path: '/metrics/logfile'
    # metrics_path defaults to '/metrics'
    # scheme defaults to 'http'.

    # static_configs:

    file_sd_configs:
      - files: ['/opt/prometheus/conf/node.d/*.yml']

# ================================================================
  - job_name: 'blackbox'
    metrics_path: /probe
    params:
      module: [http_2xx]  # Look for a HTTP 200 response.
    # static_configs:
    #   - targets:
    #     - https://123.xxxx.com    # Target to probe with http.
    #     - http://map.xxxx.com
    file_sd_configs:
      - files: ['/opt/prometheus/conf/blackbox.d/*.yml']
    relabel_configs:
      - source_labels: [__address__]
        target_label: __param_target
      - source_labels: [__param_target]
        target_label: instance
      - target_label: __address__
        replacement: blackbox.server.local:19115  # The blackbox exporter's real hostname:port.
# ================================================================
  - job_name: 'mobile.xxxx.com'
    metrics_path: /probe
    params:
      module: [http_2xx_mobile_xxxx_com]  # Look for a HTTP 200 response.
    file_sd_configs:
      - files: ['/opt/prometheus/conf/importantServices.d/xxxx.yml']
    relabel_configs:
      - source_labels: [__address__]
        target_label: __param_target
      - source_labels: [__param_target]
        target_label: instance
      - target_label: __address__
        replacement: blackbox.server.local:19115  # The blackbox exporter's real hostname:port.
# ================================================================
  - job_name: 'nginxstatus'
    file_sd_configs:
      - files: ['/opt/prometheus/conf/nginx.d/*.yml']
# ================================================================
  - job_name: 'jenkins'
    metrics_path: /prometheus
    static_configs:
    - targets:
      - '10.142.112.144:8080'
      labels:
        cluster: '业务'
        group: '产品'
# ================================================================
  - job_name: 'prometheustatus'
    metrics_path: /metrics
    static_configs:
    - targets:
      - '127.0.0.1:19090'
      labels:
        cluster: 'Prometheus'
        group: '生产环境'
# ================================================================
  - job_name: 'etcd'
    metrics_path: /metrics
    file_sd_configs:
      - files: ['/opt/prometheus/conf/etcd.d/*.yml']

子配置范例

node

- targets: [
        '10.0.0.1:9100',
        '10.0.0.2:9100',
        '10.0.0.3:9100',
]
  labels:
    cluster: '业务'
    group: '产品'

blackbox

- targets: [
'http://www.xxxx.com/',
]
  labels:
    cluster: '业务'
    group: '产品'

etcd监控

- targets: [
    'etcd01.localhost:2379',
    'etcd02.localhost:2379',
    'etcd03.localhost:2379',
]
  labels:
    cluster: 'etcd.业务'
    group: '产品'

相关资料

prometheus

exporters
grafana模板
prometheus
Node Exporter 0.16 + for Prometheus 监控展示看板
Node Exporter Full
- CentOS5版本监控
Node Exporter Server Metrics
jenkins状态监控

blackbox

telegraf + influxdb

推荐文章（由hexo文章推荐插件驱动）