0% found this document useful (0 votes)

31 views15 pages

Prometheus Monitoring Setup Guide

Uploaded by

Balamurugan Subramaniyan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views15 pages

Prometheus Monitoring Setup Guide

Uploaded by

Balamurugan Subramaniyan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Ultimate monitoring using Prometheus: Ensuring

optimal performance & Reliability

Components:
• Prometheus: It’s an open-source tool for monitoring and alerting applications. It
uses the concept of scrapping when target systems metric points are contacted to
fetch data at regular intervals.
• Node exporter: It is a monitoring agent that is installed on all target machines so that
Prometheus can fetch the data from all the metrics endpoints.
• Blackbox exporter: It is used to get information from the website like traffic is
coming from the website or not
• Alert manager: The Alert manager handles alerts sent by client applications such as
the Prometheus server. Used to set alert based on conditions so we will be notified ex
if the website is down for continuous 1- 5 minutes, service unavailability

Pre-requisites to start:
Created a security group with the following ports open:
• 22 for SSH
• 80 for HTTP
• 443 for HTTPS
• 25 for SMTP
• 465 for SMTPS
• 587 for SMTP
• 9090 for Prometheus
• 9093 for Alert manager
• 9115 for Blackbox Exporter
• 9100 for Node Exporter
Project steps:
Step 1: Lunched a 2 Ec2 Instance with ubuntu AMI, instance type=t2.medium, storage=20GB
and name them as Virtual machine 1 and Virtual machine 2

Prometheus components exporter tar files: https://prometheus.io/download/

Step 2: In Virtual machine 1:

Downloaded Node Exporter and start

→ sudo apt update
## Download Node Exporter
→ wget
https://github.com/prometheus/node_exporter/releases/download/v1.8.1/node_exporter-
1.8.1.linux-amd64.tar.gz

##Extract Node Exporter

→ tar xvfz node_exporter-1.8.1.linux-amd64.tar.gz
→ mv node_exporter-1.8.1.linux-amd64 node_exporter
##Start Node Exporter
→ cd node_exporter
→ ./node_exporter &

Step 3:
In Virtual machine 2 install Prometheus, Alert manager, Blackbox Exporter
Install Prometheus
→ sudo apt update
→ wget
https://github.com/prometheus/prometheus/releases/download/v2.52.0/prometheus-
2.52.0.linux-amd64.tar.gz
→ tar xvfz prometheus-2.52.0.linux-amd64.tar.gz
→ mv prometheus-2.52.0.linux-amd64 prometheus
→ cd prometheus
→ ./prometheus --config.file=prometheus.yml &

Alert Manager
→ wget
https://github.com/prometheus/alertmanager/releases/download/v0.27.0/alertmanager-
0.27.0.linux-amd64.tar.gz
→ tar xvfz alertmanager-0.27.0.linux-amd64.tar.gz
→ mv alertmanager-0.27.0.linux-amd64 alertmanager
→ cd alertmanager
→ ./alertmanager --config.file=alertmanager.yml &

Blackbox Exporter
→ wget
https://github.com/prometheus/blackbox_exporter/releases/download/v0.25.0/blackbox_
exporter-0.25.0.linux-amd64.tar.gz
→ tar xvfz blackbox_exporter-0.25.0.linux-amd64.tar.gz
→ mv blackbox_exporter-0.25.0.linux-amd64 blackbox_exporter
→ cd blackbox_exporter
→ ./blackbox_exporter &

Once completing the above steps, we will be able to see all folders like this

Once the VM-1 node exporter is up and running we can see the below webpage
Step 4:
Now let’s run a simple game application to monitor

To run the boardgame application on the website page we need to have java and maven to
build, so will install them using the below commands

→ cd Boardgame
→ sudo apt install openjdk-11-jre-headless
→ sudo apt install maven -y
→ mvn package // to build the project

We can execute the jar file to run the application on browser

→ cd target
→ ls // can see .jar file
→ java -jar database_service_project-0.0.4.jar

Now will access the game application at: http://3.135.20.106:8080/

Step 5:
Next, go to VM-2 to configure the Prometheus server by defining alert-rules for the
different scenarios. and based on these rules we will get the alerts
→ cd Prometheus
→ ./Prometheus &

Can access the Prometheus server at: http://3.145.128.69:9090/graph

For now, we can’t see any alert rules so let’s create a new alert_rules.yaml file to configure
alert rules in Prometheus server

vi alert_rules.yaml

groups:
- name: alert_rules # Name of the alert rules group
rules:
- alert: InstanceDown
expr: up == 0 # Expression to detect instance down
for: 1m
labels:
severity: "critical"
annotations:
summary: "Endpoint {{ $labels.instance }} down"
description: "{{ $labels.instance }} of job {{ $labels.job }} has been down for more than 1 minute."

- alert: WebsiteDown
expr: probe_success == 0 # Expression to detect website down
for: 1m
labels:
severity: critical
annotations:
description: The website at {{ $labels.instance }} is down.
summary: Website down

- alert: HostOutOfMemory
expr: node_memory_MemAvailable / node_memory_MemTotal * 100 < 25 # Expression to detect
low memory
for: 5m
labels:
severity: warning
annotations:
summary: "Host out of memory (instance {{ $labels.instance }})"
description: "Node memory is filling up (< 25% left)\n VALUE = {{ $value }}\n LABELS: {{
$labels }}"

- alert: HostOutOfDiskSpace
expr: (node_filesystem_avail{mountpoint="/"} * 100) / node_filesystem_size{mountpoint="/"} <
50 # Expression to detect low disk space
for: 1s
labels:
severity: warning
annotations:
summary: "Host out of disk space (instance {{ $labels.instance }})"
description: "Disk is almost full (< 50% left)\n VALUE = {{ $value }}\n LABELS: {{ $labels }}"

- alert: HostHighCpuLoad
expr: (sum by (instance) (irate(node_cpu{job="node_exporter_metrics",mode="idle"}[5m]))) > 80
# Expression to detect high CPU load
for: 5m
labels:
severity: warning
annotations:
summary: "Host high CPU load (instance {{ $labels.instance }})"
description: "CPU load is > 80%\n VALUE = {{ $value }}\n LABELS: {{ $labels }}"

- alert: ServiceUnavailable
expr: up{job="node_exporter"} == 0 # Expression to detect service
unavailability
for: 2m
labels:
severity: critical
annotations:
summary: "Service Unavailable (instance {{ $labels.instance }})"
description: "The service {{ $labels.job }} is not available\n VALUE = {{ $value }}\n LABELS: {{
$labels }}"

- alert: HighMemoryUsage
expr: (node_memory_Active / node_memory_MemTotal) * 100 > 90 # Expression to detect high
memory usage
for: 10m
labels:
severity: critical
annotations:
summary: "High Memory Usage (instance {{ $labels.instance }})"
description: "Memory usage is > 90%\n VALUE = {{ $value }}\n LABELS: {{ $labels }}"

- alert: FileSystemFull
expr: (node_filesystem_avail / node_filesystem_size) * 100 < 10 # Expression to detect file system
almost full
for: 5m
labels:
severity: critical
annotations:
summary: "File System Almost Full (instance {{ $labels.instance }})"
description: "File system has < 10% free space\n VALUE = {{ $value }}\n LABELS: {{ $labels }}"

Now we need to input the above rules file to Prometheus server by updating
prometheus.yml file

Now to view these alert rules on our Prometheus website page, you need to restart the
Prometheus server

→ pgrep prometheus //to get process id

→ kill id
→ ./prometheus &
Step 6:
Now we need to connect both Alert manager and VM-1 node exporter to prometheus server
by updating prometheus.yml file

After restarting the Prometheus server and we should be able to see the node exporter on
Prometheus target section
Next, need to configure the Blackbox exporter to scrape the data from the website
application, so let’s update scrapping configs on prometheus.yml file

vi prometheus.yml file

Restart the Prometheus server to reflect the changes

Need to start the Blackbox exporter

When we start Alert manager, and we won’t be able see any alerts as of now since we
haven’t configured alert manager

So, let’s configure it

Now we need to configure email notification to get emails when the defined conditions are
met

To receive email notification, we need to enable 2 step verifications on the Gmail account

Step 7:

Next, go to https://myaccount.google.com/apppasswords
And enter name and get a app password which can be used for routing configuration

cd alertmanager
vi alertmanager.yml

---
route:
group_by:
- alertname
group_wait: 30s
group_interval: 5m
repeat_interval: 1h
receiver: email-notifications
receivers:
- name: email-notifications
email_configs:
- to: jayasample1234@gmail.com
from: monitor@example.com
smarthost: smtp.gmail.com:587
auth_username: jayasample1234@gmail.com
auth_identity: jayasample1234@gmail.com
auth_password: luwg yvge wwez fjti
send_resolved: true
inhibit_rules:
- source_match:
severity: critical
target_match:
severity: warning
equal:
- alertname
- dev
- instance

Now, Restart the alert manager and check

Hurray, the monitoring setup complete!!!!

Everything seems fine now
Step 8:
Next, will try check the entire functionality by shutting down the game application.

The status is in pending state

After 1 minute the status will change to firing state and soon will receive an email
notification

Can view the notification on alert manager

Next will try terminating the node exporter

Terminating node exporter will send the notification for both ec2 instance as well the
service

Ultimate Monitoring Project
No ratings yet
Ultimate Monitoring Project
6 pages
Devo
No ratings yet
Devo
17 pages
DevOps Shack Ultimate Monitoring Project
No ratings yet
DevOps Shack Ultimate Monitoring Project
7 pages
Devops Ultimate Monitoring Project
No ratings yet
Devops Ultimate Monitoring Project
17 pages
29 Using Prometheus Alertmanager Node Exporter To Monitor A Companys Geo Distributed Infrastructure
No ratings yet
29 Using Prometheus Alertmanager Node Exporter To Monitor A Companys Geo Distributed Infrastructure
12 pages
SRECon EMEA 2017 - Monitoring Cloudflare's Planet-Scale Edge Network With Prometheus
No ratings yet
SRECon EMEA 2017 - Monitoring Cloudflare's Planet-Scale Edge Network With Prometheus
76 pages
Monitoring Ec2 Instance
No ratings yet
Monitoring Ec2 Instance
15 pages
SRE-Practical Work 3 Monitoring and Alerting Setup
No ratings yet
SRE-Practical Work 3 Monitoring and Alerting Setup
6 pages
Prometheus Monitoring & Alerting Guide
No ratings yet
Prometheus Monitoring & Alerting Guide
13 pages
Prometheus Concepts
No ratings yet
Prometheus Concepts
4 pages
Booking Confirmation
No ratings yet
Booking Confirmation
56 pages
16 - Prometheus Handout
No ratings yet
16 - Prometheus Handout
31 pages
Prometheus for DevOps Monitoring
No ratings yet
Prometheus for DevOps Monitoring
10 pages
7.IT Infra Support Q&A
No ratings yet
7.IT Infra Support Q&A
3 pages
Prometheus Grafana Setup
100% (1)
Prometheus Grafana Setup
5 pages
Install Prometheus, Grafana & Node Exporter
No ratings yet
Install Prometheus, Grafana & Node Exporter
7 pages
Prometheus Grafana Setup
No ratings yet
Prometheus Grafana Setup
4 pages
Prometheus and Grafana Monitoring Tools 1703260158
No ratings yet
Prometheus and Grafana Monitoring Tools 1703260158
59 pages
Prometheus & Grafana Setup Guide
No ratings yet
Prometheus & Grafana Setup Guide
4 pages
Prom Qna
No ratings yet
Prom Qna
43 pages
DevOps Shack - Comprehensive Monitoring Guide
No ratings yet
DevOps Shack - Comprehensive Monitoring Guide
41 pages
Setup of Prometheus, Node Exporter, and Grafana
No ratings yet
Setup of Prometheus, Node Exporter, and Grafana
18 pages
Monotoring Tool
No ratings yet
Monotoring Tool
3 pages
Monitor Health Graf Prom
No ratings yet
Monitor Health Graf Prom
34 pages
Mastering Prometheus & Grafana
No ratings yet
Mastering Prometheus & Grafana
18 pages
Session 14 Alerting
No ratings yet
Session 14 Alerting
24 pages
Setup Prometheus Monitoring On Kubernetes
No ratings yet
Setup Prometheus Monitoring On Kubernetes
6 pages
Prometheus Monitoring
No ratings yet
Prometheus Monitoring
13 pages
Grafana
No ratings yet
Grafana
13 pages
All MonitoringTools Configurations
No ratings yet
All MonitoringTools Configurations
5 pages
Kubernetes Observability with Prometheus
No ratings yet
Kubernetes Observability with Prometheus
2 pages
Prometheus Course
No ratings yet
Prometheus Course
162 pages
07 - Own Alert Rules Part 2 Revised
No ratings yet
07 - Own Alert Rules Part 2 Revised
5 pages
Grafana 02
No ratings yet
Grafana 02
6 pages
Intro To Prometheus Workshop - Grafana
No ratings yet
Intro To Prometheus Workshop - Grafana
67 pages
Prometheus
No ratings yet
Prometheus
34 pages
Kubernetes Monitoring With Prometheus Grafana
No ratings yet
Kubernetes Monitoring With Prometheus Grafana
6 pages
House Dzone Refcard 293 Getting Started Prometheus
No ratings yet
House Dzone Refcard 293 Getting Started Prometheus
6 pages
Prometheus and Grafana
No ratings yet
Prometheus and Grafana
6 pages
Setting Up Prometheus Monitoring and Slack In: Alerts
No ratings yet
Setting Up Prometheus Monitoring and Slack In: Alerts
13 pages
16 Monitoring Part4 02
No ratings yet
16 Monitoring Part4 02
5 pages
An Introduction To Prometheus: Brian Brazil Founder
No ratings yet
An Introduction To Prometheus: Brian Brazil Founder
42 pages
Performance Monitoring With Prometheus Readthedocs Io en Stable
No ratings yet
Performance Monitoring With Prometheus Readthedocs Io en Stable
54 pages
Prometheus and Grafana 1712312993
No ratings yet
Prometheus and Grafana 1712312993
6 pages
DevOps Shack - 10 End-To-End DevOps Scenarios
No ratings yet
DevOps Shack - 10 End-To-End DevOps Scenarios
29 pages
Prometheus
No ratings yet
Prometheus
17 pages
Prometheus Ebook v2
80% (5)
Prometheus Ebook v2
231 pages
16 - Prometheus Checklist
No ratings yet
16 - Prometheus Checklist
9 pages
Prom Notes
No ratings yet
Prom Notes
47 pages
Unit 5
No ratings yet
Unit 5
13 pages
Observing Enterprise Kubernetes Clusters at Scale
No ratings yet
Observing Enterprise Kubernetes Clusters at Scale
59 pages
16 - Prometheus (Dark Theme)
No ratings yet
16 - Prometheus (Dark Theme)
10 pages
Turnbull James Monitoring With Prometheus PDF
100% (1)
Turnbull James Monitoring With Prometheus PDF
394 pages
SESSION6 - Real Time Monitoring - 1
No ratings yet
SESSION6 - Real Time Monitoring - 1
16 pages
Kubernetes Monitoring Using Prometheus and Grafana
No ratings yet
Kubernetes Monitoring Using Prometheus and Grafana
8 pages
Alerta PDF
No ratings yet
Alerta PDF
123 pages
Prometheus
No ratings yet
Prometheus
37 pages
Observability - Part 2
No ratings yet
Observability - Part 2
9 pages
Oneert
No ratings yet
Oneert
4 pages
Diff Maven Jenkins
No ratings yet
Diff Maven Jenkins
1 page
Rttwo
No ratings yet
Rttwo
2 pages
Groovy Design Pattern
No ratings yet
Groovy Design Pattern
3 pages
Splunk DevOps Interview Guide
No ratings yet
Splunk DevOps Interview Guide
5 pages
Splunk Essentials for IT Pros
No ratings yet
Splunk Essentials for IT Pros
7 pages
Linux Dockers
No ratings yet
Linux Dockers
1 page
Rtfour
No ratings yet
Rtfour
1 page
Docker Containers
No ratings yet
Docker Containers
1 page
Docker Infra
No ratings yet
Docker Infra
1 page
Fivert
No ratings yet
Fivert
1 page
Advantages of Log4j
No ratings yet
Advantages of Log4j
5 pages
Log4j Properties
No ratings yet
Log4j Properties
2 pages
CI Tools
No ratings yet
CI Tools
1 page
Git Commd
No ratings yet
Git Commd
2 pages
Git Commit
No ratings yet
Git Commit
1 page
Secret Manager Script
No ratings yet
Secret Manager Script
1 page
Shell Find and Replace
No ratings yet
Shell Find and Replace
6 pages
Devops Vs Agile
No ratings yet
Devops Vs Agile
3 pages
Shell Script Tutorial
No ratings yet
Shell Script Tutorial
6 pages
Shell Conversion From Lower To Upper
No ratings yet
Shell Conversion From Lower To Upper
1 page
Maven
No ratings yet
Maven
4 pages
Handling Deployment Failures in Production
No ratings yet
Handling Deployment Failures in Production
9 pages
Iptables Tables
No ratings yet
Iptables Tables
6 pages
Devi Academy: Valasaravakkam, Chennai - 600 087
No ratings yet
Devi Academy: Valasaravakkam, Chennai - 600 087
11 pages
Configure A Default Web Site
No ratings yet
Configure A Default Web Site
2 pages
TENDA ONT XPON HG15V2.0 Datasheet
No ratings yet
TENDA ONT XPON HG15V2.0 Datasheet
7 pages
DLL 6
No ratings yet
DLL 6
6 pages
LIST 4 Nakivale
No ratings yet
LIST 4 Nakivale
3 pages
History and Overview of Painting
No ratings yet
History and Overview of Painting
34 pages
Training Record Book Plan and Conduct A Passage and Determine Position
No ratings yet
Training Record Book Plan and Conduct A Passage and Determine Position
5 pages
Gma Network - CSR
100% (2)
Gma Network - CSR
1 page
Unified Threat Management User Guide: Junos® OS
No ratings yet
Unified Threat Management User Guide: Junos® OS
934 pages
Chapter 03
No ratings yet
Chapter 03
52 pages
Yearbook Justin
No ratings yet
Yearbook Justin
1 page
Jake S Resume
No ratings yet
Jake S Resume
1 page
ST - Clare'S Medical Center, Inc.: 1838 Dian Street, Makati City, Metro Manila
No ratings yet
ST - Clare'S Medical Center, Inc.: 1838 Dian Street, Makati City, Metro Manila
1 page
Vishnu Sahasra Namam Vol1
100% (3)
Vishnu Sahasra Namam Vol1
306 pages
Caraballo Vs Republic of PH
No ratings yet
Caraballo Vs Republic of PH
2 pages
BEBRE Spelling Correction Form
No ratings yet
BEBRE Spelling Correction Form
1 page
Acquisition Agreement
No ratings yet
Acquisition Agreement
4 pages
Polity in Ancient India
No ratings yet
Polity in Ancient India
8 pages
Jos Jewellers
No ratings yet
Jos Jewellers
4 pages
e-AWB-Recommended Guidelines-HYD
No ratings yet
e-AWB-Recommended Guidelines-HYD
36 pages
New Teachers' Training (NTT) : © 51talk. Proprietary and Confidential
50% (2)
New Teachers' Training (NTT) : © 51talk. Proprietary and Confidential
54 pages
Windows & Active Directory Exploitation
No ratings yet
Windows & Active Directory Exploitation
30 pages
Commander's 9th Army Sultan Brigade Anniversary Message
100% (1)
Commander's 9th Army Sultan Brigade Anniversary Message
2 pages
Identifying Your Values
No ratings yet
Identifying Your Values
9 pages
ADDENDUM 3 TP Agreement Document
No ratings yet
ADDENDUM 3 TP Agreement Document
2 pages
The Unfortunate Marriage of Azeb Yitades
No ratings yet
The Unfortunate Marriage of Azeb Yitades
10 pages
Facebook Marketing A To Z Pdfword
No ratings yet
Facebook Marketing A To Z Pdfword
137 pages
80 Trading Strategies For Forex
100% (1)
80 Trading Strategies For Forex
26 pages
Travelers' Guide:, You Have To Appear in Person. Avoid To Entrust This
No ratings yet
Travelers' Guide:, You Have To Appear in Person. Avoid To Entrust This
11 pages
Homestead Patent and Free Patent
80% (5)
Homestead Patent and Free Patent
2 pages
Onyait (Suing For and On Behalf of 877 Others) V KimanjeNsibambi Anor (Civil Suit No 548 of 2016) 2020 UGHCCD 22 (26 March 2020)
No ratings yet
Onyait (Suing For and On Behalf of 877 Others) V KimanjeNsibambi Anor (Civil Suit No 548 of 2016) 2020 UGHCCD 22 (26 March 2020)
9 pages
GLOBALG.A.P. GRASP v2 Social Risk Assessment
No ratings yet
GLOBALG.A.P. GRASP v2 Social Risk Assessment
41 pages

Prometheus Monitoring Setup Guide

Uploaded by

Prometheus Monitoring Setup Guide

Uploaded by

Ultimate monitoring using Prometheus: Ensuring

optimal performance & Reliability

Prometheus components exporter tar files: https://prometheus.io/download/

Step 2: In Virtual machine 1:

Downloaded Node Exporter and start

##Extract Node Exporter

We can execute the jar file to run the application on browser

Now will access the game application at: http://3.135.20.106:8080/

Can access the Prometheus server at: http://3.145.128.69:9090/graph

→ pgrep prometheus //to get process id

Restart the Prometheus server to reflect the changes

So, let’s configure it

Now, Restart the alert manager and check

Hurray, the monitoring setup complete!!!!

The status is in pending state

Can view the notification on alert manager

You might also like