mirror of
https://gerrit.hackerspace.pl/hscloud
synced 2024-10-18 03:07:44 +00:00
viq
30a563c49f
This adds automatic scraping of pods and services based on presence of annotations: - prometheus.io/scrape - prometheus.io/port - prometheus.io/path Change-Id: I1c1afecc75c30278889de1f6ca0b17da69997295 Reviewed-on: https://gerrit.hackerspace.pl/c/hscloud/+/1850 Reviewed-by: implr <implr@hackerspace.pl> |
||
---|---|---|
.. | ||
doc | ||
lib | ||
secrets | ||
k0.jsonnet | ||
OWNERS | ||
README.md |
hscloud monitoring
Quick links
- Old Global Dashboard: monitoring.hackerspace.pl - old monitoring system, unrelated to this one, configured using Chef at management.hackerspace.pl (long since dead). This setup is supposed to replace it.
Architecture
The hscloud monitoring solution is two-tiered:
- at the global tier we run metrics aggregation, long-term storage, dashboard and alerting.
- at the agent tier we collect metrics from various sources (possibly even lower tiered agents).
All agent-tier agents send metrics to all global instances.
.--------. .--------. '.
| global | | global | > - global tier
'--------' '--------' .' (contains 'global instances')
| '---. .---' |
| X |
| .---' '---. |
| | | |
.--------------. .--------------------. '.
| cluster | | hswaw-proxy | |
| k0.hswaw.net | | waw.hackerspace.pl | > - agent tier
'--------------' '--------------------' .' (contains 'agents')
Agent - cluster
Cluster agents are responsible from collecting Kubernetes cluster metrics. They run a prometheus server that scrapes kubelet/cadvisor/... metrics and send them off to global instances.
Global Instances
Global agents run Victoria Metrics, ingest metrics from all agents, and perform long-term storage. In the future they will also run Grafana and AlertManager.