hscloud

cheshire

hscloud

forked from hswaw/hscloud

Fork 0

Commit Graph

Author	SHA1	Message	Date
q3k	9848e7e15f	cluster: deploy NixOS-based ceph First pass at a non-rook-managed Ceph cluster. We call it k0 instead of ceph-waw4, as we pretty much are sure now that we will always have a one-kube-cluster-to-one-ceph-cluster correspondence, with different Ceph pools for different media kinds (if at all). For now this has one mon and spinning rust OSDs. This can be iterated on to make it less terrible with time. See b/6 for more details. Change-Id: Ie502a232c700af93f33fcad9fa1c57058161aa11	2021-09-11 20:33:24 +00:00
q3k	1257389d3d	k0: expose controller-manager and scheduler metrics We want to be able to scrape controller-manager and scheduler metrics into Prometheus. For that, each of them needs to: 1) listen on a secure port 2) have authn enabled With this, any k8s user with the right permissions (and a bearer token or TLS certificate) can come in and access metrics over a node's public IP address. Access without a certificate/token gets thrown into the system:anonymous user, which as no access to any API. Change-Id: I267680f92f748ba63b6762e6aaba3c417446e50b	2020-10-10 16:00:15 +00:00
q3k	c78cc13528	cluster/nix: locally build nixos derivations We change the existing behaviour (copy files & run nixos-rebuild switch) to something closer to nixops-style. This now means that provisioning admin machines need Nix installed locally, but that's probably an okay choice to make. The upside of this approach is that it's easier to debug and test derivations, as all data is local to the repo and the workstation, and deploying just means copying a configuration closure and switching the system to it. At some point we should even be able to run the entire cluster within a set of test VMs. We also bump the kubernetes control plane to 1.14. Kubelets are still at 1.13 and their upgrade is comint up today too. Change-Id: Ia9832c47f258ee223d93893d27946d1161cc4bbd	2020-02-02 22:31:53 +01:00

Author

SHA1

Message

Date

q3k

9848e7e15f

cluster: deploy NixOS-based ceph

First pass at a non-rook-managed Ceph cluster. We call it k0 instead of
ceph-waw4, as we pretty much are sure now that we will always have a
one-kube-cluster-to-one-ceph-cluster correspondence, with different Ceph
pools for different media kinds (if at all).

For now this has one mon and spinning rust OSDs. This can be iterated on
to make it less terrible with time.

See b/6 for more details.

Change-Id: Ie502a232c700af93f33fcad9fa1c57058161aa11

2021-09-11 20:33:24 +00:00

q3k

1257389d3d

k0: expose controller-manager and scheduler metrics

We want to be able to scrape controller-manager and scheduler metrics
into Prometheus. For that, each of them needs to:

 1) listen on a secure port
 2) have authn enabled

With this, any k8s user with the right permissions (and a bearer token
or TLS certificate) can come in and access metrics over a node's public
IP address. Access without a certificate/token gets thrown into the
system:anonymous user, which as no access to any API.

Change-Id: I267680f92f748ba63b6762e6aaba3c417446e50b

2020-10-10 16:00:15 +00:00

q3k

c78cc13528

cluster/nix: locally build nixos derivations

We change the existing behaviour (copy files & run nixos-rebuild switch)
to something closer to nixops-style. This now means that provisioning
admin machines need Nix installed locally, but that's probably an okay
choice to make.

The upside of this approach is that it's easier to debug and test
derivations, as all data is local to the repo and the workstation, and
deploying just means copying a configuration closure and switching the
system to it. At some point we should even be able to run the entire
cluster within a set of test VMs.

We also bump the kubernetes control plane to 1.14. Kubelets are still at
1.13 and their upgrade is comint up today too.

Change-Id: Ia9832c47f258ee223d93893d27946d1161cc4bbd

2020-02-02 22:31:53 +01:00

3 Commits (523df5c2353bec74f6c4be74713a02e170a4f079)