3.2. 配置机器健康检查以使用 Self Node Remediation Operator
使用以下步骤将 worker 或 control-plane 机器健康检查配置为使用 Self Node Remediation Operator 作为补救供应商。
先决条件
-
安装 OpenShift CLI(
oc)。 -
以具有
cluster-admin特权的用户身份登录。
流程
创建
SelfNodeRemediationTemplateCR:定义
SelfNodeRemediationTemplateCR:apiVersion: self-node-remediation.medik8s.io/v1alpha1 kind: SelfNodeRemediationTemplate metadata: namespace: openshift-machine-api name: selfnoderemediationtemplate-sample spec: template: spec: remediationStrategy: ResourceDeletion 1- 1
- 指定补救策略。默认策略是
ResourceDeletion。
要创建
SelfNodeRemediationTemplateCR,请运行以下命令:$ oc create -f <snrt-name>.yaml
创建或更新
MachineHealthCheckCR 以指向SelfNodeRemediationTemplateCR:定义或更新
MachineHealthCheckCR:apiVersion: machine.openshift.io/v1beta1 kind: MachineHealthCheck metadata: name: machine-health-check namespace: openshift-machine-api spec: selector: matchLabels: 1 machine.openshift.io/cluster-api-machine-role: "worker" machine.openshift.io/cluster-api-machine-type: "worker" unhealthyConditions: - type: "Ready" timeout: "300s" status: "False" - type: "Ready" timeout: "300s" status: "Unknown" maxUnhealthy: "40%" nodeStartupTimeout: "10m" remediationTemplate: 2 kind: SelfNodeRemediationTemplate apiVersion: self-node-remediation.medik8s.io/v1alpha1 name: selfnoderemediationtemplate-sample要创建
MachineHealthCheckCR,请运行以下命令:$ oc create -f <mhc-name>.yaml
要更新
MachineHealthCheckCR,请运行以下命令:$ oc apply -f <mhc-name>.yaml