Help us improve your experience.

Let us know what you think.

Do you have time for a two-minute survey?

external-header-nav
keyboard_arrow_up
close
keyboard_arrow_left
list Table of Contents
file_download PDF
keyboard_arrow_right

Health Check

date_range 26-Sep-22

SUMMARY In Cloud-Native Contrail Networking (CN2) Release 22.3, a new health check custom resource object is introduced that associates the virtual machine interface (VMI) to the pod creation and update workflow. The health check resource is a namespace-scoped resource.

Health Check Overview

The health check functionality is provided by the Contrail vRouter agent. You can associate a ping or HTTP health check to an interface. If the health check fails, based on the timers and intervals configured in the health check object, the interface is set as administratively down and associated routes are withdrawn. Health check traffic continues to be transmitted in an administratively down state to allow for an interface to recover.

Create a Health Check Object

Use this procedure to create a health check object.

  1. In the deployment manifests from the Contrail Networking download page, use the hc.yaml file (shown below) for the YML definition for health check objects. The same folder also includes the hc_pod.yaml which has the YML definition to associate the health check object with VMI by means of pod definitions.

    Sample hc.yaml file:

    content_copy zoom_out_map
    apiVersion: v1
    kind: Namespace
    metadata:
      name: healthcheck
    ---
    apiVersion: core.contrail.juniper.net/v1alpha1
    kind: ServiceHealthCheck
    metadata:
      name: ping-hc
      namespace: healthcheck
    spec:
      serviceHealthCheckProperties:
        delay: 2
        enabled: true
        healthCheckType: end-to-end
        maxRetries: 5
        monitorType: PING
        timeout: 5
    ---
    apiVersion: core.contrail.juniper.net/v1alpha1
    kind: ServiceHealthCheck
    metadata:
      name: bfd-hc
      namespace: healthcheck
    spec:
      serviceHealthCheckProperties:
        delay: 2
        enabled: true
        healthCheckType: link-local
        maxRetries: 5
        monitorType: BFD
        timeout: 5
    ---
    apiVersion: core.contrail.juniper.net/v1alpha1
    kind: ServiceHealthCheck
    metadata:
      name: http-hc
      namespace: healthcheck
    spec:
      serviceHealthCheckProperties:
        delay: 2
        enabled: true
        healthCheckType: end-to-end
        maxRetries: 5
        monitorType: HTTP
        timeout: 5
        httpMethod: GET
        expectedCodes: 200
        urlPath: /health
    
  2. Complete the parameters to define the health check. Table 1 lists and explains the parameters.
    Table 1: Health Check Configurable Parameters
    Field Description
    Delay The delay, in seconds, to repeat the health check.
    DelayUsecs Time in micro seconds at which health check is repeated.
    Enabled Indicates that health check is enabled. The default is False.
    ExpectedCodes When the monitor protocol is HTTP, the expected return code for HTTP operations. Must be in the range of 200-299.
    HealthCheckType Indicates the health check type: link-local, end-to-end, segment, vn-ip-list, and end2end. The default is link-local.

    In both link-local and end-to-end modes, health check is executed for the pod on the vRouter where the VMI is running.

    HttpMethod When the monitor protocol is HTTP, the type of HTTP method used is GET.
    MaxRetries The number of retries to attempt before declaring an instance health down.
    MonitorType The protocol type to be used: PING, BFD, or TCP.
    Timeout The number of seconds to wait for a response.
    TimeoutUsecs Time in micro seconds to wait for response.
    UrlPath Must be a valid URL. For example, http://172.16.0.1/<path>, The IP address can be a placeholder which will be replaced with the pod link-local IP address or metadata IP address.

    Following is an abstract Golang schema for the health check resource.

    content_copy zoom_out_map
    type ServiceHealthCheckProperties struct {
        Delay              *int
        DelayUsecs         *int
        Enabled            boolean
        ExpectedCodes      int // Only for http
        HealthCheckType    (link-local | end-to-end |segment | vn-ip-list) //end2end
        HttpMethod          *string
        MaxRetries          int
        MonitorType         (ping | BFD |TCP)
        Timeout             int
        TimeoutUsecs        int
    
    }
    
    type ServiceHealthCheckSpec struct {
        ServiceHealthCheckProperties *ServiceHealthCheckProperties
    }
    
    type ServiceHealthCheckStatus struct {
        uuid *string
    }
    
    type ServiceHealthCheck struct {
        <kube_specific_objetcs>
        Spec     ServiceHealthCheckSpec
        Status   ServiceHealthCheckStatus
    }
    

    The YML representation for the Golang schema is:

    content_copy zoom_out_map
    ```
    apiVersion: core.contrail.juniper.net/v1alpha1
    kind: ServiceHealthCheck
    metadata:
      name: ping-hc
      namespace: healthcheck
    spec:
      serviceHealthCheckProperties:
        delay: 2
        enabled: true
        healthCheckType: end-to-end  #valid values are link-local|end-to-end|segment|vn-ip-list
        maxRetries: 5
        monitorType: PING  #valid are PING|HTTP|BFD
        timeout: 5
    
    ```
  3. Link the health check object to the VMI by means of the pod annotation reference value core.juniper.net/health-check. The default behavior is to associate the health check with the primary interface.
    content_copy zoom_out_map
    apiVersion: v1
    kind: Pod
    metadata:
      name: hc_pod
      namespace: hc_ns
      annotations:
         core.juniper.net/health-check: '[{"name": "ping-hc", "namespace": "healthcheck"}]'
    spec:
      <>
    
  4. (Optional) To link the health check with multiple interfaces (attached to different NAD or VN), you can refer the health check object within the cni-args section. Following is an example of configured cni-args in annotations.
    content_copy zoom_out_map
    apiVersion: v1
    kind: Pod
    metadata:
      name: hc_pod
      namespace: hc_ns
      annotations:
              k8s.v1.cni.cncf.io/networks: |
          [
            {
              "name": "hc-vn",
              "namespace": "healthcheck",
              "cni-args": {
                "core.juniper.net/health-check": "[{\"name\": \"ping-hc\", \"namespace\": \"healthcheck\"}]"
              }
            }
          ]
    spec:
      <>
    

    Existing VMI objects will have a new field to reference the HealthCheck object.

    content_copy zoom_out_map
    type VirtualMachineInterfaceStatus struct {
        <existing_vmi_status_attributes>
        ServiceHealthCheckReference *ResourceReference
    }
    
    
    type VirtualMachineInterface struct {
        <other VMI attributes>
        Status   VirtualMachineInterfaceStatus
    }
    
    For the PING or HTTP monitoring-based health check minimum interval is 1second. If you need a sub-second level health check for critical applications,you can opt for the BFD-based monitoring type.

Health Check Process

The Contrail vRouter agent is responsible for providing the health check service. The agent spawns a health check probe process to monitor the status of a service hosted on the same compute node, and the process updates the status to the vRouter agent.

The vRouter agent acts on the status provided by the script to withdraw or restore the exported interface routes. The agent is also responsible for providing a link-local metadata IP address for allowing the script to communicate with the destination IP address from the underlay network, using appropriate NAT translations. In a running system, this information is displayed in the vRouter agent introspect at:

content_copy zoom_out_map
http://<compute-node-ip>:8085/Snh_HealthCheckSandeshReq?uuid=
external-footer-nav