Installing a cluster quickly on oVirt

    The installation program uses installer-provisioned infrastructure to automate creating and deploying the cluster.

    To install a default cluster, you prepare the environment, run the installation program and answer its prompts. Then, the installation program creates the OKD cluster.

    For an alternative to installing a default cluster, see Installing a cluster with customizations.

    Requirements for the oVirt environment

    To install and run an OKD version 4.13 cluster, the oVirt environment must meet the following requirements.

    Not meeting these requirements can cause the installation or process to fail. Additionally, not meeting these requirements can cause the OKD cluster to fail days or weeks after installation.

    The following requirements for CPU, memory, and storage resources are based on default values multiplied by the default number of virtual machines the installation program creates. These resources must be available in addition to what the oVirt environment uses for non-OKD operations.

    By default, the installation program creates seven virtual machines during the installation process. First, it creates a bootstrap virtual machine to provide temporary services and a control plane while it creates the rest of the OKD cluster. When the installation program finishes creating the cluster, deleting the bootstrap machine frees up its resources.

    If you increase the number of virtual machines in the oVirt environment, you must increase the resources accordingly.

    Requirements

    • The oVirt version is 4.4.

    • The oVirt environment has one data center whose state is Up.

    • The oVirt data center contains an oVirt cluster.

    • The oVirt cluster has the following resources exclusively for the OKD cluster:

      • Minimum 28 vCPUs: four for each of the seven virtual machines created during installation.

      • 112 GiB RAM or more, including:

        • 16 GiB or more for the bootstrap machine, which provides the temporary control plane.

        • 16 GiB or more for each of the three control plane machines which provide the control plane.

        • 16 GiB or more for each of the three compute machines, which run the application workloads.

    • The oVirt storage domain must meet .

    • For affinity group support: Three or more hosts in the oVirt cluster. If necessary, you can disable affinity groups. For details, see Example: Removing all affinity groups for a non-production lab setup in Installing a cluster on oVirt with customizations

    • In production environments, each virtual machine must have 120 GiB or more. Therefore, the storage domain must provide 840 GiB or more for the default OKD cluster. In resource-constrained or non-production environments, each virtual machine must have 32 GiB or more, so the storage domain must have 230 GiB or more for the default OKD cluster.

    • To download images from the Red Hat Ecosystem Catalog during installation and update procedures, the oVirt cluster must have access to an internet connection. The Telemetry service also needs an internet connection to simplify the subscription and entitlement process.

    • The oVirt cluster must have a virtual network with access to the REST API on the oVirt Engine. Ensure that DHCP is enabled on this network, because the VMs that the installer creates obtain their IP address by using DHCP.

    • A user account and group with the following least privileges for installing and managing an OKD cluster on the target oVirt cluster:

      • DiskOperator

      • DiskCreator

      • UserTemplateBasedVm

      • TemplateOwner

      • TemplateCreator

      • ClusterAdmin on the target cluster

    Apply the principle of least privilege: Avoid using an administrator account with SuperUser privileges on oVirt during the installation process. The installation program saves the credentials you provide to a temporary ovirt-config.yaml file that might be compromised.

    Additional resources

    Verifying the requirements for the oVirt environment

    Verify that the oVirt environment meets the requirements to install and run an OKD cluster. Not meeting these requirements can cause failures.

    These requirements are based on the default resources the installation program uses to create control plane and compute machines. These resources include vCPUs, memory, and storage. If you change these resources or increase the number of OKD machines, adjust these requirements accordingly.

    Procedure

    1. Check that the oVirt version supports installation of OKD version 4.13.

      1. In the oVirt Administration Portal, click the ? help icon in the upper-right corner and select About.

      2. In the window that opens, make a note of the oVirt Software Version.

      3. Confirm that the oVirt version is 4.4. For more information about supported version combinations, see Support Matrix for OKD on oVirt.

    2. Inspect the data center, cluster, and storage.

      1. In the oVirt Administration Portal, click ComputeData Centers.

      2. Confirm that the data center where you plan to install OKD is accessible.

      3. Click the name of that data center.

      4. In the data center details, on the Storage tab, confirm the storage domain where you plan to install OKD is Active.

      5. Record the Domain Name for use later on.

      6. Confirm Free Space has at least 230 GiB.

      7. Confirm that the storage domain meets , which you can measure by using the fio performance benchmarking tool.

      8. In the data center details, click the Clusters tab.

      9. Find the oVirt cluster where you plan to install OKD. Record the cluster name for use later on.

    3. Inspect the oVirt host resources.

      1. In the oVirt Administration Portal, click Compute > Clusters.

      2. Click the cluster where you plan to install OKD.

      3. In the cluster details, click the Hosts tab.

      4. Inspect the hosts and confirm they have a combined total of at least 28 Logical CPU Cores available exclusively for the OKD cluster.

      5. Record the number of available Logical CPU Cores for use later on.

      6. Confirm that these CPU cores are distributed so that each of the seven virtual machines created during installation can have four cores.

      7. Confirm that, all together, the hosts have 112 GiB of Max free Memory for scheduling new virtual machines distributed to meet the requirements for each of the following OKD machines:

        • 16 GiB required for the bootstrap machine

        • 16 GiB for each of the three compute machines

      8. Record the amount of Max free Memory for scheduling new virtual machines for use later on.

    4. Verify that the virtual network for installing OKD has access to the oVirt Engine’s REST API. From a virtual machine on this network, use curl to reach the oVirt Engine’s REST API:

      1For <username>, specify the user name of an oVirt account with privileges to create and manage an OKD cluster on oVirt. For <profile>, specify the login profile, which you can get by going to the oVirt Administration Portal login page and reviewing the Profile dropdown list. For <password>, specify the password for that user name.
      2For <engine-fqdn>, specify the fully qualified domain name of the oVirt environment.

      For example:

      1. $ curl -k -u admin@internal:pw123 \
      2. https://ovirtlab.example.com/ovirt-engine/api

    Preparing the network environment on oVirt

    Configure two static IP addresses for the OKD cluster and create DNS entries using these addresses.

    Procedure

    1. Reserve two static IP addresses

      1. On the network where you plan to install OKD, identify two static IP addresses that are outside the DHCP lease pool.

        1. $ arp 10.35.1.19

        Example output

        1. 10.35.1.19 (10.35.1.19) -- no entry
      2. Reserve two static IP addresses following the standard practices for your network environment.

      3. Record these IP addresses for future reference.

    2. Create DNS entries for the OKD REST API and apps domain names using this format:

      1. api.<cluster-name>.<base-domain> <ip-address> (1)
      2. *.apps.<cluster-name>.<base-domain> <ip-address> (2)
      1For <cluster-name>, <base-domain>, and <ip-address>, specify the cluster name, base domain, and static IP address of your OKD API.
      2Specify the cluster name, base domain, and static IP address of your OKD apps for Ingress and the load balancer.

      For example:

      1. api.my-cluster.virtlab.example.com 10.35.1.19
      2. *.apps.my-cluster.virtlab.example.com 10.35.1.20

    By default, the installer creates a CA certificate, prompts you for confirmation, and stores the certificate to use during installation. You do not need to create or install one manually.

    Although it is not recommended, you can override this functionality and install OKD without verifying a certificate by installing OKD on oVirt in insecure mode.

    Installing in insecure mode is not recommended, because it enables a potential attacker to perform a Man-in-the-Middle attack and capture sensitive credentials on the network.

    Procedure

    1. Create a file named ~/.ovirt/ovirt-config.yaml.

    2. Add the following content to ovirt-config.yaml:

      1. ovirt_url: https://ovirt.example.com/ovirt-engine/api (1)
      2. ovirt_fqdn: ovirt.example.com (2)
      3. ovirt_pem_url: ""
      4. ovirt_username: admin@internal
      5. ovirt_password: super-secret-password (3)
      6. ovirt_insecure: true
      1Specify the hostname or address of your oVirt engine.
      2Specify the fully qualified domain name of your oVirt engine.
      3Specify the admin password for your oVirt engine.
    3. Run the installer.

    Generating a key pair for cluster node SSH access

    During an OKD installation, you can provide an SSH public key to the installation program. The key is passed to the Fedora CoreOS (FCOS) nodes through their Ignition config files and is used to authenticate SSH access to the nodes. The key is added to the ~/.ssh/authorized_keys list for the core user on each node, which enables password-less authentication.

    After the key is passed to the nodes, you can use the key pair to SSH in to the FCOS nodes as the user core. To access the nodes through SSH, the private key identity must be managed by SSH for your local user.

    If you want to SSH in to your cluster nodes to perform installation debugging or disaster recovery, you must provide the SSH public key during the installation process. The ./openshift-install gather command also requires the SSH public key to be in place on the cluster nodes.

    Do not skip this procedure in production environments, where disaster recovery and debugging is required.

    On clusters running Fedora CoreOS (FCOS), the SSH keys specified in the Ignition config files are written to the /home/core/.ssh/authorized_keys.d/core file. However, the Machine Config Operator manages SSH keys in the /home/core/.ssh/authorized_keys file and configures sshd to ignore the /home/core/.ssh/authorized_keys.d/core file. As a result, newly provisioned OKD nodes are not accessible using SSH until the Machine Config Operator reconciles the machine configs with the authorized_keys file. After you can access the nodes using SSH, you can delete the /home/core/.ssh/authorized_keys.d/core file.

    Procedure

    1. If you do not have an existing SSH key pair on your local machine to use for authentication onto your cluster nodes, create one. For example, on a computer that uses a Linux operating system, run the following command:

      1. $ ssh-keygen -t ed25519 -N '' -f <path>/<file_name> (1)

      If you plan to install an OKD cluster that uses FIPS Validated / Modules in Process cryptographic libraries on the x86_64 architecture, do not create a key that uses the ed25519 algorithm. Instead, create a key that uses the rsa or ecdsa algorithm.

    2. View the public SSH key:

      1. $ cat <path>/<file_name>.pub

      For example, run the following to view the ~/.ssh/id_ed25519.pub public key:

      1. $ cat ~/.ssh/id_ed25519.pub
    3. Add the SSH private key identity to the SSH agent for your local user, if it has not already been added. SSH agent management of the key is required for password-less SSH authentication onto your cluster nodes, or if you want to use the ./openshift-install gather command.

      On some distributions, default SSH private key identities such as ~/.ssh/id_rsa and ~/.ssh/id_dsa are managed automatically.

      1. If the ssh-agent process is not already running for your local user, start it as a background task:

        Example output

        1. Agent pid 31874

        If your cluster is in FIPS mode, only use FIPS-compliant algorithms to generate the SSH key. The key must be either RSA or ECDSA.

    4. Add your SSH private key to the ssh-agent:

      1. $ ssh-add <path>/<file_name> (1)
      1Specify the path and file name for your SSH private key, such as ~/.ssh/id_ed25519

      Example output

    Next steps

    • When you install OKD, provide the SSH public key to the installation program.

    Obtaining the installation program

    Before you install OKD, download the installation file on the host you are using for installation.

    Prerequisites

    • You have a computer that runs Linux or macOS, with 500 MB of local disk space.

    Procedure

    1. Download installer from

      The installation program creates several files on the computer that you use to install your cluster. You must keep the installation program and the files that the installation program creates after you finish installing the cluster. Both files are required to delete the cluster.

      Deleting the files created by the installation program does not remove your cluster, even if the cluster failed during installation. To remove your cluster, complete the OKD uninstallation procedures for your specific cloud provider.

    2. Extract the installation program. For example, on a computer that uses a Linux operating system, run the following command:

      1. $ tar -xvf openshift-install-linux.tar.gz
    3. Download your installation pull secret from the Red Hat OpenShift Cluster Manager. This pull secret allows you to authenticate with the services that are provided by the included authorities, including Quay.io, which serves the container images for OKD components.

      Using a is not required. You can use a pull secret for another private registry. Or, if you do not need the cluster to pull images from a private registry, you can use {"auths":{"fake":{"auth":"aWQ6cGFzcwo="}}} as the pull secret when prompted during the installation.

      If you do not use the pull secret from the Red Hat OpenShift Cluster Manager:

      • Red Hat Operators are not available.

      • The Telemetry and Insights operators do not send data to Red Hat.

      • Content from the registry, such as image streams and Operators, are not available.

    Deploying the cluster

    You can install OKD on a compatible cloud platform.

    You can run the create cluster command of the installation program only once, during initial installation.

    Prerequisites

    • Open the ovirt-imageio port to the Engine from the machine running the installer. By default, the port is 54322.

    • Obtain the OKD installation program and the pull secret for your cluster.

    • Verify the cloud provider account on your host has the correct permissions to deploy the cluster. An account with incorrect permissions causes the installation process to fail with an error message that displays the missing permissions.

    Procedure

    1. Change to the directory that contains the installation program and initialize the cluster deployment:

      1. $ ./openshift-install create cluster --dir <installation_directory> \ (1)
      2. --log-level=info (2)
      1For <installation_directory>, specify the directory name to store the files that the installation program creates.
      2To view different installation details, specify warn, debug, or error instead of info.

      When specifying the directory:

      • Verify that the directory has the execute permission. This permission is required to run Terraform binaries under the installation directory.

      • Use an empty directory. Some installation assets, such as bootstrap X.509 certificates, have short expiration intervals, therefore you must not reuse an installation directory. If you want to reuse individual files from another cluster installation, you can copy them into your directory. However, the file names for the installation assets might change between releases. Use caution when copying installation files from an earlier OKD version.

    2. Respond to the installation program prompts.

      1. Optional: For SSH Public Key, select a password-less public key, such as ~/.ssh/id_rsa.pub. This key authenticates connections with the new OKD cluster.

      2. For Platform, select ovirt.

      3. For Engine FQDN[:PORT], enter the fully qualified domain name (FQDN) of the oVirt environment.

        For example:

        1. $ curl -k -u admin@internal:pw123 \
        2. https://ovirtlab.example.com/ovirt-engine/api
      4. The installation program automatically generates a CA certificate. For Would you like to use the above certificate to connect to the Engine?, answer y or N. If you answer N, you must install OKD in insecure mode.

      5. For Engine username, enter the user name and profile of the oVirt administrator using this format:

        1. <username>@<profile> (1)
        1For <username>, specify the user name of an oVirt administrator. For <profile>, specify the login profile, which you can get by going to the oVirt Administration Portal login page and reviewing the Profile dropdown list. For example: admin@internal.
      6. For Engine password, enter the oVirt admin password.

      7. For Cluster, select the oVirt cluster for installing OKD.

      8. For Storage domain, select the storage domain for installing OKD.

      9. For Network, select a virtual network that has access to the oVirt Engine REST API.

      10. For Internal API Virtual IP, enter the static IP address you set aside for the cluster’s REST API.

      11. For Base Domain, enter the base domain of the OKD cluster. If this cluster is exposed to the outside world, this must be a valid domain recognized by DNS infrastructure. For example, enter: virtlab.example.com

      12. For Cluster Name, enter the name of the cluster. For example, my-cluster. Use cluster name from the externally registered/resolvable DNS entries you created for the OKD REST API and apps domain names. The installation program also gives this name to the cluster in the oVirt environment.

      13. For Pull Secret, copy the pull secret from the pull-secret.txt file you downloaded earlier and paste it here. You can also get a copy of the same .

    Verification

    When the cluster deployment completes successfully:

    • Credential information also outputs to <installation_directory>/.openshift_install.log.

    Do not delete the installation program or the files that the installation program creates. Both are required to delete the cluster.

    Example output

    1. ...
    2. INFO Install complete!
    3. INFO To access the cluster as the system:admin user when using 'oc', run 'export KUBECONFIG=/home/myuser/install_dir/auth/kubeconfig'
    4. INFO Access the OpenShift web-console here: https://console-openshift-console.apps.mycluster.example.com
    5. INFO Login to the console with user: "kubeadmin", and password: "4vYBz-Ee6gm-ymBZj-Wt5AL"
    6. INFO Time elapsed: 36m22s
    • The Ignition config files that the installation program generates contain certificates that expire after 24 hours, which are then renewed at that time. If the cluster is shut down before renewing the certificates and the cluster is later restarted after the 24 hours have elapsed, the cluster automatically recovers the expired certificates. The exception is that you must manually approve the pending certificate signing requests (CSRs) to recover kubelet certificates. See the documentation for Recovering from expired control plane certificates for more information.

    • It is recommended that you use Ignition config files within 12 hours after they are generated because the 24-hour certificate rotates from 16 to 22 hours after the cluster is installed. By using the Ignition config files within 12 hours, you can avoid installation failure if the certificate update runs during installation.

    You have completed the steps required to install the cluster. The remaining steps show you how to verify the cluster and troubleshoot the installation.

    You can install the OpenShift CLI (oc) to interact with OKD from a command-line interface. You can install oc on Linux, Windows, or macOS.

    If you installed an earlier version of oc, you cannot use it to complete all of the commands in OKD 4.13. Download and install the new version of oc.

    You can install the OpenShift CLI (oc) binary on Linux by using the following procedure.

    Procedure

    1. Navigate to https://mirror.openshift.com/pub/openshift-v4/clients/oc/latest/ and choose the folder for your operating system and architecture.

    2. Download oc.tar.gz.

    3. Unpack the archive:

      1. $ tar xvf <file>
    4. Place the oc binary in a directory that is on your PATH.

      To check your PATH, execute the following command:

      1. $ echo $PATH

    After you install the OpenShift CLI, it is available using the oc command:

    1. $ oc <command>

    You can install the OpenShift CLI (oc) binary on Windows by using the following procedure.

    Procedure

    1. Navigate to https://mirror.openshift.com/pub/openshift-v4/clients/oc/latest/ and choose the folder for your operating system and architecture.

    2. Download oc.zip.

    3. Unzip the archive with a ZIP program.

    4. Move the oc binary to a directory that is on your PATH.

      To check your PATH, open the command prompt and execute the following command:

      1. C:\> path

    After you install the OpenShift CLI, it is available using the oc command:

    1. C:\> oc <command>

    You can install the OpenShift CLI (oc) binary on macOS by using the following procedure.

    Procedure

    1. Navigate to https://mirror.openshift.com/pub/openshift-v4/clients/oc/latest/ and choose the folder for your operating system and architecture.

    2. Download oc.tar.gz.

    3. Unpack and unzip the archive.

    4. Move the oc binary to a directory on your PATH.

      To check your PATH, open a terminal and execute the following command:

      1. $ echo $PATH

    After you install the OpenShift CLI, it is available using the oc command:

    1. $ oc <command>

    To learn more, see .

    Logging in to the cluster by using the CLI

    You can log in to your cluster as a default system user by exporting the cluster kubeconfig file. The kubeconfig file contains information about the cluster that is used by the CLI to connect a client to the correct cluster and API server. The file is specific to a cluster and is created during OKD installation.

    Prerequisites

    • You deployed an OKD cluster.

    • You installed the oc CLI.

    Procedure

    1. Export the kubeadmin credentials:

      1For <installation_directory>, specify the path to the directory that you stored the installation files in.
    2. Verify you can run oc commands successfully using the exported configuration:

      1. $ oc whoami

      Example output

      1. system:admin

    Additional resources

    • See for more details about accessing and understanding the OKD web console.

    Verifying cluster status

    You can verify your OKD cluster’s status during or after installation.

    Procedure

    1. In the cluster environment, export the administrator’s kubeconfig file:

      1. $ export KUBECONFIG=<installation_directory>/auth/kubeconfig (1)
      1For <installation_directory>, specify the path to the directory that you stored the installation files in.

      The kubeconfig file contains information about the cluster that is used by the CLI to connect a client to the correct cluster and API server.

    2. View the control plane and compute machines created after a deployment:

      1. $ oc get nodes
    3. View your cluster’s version:

      1. $ oc get clusterversion
    4. View your Operators’ status:

      1. $ oc get clusteroperator
    5. View all running pods in the cluster:

      1. $ oc get pods -A

    Troubleshooting

    If the installation fails, the installation program times out and displays an error message. To learn more, see .

    Accessing the OKD web console on oVirt

    After the OKD cluster initializes, you can log in to the OKD web console.

    Procedure

    1. Optional: In the oVirt Administration Portal, open ComputeCluster.

    2. Verify that the installation program creates the virtual machines.

    3. Return to the command line where the installation program is running. When the installation program finishes, it displays the user name and temporary password for logging into the OKD web console.

    4. In a browser, open the URL of the OKD web console. The URL uses this format:

      1. console-openshift-console.apps.<clustername>.<basedomain> (1)
      1For <clustername>.<basedomain>, specify the cluster name and base domain.

      For example:

      1. console-openshift-console.apps.my-cluster.virtlab.example.com

    Additional resources

    • See for more information about the Telemetry service

    Here are some common issues you might encounter, along with proposed causes and solutions.

    • Symptom: CPU load increases significantly and nodes start going into a Not Ready state.

    • Cause: The storage domain latency might be too high, especially for control plane nodes.

    • Solution:

      Make the nodes ready again by restarting the kubelet service:

      1. $ systemctl restart kubelet

      Inspect the OKD metrics service, which automatically gathers and reports on some valuable data such as the etcd disk sync duration. If the cluster is operational, use this data to help determine whether storage latency or throughput is the root issue. If so, consider using a storage resource that has lower latency and higher throughput.

      To get raw metrics, enter the following command as kubeadmin or user with cluster-admin privileges:

      1. $ oc get --insecure-skip-tls-verify --server=https://localhost:<port> --raw=/metrics

      To learn more, see Exploring Application Endpoints for the purposes of Debugging with OpenShift 4.x

    • Symptom: The installation program completes but the OKD cluster API is not available. The bootstrap virtual machine remains up after the bootstrap process is complete. When you enter the following command, the response will time out.

      1. $ oc login -u kubeadmin -p *** <apiurl>
    • Cause: The bootstrap VM was not deleted by the installation program and has not released the cluster’s API IP address.

    • Solution: Use the wait-for subcommand to be notified when the bootstrap process is complete:

      When the bootstrap process is complete, delete the bootstrap virtual machine:

      1. $ ./openshift-install destroy bootstrap

    Post-installation tasks

    After the OKD cluster initializes, you can perform the following tasks.

    • Optional: Remove the kubeadmin user. Instead, use the authentication provider to create a user with cluster-admin privileges.