Troubleshooting installation issues

    • You attempted to install an OKD cluster and the installation failed.

    If you gave an SSH key to your installation program, you can gather data about your failed installation.

    Prerequisites

    • Your OKD installation failed before the bootstrap process finished. The bootstrap node is running and accessible through SSH.

    • The ssh-agent process is active on your computer, and you provided the same SSH key to both the ssh-agent process and the installation program.

    • If you tried to install a cluster on infrastructure that you provisioned, you must have the fully qualified domain names of the bootstrap and control plane nodes (also known as the master nodes).

    Procedure

    1. Generate the commands that are required to obtain the installation logs from the bootstrap and control plane machines:

      • If you used installer-provisioned infrastructure, change to the directory that contains the installation program and run the following command:

        1installation_directory is the directory you specified when you ran ./openshift-install create cluster. This directory contains the OKD definition files that the installation program creates.

        For installer-provisioned infrastructure, the installation program stores information about the cluster, so you do not specify the hostnames or IP addresses.

        1. --bootstrap <bootstrap_address> \ (2)
        2. --master <master_1_address> \ (3)
        3. --master <master_2_address> \ (3)
        4. --master <master_3_address>" (3)

        A default cluster contains three control plane machines. List all of your control plane machines as shown, no matter how many your cluster uses.

      Example output

      1. INFO Pulling debug logs from the bootstrap machine
      2. INFO Bootstrap gather logs captured here "<installation_directory>/log-bundle-<timestamp>.tar.gz"

      If you open a Red Hat support case about your installation failure, include the compressed logs in the case.

    Manually gather logs in situations where must-gather or automated collection methods do not work.

    Prerequisites

    • You must have SSH access to your host(s).

    Procedure

    1. Collect the bootkube.service service logs from the bootstrap host using the journalctl command by running:

      1. $ journalctl -b -f -u bootkube.service
    2. Collect the bootstrap host’s container logs using the podman logs. This is shown as a loop to get all of the container logs from the host:

    3. Alternatively, collect the host’s container logs using the tail command by running:

      1. # tail -f /var/lib/containers/storage/overlay-containers/*/userdata/ctr.log
      1. $ sudo tail -f /var/log/containers/*

    Manually gather logs in situations where must-gather or automated collection methods do not work.

    If you do not have SSH access to your node, you can access the systems journal to investigate what is happening on your host.

    Prerequisites

    • Your OKD installation must be complete.

    • Your API service is still functional.

    • You have system administrator privileges.

    Procedure

    1. Access journald unit logs under /var/log by running:

    2. Access host file paths under /var/log by running:

      1. $ oc adm node-logs --role=master --path=openshift-apiserver

    You can use any of the following actions to get debug information from the installation program.

      1. $ cat ~/<installation_directory>/.openshift_install.log (1)
    • Change to the directory that contains the installation program and re-run it with --log-level=debug:

        1For , specify the same directory you specified when you ran ./openshift-install create cluster.