Skip to main content

Troubleshooting Guide

About 1 min

Troubleshooting Guide

This guide is created to aid users in resolving issues with their GPU-equipped Nosana Node configuration on both Linux and Windows operating systems.

Error Messages

Nvidia

nvidia-smi: command not found

Solution

It means that you do not have NVIDIA drivers installed. To install them, download and install the correct drivers from the NVIDIA website: https://www.nvidia.com/download/index.aspxopen in new window

Error: setting up CDI devices: unresolvable CDI devices nvidia.com/gpu=all

Solution

It means that you did not install and configure the Nvidia Container Toolkit correctly:

Docker

The command 'docker' could not be found in this WSL 2 distro.

Solution

Ensure that you have Docker Desktopopen in new window installed and that it is running. If you still have this error message, check if Docker Desktop is using the WSL2 Backend (not Hyper-V). Follow this guide to turn on the WSL2 backend for Docker Desktop: https://docs.docker.com/desktop/wsl/open in new window.

Also check with Command Prompt or Powershell to make sure you have WSL version 2 installed:

wsl -l -v

Also make sure your Ubuntu 22.04 distro is the default WSL distribution. The Docker-WSL integration is enabled on the default WSL distribution. To change your default WSL distro, run:

wsl --set-default <distro name>

Podman

Error: Could not connect to Podman

Solution

When you see this error go in Docker Desktop to -> Settings -> Docker engine, please add this line "bip":"192.168.200.1/24", somewhere after the first bracket, like this:

{
  "bip":"192.168.200.1/24",
}

Then click appy and restart Docker.

Error: container create failed (no logs from conmon): conmon bytes "": readObjectStart: expect { or n, but found , error found in #0 byte of ...||..., bigger context ...||...

Solution

This error is caused by the latest version of conmon having known issues, downgrade conmon to resolve this, like this:

wget https://download.opensuse.org/repositories/devel:/kubic:/libcontainers:/stable/xUbuntu_22.04/amd64/conmon_2.1.2~0_amd64.deb -O /tmp/conmon_2.1.2.deb
sudo apt install /tmp/conmon_2.1.2.deb

Then you can rerun the podmon command

podman run --rm --device nvidia.com/gpu=all --security-opt=label=disable ubuntu nvidia-smi -L
Last update: