Ollama cannot use GPU in Docker on QNAP (RTX 3090, CUDA init fails)

Hello,

Thank you. By:

“Docker: Container Station + CLI”
I mean that I tested both:

  • containers created from QNAP Container Station

  • containers started manually via Docker CLI

Also, to clarify:

nvidia-smi does not exist as a native command in the QNAP shell,
but it works correctly inside Docker containers started with GPU access.

I will now test your suggested minimal Container Station setup and check:

  • nvidia-smi inside the container

  • whether Ollama still detects only CPU during startup

My original issue is that even when GPU is visible inside the container, Ollama often reports:

inference compute id=cpu library=cpu
total_vram="0 B"

and sometimes:

ggml_cuda_init: failed to initialize CUDA: initialization error

I will report the results from your YAML test.