Picar-X+Ollama: Vision Talk Issue

sameersingh17 · January 5, 2026, 5:27pm

Hey everyone,

I’m a beginner, so apologies if this is a silly question. So far, I’ve put together the Picar-X, completed calibration, tested the camera and TTS/STT. All fine on that front.

I’m currently on the Vision Talk with Ollama section of the documentation. I’ve got Ollama working with a 4 bit quantized version of Qwen2.5VL:3B. The model itself seems to work fine, but when I run 17.text_vision_talk.py, the images don’t seem to go to the model. Instead the model just makes up an answer based on the text of the question. Any ideas how I can get it to use the image as an input?

sameersingh17 · January 5, 2026, 6:32pm

Here’s the code (just edited Sunfounder’s to change the model). Pasting a screenshot from a Google doc as it wouldn’t let me post the code directly from my phone.

Topic		Replies	Views
Picar X won't connect to Ollama Robotic kit for Raspberry Pi	18	137	December 29, 2025
Pidog OpenAI Image Prompt Fails General	7	186	February 17, 2025
PiCarX Connecting to Ollama online General	9	101	December 30, 2025
Picar camera computer vision project 7 General	11	67	March 2, 2026
picar-X: setup, vilib, & camera issues Robotic kit for Raspberry Pi	9	734	April 16, 2025

Picar-X+Ollama: Vision Talk Issue

Related topics