Ollama Guide: Local Running, Model Selection, UI Enhancements, and Copilot Alternatives

CodeLlama: A model specially trained by Meta for code generation and explanation.
DeepSeek-Coder: A model launched by DeepSeek AI focused on coding.
WizardCoder: A model focused on Python code generation.
Qwen2.5-Coder: A model in the Qwen series focused on code.
Codestral: Excellent code model launched by Devstral.


bash
ollama --version


bash
ollama run deepseek-r1:8b

q8_0

Ollama Guide: Local Running, Model Selection, UI Enhancements, and Copilot Alternatives | The ideal shore

Llama 4	Scout (e.g., 16x17B), Maverick (e.g., 128x17B)	Large	Meta, Multimodal, Vision. (May 2025)
Llama 3.3	70B	~43GB	Meta, High-performance. (Months before June 2025)
Llama 3.2	1B, 3B	1B: ~1.3GB, 3B: ~2.0GB	Meta, Smaller and more efficient models.
Llama 3.2 Vision	11B, 90B	11B: ~7.9GB, 90B: ~55GB	Meta, Visual capabilities. (November 2024 / May 2025)
Llama 3.1	8B, 70B, 405B	8B: ~4.7GB, 405B: ~231GB	Meta, Multi-functional sizes.
Qwen3 (通义千问 3)	0.6B, 1.7B, 4B, 8B, 14B, 30B, 32B, 235B	Diverse	Alibaba, Latest generation, Dense & MoE.
Qwen2.5-coder	0.5B, 1.5B, 3B, 7B, 14B, 32B	Diverse	Alibaba, Code-specific.
Qwen2.5vl	3B, 7B, 32B, 72B	Diverse	Alibaba, Visual language.
Gemma 3	1B, 4B, 12B, 27B	1B: ~815MB, 27B: ~17GB	Google, Supports vision via new engine. (May 2025)

~1B - 3B	`q4_K_M`	~0.6 - 1.8	8	Suitable for lightweight tasks, can run on lower configuration hardware.
	`q8_0`	~1.0 - 3.0	8 - 16
~7B - 8B	`q4_K_M`	~3.8 - 5.0	8 - 16	Common choice balancing performance and resource consumption.
	`q5_K_M`	~4.5 - 5.5	16	Slightly higher quality, slightly higher memory requirements.
	`q8_0`	~7.0 - 8.0	16 - 32	Higher quality, but requires more memory.
~13B-15B	`q4_K_M`	~7.0 - 9.0	16 - 32	More complex tasks require better hardware.
	`q5_K_M`	~8.5 - 10.0	32

Comprehensive understanding of Ollama, from common commands to supported models, to third-party user interfaces (UIs) enhancing the experience, and transforming it into a local alternative to GitHub Copilot.