Ollama - Serverman | Tech Reviews | How-To Guides

How to Use Ollama with VS Code: Continue and Cline Extensions

Ollama brings powerful AI models to your local machine, and VS Code is where most developers spend their working day. Connecting the two gives you free, private AI coding assistance that runs entirely...

Stuart Stafford

Ollama REST API: Complete Developer Guide (2026)

Ollama exposes a straightforward HTTP REST API that lets you generate text, hold multi-turn conversations, create embeddings, and manage models programmatically. Whether you’re building an appli...

Stuart Stafford

Ollama

Ollama MLX: How to Enable Faster Inference on Apple Silicon

Ollama 0.19, released in March 2026, introduced an MLX backend for Apple Silicon Macs. MLX is Apple’s machine learning framework optimised specifically for the M-series chip architecture. Enabli...

Ollama

Ollama + MCP: Building Local AI Agents Without the Cloud

Running advanced AI agents doesn’t require renting expensive cloud infrastructure or sending your data to third-party providers. With Ollama and the Model Context Protocol (MCP), you can build s...

Ollama

How to Use Ollama with n8n: Private AI Automation Workflows

n8n is an open-source workflow automation tool — think Zapier but self-hosted. Combined with Ollama, you can build private AI automation workflows that run entirely on your own hardware, with no data ...

Ollama

Best Ollama Models for 8GB RAM and Low VRAM Hardware

Running Ollama on hardware with 8GB of RAM or VRAM is entirely possible — you just need to pick the right models. The key is choosing quantised versions of smaller models that fit within your memory b...

Ollama

Ollama OpenAI API Compatibility: Drop-In Replacement Guide

Ollama includes a built-in OpenAI-compatible API endpoint. This means you can take existing code written for the OpenAI API — Python scripts, applications, integrations — and point them at your local ...

Ollama

Qwen3-Coder vs Llama 4 Scout: Best Local Coding Model

Two models are competing for the title of best local coding AI in 2026: Qwen3-Coder and Llama 4 Scout. Both are available on Ollama, both run on consumer hardware, and both outperform models from a ye...

Ollama

How to Run Gemma 4 on Ollama (All Sizes Explained)

Gemma 4 is Google’s latest open-weight model family, released in April 2026. It comes in four sizes — E2B, E4B, E12B, and E27B — all natively multimodal, meaning they handle text and images with...

1 234 5...9