🔐🦙Private & Local Ollama Self-Hosted + Dynamic LLM Router

Go to Workflow
11,663 views
Built by Joseph LePage Joseph LePage
Created on June 05, 2026

Description

Who is this for?
This workflow template is designed for AI enthusiasts, developers, and privacy-conscious users who want to leverage the power of local large language models (LLMs) without sending data to external services. It's particularly valuable for those running Ollama locally who want intelligent routing between different specialized models.

What problem is this workflow solving?
When working with multiple local LLMs, each with different strengths and capabilities, it can be challenging to manually select the right model for each specific task. This workflow automatically analyzes user prompts and routes them to the most appropriate specialized Ollama model, ensuring optimal performance without requiring technical knowledge from the end user.

What this workflow does
This intelligent router:
Analyzes incoming user prompts to determine the nature of the request
Automatically selects the optimal Ollama model from your local collection based on task requirements
Routes requests between specialized models for different tasks:
Text-only models (qwq, llama3.2, phi4) for various reasoning and conversation tasks
Code-specific models (qwen2.5-coder) for programming assistance
Vision-capable models (granite3.2-vision, llama3.2-vision) for image analysis
Maintains conversation memory for consistent interactions
Processes everything locally for complete privacy and data security

Setup
Ensure you have Ollama installed and running locally
Pull the required models mentioned in the workflow using Ollama CLI (e.g., ollama pull phi4)
Configure the Ollama API credentials in n8n (default: http://127.0.0.1:11434)
Activate the workflow and start interacting through the chat interface

How to customize this workflow to your needs
Add or remove models from the router's decision framework based on your specific Ollama collection
Adjust the system prompts in the LLM Router to prioritize different model selection criteria
Modify the decision tree logic to better suit your specific use cases
Add additional preprocessing steps for specialized inputs


This workflow demonstrates how n8n can be used to create sophisticated AI orchestration systems that respect user privacy by keeping everything local while still providing intelligent model selection capabilities.

Nodes Used (3)

AI Agent
@n8n/n8n-nodes-langchain.agent
Ollama Chat Model
@n8n/n8n-nodes-langchain.lmChatOllama
Simple Memory
@n8n/n8n-nodes-langchain.memoryBufferWindow