An n8n workflow that uses local Ollama LLM models to dynamically route user prompts to the best specialized AI model (text, code, vision) for task-specific responses. It ensures privacy by running entirely on local Ollama API with memory buffers for chat context.