Free Mermaid diagrams covering LLMs, RAG, vector databases, AI agents, prompt engineering, inference pipelines and more.
Modern AI applications are built from a web of interacting pipelines, models, and data stores that are difficult to reason about without a clear visual model. This collection of 20 free Mermaid diagrams covers the full lifecycle of production AI systems — from the moment a user query enters an LLM Request Flow to the long-running loops of an AI Feedback Loop that continuously improves model quality.
Retrieval-Augmented Generation is represented end-to-end: the RAG Architecture diagram connects document ingestion, Embedding Generation Flow, and Vector Database Query into a single coherent picture. For prompt engineering, see Prompt Processing Pipeline and Prompt Cache System, which show how modern inference stacks reduce latency and cost.
Agentic systems have their own dedicated diagrams: AI Agent Workflow maps the plan-act-observe loop, while AI Tool Calling Flow shows how models invoke external functions at runtime. The collection also covers the full MLOps lifecycle — Model Training Pipeline, Feature Engineering Pipeline, Inference Pipeline, and Model Version Deployment. Application-layer patterns round out the set: AI Search System, AI Ranking Pipeline, AI Recommendation System, AI Moderation Pipeline, AI Content Generation Pipeline, and AI Chat Application Architecture. Every diagram opens directly in Graphlet for live editing and export.



















