AgentAlapaca

Agents, Models Tools, Generative Systems

AI Landscape: The Four Essential Layers

LLMs, AI Engines, Frameworks, and Agents.

These four categories build on each other. Understanding the difference unlocks the whole ecosystem.


📌 The big picture (bottom to top)

LLM → AI Engine → Framework → Agent

Think of it like computing:

  • LLM = raw processor / brain (text in, text out).
  • AI Engine = packaged product with API / UI / added features.
  • Framework = toolkit to build custom agents (orchestration, tools, memory).
  • Agent = autonomous system that acts, plans, and uses tools.

You can have an LLM without an agent, but you cannot have a modern agent without an LLM or engine underneath.


🧠 1. LLMs (Large Language Models) — the raw “brain”

What they are: a neural network trained on massive text data. It predicts next tokens. No memory by default, no tool use, no autonomy. Just completion / chat completion.

Key characteristics:

  • Stateless (each call is independent unless you feed history).
  • No built-in actions (cannot call APIs, browse web, run code).
  • Pure language understanding & generation.

Examples (well-known LLMs):

  • GPT-4 / GPT-4o (OpenAI) – base model family.
  • Claude 3.5 Sonnet / Opus (Anthropic).
  • Gemini 1.5 Pro / 2.0 (Google).
  • DeepSeek-V3 / DeepSeek-R1 (DeepSeek).
  • Mistral Large / Mixtral (Mistral AI).
  • Grok-2 / Grok-3 (xAI).
  • Llama 3 / 4 (Meta – open weights).

Where they fit: LLMs are the foundation. When someone says “ChatGPT” or “Copilot”, they are using an LLM wrapped inside a product. But the LLM alone cannot take action.


⚙️ 2. AI Engines (public / commercial products) — the “ready-to-use brain + interface”

What they are: production-ready services built on top of one or more LLMs. They add UI, system prompts, safety layers, often tool use, memory, and multimodal capabilities.

Key traits:

  • Directly accessible via web, mobile app, or API.
  • Often include pre-built tools (browsing, code interpreter, file upload).
  • May include limited agentic behavior (e.g. Perplexity doing web research autonomously).
  • Distributed as a service by a company (OpenAI, Anthropic, Google, Microsoft, etc.).

Examples (well-known AI engines / products):

  • ChatGPT (OpenAI) – GPT-4o + tools + memory + voice.
  • Claude (Anthropic) – web & mobile client with artifacts and analysis tool.
  • Gemini (Google) – deep integration with Workspace, YouTube, Search.
  • DeepSeek Chat – free chat interface with file upload and web search.
  • Microsoft Copilot – integrated into Windows, Office 365, GitHub.
  • Perplexity Pro – answer engine that actively searches the web and cites sources.
  • Grok (xAI) – real-time access to X platform, “rebellious” tone.
  • Mistral Chat (Le Chat) – interface for Mistral models with code interpreter.

Relationship to LLMs: Every AI engine contains at least one LLM, but an engine adds product features, usage limits, safety filters, and often agent-like loops (e.g. ChatGPT’s “analyze” or “browse”).


🏗️ 3. Frameworks (agentic infrastructure) — the “toolkit to build custom agents”

What they are: software libraries, SDKs, or orchestration layers that let developers build custom AI agents. They provide memory, tool calling, state management, multi-agent collaboration, and execution loops.

Core capabilities provided by frameworks:

  • Agent loop (ReAct, Plan-and-Execute, etc.).
  • Tool definition & schema (function calling, API integration).
  • Memory (short-term, long-term, vector databases).
  • Multi-agent orchestration (handoffs, supervisor, swarm).
  • State management & checkpointing (pause/resume long tasks).

Examples (popular agent stacks / frameworks):

  • LangGraph (LangChain) – graph-based agent orchestration, cyclic workflows, persistent checkpoints.
  • CrewAI – role-based multi-agent teams (researcher → writer → critic).
  • Microsoft AutoGen – multi-agent conversation framework, agent-to-agent dialogue.
  • OpenAI Agents SDK – lightweight agent building with handoffs (evolved from Swarm).
  • Claude Agent SDK (Anthropic) – subprocess-based agent loop with context compaction.
  • Vercel AI SDK – TypeScript tools for building agentic web apps.
  • LlamaIndex (Workflows / Agents) – data-aware agent frameworks.

What frameworks are NOT: They are not end-user products. You, as a developer, use a framework to create your own agent (e.g. a personal travel assistant, research agent, or DevOps bot). Frameworks wrap around any LLM or AI engine (OpenAI, Anthropic, Gemini, open-source models).


🤖 4. Agents (autonomous systems) — the “doers that plan, act, and remember”

What they are: software entities that use an LLM + tools + memory to achieve goals autonomously. They run loops: think → act → observe → repeat. They can operate without step-by-step human prompting.

Key characteristics of true agents (L3-L4 on autonomy ladder):

  • Autonomous execution – perform multi-step tasks with minimal intervention.
  • Tool use – run code, browse web, call APIs, control files, send messages.
  • Persistent memory – remember across sessions, learn from user feedback.
  • Planning & reflection – break down high-level goals, evaluate outcomes.
  • Self-waking (L4) – can be scheduled or triggered to act on their own (e.g. monitor email at 3 AM).

Examples of well-known agents / autonomous products:

  • OpenClaw / PythonClaw / NanoClaw – autonomous agents that control your computer via chat apps (WhatsApp, Telegram, etc.). Persistent memory, skill library, can run terminal commands.
  • Hermes Desktop (Nous Research) – GUI agent with reflective learning; writes skills automatically; uses “Soul + Persona” system.
  • Devin (Cognition) – software engineering agent that writes code, debugs, deploys.
  • Waymo Driver – autonomous vehicle stack (real‑world agent with perception, planning, control).
  • Taskade Genesis – long‑horizon autonomous team of agents.
  • Custom agents built with LangGraph, CrewAI or AutoGen – if deployed as production systems.

Important distinction: Many AI engines (ChatGPT with “Tasks” or scheduled actions) are becoming agentic. But a pure agent often runs without continuous manual prompting and is designed for a specific domain (coding, research, desktop automation).


🔁 How they relate: concrete example

Example scenario: “Build a competitor analysis agent that scrapes websites, summarizes findings, and emails me a report every morning.”

  1. LLM – GPT-4o / Claude / DeepSeek provides the language reasoning.
  2. AI Engine – you could use the OpenAI API (engine) to call the LLM with structured prompts.
  3. Framework – you use LangGraph or CrewAI to orchestrate steps: scrape → summarize → validate → write email.
  4. Agent – the final deployed system that runs autonomously each morning, calls tools (web scraper, email API), and adjusts its plan if a website changes.

Without a framework, you would hardcode each step. Without an engine or LLM, there is no “intelligence”. The agent ties everything together.


📊 Quick reference: LLM vs Engine vs Framework vs Agent

  • LLM: raw statistical model. Predicts text. No memory, no tools. (e.g. GPT-4 base weights).
  • AI Engine: productized LLM service with UI/API, often includes basic tools & safety. (e.g. ChatGPT, Gemini, Claude.ai).
  • Framework: developer library to orchestrate agents, manage state, tools, and multi-agent logic. (e.g. LangGraph, CrewAI, AutoGen).
  • Agent: autonomous system that uses LLM + framework logic + tools + memory to complete tasks end-to-end. (e.g. OpenClaw, Devin, custom HR assistant).

🧩 Practical overlapping (why confusion happens)

Many products blur the boundaries:

  • ChatGPT with “Code Interpreter” or “Tasks” – behaves like an agent (can plan and execute code) but it’s still an AI engine with agentic features, not a fully persistent L4 agent (unless you use OpenAI’s Assistants API + persistent threads).
  • Perplexity Pro – acts as a research agent (iterative search, reasoning), yet marketed as an “answer engine”.
  • Copilot in GitHub – L2 copilot (suggests actions), but when paired with agents like “Copilot Workspace”, it becomes agentic.
  • OpenClaw – built using node.js and uses LLMs (OpenAI/Anthropic) via their engines, but the final product is a standalone L4 autonomous agent.

Golden rule: Look at autonomy, tool access, and memory. If it executes tasks without asking you every step → it’s an agent. If it helps you but waits for confirmation → copilot/L2. If it only replies → chatbot/engine. If it’s a library to build such things → framework.


📚 Next steps: explore deeper on this site

  • Core concepts deep dive – detailed definitions, memory types, agent loops.
  • Autonomy Ladder (L1 to L4+) – classification table with reactive, deliberative, learning, collaborative, autonomous agents.
  • Product directories – side‑by‑side comparison of AI engines (ChatGPT, Gemini, Grok, Mistral, DeepSeek, etc.).
  • Framework guides – when to use LangGraph vs CrewAI vs AutoGen.
  • Agent catalog – OpenClaw, Hermes Desktop, Devin, open‑source claws and their architectures.

Homepage summary: The AI space stacks cleanly. Start with LLMs (brains). Then AI Engines (accessible products). Then Frameworks (building tools). Then Agents (autonomous workers). Use this map to navigate everything from ChatGPT to OpenClaw to your own custom agent.