Architecture Overview¶

Mamba Agents is a thin wrapper around pydantic-ai that adds enterprise-grade infrastructure.

High-Level Architecture¶

graph TB
    subgraph "User Application"
        APP[Application Code]
    end

    subgraph "Mamba Agents"
        AGENT[Agent]
        CONFIG[AgentSettings]

        subgraph "Core Features"
            CTX[ContextManager]
            TOK[TokenCounter]
            TRACK[UsageTracker]
            COST[CostEstimator]
        end

        subgraph "Extensions"
            WORK[Workflows]
            MCP[MCP Client]
            TOOLS[Built-in Tools]
        end

        subgraph "Infrastructure"
            LOG[Logging]
            TRACE[Tracing]
            ERR[Error Handling]
        end
    end

    subgraph "External"
        PAI[pydantic-ai]
        LLM[LLM Provider]
        MCPS[MCP Servers]
    end

    APP --> AGENT
    AGENT --> CONFIG
    AGENT --> CTX
    AGENT --> TOK
    AGENT --> TRACK
    AGENT --> COST
    AGENT --> PAI
    PAI --> LLM
    WORK --> AGENT
    MCP --> MCPS
    TOOLS --> AGENT
    LOG --> AGENT
    TRACE --> AGENT
    ERR --> AGENT

Module Structure¶

src/mamba_agents/
├── agent/           # Core agent (wraps pydantic-ai)
│   ├── core.py      # Agent class
│   ├── config.py    # AgentConfig
│   └── result.py    # AgentResult
│
├── config/          # Configuration system
│   ├── settings.py  # AgentSettings (root)
│   ├── model_backend.py
│   ├── logging_config.py
│   └── ...
│
├── context/         # Context window management
│   ├── manager.py   # ContextManager
│   ├── config.py    # CompactionConfig
│   └── compaction/  # 5 strategies
│
├── tokens/          # Token counting & cost
│   ├── counter.py   # TokenCounter
│   ├── tracker.py   # UsageTracker
│   └── cost.py      # CostEstimator
│
├── workflows/       # Workflow orchestration
│   ├── base.py      # Workflow ABC
│   ├── config.py    # WorkflowConfig
│   └── react/       # ReAct implementation
│
├── tools/           # Built-in tools
│   ├── filesystem/  # File operations
│   ├── bash.py      # Shell commands
│   ├── glob.py      # Pattern matching
│   └── grep.py      # Content search
│
├── mcp/             # MCP integration
│   ├── client.py    # MCPClientManager
│   └── config.py    # MCPServerConfig
│
├── backends/        # Model backends
│   ├── openai_compat.py
│   └── profiles.py
│
├── observability/   # Logging & tracing
│   ├── logging.py
│   ├── tracing.py
│   └── otel.py
│
└── errors/          # Error handling
    ├── exceptions.py
    ├── circuit_breaker.py
    └── retry.py

Component Relationships¶

Agent Core¶

The Agent class is the central component:

class Agent[DepsT, OutputT]:
    # Delegates to pydantic-ai
    _agent: PydanticAgent

    # Manages context
    context_manager: ContextManager | None

    # Tracks tokens
    token_counter: TokenCounter
    usage_tracker: UsageTracker
    cost_estimator: CostEstimator

    # Configuration
    config: AgentConfig
    settings: AgentSettings

Context Flow¶

sequenceDiagram
    participant App
    participant Agent
    participant Context
    participant PydanticAI
    participant LLM

    App->>Agent: run(prompt)
    Agent->>Context: get_messages()
    Context-->>Agent: history
    Agent->>PydanticAI: run(prompt + history)
    PydanticAI->>LLM: API call
    LLM-->>PydanticAI: response
    PydanticAI-->>Agent: result
    Agent->>Context: add_messages(new)
    Agent->>Context: check threshold
    alt Should Compact
        Agent->>Context: compact()
    end
    Agent-->>App: AgentResult

Workflow Orchestration¶

graph LR
    subgraph "Workflow Layer"
        WF[Workflow]
        REACT[ReActWorkflow]
    end

    subgraph "Agent Layer"
        AGENT[Agent]
    end

    subgraph "Execution Layer"
        PAI[pydantic-ai]
        TOOLS[Tools]
    end

    WF --> AGENT
    REACT --> AGENT
    AGENT --> PAI
    PAI --> TOOLS

Design Principles¶

1. Thin Wrapper¶

Mamba Agents doesn't reinvent pydantic-ai; it extends it:

Core agent logic delegates to pydantic-ai
Infrastructure layers are composable
Components work standalone or integrated

2. Configuration Layering¶

Multiple configuration sources with clear priority:

# Priority: constructor > env > .env > file > default
settings = AgentSettings()  # Loads from all sources
agent = Agent("gpt-4o", settings=settings)  # Constructor overrides

3. Facade Pattern¶

Agent provides a simplified interface to complex subsystems:

# Instead of:
counter = TokenCounter()
tracker = UsageTracker()
count = counter.count(text)
tracker.record_usage(...)

# Just use:
agent.get_token_count(text)
agent.get_usage()

4. Strategy Pattern¶

Interchangeable algorithms for compaction:

# Same interface, different strategies
config = CompactionConfig(strategy="sliding_window")
config = CompactionConfig(strategy="summarize_older")
config = CompactionConfig(strategy="hybrid")

5. Hooks for Extensibility¶

Observable lifecycle events:

hooks = WorkflowHooks(
    on_step_start=my_logger,
    on_step_complete=my_metrics,
)

Data Flow¶

Request Lifecycle¶

Input - User provides prompt
Context - Messages retrieved from history
Execution - pydantic-ai runs with tools
Tracking - Usage recorded
Storage - New messages added to context
Compaction - Context pruned if needed
Output - Result returned to user

Token Tracking Flow¶

Agent.run()
  → pydantic-ai execution
  → Extract usage from response
  → UsageTracker.record_usage()
  → CostEstimator.estimate()
  → TokenUsage/CostBreakdown available

Extension Points¶

Custom Tools¶

@agent.tool
def my_tool(param: str) -> str:
    return "result"

Custom Workflows¶

class MyWorkflow(Workflow[None, str, dict]):
    @property
    def name(self) -> str:
        return "my_workflow"

    async def _execute(self, prompt, state, deps):
        # Custom logic
        pass

Custom Compaction¶

class MyStrategy(CompactionStrategy):
    def compact(self, messages, config):
        # Custom compaction logic
        pass