🤗 smolagents: 用代码思考的AI智能体

Hugging Face轻量级智能体框架源码深度解析

源码级别解析 · ~1000行核心代码 · CodeAgent · 多智能体协作
2026-05-07 | 每日技术深度解读

什么是 smolagents？

Hugging Face出品的极简智能体库

核心逻辑仅约1000行代码 (agents.py)
CodeAgent: 智能体用Python代码表达行动
ToolCallingAgent: 传统工具调用智能体
支持多智能体协作 (ManagedAgent)
模型无关: 支持本地模型、OpenAI、Anthropic等

smol = small, 精简到极致的智能体实现

核心理念: 代码即行动

与传统ReAct智能体的关键区别

传统ReAct: LLM输出文本描述 → 解析为工具调用
CodeAgent: LLM直接输出可执行Python代码
代码中自然包含逻辑判断、循环、变量赋值
工具调用 = 函数调用，在代码中直接写
更灵活、更强大、更少解析错误

代码作为行动表达，比文本指令更精确

CodeAgent ReAct循环

┌──────────────────────────────────────┐ │ User Task │ └──────────────┬───────────────────────┘ ▼ ┌──────────────────────────────────────┐ │ agent.memory (Chat Messages) │ └──────────────┬───────────────────────┘ ▼ ┌──────────────────────────────────────┐ │ ┌────────────────────────────────┐ │ │ │ ReAct Loop │ │ │ │ │ │ │ │ 1. Generate: LLM生成代码 │ │ │ │ 2. Parse: 提取代码action │ │ │ │ 3. Execute: 执行Python代码 │ │ │ │ 4. Observe: 记录执行结果 │ │ │ │ │ │ │ │ → 调用final_answer? → 输出 │ │ │ │ → 否则 → 回到步骤1 │ │ │ └────────────────────────────────┘ │ └──────────────────────────────────────┘

CodeAgent的核心循环: 生成代码→执行→观察→重复

两大智能体类型

CodeAgent vs ToolCallingAgent

CodeAgent: LLM输出Python代码片段，直接执行
ToolCallingAgent: LLM输出结构化工具调用
CodeAgent更灵活: 可以写循环、条件判断
ToolCallingAgent更安全: 不执行任意代码
两者共享相同的Tool、Memory、Model基础设施

选择取决于安全需求和任务复杂度

CodeAgent vs ToolCallingAgent 对比

特性	CodeAgent	ToolCallingAgent
行动表达	Python代码	结构化JSON
灵活性	极高(任意Python)	受限(预定义工具)
安全性	需沙箱	天然安全
多步推理	代码循环/变量	多次工具调用
适用场景	数据分析、编程	信息检索、API调用
代码行数	~500行	~300行

快速上手: 5行代码创建智能体

from smolagents import CodeAgent, WebSearchTool, InferenceClientModel

model = InferenceClientModel()
agent = CodeAgent(
    tools=[WebSearchTool()],
    model=model,
    stream_outputs=True
)
agent.run("How many seconds for a leopard at full speed to run through Pont des Arts?")

smolagents的设计目标: 几行代码即可运行强大智能体

Agent类继承体系 (源码 agents.py)

class Agent(ABC):
    """所有智能体的基类"""
    model: Model
    tools: list[BaseTool]
    memory: AgentMemory
    max_steps: int = 20

    @abstractmethod
    def run(self, task: str) -> str: ...


class ToolCallingAgent(Agent):
    """传统工具调用智能体"""
    def step(self) -> ActionOutput:
        # LLM生成ChatMessageToolCall
        # 解析并执行工具调用
        ...


class CodeAgent(Agent):
    """代码执行智能体 (核心创新)"""
    python_executor: PythonExecutor
    
    def step(self) -> ActionOutput:
        # LLM生成Python代码
        # 通过python_executor执行
        ...

Agent基类 → ToolCallingAgent / CodeAgent 两种实现

CodeAgent.step() 核心逻辑

class CodeAgent(Agent):
    def step(self) -> ActionOutput:
        # 1. 构建消息: system_prompt + memory + tools
        messages = self.memory.get_serialized_messages()
        
        # 2. LLM生成响应 (包含Python代码)
        chat_message = self.model(messages)
        
        # 3. 从响应中提取代码块
        code_action = extract_code_from_text(chat_message.content)
        
        # 4. 在沙箱中执行代码
        observation, is_final = self.python_executor(
            code_action,
            tools=self.tools,
            authorized_imports=self.authorized_imports
        )
        
        # 5. 记录到memory
        self.memory.step(ActionStep(
            tool_calls=[],
            observations=[observation]
        ))
        
        return ActionOutput(observation, is_final)

CodeAgent每一步: 生成代码→提取→执行→记录

代码提取: extract_code_from_text()

def extract_code_from_text(text: str) -> str:
    """从LLM输出中提取Python代码块"""
    # 匹配 ```python ... ``` 代码块
    code_blocks = []
    
    # 尝试解析markdown代码块
    pattern = r"```(?:python)?\s*\n(.*?)```"
    matches = re.findall(pattern, text, re.DOTALL)
    
    if matches:
        return matches[-1]  # 取最后一个代码块
    
    # 回退: 如果没有代码块标记，
    # 尝试解析整个文本为代码
    return parse_code_blobs(text)


def fix_final_answer_code(code: str) -> str:
    """确保代码调用final_answer工具"""
    if 'final_answer' not in code:
        code += f"\nfinal_answer({code_result})"
    return code

从LLM输出中智能提取可执行代码

沙箱代码执行架构

┌──────────────────────────────────────┐ │ CodeAgent.step() │ │ │ │ LLM → Python Code → Executor │ │ │ │ │ ┌───────────┼───────────┐│ │ ▼ ▼ ▼│ │ ┌──────────┐ ┌──────────┐ ┌────┐│ │ │ Local │ │ Docker │ │E2B ││ │ │ Executor │ │ Executor │ │ ││ │ └──────────┘ └──────────┘ └────┘│ │ ▼ ▼ ▼│ │ ┌──────────┐ ┌──────────┐ ┌────┐│ │ │ Pyodide │ │ Modal │ │Wasm││ │ │ + Deno │ │ │ │ ││ │ └──────────┘ └──────────┘ └────┘│ └──────────────────────────────────────┘

6种沙箱执行后端，从本地到云端全覆盖

6种代码沙箱后端

安全性与便利性的平衡

LocalPythonExecutor: 本地直接执行(开发用)
DockerExecutor: Docker容器隔离执行
E2BExecutor: E2B云端沙箱
ModalExecutor: Modal无服务器执行
BlaxelExecutor: Blaxel云平台
WasmExecutor: Pyodide+Deno WebAssembly(浏览器)

生产环境推荐Docker或E2B沙箱

LocalPythonExecutor 源码

class LocalPythonExecutor(PythonExecutor):
    """本地Python代码执行器"""
    
    BASE_BUILTIN_MODULES = [
        'math', 'random', 'datetime', 're', 
        'collections', 'itertools', 'statistics'
    ]
    
    def __init__(self, authorized_imports, tools):
        self.authorized_imports = authorized_imports
        self.tools = {t.name: t for t in tools}
        self.state = {}  # 跨步骤共享变量
    
    def __call__(self, code, tools, authorized_imports):
        # 1. 注入工具为函数
        for tool_name, tool in self.tools.items():
            self.state[tool_name] = tool
        
        # 2. 控制import: 只允许授权模块
        exec(code, {
            '__builtins__': {
                **{m: __import__(m) for m in authorized_imports},
                '__import__': restricted_import,
            }
        }, self.state)
        
        return self._get_observation()

通过受限的exec()实现安全本地执行

DockerExecutor 源码

class DockerExecutor:
    """Docker容器沙箱执行器"""
    
    def __init__(self, image="python:3.11-slim"):
        self.image = image
        self.container = None
    
    def execute(self, code, tools, authorized_imports):
        if not self.container:
            self.container = docker_client.containers.run(
                self.image, detach=True, 
                mem_limit="512m", cpu_period=100000,
                network_disabled=True  # 安全: 禁用网络
            )
        
        # 序列化工具并发送到容器
        exec_result = self.container.exec_run(
            cmd=["python", "-c", code],
            workdir="/workspace"
        )
        
        return exec_result.output.decode()
    
    def close(self):
        if self.container:
            self.container.stop()
            self.container.remove()

Docker提供进程和网络隔离，适合生产环境

Tool系统架构

统一的工具抽象层

BaseTool: 抽象基类，定义__call__接口
Tool: 完整工具类，含name/description/inputs/output_type
ToolCollection: 工具集合，支持从MCP/LangChain导入
FinalAnswerTool: 内置终止工具，调用即结束循环
Hub集成: 工具可发布到HuggingFace Hub

工具是智能体能力的扩展点

Tool基类定义 (源码 tools.py)

class Tool(BaseTool):
    """智能体工具基类"""
    name: str           # 工具名称
    description: str    # 工具描述 (LLM看到)
    inputs: dict        # 输入JSON Schema
    output_type: str    # 输出类型
    output_schema: dict # 输出JSON Schema
    is_initialized: bool = False

    @abstractmethod
    def forward(self, *args, **kwargs) -> Any:
        """工具核心逻辑，子类实现"""
        ...

    def __call__(self, *args, **kwargs) -> Any:
        """延迟初始化 + 执行"""
        if not self.is_initialized:
            self.setup()      # 首次调用时初始化
            self.is_initialized = True
        return self.forward(*args, **kwargs)

    def setup(self):
        """可选: 加载模型等昂贵操作"""
        pass

    def to_dict(self) -> dict:
        """序列化为字典 (Hub分享用)"""
        return {
            'name': self.name,
            'description': self.description,
            'inputs': self.inputs,
            'output_type': self.output_type
        }

延迟初始化模式: setup()只在首次调用时执行

自定义工具示例

from smolagents import Tool

class WeatherTool(Tool):
    name = "get_weather"
    description = "获取指定城市的天气信息"
    inputs = {
        'city': {
            'type': 'string',
            'description': '城市名称'
        }
    }
    output_type = "string"

    def forward(self, city: str) -> str:
        # 调用天气API
        response = requests.get(
            f"https://api.weather.com/v1/{city}"
        )
        data = response.json()
        return f"{city}: {data['temp']}°C, {data['condition']}"

# CodeAgent中，工具直接作为函数调用:
# result = get_weather("Beijing")

CodeAgent中工具=函数，直接在代码中调用

工具生态: 多来源集成

不绑定任何工具生态

MCP Server: 从Model Context Protocol服务器导入工具
LangChain: 兼容LangChain工具
Hub Space: 将HuggingFace Space作为工具
Hub共享: 从HuggingFace Hub加载社区工具
内置工具包: web_search、file_management等

tool-agnostic设计理念

从MCP和LangChain导入工具

# 从MCP服务器导入工具
from smolagents import ToolCollection

tools = ToolCollection.from_mcp(
    server_url="http://localhost:3000"
)

# 从LangChain导入工具
from smolagents import Tool

tool = Tool.from_langchain(
    langchain_tool  # 任何LangChain BaseTool
)

# 从HuggingFace Hub加载工具
tool = Tool.from_hub("user/tool_name")

# 将Hub Space作为工具
tool = Tool.from_space(
    "user/space_name",
    "这个工具可以做XXX"
)

# 发布工具到Hub
my_tool.push_to_hub("user/my_weather_tool")

一个框架打通多个工具生态

内置工具包

工具名	功能	输入	输出
WebSearchTool	DuckDuckGo网络搜索	查询字符串	搜索结果
VisitWebTool	访问网页获取内容	URL	页面文本
FileManagementTool	文件读写管理	文件路径+操作	文件内容
FinalAnswerTool	终止智能体循环	最终答案	终止信号
PythonTool	执行Python代码	代码字符串	执行结果

Memory系统

智能体的工作记忆

AgentMemory: 核心记忆管理器
MemoryStep: 每一步的记录 (ActionStep/TaskStep/...)
CallbackRegistry: 回调注册机制
支持流式输出: 实时显示思考过程
get_serialized_messages(): 序列化为LLM消息格式

Memory是ReAct循环的核心组件

Memory类型体系 (源码 memory.py)

class MemoryStep(TypedDict):
    """所有记忆步骤的基类"""
    timestamp: float
    

class TaskStep(MemoryStep):
    """用户任务"""
    task: str


class ActionStep(MemoryStep):
    """智能体行动"""
    tool_calls: list[ToolCall]
    observations: list[str]


class PlanningStep(MemoryStep):
    """规划步骤"""
    plan: str


class FinalAnswerStep(MemoryStep):
    """最终答案"""
    answer: str


class AgentMemory:
    def __init__(self):
        self.steps: list[MemoryStep] = []
        self.system_prompt: str = ""
    
    def get_serialized_messages(self) -> list[dict]:
        """将记忆序列化为LLM消息格式"""
        ...

类型化的记忆步骤，便于回溯和分析

Model层: 模型无关设计

统一接口，任意LLM后端

Model: 抽象基类，定义chat()接口
InferenceClientModel: HuggingFace推理端点
LiteLLMModel: 100+ LLM提供商统一接入
OpenAIModel: OpenAI兼容API
TransformersModel: 本地transformers模型
AzureOpenAIModel / AmazonBedrockModel

MODEL_REGISTRY注册机制，插件式扩展

Model抽象与实现

class Model(ABC):
    """模型抽象基类"""
    @abstractmethod
    def chat(self, messages, **kwargs) -> ChatMessage:
        ...


nclass InferenceClientModel(Model):
    """HuggingFace推理端点 (免费!)"""
    def __init__(self, model_id=None, provider=None):
        self.client = InferenceClient(
            model=model_id, provider=provider
        )
    
    def chat(self, messages, **kwargs) -> ChatMessage:
        response = self.client.chat_completion(
            messages=messages, **kwargs
        )
        return ChatMessage.from_dict(response)


class LiteLLMModel(Model):
    """100+ LLM提供商"""
    def chat(self, messages, **kwargs):
        import litellm
        response = litellm.completion(
            model=self.model_id,
            messages=messages,
            **kwargs
        )
        return ChatMessage.from_openai(response)

LiteLLM一个库接入几乎所有LLM

模型提供商支持矩阵

模型类	提供商	需要API Key	特色
InferenceClientModel	HF Inference	可选(有免费)	免费推理、HF生态
LiteLLMModel	100+提供商	是	最广泛的覆盖
OpenAIModel	OpenAI/兼容	是	GPT系列
TransformersModel	本地模型	否	离线、隐私
AzureOpenAIModel	Azure	是	企业合规
AmazonBedrockModel	AWS	是	AWS生态集成

多智能体: ManagedAgent

智能体管理智能体

ManagedAgent: 将Agent包装为Tool
主智能体通过函数调用委托任务
子智能体独立运行，返回结果
支持任意层级嵌套
共享或独立的工具集

多智能体协作的核心机制

ManagedAgent实现

from smolagents import CodeAgent, ManagedAgent

# 子智能体1: 搜索专家
web_agent = CodeAgent(
    tools=[WebSearchTool()],
    model=model,
    name="search_agent",
    description="搜索网络获取信息"
)

# 子智能体2: 数据分析专家
data_agent = CodeAgent(
    tools=[PythonTool()],
    model=model,
    name="data_agent", 
    description="分析数据和生成图表"
)

# 主智能体: 协调者
manager = CodeAgent(
    tools=[
        ManagedAgent(web_agent),   # 作为工具!
        ManagedAgent(data_agent),   # 作为工具!
    ],
    model=model,
)

# 主智能体在代码中调用子智能体:
# search_result = search_agent("AI最新进展")
# analysis = data_agent(search_result)

ManagedAgent将Agent→Tool，实现递归委托

多智能体协作架构

┌──────────────────────────────────────┐ │ Manager Agent │ │ (CodeAgent + ManagedAgent tools) │ │ │ │ # Python代码中调用: │ │ result = search_agent(query) │ │ analysis = data_agent(result) │ └──────────┬──────────────┬───────────┘ │ │ ┌─────▼─────┐ ┌─────▼─────┐ │ Search │ │ Data │ │ Agent │ │ Agent │ │ (独立执行) │ │ (独立执行)│ └───────────┘ └───────────┘ │ │ ┌─────▼─────┐ ┌─────▼─────┐ │WebSearch │ │ Python │ │ Tool │ │ Tool │ └───────────┘ └───────────┘

Manager通过代码调用委托任务给子智能体

Planning系统

让智能体先规划再执行

PlanningStep: 规划步骤记忆
initial_plan: 初始规划prompt
update_plan: 动态更新规划
规划与执行交替进行
提高复杂任务的完成率

Plan-and-Execute模式提升可靠性

规划Prompt模板 (源码)

class PlanningPromptTemplate(TypedDict):
    initial_plan: str
    update_plan_pre_messages: str
    update_plan_post_messages: str

DEFAULT_PLANNING_PROMPT = PlanningPromptTemplate(
    initial_plan="""You should first make a plan and 
    clarify your plan. Then carry out the plan step 
    by step. You can revise your plan based on new 
    information you discover along the way.""",
    
    update_plan_pre_messages="""Here is the plan you 
    have been following so far:
    {plan}
    
    Recent steps:
    {memory}""",
    
    update_plan_post_messages="""Now, based on the 
    above, please update your plan. Keep it simple: 
    just state the remaining steps."""
)

Jinja2模板驱动的规划系统

Prompt模板系统

Jinja2模板驱动，高度可定制

system_prompt: 系统提示词模板
planning: 规划相关模板
managed_agent: 子智能体委托模板
final_answer: 最终答案模板
PromptTemplates TypedDict统一管理

所有提示词都可通过模板定制

模板渲染引擎

from jinja2 import Template, StrictUndefined

def populate_template(template: str, variables: dict) -> str:
    """用变量填充Jinja2模板"""
    compiled = Template(template, undefined=StrictUndefined)
    try:
        return compiled.render(**variables)
    except Exception as e:
        raise Exception(
            f"Template rendering error: {type(e).__name__}: {e}"
        )

# 示例: 系统提示词模板
SYSTEM_PROMPT = """
You are a helpful assistant.
You have access to these tools: {{ tool_names }}.

To use a tool, write Python code calling the tool as a function.
Always end with final_answer(your_result).
"""

rendered = populate_template(SYSTEM_PROMPT, {
    'tool_names': 'web_search, visit_webpage'
})

StrictUndefined防止未定义变量导致的静默错误

Hub集成: 分享与复用

智能体和工具的社区共享

agent.push_to_hub(): 发布智能体到Hub
agent.from_hub(): 从Hub加载智能体
Tool.from_hub(): 加载社区工具
tool.push_to_hub(): 发布自定义工具
自动创建Gradio Space作为可交互demo

一键分享，社区驱动增长

Hub分享完整流程

# 1. 发布智能体到Hub
agent.push_to_hub("my-user/my-coder-agent")
# → 自动创建repo，上传代码、工具定义、配置

# 2. 从Hub加载智能体
from smolagents import CodeAgent
agent = CodeAgent.from_hub("my-user/my-coder-agent")
agent.run("帮我写一个快速排序")

# 3. 发布自定义工具
my_tool.push_to_hub("my-user/weather-tool")

# 4. 从Hub加载工具
from smolagents import Tool
tool = Tool.from_hub("my-user/weather-tool")

# 5. push_to_hub内部实现
# → create_repo() + upload_folder()
# → 包含: agent_config.yaml, tools.py, README.md
# → 自动创建Gradio Space demo

基于huggingface_hub的完整分享机制

CLI工具: smolagent & webagent

命令行直接运行智能体

smolagent: 通用代码智能体
webagent: 专门浏览网页的智能体(helium)
交互模式: 引导式配置向导
支持指定模型、工具、imports
适合快速测试和demo

无需写代码，命令行直接用

CLI使用示例

# 通用智能体
smolagent "Plan a trip to Tokyo" \
  --model-type "InferenceClientModel" \
  --model-id "Qwen/Qwen3-Next-80B-A3B-Thinking" \
  --imports pandas numpy \
  --tools web_search

# 交互模式 (无参数启动)
smolagent
# → 引导选择: Agent类型 → 工具 → 模型 → 输入任务

# 网页浏览智能体
webagent "go to amazon.com, find best-selling books" \
  --model-type "LiteLLMModel" \
  --model-id "gpt-4o"

# 使用本地模型 (完全离线)
smolagent "分析这个CSV文件" \
  --model-type "TransformersModel" \
  --model-id "Qwen/Qwen3-Next-80B-A3B-Thinking" \
  --imports pandas

CLI让智能体像命令行工具一样简单

多模态支持

文本、图像、视频、音频

AgentImage: 图像输入输出类型
AgentAudio: 音频输入输出类型
handle_agent_input_types(): 自动转换输入
handle_agent_output_types(): 自动转换输出
视觉智能体: 截图+分析网页

模态无关设计，支持任意输入输出类型

多模态类型处理

# agent_types.py
class AgentImage:
    """智能体图像类型"""
    def __init__(self, path_or_url: str):
        self.path = path_or_url
    
    def to_pil(self) -> "PIL.Image.Image":
        return Image.open(self.path)


class AgentAudio:
    """智能体音频类型"""
    def __init__(self, path: str):
        self.path = path


def handle_agent_input_types(**kwargs):
    """自动将输入转换为模型可处理格式"""
    for key, value in kwargs.items():
        if isinstance(value, AgentImage):
            kwargs[key] = value.to_pil()
        elif isinstance(value, AgentAudio):
            kwargs[key] = load_audio(value.path)
    return kwargs


ndef handle_agent_output_types(output):
    """自动包装模型输出"""
    if isinstance(output, PIL.Image.Image):
        return AgentImage.from_pil(output)
    return output

类型系统确保多模态输入输出的正确处理

监控与日志系统

AgentLogger + Monitor + TokenUsage

AgentLogger: Rich库驱动的终端输出
Monitor: 结构化监控数据收集
TokenUsage: Token使用量跟踪
LogLevel: 日志级别控制
CallbackRegistry: 执行回调注册

完整的可观测性支持

RunResult与监控数据

@dataclass
class RunResult:
    """智能体运行的完整结果"""
    output: Any | None          # 最终输出
    state: Literal["success", "max_steps_error"]  # 运行状态
    steps: list[dict]          # 所有记忆步骤
    token_usage: TokenUsage | None  # Token消耗
    timing: Timing              # 时间统计


nclass TokenUsage(TypedDict):
    input_tokens: int
    output_tokens: int
    total_tokens: int


class Timing(TypedDict):
    start_time: float
    end_time: float
    duration: float


n# 使用示例
result = agent.run("分析数据")
print(f"状态: {result.state}")
print(f"Token: {result.token_usage['total_tokens']}")
print(f"耗时: {result.timing['duration']:.2f}s")
print(f"步骤数: {len(result.steps)}")

RunResult提供运行全貌，便于调试和优化

错误处理体系

7种智能体异常类型

AgentError: 所有异常基类
AgentExecutionError: 代码执行失败
AgentGenerationError: LLM生成失败
AgentMaxStepsError: 超过最大步数
AgentParsingError: 输出解析失败
AgentToolCallError: 工具调用失败
AgentToolExecutionError: 工具执行出错

精确的错误分类便于定位和恢复

流式输出实现

# 启用流式输出
agent = CodeAgent(
    model=model,
    tools=[WebSearchTool()],
    stream_outputs=True  # 关键!
)

# 源码: stream_outputs处理
if self.stream_outputs:
    with Live(
        Group(
            Panel(Markdown(accumulated_output)),
            Rule(),
        ),
        console=console,
        refresh_per_second=4,
    ) as live:
        for delta in model.stream_chat(messages):
            # 实时更新Rich Live显示
            accumulated_output += delta.content
            live.update(
                Panel(Markdown(accumulated_output))
            )

# 使用Rich库实现漂亮的终端实时输出

Rich Live提供流畅的终端流式体验

项目结构与模块划分

清晰的单包架构

agents.py: 智能体核心 (~1000行)
tools.py: 工具系统
models.py: 模型抽象层
memory.py: 记忆管理
local_python_executor.py: 本地执行器
remote_executors.py: 远程执行器
monitoring.py: 监控日志

极简模块划分，每个文件职责明确

核心模块依赖关系

模块	行数	依赖	核心类
agents.py	~1000	memory,tools,models	Agent,CodeAgent,ToolCallingAgent
tools.py	~500	huggingface_hub	Tool,ToolCollection
models.py	~300	litellm,openai	Model,LiteLLMModel
memory.py	~200	无	AgentMemory,MemoryStep
local_python_executor.py	~300	无	LocalPythonExecutor
remote_executors.py	~400	docker,e2b,modal	DockerExecutor,E2BExecutor
monitoring.py	~200	rich	AgentLogger,Monitor

设计哲学: 极简主义

smol = small, 小而强大

1000行核心代码实现完整智能体
最小抽象层: 紧贴原始Python代码
拒绝过度工程: 能用dict就不用class
代码即文档: 源码本身就是最佳文档
可读性优先: 清晰 > 巧妙

与LangChain等重量级框架形成鲜明对比

smolagents vs 其他框架

特性	smolagents	LangChain	CrewAI	Agno
核心代码	~1000行	~50000行	~15000行	~5000行
代码智能体	✅ 一等公民	❌	❌	❌
沙箱执行	6种后端	无内置	无内置	无内置
Hub集成	✅ 原生	❌	❌	❌
CLI工具	✅ 2个	❌	❌	❌
学习曲线	极低	高	中	低
多智能体	ManagedAgent	LangGraph	✅ 核心	✅ 核心

适用场景

什么时候选择smolagents

数据分析: 自然语言驱动数据处理
编程助手: 代码生成和调试
网页浏览: 自动化信息收集
快速原型: 几行代码验证想法
教学演示: 清晰的代码便于理解

最适合需要灵活代码执行的场景

局限性

需要了解的不足

无内置RAG: 需自行集成向量数据库
无内置对话UI: 需搭配Gradio等
多智能体功能简单: 无复杂编排
无工作流引擎: 不支持DAG/状态机
生态较新: 社区工具不如LangChain丰富

极简设计意味着某些高级功能需要自行实现

实战: 数据分析智能体

from smolagents import CodeAgent, InferenceClientModel

data_analyst = CodeAgent(
    model=InferenceClientModel(
        model_id="deepseek-ai/DeepSeek-R1",
        provider="together"
    ),
    tools=[],  # 纯代码智能体，不需要外部工具
    authorized_imports=["pandas", "matplotlib", "numpy", "scipy"],
    max_steps=10,
)

result = data_analyst.run(
    """加载 iris.csv 数据集:
    1. 计算每种花的平均花瓣长度
    2. 找出花瓣最长的种类
    3. 生成散点图保存为 scatter.png
    """
)

# CodeAgent会生成类似这样的代码:
# import pandas as pd
# df = pd.read_csv('iris.csv')
# means = df.groupby('species')['petal_length'].mean()
# max_species = means.idxmax()
# import matplotlib.pyplot as plt
# df.plot.scatter(x='petal_length', y='petal_width')
# plt.savefig('scatter.png')
# final_answer(f"最长花瓣: {max_species}, 均值: {means[max_species]:.2f}")

纯代码智能体: 不需要工具，直接写Python

参考资料

GitHub仓库: https://github.com/huggingface/smolagents
官方文档: https://huggingface.co/docs/smolagents
核心源码 agents.py: https://github.com/huggingface/smolagents/blob/main/src/smolagents/agents.py
HuggingFace Hub: https://huggingface.co/smolagents

感谢阅读！
访问 https://atcfu.com/ai-articles/smolagents-code-agents/ 回顾本文