Xpeech

Xpeech 是一个基于 FastAPI 的 Agent 服务。它提供一个 /chat 接口，可以接收文本、图片和文件，调用大模型生成流式回复，并在需要时调用工具完成任务。

适合用来快速启动一个可扩展的 AI Agent API 服务。

功能

提供 HTTP API 和 SSE 流式响应
支持文本、图片和文件输入
支持多轮会话和独立工作区
支持 LiteLLM 兼容的大模型服务
支持内置工具和自定义 Python 工具
支持飞书消息桥接
使用 conf.toml 管理普通配置，使用 .env 管理密钥
YAML 格式存储会话历史，可读性更好
自动历史消息压缩（三级压缩策略），避免超出上下文限制
内置记忆系统，自动总结和保存关键信息
支持视频输入
Token 使用率实时监控
丰富的内置工具集：文件读写、Shell 执行、Web 搜索、网页抓取、Office 文档读取、文件发送、向用户提问

安装

需要 Python 3.12+ 和 uv。

uv sync

提前安装内置技能依赖：

npm install -g @playwright/cli@latest
npx playwright install chromium

配置

普通配置写在 conf.toml：

[path]
session_path = "session"
session_history_path = "session/history"
workspace_base_path = "workspace_base"

[tool]
restrict_tools_to_workspace = true
allowed_networks = ["10.0.0.0/8"]

[llm]
api_base = "https://api.siliconflow.cn/v1"
default_model = "openai/Pro/moonshotai/Kimi-K2.6"
default_context_token = 256000
default_top_p = 0.7
tools_python_package = "custom_tools"
default_tools = ["echo", "hello"]
system_name = ""
custom_system_prompt = ""
# default_reasoning_effort = "normal"
support_image = true
support_video = true
support_json_output = true
parallel = 4

[feishu]
app_id = "cli_xxx"
idle_timeout = 3

密钥写在 .env：

LLM__API_KEY=your_api_key_here
FEISHU__APP_SECRET=your_feishu_app_secret_here

.env.example 可以作为模板复制：

cp .env.example .env

启动

启动 API 服务：

uv run -m xpeech api

如果不指定服务，默认也是启动 API：

uv run -m xpeech

服务默认运行在：

http://localhost:7878

启动后可以打开：

Swagger UI: http://localhost:7878/docs
ReDoc: http://localhost:7878/redoc

启动飞书桥接：

uv run -m xpeech feishu

飞书桥接会从配置中读取：

feishu.app_id：飞书应用 ID
feishu.idle_timeout：同一会话消息合并等待时间，单位秒
FEISHU__APP_SECRET：飞书应用密钥，建议放在 .env

如需连接非默认 API 地址，可以传入 /chat 地址：

uv run -m xpeech feishu --chat-url http://127.0.0.1:7878/chat

发送消息

/chat 需要通过请求头传入会话 ID：

curl -N -X POST "http://localhost:7878/chat" \
  -H "x-session-id: demo-session" \
  -F 'session_metadata={"channel":"curl"}' \
  -F 'content=[{"text":"你好，介绍一下你自己"}]'

上传文件：

curl -N -X POST "http://localhost:7878/chat" \
  -H "x-session-id: demo-session" \
  -F 'session_metadata={"channel":"curl"}' \
  -F 'content=[{"text":"帮我看看这个文件"}]' \
  -F "files=@example.txt"

响应是 SSE 流，可以边生成边读取。

内置命令

在聊天中输入以下命令可以使用快捷功能：

/help - 显示帮助信息
/new - 开始一个新会话，自动总结并保存当前会话记忆

自定义工具

在 conf.toml 里指定工具包：

[llm]
tools_python_package = "custom_tools"
default_tools = ["echo", "hello"]

工具包示例：

custom_tools/
  __init__.py
  test_tools.py

custom_tools/__init__.py：

from .test_tools import echo, hello

__all__ = ["echo", "hello"]

custom_tools/test_tools.py：

from typing import Annotated

from pydantic import BaseModel, Field


def hello():
    """Return a hello message."""
    return "hello"


class Message(BaseModel):
    content: Annotated[str, Field(description="The content to echo")]


def echo(message: Message):
    """Echo the message content."""
    return message.content

工具函数需要有 docstring。函数可以不接收参数，也可以接收一个 Pydantic BaseModel 参数。

内置工具

Xpeech 提供丰富的内置工具，Agent 可以在对话中自动调用：

工具	说明
`read_file`	读取工作区内的文件内容
`write_file`	向工作区写入文件
`edit_file`	编辑工作区内的文件
`list_dir`	列出工作区目录内容
`shell`	执行 Bash 命令（带安全限制）
`web_fetch`	抓取网页内容并转为 Markdown
`web_search`	搜索网页并返回结果
`office_read`	读取 Office 文档（docx/xlsx/pdf/pptx 等）
`send_file`	向用户发送文件
`ask_user_question`	向用户发送表单提问

工具安全

路径隔离：工具操作默认限制在工作区内，防止访问系统文件
Shell 黑名单：禁止执行 rm -rf、format、dd 等危险命令
路径遍历检测：拦截包含 .. 的路径操作
内网 URL 拦截：防止访问内部网络接口

开发

uv sync
uv run -m xpeech api

如果需要检查配置是否能读取：

uv run python -c "from xpeech.config.settings import settings; print(settings.model_dump())"

TODO

添加 cron
添加心跳
添加飞书 CLI

Name		Name	Last commit message	Last commit date
Latest commit History 246 Commits
custom_tools		custom_tools
screenshot		screenshot
xpeech		xpeech
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
build_desktop.bat		build_desktop.bat
conf.toml		conf.toml
desktop_entry.py		desktop_entry.py
main.py		main.py
pyproject.toml		pyproject.toml
ruff.toml		ruff.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Xpeech

功能

安装

配置

启动

发送消息

内置命令

自定义工具

内置工具

工具安全

目录

开发

TODO

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Xpeech

功能

安装

配置

启动

发送消息

内置命令

自定义工具

内置工具

工具安全

目录

开发

TODO

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages