🌐 catsu 🐱

A unified, batteries-included client for embedding APIs that actually works.

The world of embedding API clients is broken. (details)

Everyone defaults to OpenAI's client for embeddings, even though it wasn't designed for that purpose
Provider-specific libraries (VoyageAI, Cohere, etc.) are inconsistent, poorly maintained, or outright broken
Universal clients like LiteLLM and any-llm-sdk don't focus on embeddings at all—they rely on native client libraries, inheriting all their problems
Every provider has different capabilities—some support dimension changes, others don't—with no standardized way to discover what's available
Most clients lack basic features like retry logic, proper error handling, and usage tracking
There's no single source of truth for model metadata, pricing, or capabilities

Catsu fixes this. It's a lightweight, unified client built specifically for embeddings with:

🎯 A clean, consistent API across all providers
🔄 Built-in retry logic with exponential backoff
💰 Automatic usage and cost tracking
📚 Rich model metadata and capability discovery
⚠️ Proper error handling and type hints
⚡ First-class support for both sync and async

📦 Install

Install with uv (recommended):

uv pip install catsu

Or with pip:

pip install catsu

🚀 Quick Start

Get started in seconds! Just import catsu, create a client, and start embedding:

import catsu

# Initialize the client
client = catsu.Client()

# Generate embeddings (auto-detects provider from model name)
response = client.embed(
    model="voyage-3",
    input="Hello, embeddings!"
)

# Access your results
print(f"Dimensions: {response.dimensions}")
print(f"Tokens used: {response.usage.tokens}")
print(f"Cost: ${response.usage.cost:.6f}")
print(f"Embedding: {response.embeddings[0][:5]}...")  # First 5 dims

That's it! No configuration needed—catsu picks up your API keys from environment variables automatically (VOYAGE_API_KEY, OPENAI_API_KEY, etc.).

Want more control? Specify the provider explicitly:

# Method 1: Separate parameters
response = client.embed(provider="voyageai", model="voyage-3", input="Hello!")

# Method 2: Provider prefix
response = client.embed(model="voyageai:voyage-3", input="Hello!")

Need async? Just use aembed:

response = await client.aembed(model="voyage-3", input="Hello, async world!")

📖 Want to learn more? Check out the complete documentation for detailed guides on all providers, parameters, and best practices.

🤝 Contributing

Can't find your favorite model or provider? Open an issue and we will promptly try to add it! We're constantly expanding support for new embedding providers and models.

For guidelines on contributing, please see CONTRIBUTING.md.

If you found this helpful, consider giving it a ⭐!

made with ❤️ by chonkie, inc.

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
.github		.github
assets		assets
docs		docs
examples		examples
src/catsu		src/catsu
tests		tests
website		website
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
DOCS.md		DOCS.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🌐 catsu 🐱

📦 Install

🚀 Quick Start

🤝 Contributing

About

Uh oh!

Releases 5

Packages

Uh oh!

Languages

License

chonkie-inc/catsu

Folders and files

Latest commit

History

Repository files navigation

🌐 catsu 🐱

📦 Install

🚀 Quick Start

🤝 Contributing

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Languages

Packages