🤖📊 TrainLoop Evals

TrainLoop Evals is a comprehensive LLM evaluation framework designed for developers who need simple, vendor-independent evaluation tools.

Core Principles

Simplicity First – One environment variable, one function call, one folder of JSON files
Vendor Independence – Everything stored as newline-delimited JSON; no databases required
Type-Safe & Extensible – In-code tests with full TypeScript support and composable system
Meet Developers Where They Are – Works with existing workflows and bespoke loops

Quick Start

# Install the CLI
pipx install trainloop-cli

# Create a workspace
trainloop init

# Set your data path
export TRAINLOOP_DATA_FOLDER=/path/to/data

# Run evaluations
trainloop eval

# View results
trainloop studio

📚 Documentation

For comprehensive documentation, installation guides, tutorials, and API reference:

👉 evals.docs.trainloop.ai

👉 DeepWiki - lets you chat directly with this codebase rather than wading through documentation. It's the purest form of talking to your docs.

Quick Links

Getting Started - Installation and setup
Quick Start Guide - Complete walkthrough
SDK Guides - Python, TypeScript, and Go integration
CLI Reference - Complete command documentation
Contributing - Development setup and guidelines

Demo

Demo Repository: chat-ui-demo
Live Demo: evals.trainloop.ai

Support

GitHub Issues - Bug reports and feature requests
GitHub Discussions - Community support and questions
Documentation - Comprehensive guides and tutorials

License

MIT

Need help? Check out our comprehensive documentation or open an issue.

Name		Name	Last commit message	Last commit date
Latest commit History 126 Commits
.github/workflows		.github/workflows
cli		cli
docs		docs
examples		examples
images		images
infra		infra
registry		registry
releases		releases
runner		runner
scripts		scripts
sdk		sdk
tests		tests
ui		ui
.flake8		.flake8
.gitignore		.gitignore
.pylintrc		.pylintrc
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
Taskfile.yml		Taskfile.yml
VERSION		VERSION
VERSIONING.md		VERSIONING.md
package-lock.json		package-lock.json
package.json		package.json
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🤖📊 TrainLoop Evals

Core Principles

Quick Start

📚 Documentation

Quick Links

Demo

Support

License

About

Uh oh!

Releases

Packages

Languages

License

NilayYadav/evals

Folders and files

Latest commit

History

Repository files navigation

🤖📊 TrainLoop Evals

Core Principles

Quick Start

📚 Documentation

Quick Links

Demo

Support

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages