Skip to content

Hi there 👋

We do DeepSeek R1 (like) fine tuning as a service. A user gives us an Agent and where it went wrong and we'll improve it.

We provide a turn key solution to customers to act on poorly performing agents, by providing feedback in our UI to the agent and we'll take care of improving the model with reinforcement learning.

Imagine a startup developing a tool for law firms, enhanced with an AI to assist lawyers in selecting clients. Currently, this startup would face significant challenges in building such a tool, as it’s crucial to provide accurate and consistent responses every time. With our technology, the startup can enhance the initial model by incorporating its own experience and feedback from beta users, eventually surpassing the initial model and delivering reliable results on every query.

Previously this team built: hyveOS

Building a swarm OS for robots seemed like a good idea and people liked it!1 For a lot of reasons this was not a sustaibale thing to do as a buisness and at its core this product doesn't have a paying market at the moment. If you would like to use it, write to us, we might give it (and our thougts) to you! If you would like to talk to us about it, write to us! It's possible that the time for this product just hasn't come (yet).

Footnotes

  1. https://news.ycombinator.com/item?id=42694384

Popular repositories Loading

  1. unsloth unsloth Public

    Forked from unslothai/unsloth

    Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

    Python 1

  2. .github .github Public

  3. trl-augento trl-augento Public

    Forked from huggingface/trl

    Train transformer language models with reinforcement learning at augento.

    Python

  4. xgrammar xgrammar Public

    Forked from mlc-ai/xgrammar

    Fast, Flexible and Portable Structured Generation

    C++

  5. reward-function-python reward-function-python Public template

    Dockerfile 2

  6. reward-function-node reward-function-node Public template

    Dockerfile

Repositories

Showing 10 of 14 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…