Stars
🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.
🚀 Lightweight Python library for building production LLM applications with smart context management and automatic token optimization. Save 10-20% on API costs while fitting RAG docs, chat history, …
Open, Multi-modal Catalog for Data & AI
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
🤗 smolagents: a barebones library for agents that think in code.
A website where you can compare every AI Model ✨
A visual playground for agentic workflows: Iterate over your agents 10x faster
Easy token price estimates for 400+ LLMs. TokenOps.
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
A collection of projects designed to help developers quickly get started with building deployable applications using the Claude API
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Official Github repository for the SIGCOMM '24 paper "Accelerating Model Training in Multi-cluster Environments with Consumer-grade GPUs"
A streamlit component to embed video and music players from various websites.
Refacer: One-Click Deepfake Multi-Face Swap Tool
IA3방식으로 KoAlpaca를 fine tuning한 한국어 LLM모델
VectorHub is a free, open-source learning website for people (software developers to senior ML architects) interested in adding vector retrieval to their ML stack.
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
Convert PDF to HTML without losing text or format.
Practical course about Large Language Models.
LangChain 공식 Document, Cookbook, 그 밖의 실용 예제를 바탕으로 작성한 한국어 튜토리얼입니다. 본 튜토리얼을 통해 LangChain을 더 쉽고 효과적으로 사용하는 방법을 배울 수 있습니다.