Stars
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
OpenUI let's you describe UI using your imagination, then see it rendered live.
This repository contains demos I made with the Transformers library by HuggingFace.
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning
The easiest way to use Agentic RAG in any enterprise
llama3.cuda is a pure C/CUDA implementation for Llama 3 model.
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
A modern JavaScript library for handling Hangul characters.
LangChain 공식 Document, Cookbook, 그 밖의 실용 예제를 바탕으로 작성한 한국어 튜토리얼입니다. 본 튜토리얼을 통해 LangChain을 더 쉽고 효과적으로 사용하는 방법을 배울 수 있습니다.
The Universe of Evaluation. All about the evaluation for LLMs.
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
A pytorch quantization backend for optimum
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
👶🏻 신입 개발자 전공 지식 & 기술 면접 백과사전 📖
A python module that wraps the pdftoppm utility to convert PDF to PIL Image object
Tools for merging pretrained large language models.
Sakura-SOLAR-DPO: Merge, SFT, and DPO
A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Official inference library for Mistral models
Official implementation of project Honeybee (CVPR 2024)
🥤🧑🏻🚀Code and dataset for our EMNLP 2023 paper - "SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization"
An open source extension that connects AI agents to computational notebooks in JupyterLab.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)