Stars
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Demo of a customer service use case implemented with the OpenAI Agents SDK
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
😎 A curated list of awesome MLOps tools
Fara-7B: An Efficient Agentic Model for Computer Use
Stream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a clo…
Aligning pretrained language models with instruction data generated by themselves.
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
A system for agentic LLM-powered data processing and ETL
A Python library to extract tabular data from PDFs
Quick illustration of how one can easily read books together with LLMs. It's great and I highly recommend it.
A curated list of awesome Discord communities for programmers
Scalable and efficient data transformation framework - backwards compatible with dbt.
Build, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/T…
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
CLI tool that uses Codex to turn natural language commands into their Bash/ZShell/PowerShell equivalents
Deploy a ML inference service on a budget in less than 10 lines of code.
Sharing the learning along the way we been gathering to enable Azure OpenAI at enterprise scale in a secure manner. GPT-RAG core is a Retrieval-Augmented Generation pattern running in Azure, using …
The simplest way to serve AI/ML models in production