Skip to content
View saqadri's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report saqadri

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
82 stars written in Python
Clear filter

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Python 7,976 574 Updated Jul 11, 2025

A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.

Python 7,619 5,393 Updated Apr 9, 2026

s1: Simple test-time scaling

Python 6,643 764 Updated Jun 25, 2025

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

Python 6,076 384 Updated Apr 8, 2026

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 6,000 357 Updated Apr 8, 2026

Demo of a customer service use case implemented with the OpenAI Agents SDK

Python 5,956 920 Updated Dec 18, 2025

All things prompt engineering

Python 5,739 328 Updated Jun 4, 2024

AIOS: AI Agent Operating System

Python 5,480 752 Updated Jan 22, 2026

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Python 5,173 418 Updated Oct 27, 2025

😎 A curated list of awesome MLOps tools

Python 5,088 709 Updated Mar 20, 2026

Ecommerce Search and Discovery - marqo.ai

Python 5,022 231 Updated Apr 9, 2026

Fara-7B: An Efficient Agentic Model for Computer Use

Python 4,783 444 Updated Apr 8, 2026

Stream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a clo…

Python 4,752 530 Updated Dec 4, 2025

Aligning pretrained language models with instruction data generated by themselves.

Python 4,587 523 Updated Mar 27, 2023

Zep | Examples, Integrations, & More

Python 4,392 601 Updated Apr 9, 2026

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,824 469 Updated Oct 14, 2025

Minimal CLI coding agent by Mistral

Python 3,818 417 Updated Apr 3, 2026

A system for agentic LLM-powered data processing and ETL

Python 3,702 387 Updated Mar 27, 2026

A Python library to extract tabular data from PDFs

Python 3,663 534 Updated Apr 3, 2026

Quick illustration of how one can easily read books together with LLMs. It's great and I highly recommend it.

Python 3,480 443 Updated Nov 18, 2025

A curated list of awesome Discord communities for programmers

Python 3,423 211 Updated Nov 18, 2025

Scalable and efficient data transformation framework - backwards compatible with dbt.

Python 3,025 371 Updated Apr 8, 2026

Adaptive Experimentation Platform

Python 2,732 367 Updated Apr 9, 2026

Build, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/T…

Python 2,663 212 Updated Mar 4, 2026

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,370 449 Updated Apr 7, 2026

CLI tool that uses Codex to turn natural language commands into their Bash/ZShell/PowerShell equivalents

Python 2,351 223 Updated Jan 3, 2024
Python 1,939 298 Updated Mar 30, 2026

Deploy a ML inference service on a budget in less than 10 lines of code.

Python 1,345 65 Updated Feb 12, 2024

Sharing the learning along the way we been gathering to enable Azure OpenAI at enterprise scale in a secure manner. GPT-RAG core is a Retrieval-Augmented Generation pattern running in Azure, using …

Python 1,140 293 Updated Apr 8, 2026

The simplest way to serve AI/ML models in production

Python 1,135 98 Updated Apr 9, 2026