Stars
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Simple, unified interface to multiple Generative AI providers
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
The official GitHub page for the survey paper "A Survey of Large Language Models".
HunyuanVideo: A Systematic Framework For Large Video Generation Model
gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.
Modin: Scale your Pandas workflows by changing a single line of code
Hackable and optimized Transformers building blocks, supporting a composable construction.
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Manipulate audio with a simple and easy high level interface
The leading native Python SSHv2 protocol library.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
An advanced memory forensics framework
Accessible large language models via k-bit quantization for PyTorch.
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
Open-source observability for your GenAI or LLM application, based on OpenTelemetry
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
📚 Parameterize, execute, and analyze notebooks
The main repo for NLWeb, implemented in Python.
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷