Lists (2)
Sort Name ascending (A-Z)
Stars
A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment…
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
A high-throughput and memory-efficient inference and serving engine for LLMs
Qwen Code is a coding agent that lives in the digital world.
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Financial data platform for analysts, quants and AI agents.
Lean 4 programming language and theorem prover
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.
For optimization algorithm research and development.
Democratizing Reinforcement Learning for LLMs
Optimizing inference proxy for LLMs
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
The OWASP Mobile Application Security Testing Guide (MASTG) is a comprehensive manual for mobile app security testing and reverse engineering. It describes technical processes for verifying the OWA…
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Tools for merging pretrained large language models.
Documentation and source code powering Twitter's Community Notes
OCR, layout analysis, reading order, table recognition in 90+ languages
LLMs as Copilots for Theorem Proving in Lean
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.