-
Harvard University
- Cambridge
- http://jasony.me
- @1a1a11a
Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Starred repositories
FastAPI framework, high performance, easy to learn, fast to code, ready for production
A high-throughput and memory-efficient inference and serving engine for LLMs
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
LlamaIndex is the leading document agent and OCR platform
Making large AI models cheaper, faster and more accessible
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
A book-in-progress about the Linux kernel and its insides.
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
Code for the paper "Language Models are Unsupervised Multitask Learners"
Fast and memory-efficient exact attention
Best Practices on Recommendation Systems
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
Ongoing research training transformer models at scale
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Proxy server to bypass Cloudflare protection
Freeze (package) Python programs into stand-alone executables
Modular visual interface for GDB in Python
Retrieval and Retrieval-augmented LLMs
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
YOLOv3 in PyTorch > ONNX > CoreML > TFLite
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).