Starred repositories
real time face swap and one-click video deepfake with only a single image
A high-throughput and memory-efficient inference and serving engine for LLMs
🤗 smolagents: a barebones library for agents that think in code.
DeepSeek Coder: Let the Code Write Itself
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Automated nginx proxy for Docker containers using docker-gen
Janus-Series: Unified Multimodal Understanding and Generation Models
Under 1KB each! Super Tiny Icons are miniscule SVG versions of your favourite website and app logos
Train your AI self, amplify you, bridge the world
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Context engineering is the new vibe coding - it's the way to actually make AI coding assistants work. Claude Code is the best for this so that's what this repo is centered around, but you can apply…
Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).
An open source multi-tool for exploring and publishing data
Trae Agent is an LLM-based agent for general purpose software engineering tasks.
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
The tiniest PaaS you've ever seen. Piku allows you to do git push deployments to your own servers.
🎮 ⌨ An easy to use tool to change the behaviour of your input devices.
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
Generate audiobooks from EPUBs, PDFs and text with synchronized captions.
The cryptography-based networking stack for building unstoppable networks with LoRa, Packet Radio, WiFi and everything in between.
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models
Python CLI utility and library for manipulating SQLite databases