Highlights
- Pro
Stars
Magnificent app which corrects your previous console command.
Python tool for converting files and office documents to Markdown.
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.
The definitive Web UI for local AI, with powerful features and easy setup.
Collection of Summer 2026 tech internships!
aider is AI pair programming in your terminal
A generative world for general-purpose robotics & embodied AI learning.
Official inference repo for FLUX.1 models
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Open Source AI Platform - AI Chat with advanced features that works with every LLM
pix2tex: Using a ViT to convert images of equations into LaTeX code.
🚀 Level up your GitHub profile readme with customizable cards including LOC statistics!
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
the first library to let you embed a developer agent in your own app!
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Count the number of people around you 👨👨👦 by monitoring wifi signals 📡
Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!
a state-of-the-art-level open visual language model | 多模态预训练模型
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.