Lists (2)
Sort Name ascending (A-Z)
Stars
percept everything and make the 'best' decision for you. Your second 'brain' 感知万物,做最适合你的决策,你的“第二大脑”
openaiotlab / CUHK-X
Forked from siyang-jiang/CUHK-X[MobiCom 2025@ANAI workshop, Best Presentation Award] A large-scale, multimodal dataset and benchmark for Human Action Recognition, Understanding and Reasoning
Build, evaluate, and integrate long-term memory for self-evolving agents.
Memory Sparse Attention - A scalable, end-to-end trainable latent-memory framework for 100M-token contexts.
Real-time AI assistant for Meta Ray-Ban smart glasses -- voice + vision + agentic actions via Gemini Live and OpenClaw
Open source audio recorder and transcriber for MacOS
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
🤖 An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python 3.6 required.
The first Large Audio Language Model that enables native in-depth thinking, which is trained on large-scale audio Chain-of-Thought data.
心理健康大模型 (LLM x Mental Health), Pre & Post-training & Dataset & Evaluation & Depoly & RAG, with InternLM / Qwen / Baichuan / DeepSeek / Mixtral / LLama / GLM series models
Memory engine and app that is extremely fast, scalable. The Memory API for the AI era.
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Agentic components of the Llama Stack APIs
AI that sees your screen, listens to your conversations and tells you what to do
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English versi…
Simple d-vector based Speaker Recognition (verification and identification) using Pytorch
A generative speech model for daily dialogue.
A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.
This is the official implementation of the paper Soar: Design and Deployment of A Smart Roadside Infrastructure System for Autonomous Driving. (MobiCom 2024))
Simulation, multi-path estimation, and CBR parsing code of SIGCOMM2023 BeamSense CBR-Sensing
SGLang is a high-performance serving framework for large language models and multimodal models.
An easy and fast way to create a Python GUI 🐍