Lists (1)
Sort Name ascending (A-Z)
Starred repositories
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion
Your one-person Wall Street. An AI trading agent covering equities, crypto, commodities, forex, and macro — from research through position entry, ongoing management, to exit.
claude-red is a curated library of offensive security skills designed for the Claude skills system. Each skill is a structured SKILL.md file that primes Claude with expert-level methodology for a s…
[ICML 2026] ByteDance's All-in-One Video Generation Model for Human-Object Interaction Video Generation
AI that sees your screen, listens to your conversations and tells you what to do
Open-source, low-cost 10.5 GHz PLFM phased array RADAR system
[CVPR 2026 Highlight] Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision
turboquant-based compression engine for LLM KV cache
Semantic search over videos using Gemini Embedding 2 or Qwen3-VL.
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Point at any URL/YouTube/Podcast or file. Get the gist. CLI and Chrome Extension.
WiFi-3D-Fusion is an open-source research project that leverages WiFi CSI signals and deep learning to estimate 3D human pose, fusing wireless sensing with computer vision techniques for next-gener…
Fully autonomous AI Agents system capable of performing complex penetration testing tasks
C inference for Qwen3-ASR 0.6b and 1.7b transcriptions models
A set of AI-enabled effects, generators, and analyzers for Audacity®.
A Python pickling decompiler and static analyzer
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/l…
An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone
A simple web application that can restream and synchonize IPTV streams using HLS & ffmpeg.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
"Paper2Slides: From Paper to Presentation in One Click"
Reflection Removal through Efficient Adaptation of Diffusion Transformers
A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems
A ComfyUI custom node integration for local multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwen3-TTS, Cozy Voice 3, Step Audio EditX, IndexTTS-2, Chatterb…
Automated Penetration Testing Agentic Framework Powered by Large Language Models
SONIC: Spectral Optimization of Noise for Inpainting with Consistency
Official repo for "GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization"