Starred repositories
"Vibe-Trading: Your Personal Trading Agent"
Open-source super AI assistant & Agent Harness. Plans tasks, runs tools and skills, self-evolves with memory and knowledge. Multi-model, multi-channel. Lightweight, extensible, one-line install. (f…
Web-based game library frontend for EmulatorJS cores
Complete setup guide for a 2-node NVIDIA DGX Spark cluster — distributed training, CUDA inference with EXO, NCCL tuning for Grace Blackwell, NVMe-TCP shared storage, and 200 Gb/s direct fabric netw…
Exo distributed inference with NVIDIA CUDA support via tinygrad
Automated LNMP stack builder for Ubuntu — compile-from-source Nginx, MySQL/MariaDB, PHP with auto-tuning
Automation-ready anti-detect browser workspace with isolated profiles, REST API, MCP, Docker, and Playwright control.
SSH workspace, SFTP, and terminals in one
Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.
Reverse engineered Windows Copilot into an OpenAI-compatible API. Access GPT-4 and GPT-5 models through a simple REST interface without API keys or billing.
Unlimited FREE AI coding. Connect Claude Code, Codex, Cursor, Cline, Copilot, Antigravity to FREE Claude/GPT/Gemini via 40+ providers. Auto-fallback, RTK -40% tokens, never hit limits.
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.
Open-source live-chat, email support, omni-channel desk. An alternative to Intercom, Zendesk, Salesforce Service Cloud etc. 🔥💬
Makes your AI agent think like the laziest senior dev in the room. The best code is the code you never wrote.
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
CUDA-compatible fork: fixes non-F32 quantized type support in concat op for NVIDIA GPUs
Bash's powerful command line editing in cmd.exe
Systematic benchmark study of DeepSeek-V4-Flash inference on 4× NVIDIA RTX PRO 6000 Blackwell (TP=4, FP8 KV, MTP=2, 1M context). Sustained decode matrix + Estonia long-context profile.
The original sources of MS-DOS 1.25, 2.0, and 4.0 for reference purposes
JT-PROXENSE: Sci-fi cyberpunk real-time monitoring for Proxmox VE clusters, guests, and Ceph storage. / JT-PROXENSE:為 Proxmox VE 打造的科幻 Cyberpunk 風格即時監控,支援叢集、客體機與 Ceph 儲存視覺化。
A self-hosted, integration-focused IPAM, independently developed with an operation flow familiar to phpIPAM users, deeply integrated with multiple DNS servers, LibreNMS, OPNsense, Proxmox VE, Wazuh…
A cost-efficient toolkit for helping company deploy Large Language Models with memory offload to DRAM or SSD.
Your Personal AI super intelligence. Private, Simple and extremely powerful.
Find the local LLM that actually runs and performs best on your hardware. Ranked by real, recency-aware benchmarks, not parameter count. One command, run it instantly.