-
RLinf Public
Forked from RLinf/RLinfRLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.
Python Apache License 2.0 UpdatedSep 2, 2025 -
ARSQ Public
Official implementation of the paper "Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network"
-
on-policy Public
Forked from marlbenchmark/on-policyThis is the official implementation of Multi-Agent PPO (MAPPO).
Python MIT License UpdatedNov 4, 2024 -
chatgpt-web Public
Forked from Niek/chatgpt-webChatGPT web interface using the OpenAI API
Svelte GNU General Public License v3.0 UpdatedApr 24, 2024 -
UHUB Public
通用外设 Lua 运行时 An Universal Lua Runtime for Peripherals
-
-
text-generation-webui Public
Forked from oobabooga/textgenA Gradio web UI for Large Language Models. Supports transformers, GPTQ, llama.cpp (ggml/gguf), Llama models.
Python GNU Affero General Public License v3.0 UpdatedSep 6, 2023 -
-
PersonalFirewall Public
A Windows NDIS Filter Driver based toy firewall
-
APEA Public
A Low-latency AI-Assisted Automation Tool implementated with TensorRT and DXGI API
-
TJCS-Course Public
Forked from DTennant/TJCS-Course💡 同济大学计算机科学与技术、信息安全专业课程资源共享仓库。含部分科目介绍、报告模板、实验工具等内容。期待更多课程加入……
-
EQPass Public
A web tool for personalized headphone EQ adjusting