dqwang122

Danqing Wang dqwang122

76 followers · 17 following

CMU
Pennsylvania, USA

Achievements

x2 x2

Achievements

x2 x2

Stars

HQ1995 / vibe-security-radar

Python 95 7 Updated Apr 6, 2026

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,618 578 Updated Jun 15, 2026

Hmbown / CodeWhale

Open-source, community-driven agent harness

Rust 38,389 3,303 Updated Jun 15, 2026

Tencent / AICGSecEval

A.S.E (AICGSecEval) is a repository-level AI-generated code security evaluation benchmark developed by Tencent Wukong Code Security Team.

Python 643 108 Updated May 25, 2026

TeleAI-UAGI / telemem

TeleMem is a high-performance drop-in replacement for Mem0, featuring semantic deduplication, long-term dialogue memory, and multimodal video reasoning.

Python 466 32 Updated Jun 12, 2026

huggingface / parler-tts

Inference and training library for high-quality TTS models.

Python 5,578 590 Updated Dec 10, 2024

openai / mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 1,585 253 Updated Apr 24, 2026

multi-swe-bench / multi-swe-bench

Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving

Python 339 55 Updated Dec 18, 2025

langgptai / awesome-claude-prompts

This repo includes Claude prompt curation to use Claude better.

5,239 579 Updated Feb 28, 2026

ulab-uiuc / MARBLE

(ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.01935

Python 265 38 Updated Oct 27, 2025

sail-sg / understand-r1-zero

Understanding R1-Zero-Like Training: A Critical Perspective

Python 1,261 59 Updated Aug 27, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 2,097 120 Updated Jun 2, 2025

marlbenchmark / on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 2,024 378 Updated Jul 18, 2024

areal-project / AReaL

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

Python 5,307 520 Updated Jun 15, 2026

ZhangYiqun018 / agent-for-debate

[ICASSP 2026] Agent4Debate is a dynamic multi-agent framework that leverages LLMs to achieve human-level performance in competitive debate by dynamically coordinating specialized agents to mitigate…

Python 38 6 Updated Jan 19, 2026