Skip to content
View sudanl's full-sized avatar
🟢
🟢

Organizations

@NLP2CT

Block or report sudanl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.

TypeScript 80,436 4,935 Updated Dec 23, 2025
Python 39 3 Updated Aug 31, 2025

Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"

Python 55 3 Updated Dec 10, 2025

Processed / Cleaned Data for Paper Copilot

Python 790 36 Updated Dec 4, 2025

A Systematic Survey of Deep Research

217 10 Updated Dec 23, 2025

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

6 1 Updated Nov 20, 2025
Python 151 21 Updated Oct 29, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 48,155 3,378 Updated Dec 20, 2025

Python version of the Playwright testing and automation library.

Python 14,073 1,111 Updated Dec 9, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 18,492 2,142 Updated Nov 24, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,066 643 Updated Dec 23, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,713 1,357 Updated Dec 17, 2025

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Python 521 60 Updated Nov 22, 2025

Awesome List for Agentic RL

HTML 645 26 Updated Dec 9, 2025

Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414

Python 475 46 Updated Oct 17, 2025

A simple yet powerful agent framework that delivers with open-source models

Python 3,998 395 Updated Dec 23, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,519 184 Updated Dec 23, 2025

A Scientific Multimodal Foundation Model

621 31 Updated Sep 30, 2025

[TACL 2025] RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns

Python 13 1 Updated Nov 2, 2025

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,058 75 Updated Nov 25, 2025

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Python 564 124 Updated Dec 18, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,461 1,999 Updated Nov 1, 2025

[EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward

Jupyter Notebook 59 2 Updated Aug 10, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,260 255 Updated Dec 23, 2025

Public quant internship repository, maintained by NUFT but available for everyone.

OCaml 1,878 132 Updated Oct 19, 2025

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,283 106 Updated Dec 15, 2025

[Preprint 2025] Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective

Jupyter Notebook 6 2 Updated May 27, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,192 120 Updated Nov 9, 2025
Next