Skip to content
View jlidw's full-sized avatar
👀
👀
  • The Hong Kong University of Science and Technology
  • Hong Kong SAR, China

Block or report jlidw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM驱动的 A/H/美股智能分析:多数据源行情 + 实时新闻 + LLM决策仪表盘 + 多渠道推送,零成本定时运行,纯白嫖. LLM-powered stock analysis system for A/H/US markets.

Python 43,114 40,781 Updated Jun 18, 2026

A live reading list for LLM data synthesis (Updated to July, 2025).

483 39 Updated Apr 9, 2026

Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models

Python 51 7 Updated Sep 19, 2025

The agent that grows with you

Python 196,948 34,765 Updated Jun 18, 2026

The best agent harness.

TypeScript 10,646 592 Updated Jun 18, 2026

A benchmark for LLMs on complicated tasks in the terminal

Python 2,369 544 Updated Jan 22, 2026

KIRA

Python 897 108 Updated May 29, 2026

PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with 🦀 by the humans at https://kilo.ai

Python 1,240 140 Updated Jun 2, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,390 79,418 Updated Jun 18, 2026

SkillsBench evaluates how well skills work and how effective agents are at using them.

PDDL 1,373 317 Updated Jun 18, 2026

Harbor is a framework for running agent evaluations and creating and using RL environments.

Python 2,541 1,176 Updated Jun 18, 2026

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

Python 674 59 Updated May 17, 2026

Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.

TypeScript 35,120 2,880 Updated Jun 18, 2026

Multi-Agent Harness for Production AI

Python 10,561 5,655 Updated May 29, 2026

MCPMark is a comprehensive, stress-testing MCP benchmark designed to evaluate model and agent capabilities in real-world MCP use.

Python 429 37 Updated Jun 12, 2026

Salesforce Enterprise Deep Research

Python 1,182 189 Updated Jun 2, 2026

MCP-Universe is a comprehensive framework designed for RL training, benchmarking, and developing AI agents for general tool-use.

Python 591 85 Updated Jun 2, 2026

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,466 131 Updated Nov 9, 2025

MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers

Python 486 60 Updated Oct 7, 2025

基于多智能体LLM的中文金融交易框架 - TradingAgents中文增强版

Python 28,665 6,067 Updated Apr 20, 2026

The evaluation benchmark on MCP servers

Python 247 16 Updated Sep 3, 2025
Python 2 Updated Nov 3, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 19,466 1,490 Updated Feb 27, 2026

Eigent: The Open Source Cowork Desktop to Unlock Your Exceptional Productivity. Local and Free Alternative to Claude Cowork.

TypeScript 14,323 1,691 Updated Jun 18, 2026

A collection of MCP servers.

89,421 11,790 Updated Jun 18, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,655 971 Updated Jun 17, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,032 4,095 Updated Jun 18, 2026

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 16,577 1,648 Updated Mar 4, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 72,281 8,846 Updated Jun 17, 2026

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 53,911 7,545 Updated Jun 18, 2026
Next