Highlights
- Pro
Stars
Filtered - PCMCI (F-PCMCI) causal discovery algorithm. Extension of the PCMCI causal discovery algorithm augmented with a feature selection method.
Library for sequence-to-sequence numeric prediction, applicable to any tokenizable input, and allows pretraining and fine-tuning over multiple tasks.
Github repository for paper: "ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents"
The official implementation of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis".
Temporal Causal Discovery Framework (PyTorch): discovering causal relationships between time series
A general purpose scientific writer
Turn Claude Code into a full game dev studio — 49 AI agents, 72 workflow skills, and a complete coordination system mirroring real studio hierarchy.
Academic Research Skills for Claude Code: research → write → review → revise → finalize
A powerful Telegram bot that provides remote access to Claude Code, enabling developers to interact with their projects from anywhere with full AI assistance and session persistence.
The official Python client library for the Massive.com REST and WebSocket API.
A Super AI Lab with massive AI Doctors as Assistants. Best IDE for Research via AI Power.
Persistent file-based planning for AI coding agents and long-running agentic tasks. Crash-proof markdown plans that survive context loss and /clear, plus a deterministic completion gate and multi-a…
Browser automation CLI for AI agents
Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞
AI agents running research on single-GPU nanochat training automatically
The official code of "Adversarial Counterfactual Environment Model Learning" (NeurIPS'23 spotlight)
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Official code for ACL2025 "🔍 Retrieval Models Aren’t Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models"
[ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios
The official code repo of paper "Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training"
Code accompanying the paper "Empirical analysis of model selection for heterogeneous causal effect estimation"