-
Shanghai Jiao Tong University (SJTU)
- Shanghai, China
- https://www.linkedin.com/in/tsingz/
Lists (1)
Sort Name ascending (A-Z)
Stars
[ACL 2025] Official repo for BOOKWORLD: From Novels to Interactive Agent Societies for Story Creation
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …
No fortress, purely open ground. OpenManus is Coming.
Open source alternative to AWS. Elastic compute, block storage (non replicated), firewall and load balancer, managed Postgres, K8s, AI inference, and IAM services.
Trae Agent is an LLM-based agent for general purpose software engineering tasks.
Awesome curated collection of images and prompts generated by gemini-2.5-flash-image (aka Nano Banana) state-of-the-art image generation and editing model. Explore AI generated visuals created with…
👩🏿💻👨🏾💻👩🏼💻👨🏽💻👩🏻💻中国独立开发者项目列表 -- 分享大家都在做什么
Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
An Open-source RL System from ByteDance Seed and Tsinghua AIR
verl: Volcano Engine Reinforcement Learning for LLMs
aider is AI pair programming in your terminal
Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…
DeepSeek Coder: Let the Code Write Itself
Train transformer language models with reinforcement learning.
Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.
DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference code and tests) covering six domains (i.e., Computation, Bas…