Skip to content
View TsingZ0's full-sized avatar
📌
writing papers
📌
writing papers

Block or report TsingZ0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ACL 2025] Official repo for BOOKWORLD: From Novels to Interactive Agent Societies for Story Creation

Python 144 23 Updated Dec 15, 2025
Python 1,368 120 Updated Sep 12, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,760 1,071 Updated Dec 21, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 51,389 8,965 Updated Nov 17, 2025

Open source alternative to AWS. Elastic compute, block storage (non replicated), firewall and load balancer, managed Postgres, K8s, AI inference, and IAM services.

Ruby 11,607 529 Updated Dec 21, 2025

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 10,299 1,082 Updated Sep 24, 2025

Awesome curated collection of images and prompts generated by gemini-2.5-flash-image (aka Nano Banana) state-of-the-art image generation and editing model. Explore AI generated visuals created with…

JavaScript 8,164 832 Updated Sep 8, 2025

👩🏿‍💻👨🏾‍💻👩🏼‍💻👨🏽‍💻👩🏻‍💻中国独立开发者项目列表 -- 分享大家都在做什么

45,989 3,897 Updated Dec 21, 2025

Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.

Python 772 61 Updated Sep 24, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,297 116 Updated Dec 11, 2025

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Jupyter Notebook 214 66 Updated Jul 19, 2022

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,443 1,995 Updated Nov 1, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,678 76 Updated May 11, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,662 2,860 Updated Dec 21, 2025

aider is AI pair programming in your terminal

Python 39,099 3,755 Updated Dec 18, 2025

Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中

Python 103 16 Updated Apr 28, 2025

Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"

Python 746 153 Updated Jul 16, 2025

[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI

Python 460 61 Updated Oct 15, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 47,593 3,334 Updated Dec 20, 2025

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,658 186 Updated Oct 2, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,443 705 Updated Dec 17, 2025

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…

101,324 27,027 Updated Dec 19, 2025

DeepSeek Coder: Let the Code Write Itself

Python 22,524 2,687 Updated Nov 11, 2025

Train transformer language models with reinforcement learning.

Python 16,726 2,371 Updated Dec 20, 2025

Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.

715 54 Updated Jun 6, 2025

DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference code and tests) covering six domains (i.e., Computation, Bas…

Python 14 3 Updated Dec 12, 2024

Multilingual Structured CoT

2 Updated Sep 1, 2025
Next