Skip to content
View jnanliu's full-sized avatar
🎯
Focusing
🎯
Focusing
  • NWPU / BUAA / PJLab / Monash
  • Melbourne
  • 20:41 (UTC +10:00)
  • X @jnanliu

Highlights

  • Pro

Block or report jnanliu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,375 450 Updated Apr 13, 2026

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.

Python 61,304 5,295 Updated Apr 13, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,724 5,318 Updated Apr 13, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,639 3,642 Updated Apr 13, 2026

An open-source AI agent that lives in your terminal.

TypeScript 22,960 2,132 Updated Apr 13, 2026

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.

Python 28,255 2,744 Updated Apr 13, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 76,374 15,514 Updated Apr 13, 2026

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,864 758 Updated Apr 13, 2026

AllenAI's post-training codebase

Python 3,683 534 Updated Apr 13, 2026

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 385 89 Updated Apr 13, 2026

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…

Python 3,802 199 Updated Apr 13, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,419 539 Updated Apr 13, 2026

Our library for RL environments + evals

Python 4,001 531 Updated Apr 12, 2026

Arxiv个性化定制化模版,实现对特定领域的相关内容、作者与学术会议的有效跟进。

CSS 341 27 Updated Apr 12, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,340 917 Updated Apr 12, 2026

阿里云盘命令行客户端,支持JavaScript插件,支持同步备份功能。

Go 4,994 386 Updated Apr 12, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,018 8,554 Updated Apr 12, 2026

A Clash Client For OpenWrt

HTML 25,272 3,850 Updated Apr 11, 2026

A metasearch library that aggregates results from diverse web search services

Python 2,444 238 Updated Apr 11, 2026
Python 21 Updated Apr 11, 2026

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 13,297 1,415 Updated Apr 10, 2026

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Jupyter Notebook 19,062 2,683 Updated Apr 10, 2026

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。

C++ 4,189 418 Updated Apr 10, 2026

Public repository for Agent Skills

Python 116,302 13,348 Updated Apr 9, 2026

Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks

Python 192 18 Updated Apr 9, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,264 713 Updated Apr 9, 2026

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Python 2,438 313 Updated Apr 8, 2026

[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

Python 33,028 2,980 Updated Apr 8, 2026

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 3,160 234 Updated Apr 6, 2026

A compilation of the best multi-agent papers

TeX 1,365 120 Updated Apr 5, 2026
Next