Skip to content
View Giantshaco's full-sized avatar

Block or report Giantshaco

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems

Python 225 20 Updated Nov 3, 2025
Python 14 4 Updated Jun 18, 2024

slime is an LLM post-training framework for RL Scaling.

Python 2,378 243 Updated Nov 5, 2025

Stay True

Python 20 Updated Aug 20, 2025

RewardBench: the first evaluation tool for reward models.

Python 649 89 Updated Jun 12, 2025

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Python 1,186 75 Updated Oct 8, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 10,705 1,092 Updated Apr 30, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,138 2,428 Updated Nov 5, 2025

Official implementation of "TimeBridge: Non-Stationarity Matters for Long-term Time Series Forecasting" (ICML 2025)

Python 179 9 Updated May 16, 2025

[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)

Python 422 36 Updated Oct 23, 2025

Simple (slightly optimized) implementation of Tensor Product Attention from the T6 paper with a KV cache

Python 4 Updated Jan 23, 2025

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,762 76 Updated Oct 22, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

21,618 2,050 Updated May 19, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,475 285 Updated Nov 5, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 114,338 15,939 Updated Nov 5, 2025

Build Conversational AI in minutes ⚡️

TypeScript 10,934 1,569 Updated Nov 4, 2025

Docker官方安装包,用来解决因国内网络无法安装使用Docker的问题

2,678 630 Updated Jun 10, 2025

这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。

Python 662 90 Updated Feb 18, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,869 7,481 Updated Nov 5, 2025

pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models

Python 316 98 Updated Sep 18, 2025

EDM 2025, Using Large Multimodal Models to Extract Knowledge Components for Knowledge Tracing from Multimedia Question Information

Python 6 2 Updated Oct 1, 2024

提供计算机考研和软件工程考研专业的各个学校 考研真题

9,187 1,564 Updated Feb 2, 2023

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 95,585 10,662 Updated Nov 4, 2025