Giantshaco

Septend Giantshaco

0 followers · 2 following

Lists (1)

Sort

私藏

1 repository

Stars

23 results for source starred repositories

Clear filter

sii-research / siiRL

siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems

Python 225 20 Updated Nov 3, 2025

hhyqhh / LAEA

Python 14 4 Updated Jun 18, 2024

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,379 243 Updated Nov 5, 2025

allenai / reward-bench

RewardBench: the first evaluation tool for reward models.

Python 649 89 Updated Jun 12, 2025

segment-any-text / wtpsplit

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Python 1,186 75 Updated Oct 8, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 10,705 1,093 Updated Apr 30, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,141 2,428 Updated Nov 6, 2025

Hank0626 / TimeBridge

Official implementation of "TimeBridge: Non-Stationarity Matters for Long-term Time Series Forecasting" (ICML 2025)

Python 179 9 Updated May 16, 2025

tensorgi / TPA

[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)

Python 422 36 Updated Oct 23, 2025

Greg-Tarr / tpa_pytorch

Simple (slightly optimized) implementation of Tensor Product Attention from the T6 paper with a KV cache

Python 4 Updated Jan 23, 2025

UncertaintyForKnowledgeTracing / UKT

Python 13 3 Updated Feb 24, 2025

showlab / Show-o

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,763 76 Updated Oct 22, 2025

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

21,619 2,050 Updated May 19, 2025

datajuicer / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,475 285 Updated Nov 5, 2025

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 114,345 15,939 Updated Nov 5, 2025

Chainlit / chainlit

Build Conversational AI in minutes ⚡️

TypeScript 10,935 1,569 Updated Nov 4, 2025

tech-shrimp / docker_installer

Docker官方安装包，用来解决因国内网络无法安装使用Docker的问题

2,678 630 Updated Jun 10, 2025

qiufengqijun / mini_qwen

这是一个从头训练大语言模型的项目，包括预训练、微调和直接偏好优化，模型拥有1B参数，支持中英文。

Python 662 90 Updated Feb 18, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,877 7,481 Updated Nov 5, 2025

pykt-team / pykt-toolkit

pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models

Python 316 98 Updated Sep 18, 2025

DoniMoon / LLMKT

EDM 2025, Using Large Multimodal Models to Extract Knowledge Components for Knowledge Tracing from Multimedia Question Information

Python 6 2 Updated Oct 1, 2024

csseky / cskaoyan

提供计算机考研和软件工程考研专业的各个学校考研真题

9,187 1,564 Updated Feb 2, 2023

Anduin2017 / HowToCook

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 95,586 10,662 Updated Nov 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly