Giantshaco

Septend Giantshaco

0 followers · 2 following

Lists (1)

Sort

私藏

1 repository

Stars

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 114,411 15,950 Updated Nov 6, 2025

Anduin2017 / HowToCook

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 95,606 10,669 Updated Nov 4, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,954 7,494 Updated Nov 6, 2025

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

21,635 2,050 Updated May 19, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,174 2,438 Updated Nov 6, 2025

Chainlit / chainlit

Build Conversational AI in minutes ⚡️

TypeScript 10,938 1,569 Updated Nov 4, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 10,714 1,094 Updated Apr 30, 2025

csseky / cskaoyan

提供计算机考研和软件工程考研专业的各个学校考研真题

9,188 1,564 Updated Feb 2, 2023

datajuicer / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,478 286 Updated Nov 6, 2025

tech-shrimp / docker_installer

Docker官方安装包，用来解决因国内网络无法安装使用Docker的问题

2,680 630 Updated Jun 10, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,390 244 Updated Nov 6, 2025

showlab / Show-o

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,763 76 Updated Oct 22, 2025

segment-any-text / wtpsplit

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Python 1,186 75 Updated Oct 8, 2025

qiufengqijun / mini_qwen

这是一个从头训练大语言模型的项目，包括预训练、微调和直接偏好优化，模型拥有1B参数，支持中英文。

Python 664 90 Updated Feb 18, 2025

allenai / reward-bench

RewardBench: the first evaluation tool for reward models.

Python 649 89 Updated Jun 12, 2025

tensorgi / TPA

[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)

Python 422 36 Updated Oct 23, 2025

pykt-team / pykt-toolkit

pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models

Python 316 98 Updated Sep 18, 2025

sii-research / siiRL

siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems

Python 225 20 Updated Nov 3, 2025

Hank0626 / TimeBridge

Official implementation of "TimeBridge: Non-Stationarity Matters for Long-term Time Series Forecasting" (ICML 2025)

Python 179 9 Updated May 16, 2025

sii-research / elmes

Forked from Mars160/elmes

Stay True

Python 20 Updated Aug 20, 2025

hhyqhh / LAEA

Python 14 4 Updated Jun 18, 2024

UncertaintyForKnowledgeTracing / UKT

Python 13 3 Updated Feb 24, 2025

DoniMoon / LLMKT

EDM 2025, Using Large Multimodal Models to Extract Knowledge Components for Knowledge Tracing from Multimedia Question Information

Python 6 2 Updated Oct 1, 2024

Greg-Tarr / tpa_pytorch

Simple (slightly optimized) implementation of Tensor Product Attention from the T6 paper with a KV cache

Python 4 Updated Jan 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly