Giantshaco

Septend Giantshaco

0 followers · 2 following

Lists (1)

Sort

私藏

1 repository

Stars

Chainlit / chainlit

Build Conversational AI in minutes ⚡️

TypeScript 10,954 1,570 Updated Nov 8, 2025

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 114,615 15,981 Updated Nov 8, 2025

datajuicer / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,488 287 Updated Nov 7, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,239 2,448 Updated Nov 7, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,411 245 Updated Nov 7, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 62,088 7,513 Updated Nov 6, 2025

Anduin2017 / HowToCook

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 95,648 10,673 Updated Nov 4, 2025

sii-research / siiRL

siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems

Python 225 20 Updated Nov 3, 2025

tensorgi / TPA

[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)

Python 423 36 Updated Oct 23, 2025

showlab / Show-o

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,765 76 Updated Oct 22, 2025

segment-any-text / wtpsplit

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Python 1,186 75 Updated Oct 8, 2025

pykt-team / pykt-toolkit

pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models

Python 317 98 Updated Sep 18, 2025

sii-research / elmes

Forked from Mars160/elmes

Stay True

Python 20 Updated Aug 20, 2025

allenai / reward-bench

RewardBench: the first evaluation tool for reward models.

Python 649 89 Updated Jun 12, 2025

tech-shrimp / docker_installer

Docker官方安装包，用来解决因国内网络无法安装使用Docker的问题

2,689 632 Updated Jun 10, 2025

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

21,651 2,056 Updated May 19, 2025

Hank0626 / TimeBridge

Official implementation of "TimeBridge: Non-Stationarity Matters for Long-term Time Series Forecasting" (ICML 2025)

Python 179 9 Updated May 16, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 10,753 1,096 Updated Apr 30, 2025

UncertaintyForKnowledgeTracing / UKT

Python 13 3 Updated Feb 24, 2025

qiufengqijun / mini_qwen

这是一个从头训练大语言模型的项目，包括预训练、微调和直接偏好优化，模型拥有1B参数，支持中英文。

Python 669 90 Updated Feb 18, 2025

Greg-Tarr / tpa_pytorch

Simple (slightly optimized) implementation of Tensor Product Attention from the T6 paper with a KV cache

Python 4 Updated Jan 23, 2025

DoniMoon / LLMKT

EDM 2025, Using Large Multimodal Models to Extract Knowledge Components for Knowledge Tracing from Multimedia Question Information

Python 6 2 Updated Oct 1, 2024

hhyqhh / LAEA

Python 14 4 Updated Jun 18, 2024

csseky / cskaoyan

提供计算机考研和软件工程考研专业的各个学校考研真题

9,187 1,564 Updated Feb 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly