Skip to content
View mssssss123's full-sized avatar

Block or report mssssss123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
259 stars written in Python
Clear filter

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 8,422 661 Updated Nov 9, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,353 811 Updated Nov 9, 2025

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…

Python 8,171 621 Updated Sep 22, 2025

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 8,080 1,328 Updated Jul 23, 2024

Large World Model -- Modeling Text and Video with Millions Context

Python 7,368 561 Updated Oct 19, 2024

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 7,125 688 Updated Jul 10, 2025

Example models using DeepSpeed

Python 6,713 1,108 Updated Oct 15, 2025

s1: Simple test-time scaling

Python 6,592 762 Updated Jun 25, 2025

Repo for external large-scale work

Python 6,546 721 Updated Apr 27, 2024

Modeling, training, eval, and inference code for OLMo

Python 6,100 670 Updated Oct 24, 2025

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,659 574 Updated Jan 16, 2025

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

Python 5,630 646 Updated Feb 17, 2024

PyTorch native post-training library

Python 5,581 679 Updated Nov 10, 2025

The official PyTorch implementation of Google's Gemma models

Python 5,571 562 Updated May 30, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,259 553 Updated Oct 30, 2025

AIOS: AI Agent Operating System

Python 4,772 614 Updated Oct 25, 2025

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,208 361 Updated Oct 19, 2025
Python 4,164 448 Updated Jul 31, 2025

A series of large language models developed by Baichuan Intelligent Technology

Python 4,122 294 Updated Nov 8, 2024

A unified, comprehensive and efficient recommendation library

Python 4,069 695 Updated Feb 24, 2025

An open-source framework for training large multimodal models.

Python 4,039 315 Updated Aug 31, 2024

崩坏:星穹铁道脚本 | Honkai: Star Rail auto bot (简体中文/繁體中文/English/Español)

Python 4,036 222 Updated Nov 8, 2025

Simple RL training for reasoning

Python 3,784 279 Updated Aug 3, 2025

SOTA search powered LLM

Python 3,719 344 Updated Apr 4, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,479 294 Updated Oct 29, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,351 338 Updated Jul 12, 2025

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,325 278 Updated May 4, 2024

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,277 209 Updated Mar 5, 2024

🦛 CHONK docs with Chonkie ✨ — The no-nonsense RAG library

Python 3,181 200 Updated Nov 5, 2025