Skip to content
View csgcmai's full-sized avatar
😜
Be the fire and wish for the wind
😜
Be the fire and wish for the wind

Block or report csgcmai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

737 results for source starred repositories
Clear filter

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,457 119 Updated Feb 19, 2025

Refine high-quality datasets and visual AI models

Python 10,330 712 Updated Feb 6, 2026

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 629 37 Updated Feb 5, 2026

A Lighting Pytorch Framework for Recommendation Models (PyTorch推荐算法框架), Easy-to-use and Easy-to-extend. https://datawhalechina.github.io/torch-rechub/

Python 741 112 Updated Feb 5, 2026

An Open Foundation Model and Benchmark to Accelerate Generative Recommendation

Python 555 83 Updated Feb 3, 2026
Python 38 1 Updated Dec 19, 2025

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]

Python 564 50 Updated Jan 27, 2026

A curated list of awesome platforms, tools, practices and resources that helps run LLMs locally

1,085 88 Updated Feb 5, 2026

CaptionQA: Is Your Caption as Useful as the Image Itself?

Python 32 1 Updated Jan 19, 2026

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 7,558 988 Updated Feb 3, 2026

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 26,772 2,620 Updated Jan 14, 2026

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 750 57 Updated Aug 6, 2025

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.

Python 22,579 2,175 Updated Feb 2, 2026

[ICLR'26] Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Python 97 10 Updated Jan 26, 2026

[NeurIPS 2025 Spotlight] A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone.

Python 45 Updated Oct 29, 2025

从零构建大模型:从预训练到RLHF的完整实践

Python 2,356 173 Updated Jan 30, 2026

MISP-Meeting Dataset & Code

Python 2 2 Updated Jan 11, 2026

Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.

Python 515 28 Updated Aug 14, 2025

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,524 224 Updated Dec 15, 2025

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 161 29 Updated Jan 22, 2026

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,078 404 Updated Feb 6, 2026

ScalarLM - a unified training and inference stack

Python 97 11 Updated Nov 18, 2025

AllenAI's post-training codebase

Python 3,568 493 Updated Feb 6, 2026

PyTorch building blocks for the OLMo ecosystem

Python 778 138 Updated Feb 5, 2026

Repository containing code and data for the paper "ArgCMV: An Argument Summarization Benchmark for the LLM-era", accepted at EMNLP 2025 Main Conference.

Python 1 Updated Nov 7, 2025

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 934 58 Updated Dec 27, 2025

Awesome LLM pre-training resources, including data, frameworks, and methods.

323 23 Updated Apr 29, 2025

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 616 79 Updated Sep 11, 2024

Latency and Memory Analysis of Transformer Models for Training and Inference

Python 479 56 Updated Apr 19, 2025
Next