csgcmai

😜

Be the fire and wish for the wind

Guangcan MAI csgcmai

😜

Be the fire and wish for the wind

Computer Vision @ YY Live, Baidu Inc

61 followers · 164 following

YY Live, Baidu Inc
guangcan.tech

Achievements

Starred repositories

737 results for source starred repositories

Clear filter

tencent-ailab / persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,457 119 Updated Feb 19, 2025

voxel51 / fiftyone

Refine high-quality datasets and visual AI models

Python 10,330 712 Updated Feb 6, 2026

SandAI-org / MagiAttention

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 629 37 Updated Feb 5, 2026

datawhalechina / torch-rechub

A Lighting Pytorch Framework for Recommendation Models (PyTorch推荐算法框架), Easy-to-use and Easy-to-extend. https://datawhalechina.github.io/torch-rechub/

Python 741 112 Updated Feb 5, 2026

Kuaishou-OneRec / OpenOneRec

An Open Foundation Model and Benchmark to Accelerate Generative Recommendation

Python 555 83 Updated Feb 3, 2026

XMUDeepLIT / UME-R1

Python 38 1 Updated Dec 19, 2025

TIGER-AI-Lab / VLM2Vec

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]

Python 564 50 Updated Jan 27, 2026

rafska / awesome-local-llm

A curated list of awesome platforms, tools, practices and resources that helps run LLMs locally

1,085 88 Updated Feb 5, 2026

bronyayang / CaptionQA

CaptionQA: Is Your Caption as Useful as the Image Itself?

Python 32 1 Updated Jan 19, 2026

facebookresearch / sam3

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 7,558 988 Updated Feb 3, 2026

bytedance / UI-TARS-desktop

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 26,772 2,620 Updated Jan 14, 2026

microsoft / BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 750 57 Updated Aug 6, 2025

vanna-ai / vanna

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.

Python 22,579 2,175 Updated Feb 2, 2026

Haochen-Wang409 / Grasp-Any-Region

[ICLR'26] Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Python 97 10 Updated Jan 26, 2026

CURRENTF / LowRankClone

[NeurIPS 2025 Spotlight] A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone.

Python 45 Updated Oct 29, 2025

qibin0506 / Cortex

从零构建大模型：从预训练到RLHF的完整实践

Python 2,356 173 Updated Jan 30, 2026

coalboss / MISP-Meeting

MISP-Meeting Dataset & Code

Python 2 2 Updated Jan 11, 2026

bytedance / tarsier

Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.

Python 515 28 Updated Aug 14, 2025

alibaba / Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,524 224 Updated Dec 15, 2025

yanring / Megatron-MoE-ModelZoo

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 161 29 Updated Jan 22, 2026

InternLM / xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,078 404 Updated Feb 6, 2026

tensorwavecloud / ScalarLM

ScalarLM - a unified training and inference stack

Python 97 11 Updated Nov 18, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,568 493 Updated Feb 6, 2026

allenai / OLMo-core

PyTorch building blocks for the OLMo ecosystem

Python 778 138 Updated Feb 5, 2026

scuba-illinois / ArgCMV

Repository containing code and data for the paper "ArgCMV: An Argument Summarization Benchmark for the LLM-era", accepted at EMNLP 2025 Main Conference.

Python 1 Updated Nov 7, 2025

Alpha-VLLM / Lumina-DiMOO

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 934 58 Updated Dec 27, 2025

RUCAIBox / awesome-llm-pretraining

Awesome LLM pre-training resources, including data, frameworks, and methods.

323 23 Updated Apr 29, 2025

PrincetonUniversity / LLMCompass

Python 224 58 Updated Oct 24, 2025

hahnyuan / LLM-Viewer

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 616 79 Updated Sep 11, 2024

cli99 / llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

Python 479 56 Updated Apr 19, 2025

Guangcan MAI csgcmai

Starred repositories

text-to-video

video-object-segmentation

saliency-detection