yeziyang1992

yeziyang yeziyang1992

Achievements

Stars

chatanywhere / GPT_API_free

Free ChatGPT&DeepSeek API Key，免费ChatGPT&DeepSeek API。免费接入DeepSeek API和GPT4 API，支持 gpt | deepseek | claude | gemini | grok 等排名靠前的常用大模型。

Python 35,030 2,492 Updated Dec 15, 2025

DayuanJiang / next-ai-draw-io

A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…

TypeScript 14,048 1,423 Updated Dec 21, 2025

luochang212 / dive-into-langgraph

LangGraph 1.0 Tutorial

Jupyter Notebook 250 26 Updated Dec 18, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,282 7,790 Updated Dec 21, 2025

harry0703 / AudioNotes

快速提取音视频内容，整理成一份结构化的markdown笔记

Python 1,960 286 Updated Jul 26, 2024

Ola-Omni / Ola

Ola: Pushing the Frontiers of Omni-Modal Language Model

Python 381 16 Updated Jun 13, 2025

IDEA-Research / Grounded-SAM-2

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 3,127 361 Updated Nov 11, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,745 2,405 Updated Nov 24, 2025

bytedance / tarsier

Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.

Python 507 28 Updated Aug 14, 2025

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,401 461 Updated Dec 18, 2025

facebookresearch / dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 12,097 1,145 Updated Dec 17, 2025

zjysteven / lmms-finetune

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Python 359 41 Updated Dec 11, 2025

xindoo / openai-examples

Jupyter Notebook 9 4 Updated Nov 17, 2024

soCzech / TransNetV2

TransNet V2: Shot Boundary Detection Neural Network

Python 809 129 Updated Dec 4, 2023

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,197 2,685 Updated Aug 12, 2024

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 24,936 1,829 Updated Jul 31, 2025

ArrowLuo / CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Python 1,014 138 Updated Apr 12, 2024

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,302 1,445 Updated Nov 28, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,760 1,071 Updated Dec 21, 2025