Skip to content
View zyddnys's full-sized avatar

Block or report zyddnys

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
187 results for source starred repositories
Clear filter

Open-source unified multimodal model

Python 5,268 456 Updated Oct 27, 2025

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 1,725 80 Updated Sep 8, 2025

Lets make video diffusion practical!

Python 16,145 1,552 Updated Oct 16, 2025
Python 2,466 237 Updated Jul 16, 2025

Efficient optimizers

Python 275 25 Updated Oct 16, 2025

A quick intro to IPSec on the kernel side with STUN and UDP hole punching

C 16 Updated Jan 10, 2019

CogView4, CogView3-Plus and CogView3(ECCV 2024)

Python 1,091 79 Updated Mar 29, 2025
Python 2,217 159 Updated Nov 8, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,962 131 Updated Nov 7, 2025

Your image is almost there!

Python 7,651 441 Updated Jul 26, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,420 161 Updated Mar 3, 2025

cuDF - GPU DataFrame Library

C++ 9,317 982 Updated Nov 10, 2025

cuML - RAPIDS Machine Learning Library

C++ 5,002 600 Updated Nov 7, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,470 543 Updated May 18, 2025

Grok open release

Python 50,565 8,372 Updated Aug 30, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 4,003 580 Updated Apr 24, 2024

适配轻小说/Galgame的日中翻译大模型

Python 4,089 106 Updated Feb 9, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,086 733 Updated Oct 31, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,640 11,147 Updated Nov 10, 2025

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,691 445 Updated May 29, 2024

The JoyTag Image Tagging Model

Python 528 33 Updated May 18, 2024

Blazing fast concurrent HashMap for Rust.

Rust 3,764 174 Updated Mar 5, 2025

A convenient and user-friendly anime-style image data processing library that integrates various advanced anime-style image processing models

Python 326 22 Updated Oct 11, 2025

A fast & densely stored hashmap and hashset based on robin-hood backward shift deletion

C++ 1,194 91 Updated Nov 2, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,927 2,659 Updated Aug 12, 2024

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

Python 1,042 44 Updated May 31, 2024

Focus on prompting and generating

Python 47,004 7,596 Updated Sep 2, 2025

set prompt to divided region

Python 1,760 145 Updated Jun 23, 2025

AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI

Python 3,370 290 Updated Sep 22, 2024
Next