Skip to content
View zyddnys's full-sized avatar

Block or report zyddnys

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-source unified multimodal model

Python 5,258 455 Updated Oct 27, 2025

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 1,723 79 Updated Sep 8, 2025

Lets make video diffusion practical!

Python 16,129 1,548 Updated Oct 16, 2025
Python 2,466 237 Updated Jul 16, 2025

Efficient optimizers

Python 276 25 Updated Oct 16, 2025

A quick intro to IPSec on the kernel side with STUN and UDP hole punching

C 16 Updated Jan 10, 2019

CogView4, CogView3-Plus and CogView3(ECCV 2024)

Python 1,091 79 Updated Mar 29, 2025
Python 2,216 159 Updated Nov 8, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,960 131 Updated Nov 7, 2025

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 2,064 116 Updated Jul 29, 2024

Your image is almost there!

Python 7,652 441 Updated Jul 26, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,420 161 Updated Mar 3, 2025

cuDF - GPU DataFrame Library

C++ 9,314 982 Updated Nov 8, 2025

cuML - RAPIDS Machine Learning Library

C++ 5,001 601 Updated Nov 7, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,471 543 Updated May 18, 2025

Grok open release

Python 50,561 8,371 Updated Aug 30, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 4,002 580 Updated Apr 24, 2024

适配轻小说/Galgame的日中翻译大模型

Python 4,086 106 Updated Feb 9, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,077 733 Updated Oct 31, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,524 11,125 Updated Nov 8, 2025

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,689 445 Updated May 29, 2024

The JoyTag Image Tagging Model

Python 528 33 Updated May 18, 2024

Blazing fast concurrent HashMap for Rust.

Rust 3,758 174 Updated Mar 5, 2025

A convenient and user-friendly anime-style image data processing library that integrates various advanced anime-style image processing models

Python 326 22 Updated Oct 11, 2025

A fast & densely stored hashmap and hashset based on robin-hood backward shift deletion

C++ 1,191 91 Updated Nov 2, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,919 2,659 Updated Aug 12, 2024

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

Python 1,042 44 Updated May 31, 2024

Focus on prompting and generating

Python 46,994 7,592 Updated Sep 2, 2025

set prompt to divided region

Python 1,760 145 Updated Jun 23, 2025
Next