Skip to content
View hills-code's full-sized avatar

Highlights

  • Pro

Block or report hills-code

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The author's implementation of FUDOKI, a multimodal large language model purely based on discrete flow matching.

Python 21 Updated Jul 18, 2025

Long-RL: Scaling RL to Long Sequences

Python 506 14 Updated Jul 24, 2025

Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation

Python 59 4 Updated Jul 13, 2025

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 306 18 Updated Jul 6, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 2,613 175 Updated Jun 17, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,058 514 Updated Jun 9, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 4,393 285 Updated Jul 17, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,976 1,765 Updated Feb 26, 2025

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 917 35 Updated Jun 27, 2025

Minimal implementation of scalable rectified flow transformers, based on SD3's approach

Jupyter Notebook 599 52 Updated Jul 1, 2024
Python 222 14 Updated May 8, 2025

Next-Token Prediction is All You Need

Python 2,171 81 Updated Mar 17, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,470 2,241 Updated Feb 1, 2025

[TMLR 2025🔥] A survey for the autoregressive models in vision.

656 18 Updated Jul 23, 2025

An open source implementation of CLIP.

Python 12,244 1,134 Updated Jul 23, 2025

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,815 85 Updated Aug 15, 2024

Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,538 359 Updated May 13, 2025

Adapting LLaMA Decoder to Vision Transformer

Python 28 2 Updated May 20, 2024

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,264 1,199 Updated Jul 30, 2024

PyTorch package for the discrete VAE used for DALL·E.

Python 10,872 1,916 Updated Jan 31, 2024
Python 20 3 Updated Aug 17, 2024

The official Meta Llama 3 GitHub site

Python 28,858 3,427 Updated Jan 26, 2025

Grok open release

Python 50,384 8,356 Updated Aug 30, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 53,089 8,903 Updated Jul 24, 2025

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Python 659 60 Updated Jun 1, 2024

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Python 438 41 Updated Feb 1, 2024

Tools for merging pretrained large language models.

Python 6,089 584 Updated Jul 16, 2025
Python 311 18 Updated Jun 9, 2024

Example models using DeepSpeed

Python 6,583 1,100 Updated Jul 8, 2025
Next