Skip to content
View caffeinism's full-sized avatar

Block or report caffeinism

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"

Python 13,027 1,744 Updated Dec 11, 2025

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 336 45 Updated Jul 21, 2025

AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Python 277 19 Updated Oct 12, 2025

TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis

Python 132 6 Updated Nov 19, 2025

[ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis

Python 1,197 61 Updated Apr 7, 2023

[NeurIPS'21] Projected GANs Converge Faster

Python 900 97 Updated Jun 4, 2024

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

Python 778 52 Updated Jul 9, 2025

A bridge to use Langchain output as an OpenAI-compatible API

Python 87 18 Updated Jul 11, 2025

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,829 113 Updated Sep 27, 2024

Efficient vision foundation models for high-resolution generation and perception.

Python 3,184 229 Updated Sep 5, 2025

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,086 59 Updated Mar 20, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 4,841 321 Updated Dec 21, 2025

Official implementation of BLIP3o-Series

Python 1,611 73 Updated Nov 29, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,991 778 Updated Dec 23, 2025

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,246 1,190 Updated Dec 24, 2025

Writing kubernetes controllers can be simple

Go 974 97 Updated Aug 9, 2025

End-to-end realtime stack for connecting humans and AI

Go 16,219 1,636 Updated Dec 23, 2025

Cutting Edge WebRTC Video Conferencing

C++ 7,018 1,214 Updated Dec 23, 2025

[CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation

Python 844 26 Updated May 23, 2025

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

Jupyter Notebook 533 70 Updated Aug 27, 2024

Windows inside a Docker container.

Shell 49,209 3,825 Updated Nov 22, 2025

Official inference framework for 1-bit LLMs

Python 24,463 1,913 Updated Jun 3, 2025

High performance self-hosted photo and video management solution.

TypeScript 87,249 4,599 Updated Dec 24, 2025

Vim-fork focused on extensibility and usability

Vim Script 95,151 6,477 Updated Dec 24, 2025

Clean, modern, Python 3.6+ code generator & library for Protobuf 3 and async gRPC

Python 1,748 231 Updated Jul 17, 2025

Docker files and images to run Ceph in containers

1,330 519 Updated Dec 12, 2024

Garnet is a remote cache-store from Microsoft Research that offers strong performance (throughput and latency), scalability, storage, recovery, cluster sharding, key migration, and replication feat…

C# 11,623 627 Updated Dec 23, 2025

[ECCV 2024] Official PyTorch implementation code for realizing the technical part of Mixture of All Intelligence (MoAI) to improve performance of numerous zero-shot vision language tasks.

Python 329 32 Updated Mar 28, 2024

The fastest knowledge base for growing teams. Beautiful, realtime collaborative, feature packed, and markdown compatible.

TypeScript 36,418 3,002 Updated Dec 24, 2025
Next