Skip to content
View bitmap2024's full-sized avatar

Block or report bitmap2024

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 2 1 Updated Dec 18, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 833 54 Updated May 14, 2025

NVIDIA TensorRT-RTX is an SDK for high-performance AI inference on NVIDIA RTX GPUs. This repository contains Open-Source Software components of TensorRT-RTX.

70 9 Updated Dec 19, 2025

LightMem: Lightweight and Efficient Memory-Augmented Generation

Python 457 44 Updated Dec 17, 2025
Python 7,538 444 Updated Dec 14, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Svelte 118,363 16,676 Updated Dec 21, 2025

Tools for merging pretrained large language models.

Python 6,615 648 Updated Dec 17, 2025

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,751 173 Updated Dec 20, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,506 180 Updated Dec 21, 2025

Taming Stable Diffusion for Lip Sync!

Python 5,270 849 Updated Jun 20, 2025

JoyHallo: Digital human model for Mandarin

Python 519 51 Updated Sep 21, 2025

MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code

Python 801 100 Updated Oct 16, 2024

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 4,425 520 Updated Aug 11, 2025

all of the workflows of n8n i could find (also from the site itself)

Python 48,198 5,567 Updated Dec 5, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,302 1,445 Updated Nov 28, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,290 327 Updated Dec 15, 2025
Python 15 Updated Nov 19, 2025

A Model Context Protocol (MCP) server that provides image generation capabilities using Bytedance's SeedDream 4.0 model via the FAL AI platform.

JavaScript 13 1 Updated Sep 15, 2025

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

Python 873 333 Updated Dec 9, 2025

Voice Activity Detector (VAD) : low-latency, high-performance and lightweight

C 1,763 139 Updated Dec 18, 2025

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 12,097 1,145 Updated Dec 17, 2025

[CVPR'24 Best Student Paper] Mip-Splatting: Alias-free 3D Gaussian Splatting

Python 1,376 108 Updated Dec 17, 2024

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,159 143 Updated Sep 5, 2024

Single Image to 3D using Cross-Domain Diffusion for 3D Generation

Python 5,299 429 Updated Mar 14, 2025

An open-source impl. of Large Reconstruction Models

Python 1,186 72 Updated May 6, 2024

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 11,355 1,045 Updated Nov 5, 2025

An LLM base TTS engine

Python 90 7 Updated Dec 25, 2024

Added vLLM support to IndexTTS for faster inference.

Python 960 128 Updated Oct 24, 2025

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python 6,396 202 Updated Dec 15, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,790 869 Updated Jun 10, 2024
Next