Skip to content
View jelspace's full-sized avatar

Block or report jelspace

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"

Python 46 4 Updated Dec 16, 2025

Pushing DeepSeek to 10M context window

Python 5 Updated Sep 30, 2025

Electronic Circuit Simulator in the Browser

Java 2,743 759 Updated Jan 22, 2024

Electronic Circuit Simulator in the Browser

Java 2,142 364 Updated Mar 24, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,161 194 Updated Oct 9, 2025

This isn't abstract: 94% of long-haul operator-hours, 1 million admin jobs, 51% of retail and food service work, and even large swaths of white-collar labor are all vulnerable — and the wave is com…

JavaScript 2 Updated Sep 10, 2025

Code for the paper "Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers" [ICCV 2025]

Python 98 10 Updated Jul 28, 2025

The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.

Python 7,634 1,453 Updated Dec 19, 2025

Enhancing LLMs with LoRA

Jupyter Notebook 198 13 Updated Oct 20, 2025

[NeurIPS 2025] LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS

Python 165 9 Updated Oct 17, 2025

Multi-Granularity LLM Debugger [ICSE2026]

Python 94 10 Updated Jul 6, 2025

MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting zero-shot multi-speaker voice cloning, and long-form speech…

Python 1,061 95 Updated Dec 8, 2025

[ICML 2025] The Diffusion Duality

Python 180 23 Updated Oct 13, 2025

Public repo for HF blog posts

Jupyter Notebook 3,273 958 Updated Dec 22, 2025

OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models

Python 142 5 Updated Apr 25, 2025

the official repo for "D-AR: Diffusion via Autoregressive Models"

Python 129 2 Updated Jun 21, 2025
HTML 3 Updated Nov 20, 2025

Advanced quantization toolkit for LLMs and VLMs. Support for WOQ, MXFP4, NVFP4, GGUF, Adaptive Schemes and seamless integration with Transformers, vLLM, SGLang, and llm-compressor

Python 777 65 Updated Dec 24, 2025
Python 442 30 Updated Mar 27, 2024

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,440 434 Updated Oct 27, 2025

Train transformer language models with reinforcement learning.

Python 16,762 2,374 Updated Dec 24, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 29 4 Updated May 12, 2025

Verilog Generation with Reinforcement Learning

Verilog 10 1 Updated Jul 15, 2025

Bamboo-7B Large Language Model

93 1 Updated Mar 28, 2024

High-speed and easy-use LLM serving framework for local deployment

C++ 139 20 Updated Aug 7, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,501 464 Updated Aug 2, 2025
Python 2 Updated Apr 9, 2025

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,247 1,190 Updated Dec 24, 2025

GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.

Python 64 7 Updated Jul 8, 2024

GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.

Python 4 Updated Jul 8, 2024
Next