Skip to content
View mutiann's full-sized avatar

Block or report mutiann

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 5 Updated May 20, 2025

LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation with Spoken Language Models" (arXiv 2024).

91 3 Updated Dec 28, 2024

Trainable H-Net Package

Python 25 5 Updated Sep 3, 2025

Official code for the NeurIPS25 paper "RAT: Bridging RNN Efficiencyand Attention Accuracy in Language Modeling" (https://arxiv.org/abs/2507.04416))

Python 22 1 Updated Dec 10, 2025

Alleviating Forgetfulness of Linear Attention by Hybrid Sparse Attention and Contextualized Learnable Token Eviction.

Python 2 Updated Nov 22, 2025

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 590 32 Updated Dec 20, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,442 705 Updated Dec 17, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,394 461 Updated Dec 18, 2025

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,145 243 Updated Dec 18, 2025

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,209 402 Updated Dec 15, 2025

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,587 346 Updated Dec 20, 2025

🔥 A minimal training framework for scaling FLA models

Python 321 49 Updated Nov 15, 2025

OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.

Python 461 30 Updated Nov 23, 2025

Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity

Python 8 1 Updated Feb 5, 2025

[ICLR 2025 Oral] NeuralPlane: Structured 3D Reconstruction in Planar Primitives with Neural Fields

Python 55 Updated Jul 2, 2025

Helpful tools and examples for working with flex-attention

Python 1,091 67 Updated Dec 18, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 4,312 606 Updated Dec 20, 2025

[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule

Python 398 23 Updated Sep 15, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,087 333 Updated Dec 20, 2025

Fully open data curation for reasoning models

Python 2,171 182 Updated Dec 2, 2025

Official repository of our work "Finding Lottery Tickets in Vision Models via Data-driven Spectral Foresight Pruning" accepted at CVPR 2024

Python 26 6 Updated Feb 18, 2025

Publication-ready NN-architecture schematics.

JavaScript 5,644 749 Updated Jul 10, 2025
Python 14 1 Updated Mar 10, 2020

The Elements of Statistical Learning (ESL)的中文翻译、代码实现及其习题解答。

Jupyter Notebook 2,673 616 Updated Sep 23, 2025

The entmax mapping and its loss, a family of sparse softmax alternatives.

Python 456 47 Updated Jun 22, 2024

Tutorial: Graph Neural Networks for Natural Language Processing at EMNLP 2019 and CODS-COMAD 2020

Python 788 113 Updated Mar 24, 2023

Summer course on mathematical theory of deep learning

TeX 53 4 Updated Jul 31, 2019

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

275,046 20,997 Updated Aug 22, 2025

MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.

Python 80 31 Updated Oct 14, 2019

A tensorflow based implementation of DeepVoice3 https://arxiv.org/abs/1710.07654

Python 13 3 Updated Jun 5, 2018
Next