mutiann

Follow

Mutian He mutiann

Follow

PhD Student at Idiap/EPFL

41 followers · 5 following

12:14 (UTC +01:00)
mutiann.github.io

Achievements

Achievements

Stars

SamuelPfisterer / EuroSpeech

Python 5 Updated May 20, 2025

google-deepmind / librispeech-long

LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation with Spoken Language Models" (arXiv 2024).

91 3 Updated Dec 28, 2024

main-horse / hnet-impl

Trainable H-Net Package

Python 25 5 Updated Sep 3, 2025

CLAIRE-Labo / RAT

Official code for the NeurIPS25 paper "RAT: Bridging RNN Efficiencyand Attention Accuracy in Language Modeling" (https://arxiv.org/abs/2507.04416))

Python 22 1 Updated Dec 10, 2025

idiap / hybrid-linear-sparse-attention

Alleviating Forgetfulness of Linear Attention by Hybrid Sparse Attention and Contextualized Learnable Token Eviction.

Python 2 Updated Nov 22, 2025

SandAI-org / MagiAttention

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 590 32 Updated Dec 20, 2025

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,442 705 Updated Dec 17, 2025

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,394 461 Updated Dec 18, 2025

modelscope / evalscope

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,145 243 Updated Dec 18, 2025

huggingface / lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,209 402 Updated Dec 15, 2025

stanford-crfm / helm

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,587 346 Updated Dec 20, 2025

fla-org / flame

🔥 A minimal training framework for scaling FLA models

Python 321 49 Updated Nov 15, 2025

ASLP-lab / OSUM

OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.

Python 461 30 Updated Nov 23, 2025

idiap / linearize-distill-pretrained-transformers

Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity

Python 8 1 Updated Feb 5, 2025

3dv-casia / NeuralPlane

[ICLR 2025 Oral] NeuralPlane: Structured 3D Reconstruction in Planar Primitives with Neural Fields

Python 55 Updated Jul 2, 2025

meta-pytorch / attention-gym

Helpful tools and examples for working with flex-attention

Python 1,091 67 Updated Dec 18, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 4,312 606 Updated Dec 20, 2025

NVlabs / GatedDeltaNet

[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule

Python 398 23 Updated Sep 15, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,087 333 Updated Dec 20, 2025

open-thoughts / open-thoughts

Fully open data curation for reasoning models

Python 2,171 182 Updated Dec 2, 2025

iurada / px-ntk-pruning

Official repository of our work "Finding Lottery Tickets in Vision Models via Data-driven Spectral Foresight Pruning" accepted at CVPR 2024

Python 26 6 Updated Feb 18, 2025

alexlenail / NN-SVG

Publication-ready NN-architecture schematics.

JavaScript 5,644 749 Updated Jul 10, 2025

mutiann / ccc

Python 14 1 Updated Mar 10, 2020

szcf-weiya / ESL-CN

The Elements of Statistical Learning (ESL)的中文翻译、代码实现及其习题解答。

Jupyter Notebook 2,673 616 Updated Sep 23, 2025

deep-spin / entmax

The entmax mapping and its loss, a family of sparse softmax alternatives.

Python 456 47 Updated Jun 22, 2024

svjan5 / GNNs-for-NLP

Tutorial: Graph Neural Networks for Natural Language Processing at EMNLP 2019 and CODS-COMAD 2020

Python 788 113 Updated Mar 24, 2023

leiwu0 / course.math_theory_nn

Summer course on mathematical theory of deep learning

TeX 53 4 Updated Jul 31, 2019

996icu / 996.ICU

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

275,046 20,997 Updated Aug 22, 2025

CSTR-Edinburgh / magphase

MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.

Python 80 31 Updated Oct 14, 2019

TanUkkii007 / deepvoice3-tensorflow

A tensorflow based implementation of DeepVoice3 https://arxiv.org/abs/1710.07654

Python 13 3 Updated Jun 5, 2018