Skip to content
View shrango's full-sized avatar

Block or report shrango

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A curated collection of papers and resources on On-Policy Distillation for Large Language Models.

Python 301 6 Updated Jun 6, 2026

Image Manipulation Forensics via Segmentation

Python 3 Updated Aug 3, 2022
Python 6 1 Updated May 8, 2026

🚀 Automated & lossless LaTeX paper migration tool. Instantly convert your Overleaf source between top-tier AI conference templates (NeurIPS, ICLR, ACL等). 一键无损转换顶会论文格式!解决转投时的排版折磨,完美保留公式、图表与引用,让科研人员专…

Python 225 9 Updated Apr 8, 2026

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,551 256 Updated Jun 14, 2026

MiMo-Audio: Audio Language Models are Few-Shot Learners

Python 1,050 103 Updated Mar 3, 2026
Python 273 28 Updated May 19, 2025

A benchmark for LLMs on complicated tasks in the terminal

Python 2,353 541 Updated Jan 22, 2026

Autonomous Agents (LLMs) research papers. Updated Daily.

1,312 98 Updated Jun 12, 2026

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 105,251 14,056 Updated Jun 13, 2026

Stream-Omni is a GPT-4o-like language-vision-speech chatbot that simultaneously supports interaction across various modality combinations.

Python 388 45 Updated Jun 17, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 82,779 18,030 Updated Jun 14, 2026

FlexRAG: A RAG Framework for Information Retrieval and Generation.

Python 237 26 Updated Jun 3, 2026

PosS is a speculative decoding method with position-specialized draft layers generating high-quality drafts.

Python 10 Updated Dec 20, 2025
Python 12 1 Updated Apr 18, 2025

Code for BLT research paper

Python 2,047 193 Updated Nov 3, 2025

This is the official repository for Auto-RAG.

Python 234 20 Updated Jul 18, 2025

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,337 83 Updated Mar 6, 2025

code for paper: "MoCE: Adaptive Mixture of Contextualization Experts for Byte-based Neural Machine Translation"

Python 3 2 Updated Nov 7, 2024

Entropy Based Sampling and Parallel CoT Decoding

Python 3,435 321 Updated Nov 13, 2024

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 3,141 223 Updated May 19, 2025

Train transformer language models with reinforcement learning.

Python 18,634 2,790 Updated Jun 13, 2026

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Python 1,272 103 Updated Jun 29, 2025

A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.

Python 78 5 Updated Oct 22, 2024

ACL2024 Integrating Multi-scale Contextualized Information for Byte-based Neural Machine Translation

Python 7 1 Updated Aug 9, 2024
Jupyter Notebook 12 3 Updated Apr 24, 2025

Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.

579 9 Updated Jun 7, 2024

Code for EMNLP 2023 paper "Enhancing Neural Machine Translation with Semantic Units"

Python 8 Updated Jun 25, 2024
2 Updated Oct 25, 2023

Official implementation for EMNLP 2023 paper "Non-autoregressive Streaming Transformer for Simultaneous Translation"

Python 12 1 Updated Oct 19, 2023
Next