Stars
A curated collection of papers and resources on On-Policy Distillation for Large Language Models.
Image Manipulation Forensics via Segmentation
🚀 Automated & lossless LaTeX paper migration tool. Instantly convert your Overleaf source between top-tier AI conference templates (NeurIPS, ICLR, ACL等). 一键无损转换顶会论文格式!解决转投时的排版折磨,完美保留公式、图表与引用,让科研人员专…
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
MiMo-Audio: Audio Language Models are Few-Shot Learners
A benchmark for LLMs on complicated tasks in the terminal
Autonomous Agents (LLMs) research papers. Updated Daily.
An open-source AI agent that brings the power of Gemini directly into your terminal.
Stream-Omni is a GPT-4o-like language-vision-speech chatbot that simultaneously supports interaction across various modality combinations.
A high-throughput and memory-efficient inference and serving engine for LLMs
FlexRAG: A RAG Framework for Information Retrieval and Generation.
PosS is a speculative decoding method with position-specialized draft layers generating high-quality drafts.
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
code for paper: "MoCE: Adaptive Mixture of Contextualization Experts for Byte-based Neural Machine Translation"
Entropy Based Sampling and Parallel CoT Decoding
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Train transformer language models with reinforcement learning.
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
ACL2024 Integrating Multi-scale Contextualized Information for Byte-based Neural Machine Translation
Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.
Code for EMNLP 2023 paper "Enhancing Neural Machine Translation with Semantic Units"
Official implementation for EMNLP 2023 paper "Non-autoregressive Streaming Transformer for Simultaneous Translation"