Skip to content
View tuanio's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@AI-CLUB-IUH

Block or report tuanio

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Foundation Architecture for (M)LLMs

Python 3,118 219 Updated Apr 11, 2024

AI-powered tool that transforms STEM concepts into narrated educational animations using Manim, LLMs, and multimodal AI

Python 59 19 Updated Oct 4, 2025

Suite of tools to discover new articles on the arXiv, filter them, and broadcast them as an RSS feed, for your own use or for others.

C++ 3 1 Updated Jul 12, 2018

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding a…

Python 2,394 446 Updated Mar 14, 2022

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,532 130 Updated Oct 9, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 59,695 10,578 Updated Oct 9, 2025

An Open-Source Asynchronous Coding Agent

TypeScript 4,907 645 Updated Oct 1, 2025

Tips and resources to prepare for Behavioral interviews.

7,137 1,384 Updated Aug 19, 2025

The python library for real-time communication

JavaScript 4,338 397 Updated Sep 19, 2025

Memory efficient transducer loss computation

CMake 69 12 Updated Jun 10, 2022
Python 29 2 Updated Jan 9, 2024

Hierarchical Reasoning Model Official Release

Python 10,832 1,605 Updated Sep 9, 2025

A python package to analyze and compare voices with deep learning

Python 3,111 463 Updated Oct 12, 2023

Mamba SSM architecture

Python 16,019 1,461 Updated Oct 8, 2025

[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Python 915 46 Updated Apr 30, 2025

Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis

257 19 Updated Jul 22, 2025
Python 378 60 Updated Sep 3, 2024
Python 6 Updated Jan 7, 2025

ViStreamASR - Real-Time Vietnamese Speech Recognition

Python 46 16 Updated Jul 12, 2025

Code for the paper "Instituto de Telecomunicações at IWSLT 2025: Aligning Small-Scale Speech and Language Models for Speech-to-Text Learning"

Python 1 Updated Sep 30, 2025

Update ASR paper everyday

Python 338 18 Updated Oct 9, 2025

Web interface for browsing, search and filtering recent arxiv submissions

Python 5,411 1,339 Updated Nov 27, 2021

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,222 319 Updated Feb 27, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 51,413 5,653 Updated Sep 10, 2025

Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.

Python 9,948 823 Updated Sep 29, 2025
Jupyter Notebook 1 Updated Oct 4, 2021

Add n-gram and large language model (LLM) support to Whisper models.

Jupyter Notebook 32 4 Updated May 6, 2025

KenLM: Faster and Smaller Language Model Queries

C++ 2,675 530 Updated Mar 30, 2025

A open-source guide that demystifies how U.S. universities evaluate and admit students into Computer Science PhD programs.

TeX 160 13 Updated Oct 9, 2025
Next