Skip to content
View linyueqian's full-sized avatar
🍊
🍊

Highlights

  • Pro

Block or report linyueqian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Qwen-TTS offers a robust voice synthesis service using FastAPI, supporting bilingual and dialect options. Explore seamless audio generation on GitHub! 🚀🌟

Python 98 11 Updated Dec 19, 2025

Text-audio foundation model from Boson AI

Python 7,753 577 Updated Sep 15, 2025

This repository contains the code and tables from land use change and land occupation emissions

3 Updated Dec 4, 2025

[HPCA 2026] FractalCloud: A Fractal-Inspired Architecture for Efficient Large-Scale Point Cloud Processing

Python 8 Updated Dec 8, 2025

The baselines of ARC-Challenge-Interspeech2026

Python 49 3 Updated Dec 1, 2025

A framework for efficient model inference with omni-modality models

Python 1,001 136 Updated Dec 19, 2025

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 32,662 5,074 Updated Dec 19, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,134 191 Updated Oct 9, 2025

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 15,083 1,668 Updated Dec 19, 2025

BirdNET analyzer for scientific audio data processing.

Python 1,335 230 Updated Dec 18, 2025

Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models

Python 27 2 Updated Oct 6, 2025

[NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems

Python 11 3 Updated Nov 1, 2025

The best ChatGPT that $100 can buy.

Python 38,883 4,909 Updated Dec 9, 2025

A comprehensive framework to test audio comprehension of Large Audio Language Models.

Python 56 2 Updated Nov 25, 2025

The Source Code for OmniVideoBench

Python 39 2 Updated Nov 20, 2025

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Python 772 51 Updated Oct 15, 2025

2026 AI/ML internship & new graduate job list updated daily

4,231 173 Updated Dec 19, 2025

On-device TTS model by Neuphonic

Python 4,274 448 Updated Dec 15, 2025

Post-training with Tinker

Python 2,578 246 Updated Dec 19, 2025
Python 118 1 Updated Nov 4, 2025

This is the official Python version of Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play.

Python 104 2 Updated Oct 21, 2025

Lightweight coding agent that runs in your terminal

Rust 54,298 6,874 Updated Dec 19, 2025
Python 5 Updated Aug 10, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,327 3,240 Updated Dec 19, 2025

A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models

Python 110 4 Updated Sep 21, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,858 653 Updated Nov 20, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

9,733 703 Updated Nov 7, 2025

Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with synthetic captions.

Python 90 5 Updated Oct 15, 2025

Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.

TypeScript 23,685 1,858 Updated Dec 18, 2025
Python 202 23 Updated Jul 25, 2025
Next