jlian2

Jiachen Lian jlian2

EECS PhD @ Berkeley

62 followers · 35 following

Berkeley

Achievements

Highlights

Lists (1)

Sort

🚀 My stack

Starred repositories

Auroraaa86 / LCS-CTC

For IEEE ASRU(2025)

Jupyter Notebook 12 3 Updated Jun 21, 2025

steventan0110 / AVLM

Official Implementation for our EMNLP 2025 paper: "Seeing is Believing: Emotion-Aware Audio-Visual Language Modeling for Expressive Speech Generation"

Python 3 Updated Aug 26, 2025

baichuan-inc / Baichuan-M2-32B

Beyond the Model: Scaling Medical Capability with a Large Verifier System

172 11 Updated Sep 3, 2025

Berkeley-Speech-Group / emo-reasoning

EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Spoken Dialogue Systems

Python 6 Updated Aug 27, 2025

Berkeley-Speech-Group / Neural-LCS

For interspeech(2025)

Jupyter Notebook 3 Updated May 30, 2025

Berkeley-Speech-Group / Phonetic-Error-Detection

Interspeech 2025 [Project page]

Python 7 Updated Nov 4, 2025

Berkeley-Speech-Group / LLM-Dys

Jupyter Notebook 10 2 Updated Oct 11, 2025

Berkeley-Speech-Group / DysfluentWFST

DysfluentWFST

Jupyter Notebook 15 5 Updated Nov 13, 2025

Power-Agent / PowerFM

PowerFM is an open-source repository for foundation models in the power and energy domain. It both maintains original projects and collects community-contributed open-source projects, featuring fin…

31 1 Updated Nov 4, 2025

Power-Agent / PowerWF

PowerWorkflow is an open-source collection of agentic workflows for power system applications. These workflows enable intelligent automation and coordination of power system operations, facilitatin…

Python 24 Updated Jul 19, 2025

Berkeley-Speech-Group / RT-VC

Python 22 7 Updated Mar 29, 2025

DanielLin94144 / Full-Duplex-Bench

A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models

Python 112 4 Updated Sep 21, 2025

rorizzz / YOLO-Stutter

YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection

Jupyter Notebook 20 2 Updated Mar 4, 2025

openai / openai-realtime-agents

This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.

TypeScript 6,682 1,053 Updated Dec 15, 2025

nyrahealth / CrisperWhisper

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

Python 880 47 Updated Jun 3, 2025

VITA-MLLM / VITA

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,465 180 Updated Mar 28, 2025

openai / openai-realtime-console

React app for inspecting, building and debugging with the Realtime API

JavaScript 3,525 1,392 Updated Aug 28, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 27,850 2,571 Updated Dec 26, 2025

Berkeley-Speech-Group / Speech-Articulatory-Coding

Jupyter Notebook 53 12 Updated May 29, 2025

Berkeley-Speech-Group / sylber

Sylber: Syllabic Embedding Representation of Speech from Raw Audio

Jupyter Notebook 71 4 Updated Mar 17, 2025

cheoljun95 / sdhubert

Jupyter Notebook 26 2 Updated Dec 4, 2024

muzairkhattak / multimodal-prompt-learning

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".

Python 791 65 Updated Jul 24, 2023

articulatory / articulatory

Deep Articulatory Synthesis and Inversion

Python 54 7 Updated Feb 14, 2024

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 19,317 2,061 Updated Oct 21, 2025

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 180,498 46,188 Updated Dec 25, 2025