Skip to content
Change the repository type filter

All

    Repositories list

    • kaiyu

      Public
      Kaiyu, Shanghai Jiao Tong University
      HTML
      0000Updated Dec 11, 2025Dec 11, 2025
    • Xmart

      Public
      Xmart青年论坛仓库,存放历史学生论坛和前沿讲座的视频回放和讲义,获取最新Xmart预告欢迎关注公众号【XLANCE Lab】
      03100Updated Dec 2, 2025Dec 2, 2025
    • SALMONN-AHAMask

      Public
      Python
      0700Updated Nov 21, 2025Nov 21, 2025
    • x-lance.github.io

      Public template
      Welcome to X-LANCE! Cross Media Language Intelligence Lab in Shanghai Jiao Tong University.
      HTML
      13k000Updated Oct 30, 2025Oct 30, 2025
    • SLAM-LLM

      Public
      A Framework for Speech, Language, Audio, Music Processing with Large Language Model
      Python
      100939270Updated Oct 24, 2025Oct 24, 2025
    • LSCodec-Inference

      Public
      Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"
      Python
      43410Updated Oct 23, 2025Oct 23, 2025
    • 0000Updated Oct 21, 2025Oct 21, 2025
    • A Universal Platform for Training and Evaluation of Mobile Interaction
      Python
      65710Updated Sep 24, 2025Sep 24, 2025
    • CogBench

      Public
      Python
      0600Updated Jul 7, 2025Jul 7, 2025
    • Python
      57410Updated Jun 25, 2025Jun 25, 2025
    • [ICML 2024] Reducing Tool Hallucination via Reliability Alignment
      Python
      1810Updated Jun 17, 2025Jun 17, 2025
    • Codes for paper "Converging to a Lingua Franca: Evolution of Linguistic Regions and Semantics Alignment in Multilingual Large Language Models"
      Python
      0600Updated May 4, 2025May 4, 2025
    • RAGTalker

      Public
      HTML
      0000Updated Apr 3, 2025Apr 3, 2025
    • [EMNLP 2021] The baseline code for WebSRC dataset.
      HTML
      95000Updated Apr 2, 2025Apr 2, 2025
    • WebSRC

      Public
      [EMNLP 2021] WebSRC: A dataset for web based structural machine reading comprehension.
      CSS
      0500Updated Apr 2, 2025Apr 2, 2025
    • 0000Updated Dec 23, 2024Dec 23, 2024
    • VQTalker

      Public
      [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization
      05340Updated Dec 16, 2024Dec 16, 2024
    • UniCATS-CTX-txt2vec

      Public
      [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS
      Python
      86450Updated Nov 18, 2024Nov 18, 2024
    • MBS

      Public
      [COLING 2024] Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind
      Python
      1300Updated Oct 12, 2024Oct 12, 2024
    • [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
      Python
      2536480Updated Sep 3, 2024Sep 3, 2024
    • AniTalker

      Public
      [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
      Jupyter Notebook
      1431.6k90Updated Aug 15, 2024Aug 15, 2024
    • [EMNLP 2022] Leaderboard of META-GUI
      CSS
      0000Updated Jul 9, 2024Jul 9, 2024
    • [EMNLP 2022] The baseline code for META-GUI dataset
      Python
      31430Updated Jul 9, 2024Jul 9, 2024
    • Python
      0600Updated Jun 21, 2024Jun 21, 2024
    • [AAAI 2024] Code for CTX-vec2wav in UniCATS
      Python
      1612970Updated Jun 11, 2024Jun 11, 2024
    • [NAACL 2024] CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions
      Python
      21310Updated May 7, 2024May 7, 2024
    • StoryTTS

      Public
      [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations
      HTML
      414020Updated Apr 27, 2024Apr 27, 2024
    • weblm

      Public
      [WSDM 2024] Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding
      11720Updated Mar 6, 2024Mar 6, 2024
    • MSDWILD

      Public
      [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.
      HTML
      15810Updated Jan 24, 2024Jan 24, 2024
    • [EMNLP 2023 Findings] ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought
      Python
      72331Updated Jan 11, 2024Jan 11, 2024