Skip to content
@DigitalPhonetics

Speech and Language Technology (SaLT) at the University of Stuttgart

Research institute in the field of speech, natural language processing and machine learning

Pinned Loading

  1. IMS-Toucan Public

    Controllable and fast Text-to-Speech for over 7000 languages!

    Python 1.5k 174

  2. VoicePAT Public

    VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.

    Shell 48 4

  3. bloomzmms Public

    Materials for the publication "Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training"

    Python 2

  4. conversational-tree-search Public

    Code and Data for Conversational Tree Search: A new task that bridges the gap between FAQ-style information retrieval and task-oriented dialog.

    Python 7

Repositories

Showing 10 of 18 repositories
  • diagraph Public

    DIAGRAPH: An open-source graphic interface for dialog flow design

    Python 4 GPL-3.0 1 0 0 Updated Feb 15, 2025
  • conversational-tree-search Public

    Code and Data for Conversational Tree Search: A new task that bridges the gap between FAQ-style information retrieval and task-oriented dialog.

    Python 7 0 1 0 Updated Feb 5, 2025
  • Jupyter Notebook 4 1 0 0 Updated Jan 30, 2025
  • Intrinsic-Subgraph-Generation-for-VQA Public

    Predicting a subgraph alongside the answer in a graph based VQA model

    Python 8 MIT 1 1 0 Updated Jan 21, 2025
  • IMS-Toucan Public

    Controllable and fast Text-to-Speech for over 7000 languages!

    Python 1,541 Apache-2.0 174 11 0 Updated Nov 7, 2024
  • speaker-anonymization Public

    Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.

    Shell 70 GPL-3.0 6 2 0 Updated Sep 13, 2024
  • Python 4 0 0 0 Updated Jul 3, 2024
  • bloomzmms Public

    Materials for the publication "Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training"

    Python 2 Apache-2.0 0 0 0 Updated Jun 16, 2024
  • multilingual-seq2seq-slu Public

    Materials for the publication "Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding"

    Python 2 Apache-2.0 0 0 0 Updated Jun 16, 2024
  • VoicePAT Public

    VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.

    Shell 48 Apache-2.0 4 2 0 Updated May 14, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…