Skip to content
View soham97's full-sized avatar

Highlights

  • Pro

Block or report soham97

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Security hooks for AI coding agents : Block dangerous commands, prevent secret leaks, and enforce runtime policies across Claude, OpenClaw, Antigravity, Codex, Cursor and Windsurf

Python 40 2 Updated Apr 9, 2026
Python 7 Updated Oct 8, 2025

Make beautiful isometric infrastructure diagrams

TypeScript 19,558 1,288 Updated Apr 7, 2026

Collection of works for evaluating (and analyzing) large audio-language models (LALMs)

40 1 Updated Aug 11, 2025

Open-source unified multimodal model

Python 5,792 512 Updated Oct 27, 2025
Jupyter Notebook 52 1 Updated Mar 24, 2026

small audio language model for reasoning

Python 85 5 Updated Dec 4, 2025

A Conversational Speech Generation Model

Python 14,566 1,467 Updated May 27, 2025

Explaining audio differences using language

Python 16 Updated Feb 11, 2025

Unified automatic quality assessment for speech, music, and sound.

Python 706 51 Updated Jun 5, 2025

Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems

Python 13 1 Updated Jan 16, 2025

Audio Entailment: Deductive Reasoning for Audio Understanding

17 1 Updated Dec 10, 2024

Awesome speech/audio LLMs, representation learning, and codec models

1,219 73 Updated Apr 4, 2026

A simple library for Fréchet Audio Distance (FAD) calculation

Python 255 24 Updated Aug 22, 2025

PAM is a no-reference audio quality metric for audio generation tasks

Python 76 6 Updated Jul 19, 2024

Repository for "Training Audio Captioning Models without Audio"

10 2 Updated Sep 26, 2023

An Audio Language model for Audio Tasks

Python 319 17 Updated Apr 19, 2024

Tracking states of the arts and recent results (bibliography) on sound tasks.

32 2 Updated Jan 10, 2023

Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"

Python 50 1 Updated Nov 10, 2022

Learning audio concepts from natural language supervision

Python 651 47 Updated Sep 18, 2024

Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"

Python 17 4 Updated Nov 9, 2022

speech enhancement\speech seperation\sound source localization

1,231 224 Updated Nov 14, 2023

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,546 532 Updated Mar 12, 2026

Reading list for research topics in Sound AI

198 8 Updated Aug 8, 2024