soham97

Soham soham97

Applied Scientist at Microsoft

63 followers · 3 following

Achievements

x2 x2

Achievements

x2 x2

Highlights

Stars

PrismorSec / immunity-agent

Security for AI agents : Block dangerous commands, prevent secret leaks, and enforce runtime policies across Claude, OpenClaw, Antigravity, Codex, Cursor and Windsurf

Python 42 2 Updated Apr 11, 2026

satvik-dixit / aura

Python 7 Updated Oct 8, 2025

stan-smith / FossFLOW

Make beautiful isometric infrastructure diagrams

TypeScript 19,577 1,291 Updated Apr 11, 2026

ckyang1124 / LALM-Evaluation-Survey

Collection of works for evaluating (and analyzing) large audio-language models (LALMs)

40 1 Updated Aug 11, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,798 513 Updated Oct 27, 2025

snap-research / GenAU

Jupyter Notebook 53 1 Updated Mar 24, 2026

soham97 / mellow

small audio language model for reasoning

Python 85 5 Updated Dec 4, 2025

SesameAILabs / csm

A Conversational Speech Generation Model

Python 14,571 1,468 Updated May 27, 2025

soham97 / ADIFF

Explaining audio differences using language

Python 16 Updated Feb 11, 2025

facebookresearch / audiobox-aesthetics

Unified automatic quality assessment for speech, music, and sound.

Python 706 51 Updated Jun 5, 2025

satvik-dixit / mace

Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems

Python 13 1 Updated Jan 16, 2025

microsoft / AudioEntailment

Audio Entailment: Deductive Reasoning for Audio Understanding

17 1 Updated Dec 10, 2024

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

1,219 73 Updated Apr 4, 2026

microsoft / fadtk

A simple library for Fréchet Audio Distance (FAD) calculation

Python 255 24 Updated Aug 22, 2025

soham97 / PAM

PAM is a no-reference audio quality metric for audio generation tasks

Python 76 6 Updated Jul 19, 2024

microsoft / NoAudioCaptioning

Repository for "Training Audio Captioning Models without Audio"

10 2 Updated Sep 26, 2023

microsoft / Pengi

An Audio Language model for Audio Tasks

Python 319 17 Updated Apr 19, 2024

soham97 / sound_ai_progress

Tracking states of the arts and recent results (bibliography) on sound tasks.

32 2 Updated Jan 10, 2023

microsoft / WavText5K

Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"

Python 50 1 Updated Nov 10, 2022

microsoft / CLAP

Learning audio concepts from natural language supervision

Python 652 47 Updated Sep 18, 2024

soham97 / MTL_Weakly_labelled_audio_data

Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"

Python 17 4 Updated Nov 9, 2022

WenzheLiu-Speech / awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

1,231 224 Updated Nov 14, 2023

s3prl / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,546 532 Updated Mar 12, 2026

soham97 / awesome-sound_event_detection

Reading list for research topics in Sound AI

198 8 Updated Aug 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Soham soham97

Achievements

Achievements

Highlights

Block or report soham97

Stars

PrismorSec / immunity-agent

satvik-dixit / aura

stan-smith / FossFLOW

ckyang1124 / LALM-Evaluation-Survey

ByteDance-Seed / Bagel

snap-research / GenAU

soham97 / mellow

SesameAILabs / csm

soham97 / ADIFF

facebookresearch / audiobox-aesthetics

satvik-dixit / mace

microsoft / AudioEntailment

ga642381 / speech-trident

microsoft / fadtk

soham97 / PAM

microsoft / NoAudioCaptioning

microsoft / Pengi

soham97 / sound_ai_progress

microsoft / WavText5K

microsoft / CLAP

soham97 / MTL_Weakly_labelled_audio_data

WenzheLiu-Speech / awesome-speech-enhancement

s3prl / s3prl

soham97 / awesome-sound_event_detection