Atotti

Ayuto Tsutsumi Atotti

Student of Computer Science at Tokyo Metropolitan University

42 followers · 73 following

Tokyo Metropolitan University
Tokyo
12:56 (UTC +09:00)
https://portfolio.ayutaso.com
@aya172957

Achievements

x2 x3

Achievements

x2 x3

Highlights

Organizations

Lists (1)

Sort

web

Stars

Wataru-Nakata / latentlm-tts

Python 7 Updated Jun 18, 2026

koacai / geneses

Python 6 Updated Jan 7, 2026

xingchensong / FlashCosyVoice

FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.

Python 250 25 Updated Feb 25, 2026

NousResearch / hermes-agent

The agent that grows with you

Python 197,074 34,822 Updated Jun 19, 2026

kotoba-tech / kotoba-whisper

Python 94 11 Updated Oct 23, 2024

thunlp / UltraChat

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Python 2,865 139 Updated Mar 13, 2024

NVIDIA-AI-Blueprints / nemotron-voice-agent

Reference implementation of an end-to-end voice agent built using the NVIDIA Nemotron models

TypeScript 56 25 Updated Apr 22, 2026

CyberAgentAILab / cheat-sheet-icl

[EMNLP 2025 Findings] Code for "Distilling Many-Shot In-Context Learning into a Cheat Sheet"

Python 5 Updated Nov 21, 2025

wildminder / awesome-ai-voice

List of open-source TTS, voice cloning, and music generation models

349 50 Updated Apr 17, 2026

SakanaAI / kame_finetune

Python 30 8 Updated Apr 27, 2026

SakanaAI / kame

Python 92 14 Updated May 14, 2026

EleutherAI / concept-erasure

Erasing concepts from neural representations with provable guarantees

Python 255 15 Updated Jan 27, 2025

zjunlp / LLMAgentPapers

Must-read Papers on LLM Agents.

3,051 182 Updated Jun 18, 2026

neuphonic / neucodec

A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.

Python 160 26 Updated Jan 27, 2026

OpenMOSS / MOSS-TTS

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenario…

Python 3,403 297 Updated Jun 18, 2026

dimastatz / whisper-flow

Whisper-Flow is a framework designed to enable real-time transcription of audio content using OpenAI’s Whisper model. Rather than processing entire files after upload (“batch mode”), Whisper-Flow a…

Python 773 112 Updated Apr 20, 2026

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 13,054 1,497 Updated Jun 18, 2026