Stars
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
Foundational Models for State-of-the-Art Speech and Text Translation
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈
智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界”
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
This repository aims to collect Transformer-based sound event detection (SED) algorithms.
Source & evaluation code for ICAMCS 2024 paper "Emotional Vietnamese Speech-Based Depression Diagnosis Using Dynamic Attention Mechanism"