Skip to content
View rosrad's full-sized avatar

Block or report rosrad

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An awesome list for students who prepare for IELTS in public domains (on-going)

920 115 Updated Jul 11, 2023

Towards hot directions in industrial end to end speech recognition

331 40 Updated Nov 30, 2021

The GitHub repository for the paper "Informer" accepted by AAAI 2021.

Python 6,354 1,283 Updated Jun 20, 2025

机器人视觉 移动机器人 VS-SLAM ORB-SLAM2 深度学习目标检测 yolov3 行为检测 opencv PCL 机器学习 无人驾驶

C++ 8,516 2,804 Updated Jul 9, 2024

Infographic about the inner computations of a transformer model, training and inference

85 6 Updated Apr 15, 2024

Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute

Python 1,529 156 Updated Nov 18, 2020

[ICLR 2020] Lite Transformer with Long-Short Range Attention

Python 610 82 Updated Jul 11, 2024

PyTorch end-to-end speech recognition

Python 49 8 Updated Dec 30, 2020

Speech Recognition using DeepSpeech2.

Python 2,136 628 Updated Dec 13, 2022

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,146 31,509 Updated Dec 22, 2025

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,040 6,637 Updated Sep 30, 2025

Language-Agnostic SEntence Representations

Jupyter Notebook 3,660 464 Updated May 2, 2024

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,828 237 Updated Jul 22, 2025

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,607 3,626 Updated Dec 22, 2025

End-to-end ASR/LM implementation with PyTorch

Python 594 137 Updated Aug 30, 2021

CodeHub is an iOS application written using Xamarin

C# 22,682 611 Updated Jun 22, 2022

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,645 2,256 Updated Dec 1, 2025

Pytorch implementation of OCR system using CRNN + CTCLoss

Python 220 55 Updated Jul 12, 2019

Visualizer for neural network, deep learning and machine learning models

JavaScript 32,067 3,051 Updated Dec 22, 2025

a language for fast, portable data-parallel computation

C++ 6,478 1,095 Updated Dec 19, 2025

Command-line program to download videos from YouTube.com and other video sites

Python 139,224 10,575 Updated Nov 26, 2025

This is now the official location of the Kaldi project.

Shell 726 237 Updated Jul 28, 2019

Open MPI main development repository

C 2,493 936 Updated Dec 21, 2025

Deep Learning for Speech Recogntion based on Theano

Python 15 5 Updated Jul 28, 2017

An Open Source Machine Learning Framework for Everyone

C++ 193,002 75,146 Updated Dec 23, 2025

Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit

C++ 17,604 4,260 Updated Mar 11, 2023

:electron: Build cross-platform desktop apps with JavaScript, HTML, and CSS

C++ 119,488 16,844 Updated Dec 22, 2025

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Python 220 81 Updated Jul 26, 2016

Play with neural networks!

TypeScript 12,699 2,674 Updated Sep 10, 2025

BeamformIt acoustic beamforming software

C++ 374 113 Updated May 19, 2020
Next