Stars
An awesome list for students who prepare for IELTS in public domains (on-going)
Towards hot directions in industrial end to end speech recognition
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
机器人视觉 移动机器人 VS-SLAM ORB-SLAM2 深度学习目标检测 yolov3 行为检测 opencv PCL 机器学习 无人驾驶
Infographic about the inner computations of a transformer model, training and inference
Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute
[ICLR 2020] Lite Transformer with Long-Short Range Attention
Speech Recognition using DeepSpeech2.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language-Agnostic SEntence Representations
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
End-to-end ASR/LM implementation with PyTorch
CodeHub is an iOS application written using Xamarin
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Pytorch implementation of OCR system using CRNN + CTCLoss
Visualizer for neural network, deep learning and machine learning models
a language for fast, portable data-parallel computation
Command-line program to download videos from YouTube.com and other video sites
tramphero / kaldi
Forked from kaldi-asr/kaldiThis is now the official location of the Kaldi project.
Deep Learning for Speech Recogntion based on Theano
An Open Source Machine Learning Framework for Everyone
Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
Build cross-platform desktop apps with JavaScript, HTML, and CSS
Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.