Stars
📚 Freely available programming books
Collection of Summer 2026 tech internships!
A library for efficient similarity search and clustering of dense vectors.
Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.
A collection of full time roles in SWE, Quant, and PM for new grads.
Unsupervised text tokenizer for Neural Network-based text generation.
Speech recognition module for Python, supporting several engines and APIs, online and offline.
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
👋 Hey there new grad🎉! We've put together a collection of full-time job openings for SWE, Quant, PM and tech roles in 2024! 🚀
https://huyenchip.com/ml-interviews-book/
Machine Learning and Computer Vision Engineer - Technical Interview Questions
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
An opinionated, actionable guide for software engineering interviews.
Offline Text To Speech synthesis for python
Learning Correspondence from the Cycle-consistency of Time (CVPR 2019)
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
End-to-end ASR/LM implementation with PyTorch
pytorch implementation of video captioning
A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition
Python implementation of Bayesian Knowledge Tracing and extensions
This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as input a video and generates a caption in English describing t…
Inflate DenseNet and ResNet as per I3D with ImageNet weight transfer
这是一个基于Pytorch平台、Transformer框架实现的视频描述生成 (Video Captioning) 深度学习模型。 视频描述生成任务指的是:输入一个视频,输出一句描述整个视频内容的文字(前提是视频较短且可以用一句话来描述)。本repo主要目的是帮助视力障碍者欣赏网络视频、感知周围环境,促进“无障碍视频”的发展。
A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.
Use CLIP to represent video for Retrieval Task
Official Code Release for "Adapting Pre-trained Vision Transformers from 2D to 3D through Weight Inflation Improves Medical Image Segmentation" (ML4H 2022)