Skip to content
View reginazhai's full-sized avatar
  • Stanford, CA

Block or report reginazhai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
22 stars written in Python
Clear filter

📚 Freely available programming books

Python 376,215 65,319 Updated Nov 4, 2025

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Python 8,888 2,439 Updated Oct 28, 2025

👋 Hey there new grad🎉! We've put together a collection of full-time job openings for SWE, Quant, PM and tech roles in 2024! 🚀

Python 6,286 563 Updated Nov 26, 2024

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,686 456 Updated Oct 14, 2025

Offline Text To Speech synthesis for python

Python 2,433 354 Updated Nov 6, 2025

Learning Correspondence from the Cycle-consistency of Time (CVPR 2019)

Python 726 114 Updated Jun 26, 2019

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Python 655 136 Updated Apr 5, 2022

Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.

Python 624 102 Updated Jan 31, 2025

End-to-end ASR/LM implementation with PyTorch

Python 594 138 Updated Aug 30, 2021

pytorch implementation of video captioning

Python 400 130 Updated Aug 19, 2019

A tool for measuring Python class cohesion.

Python 247 6 Updated Dec 9, 2024

A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition

Python 237 57 Updated May 12, 2020

This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as input a video and generates a caption in English describing t…

Python 172 66 Updated Oct 12, 2019

Inflate DenseNet and ResNet as per I3D with ImageNet weight transfer

Python 152 22 Updated Apr 28, 2021

这是一个基于Pytorch平台、Transformer框架实现的视频描述生成 (Video Captioning) 深度学习模型。 视频描述生成任务指的是:输入一个视频,输出一句描述整个视频内容的文字(前提是视频较短且可以用一句话来描述)。本repo主要目的是帮助视力障碍者欣赏网络视频、感知周围环境,促进“无障碍视频”的发展。

Python 96 17 Updated Mar 12, 2022

A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.

Python 73 16 Updated Jul 30, 2023

Official Code Release for "Adapting Pre-trained Vision Transformers from 2D to 3D through Weight Inflation Improves Medical Image Segmentation" (ML4H 2022)

Python 52 11 Updated Jun 8, 2023
Python 42 6 Updated Jan 11, 2021

Python implementation of extraction of several visual features representations from videos

Python 23 5 Updated Jul 19, 2021
Python 21 18 Updated Feb 9, 2024
Python 2 Updated Sep 17, 2022
Python 2 Updated Jan 15, 2024