Stars
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…
Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.
An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...now also act as a research assistant
OCR, layout analysis, reading order, table recognition in 90+ languages
The official code for "One Fits All: Power General Time Series Analysis by Pretrained LM (NeurIPS 2023 Spotlight)"
A real world full-stack application using LlamaIndex
Easy training on custom dataset. Various backends (MobileNet and SqueezeNet) supported. A YOLO demo to detect raccoon run entirely in brower is accessible at https://git.io/vF7vI (not on Windows).
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …
Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.
Interactive Image Generation via Generative Adversarial Networks
The world's simplest facial recognition api for Python and the command line
2018 phm data challenge, ion mill machine RUL & fault diagnosis