-
PlantumAI, VocalEyes
- www.neildeshmukh.com
- @NeilDeshmukh
Stars
Completely free, unbelievably stupid wi-fi on long-haul flights
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
The Frontend Stack for Agents & Generative UI. React, Angular, Mobile, Slack, and more. Makers of the AG-UI Protocol
Fast, collaborative live terminal sharing over the web
A React component to view a PDF document
Turn expensive prompts into cheap fine-tuned models
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A semantic local search engine powered by AI models.
Easily migrate your codebase from one framework or language to another.
Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)
DuCanhGH / next-pwa
Forked from shadowwalker/next-pwaPWA for Next.js, powered by Workbox.
General video interaction platform based on LLMs, including Video ChatGPT
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Mass-editing thousands of facts into a transformer memory (ICLR 2023)
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A large-scale benchmark for co-optimizing the design and control of soft robots, as seen in NeurIPS 2021.
Code for analysis of ADNI data
AI@MIT Workshops (Updated for Fall 2021)
Open source code for AlphaFold 2.
Lip reading with machine learning
An evolving guide to learning Deep Learning effectively.
Welcome to the FIRST AI CrashCourse, presented by Neil Deshmukh at 2019 FIRST Robotics World Championship, Houston, TX! In this workshop, we'll develop Neural Networks to predict fuel efficiency, i…
These programs have been finished and are for public use. There are no contributions to this repository since 2019.
This is a repo for Part 2 of my 2019 Project | Symptom Identification from Spoken Descriptions: End-to-End Neural Network Feature Extraction
This is a repo for Part 1 of my 2019 Project: Skin Disease Detection from Images
This is a repo for Part 3 of my 2019 Project | Disease Identification from Waveform ECG Data: End-to-End Myocardial Anomaly Detection