-
Sunwater Institute | Reframe Data Services
- DMV Area
- MinShiMia.github.io
- https://orcid.org/0000-0001-6135-2673
- in/min-mia-shi
- @MiaShi19
Highlights
- Pro
Stars
Production-ready Claude subagents collection with 100+ specialized AI agents for full-stack development, DevOps, data science, and business operations.
This project is a beginner-friendly demonstration of a Retrieval-Augmented Generation (RAG) system built entirely on local, open-source technologies. The application allows a user to upload their r…
The official public repo for the coding session of the 2025 Congressional Hackathon!
code samples for the goodreads datasets
Robust Speech Recognition via Large-Scale Weak Supervision
AI enabled pair programmer for Claude, GPT, O Series, Grok, Deepseek, Gemini and 300+ models
Lightweight coding agent that runs in your terminal
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
🦜🔗 The platform for reliable agents.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials,…
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Data Structure Algorithms, (GenAI/ML) System Design, Machine Learning, DevOps coding interview practices
An open-source AI agent that brings the power of Gemini directly into your terminal.
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headles…
The official Python library for the OpenAI API
A topic-centric list of HQ open datasets.
😎 A curated list of awesome GitHub Profile which updates in real time
code for Data Science From Scratch book
Python Data Science Handbook: full text in Jupyter Notebooks
✅ The programmer-friendly testing framework for Java and the JVM
the resources I use to learn computer science in my spare time