Skip to content
View kinalmehta's full-sized avatar

Block or report kinalmehta

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
31 stars written in Jupyter Notebook
Clear filter

In-depth tutorials on LLMs, RAGs and real-world AI agent applications.

Jupyter Notebook 20,677 3,438 Updated Nov 3, 2025

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 17,590 4,113 Updated Jul 21, 2025

This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…

Jupyter Notebook 17,566 2,880 Updated Oct 30, 2025

This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for re…

Jupyter Notebook 14,743 1,924 Updated Oct 30, 2025

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 14,431 2,778 Updated Aug 22, 2025

[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)

Jupyter Notebook 11,038 1,582 Updated Feb 12, 2025

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

Jupyter Notebook 4,387 628 Updated Jun 30, 2020

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

Jupyter Notebook 3,494 580 Updated May 25, 2024

Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL

Jupyter Notebook 3,135 592 Updated Nov 4, 2021

Machine Learning Journal for Intermediate to Advanced Topics.

Jupyter Notebook 2,225 242 Updated Sep 8, 2025
Jupyter Notebook 1,465 201 Updated Sep 16, 2022

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Jupyter Notebook 1,302 140 Updated Mar 13, 2025

Implementation of all RL algorithms in a simpler way

Jupyter Notebook 1,237 217 Updated Aug 29, 2025

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

Jupyter Notebook 1,078 327 Updated May 19, 2021

JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

Jupyter Notebook 727 72 Updated Oct 26, 2022

This is a toolbox repository to help evaluate various methods that perform image matching from a pair of images.

Jupyter Notebook 584 83 Updated Apr 29, 2024

Repository with code and slides for a tutorial on causal inference.

Jupyter Notebook 583 113 Updated Sep 23, 2019

A walkthrough of transformer architecture code

Jupyter Notebook 371 63 Updated Feb 20, 2024

Minimal standalone example of diffusion model

Jupyter Notebook 161 16 Updated Jun 4, 2022

Code release for Learning with Opponent-Learning Awareness and variations.

Jupyter Notebook 151 37 Updated Apr 13, 2023

Reinforcement Learning research

Jupyter Notebook 117 67 Updated Sep 5, 2021

Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3

Jupyter Notebook 102 36 Updated Jul 19, 2022

A Cookbook to start building with LLMs

Jupyter Notebook 96 12 Updated Apr 19, 2024
Jupyter Notebook 79 39 Updated Oct 1, 2019

Solving a Rubik's Cube and 15 Puzzle using the Deep Reinforcement Learning and Search

Jupyter Notebook 48 7 Updated Jan 27, 2022
Jupyter Notebook 28 4 Updated Nov 22, 2019

A self-contained JAX implementation of DIScriminator DisAgreement INtrinsic Reward (DISDAIN).

Jupyter Notebook 10 4 Updated Jun 8, 2022

This repo is for members of the rl-implementation channel on MLC Discord to play with RL algorithms and learn.

Jupyter Notebook 10 4 Updated Jan 25, 2022

contains code and documentation for the Qualcomm tutorial sessions

Jupyter Notebook 5 Updated Oct 14, 2023

cheat sheet

Jupyter Notebook 2 Updated Feb 8, 2025
Next