Skip to content
View roshray's full-sized avatar
:octocat:
got to keep data obfuscated.
:octocat:
got to keep data obfuscated.

Organizations

@OpenMined @cudakernel @Wotline

Block or report roshray

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

Python 4,478 356 Updated Aug 10, 2024

Repository for "Introduction to Artificial Neural Networks and Deep Learning: A Practical Guide with Applications in Python"

Jupyter Notebook 2,814 744 Updated Oct 2, 2020

A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!

Cuda 54 7 Updated Nov 12, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,828 12,089 Updated Dec 20, 2025

Llama2 inference in one TypeScript file

JavaScript 19 2 Updated May 29, 2025

Vim plugin for LLM-assisted code/text completion

Vim Script 1,779 86 Updated Oct 28, 2025

Port of OpenAI's Whisper model in C/C++

C++ 45,197 5,027 Updated Dec 18, 2025

Codebase for Aria - an Open Multimodal Native MoE

Jupyter Notebook 1,085 91 Updated Jan 22, 2025

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Jupyter Notebook 45,191 7,061 Updated Aug 18, 2024

LLM101n: Let's build a Storyteller

35,911 1,963 Updated Aug 1, 2024

⚡ Fastest way to serve open source ML models to millions

Python 836 85 Updated Dec 19, 2025

A demo application using fal.realtime and the lightning fast SDXL API provided by fal

JavaScript 579 147 Updated Sep 24, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,085 869 Updated Dec 17, 2024

LLM inference in C/C++

C++ 91,645 14,166 Updated Dec 20, 2025

Framework for building and maintaining self-updating prompts for LLMs

Python 65 4 Updated Jun 9, 2024

Open-source search and retrieval database for AI applications.

Rust 25,060 1,975 Updated Dec 20, 2025

The AgentForge project focuses on building general tooling to construct multicapability AI systems by composing skills and models together, and having intelligent processes use those skills/models.

18 Updated Oct 11, 2023

Easy to maintain open source documentation websites.

TypeScript 63,060 9,602 Updated Dec 17, 2025

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 15,265 1,161 Updated Dec 19, 2025

Recoil is an experimental state management library for React apps. It provides several capabilities that are difficult to achieve with React alone, while being compatible with the newest features o…

JavaScript 19,562 1,222 Updated Jan 1, 2025

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 23,174 3,036 Updated Aug 15, 2024

GPT3 Chrome Extension Starter Kit

TypeScript 17 6 Updated Jan 16, 2023

Query Engine for AI - The only MCP Server you'll ever need

Python 38,098 6,068 Updated Dec 20, 2025
Jupyter Notebook 3 2 Updated Nov 21, 2021

Robust Speech Recognition via Large-Scale Weak Supervision

Python 92,181 11,547 Updated Dec 15, 2025

Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.

Jupyter Notebook 232 37 Updated Sep 12, 2022

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 14,096 2,110 Updated Aug 8, 2024
Jupyter Notebook 14 5 Updated Oct 15, 2017

Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world application of a neural net trained with backpropagation.

Jupyter Notebook 701 76 Updated Feb 3, 2024
Next