Skip to content
View sumukhashridhar's full-sized avatar

Block or report sumukhashridhar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Technical Analysis Indicator Function Library in C

C 936 176 Updated Feb 2, 2024

Find your trading edge, using the fastest engine for backtesting, algorithmic trading, and research.

Python 6,340 836 Updated Dec 12, 2025

Python quantitative trading strategies including VIX Calculator, Pattern Recognition, Commodity Trading Advisor, Monte Carlo, Options Straddle, Shooting Star, London Breakout, Heikin-Ashi, Pair Tra…

Python 8,821 1,650 Updated Apr 14, 2024

Fast Multi-dimensional Sparse Attention

C++ 688 53 Updated Dec 24, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,109 12,166 Updated Dec 24, 2025

JAX-LOB: A GPU-Accelerated limit order book simulator to unlock large scale reinforcement learning for trading

Jupyter Notebook 132 24 Updated Dec 9, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,860 4,112 Updated Dec 23, 2025

Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.

Jupyter Notebook 79 6 Updated Nov 25, 2025
Jupyter Notebook 113 7 Updated Dec 9, 2025

UCSD ECE277 GPU Programming coursework: GPU-accelerated reinforcement learning on CUDA C with Nsight System

Cuda 11 7 Updated Aug 17, 2021
Jupyter Notebook 8,544 1,663 Updated Sep 22, 2024

Notes, material and various stuff collected while attended TUM Master's Degree

HTML 483 99 Updated Oct 4, 2022

Add a stalin sort algorithm in any language you like ❣️ if you like give us a ⭐️

Rocq Prover 1,646 190 Updated Oct 28, 2025

Bitonic sort algorithm for GPU

Cuda 7 Updated Aug 12, 2020

A comparison study between sequential sorting algorithms implemented in C++ and parallel sorting algorithms implemented in CUDA as part of the master's thesis.

C++ 65 5 Updated Oct 28, 2021

Blog posts

6 Updated Jun 24, 2025

Tracking RISC-V Actions on Education, Training, Courses, Monitorships, etc.

1,276 128 Updated Nov 6, 2025

Inline PTX Assembly in CUDA example

Cuda 13 3 Updated May 7, 2022

Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)

Cuda 925 339 Updated Aug 19, 2024

Parallelisation of standard algorithms using CUDA

C++ 1 Updated Oct 7, 2017

Yinghan's Code Sample

Cuda 361 61 Updated Jul 25, 2022

Kernels for attention and other diffusion specific tasks.

Cuda 9 Updated Apr 19, 2025

row-major matmul optimization

C++ 692 94 Updated Aug 20, 2025

FFmpeg Assembly Language Lessons

11,298 362 Updated Nov 7, 2025

How to be low-level programmer

12,468 870 Updated Mar 24, 2025

Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO

C++ 1,886 81 Updated Dec 23, 2025

This repository contains LLM (Large language model) interview question asked in top companies like Google, Nvidia , Meta , Microsoft & fortune 500 companies.

1,587 346 Updated Feb 12, 2025

CNN based autoencoder combined with kernel density estimation for colour image anomaly detection / novelty detection. Built using Tensforflow 2.0 and Keras

Jupyter Notebook 36 22 Updated Jan 29, 2020

Lists of company wise questions available on leetcode premium. Every csv file in the companies directory corresponds to a list of questions on leetcode for a specific company based on the leetcode …

Jupyter Notebook 10,960 2,292 Updated Jul 16, 2024
Next