Skip to content
View KalyanKS-NLP's full-sized avatar

Block or report KalyanKS-NLP

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Open source implementation and extension of Google Research’s PaperBanana for automated academic figures, diagrams, and research visuals, expanded to new domains like slide generation.

Python 1,605 244 Updated Apr 22, 2026

SQL-like query language and CLI for Qdrant vector search engine

Python 45 4 Updated May 16, 2026

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 8,270 1,473 Updated Nov 28, 2025

DFlash & TurboQuant in llama.cpp with up to 3x faster generation and 7.5x more KV cache in same VRAM

C++ 351 16 Updated May 17, 2026

Advanced prompt injection defense system for AI agents. Multi-language detection, severity scoring, and security auditing.

Python 157 29 Updated Apr 22, 2026
Jupyter Notebook 62 4 Updated May 14, 2026

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Jupyter Notebook 950 141 Updated Aug 16, 2024

A fast type checker and language server for Python

Rust 6,083 358 Updated May 17, 2026

Fastino's LLM guardrail

27 3 Updated May 12, 2026

A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.

Python 483 29 Updated Mar 10, 2025

Minimal keyword extraction with BERT

Python 4,175 383 Updated May 13, 2026

ML algorithms implemented and derived from first-principles in Jupyter Notebooks and NumPy

TeX 1,413 133 Updated Apr 2, 2026

Modal-style sandbox API on top of Hugging Face Jobs

Python 141 11 Updated May 11, 2026

DFlash: Block Diffusion for Flash Speculative Decoding

Python 4,617 330 Updated May 10, 2026

Codebase for LLM Textual Hallucination Benchmark

Python 80 18 Updated Apr 25, 2025

Development repository for the Triton language and compiler

MLIR 19,203 2,855 Updated May 17, 2026
Python 348 37 Updated May 4, 2026

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,036 94 Updated May 17, 2026

Pure Rust Inference Engine

Rust 333 37 Updated May 17, 2026

MedSafetyBench: Evaluating and Improving the Medical Safety of LLMs, NeurIPS 2024

Python 47 5 Updated Dec 4, 2025

hardware implementation of transformers running microgpt at 50k+ tkps

Verilog 651 70 Updated May 14, 2026

Domain-specific GLiNER fine-tuning pipeline for sports NER. GLiNER is a bidirectional transformer encoder (DeBERTa-v3 based) for zero-shot named entity recognition — I'm fine-tuning it on a custom …

Jupyter Notebook 3 Updated Apr 21, 2026
Python 23 5 Updated Apr 25, 2025

OpenAI Privacy Filter

Python 2,171 191 Updated Apr 22, 2026

A framework for few-shot evaluation of language models.

Python 12,596 3,275 Updated May 11, 2026

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Python 33 4 Updated Feb 26, 2025

Restore heading hierarchy in markdown documents using a fine-tuned 0.6B parameter LLM.

Python 4 Updated Apr 17, 2026

Claude skill for finding ML research papers.

208 19 Updated Apr 14, 2026

KV cache compression via block-diagonal rotation. Beats TurboQuant: better PPL (6.91 vs 7.07), 28% faster decode, 5.3x faster prefill, 44x fewer params. Drop-in llama.cpp integration.

Python 986 83 Updated Apr 23, 2026
Next