gqa
Here are 18 public repositories matching this topic...
Predicting a subgraph alongside the answer in a graph based VQA model
-
Updated
Jan 21, 2025 - Python
Vision-Language, Solve GQA(Visual Reasoning in the Real World) dataset.
-
Updated
May 9, 2019 - Python
A RAG-based question-answering system that processes user queries using local documents. It extracts relevant information to answer questions, falling back to a large language model when local sources are insufficient, ensuring accurate and contextual responses.
-
Updated
Dec 20, 2024 - Python
Reference Flash Attention implementation in PyTorch with V1/V2, GQA/MQA, Triton kernels, benchmark and docs.
-
Updated
Mar 13, 2026 - Python
A code deep-dive on one of the key innovations from Deepseek - Multihead Latent Attention (MLA)
-
Updated
Nov 9, 2025 - Python
Source code for my honours thesis: "Graph Attention Networks for Compositional Visual Question Answering"
-
Updated
Jan 29, 2021 - Python
A from-scratch PyTorch LLM implementing Sparse Mixture-of-Experts (MoE) with Top-2 gating. Integrates modern Llama-3 components (RMSNorm, SwiGLU, RoPE, GQA) and a custom-coded Byte-Level BPE tokenizer. Pre-trained on a curated corpus of existential & dark philosophical literature.
-
Updated
Jan 7, 2026 - Python
Modern LLM Attention from Scratch — MHA, GQA, MQA, RoPE, and KV-Cache implemented in pure PyTorch.
-
Updated
Dec 7, 2025 - Python
This repository contains an implementation of Group Query Attention (GQA), an efficient variant of multi-head attention used in modern transformer models like LLaMA.
-
Updated
Apr 11, 2026 - Python
Case study of multi-layer perceptron and random forest techniques as applied to a subset of the GQA dataset.
-
Updated
Jun 27, 2022 - Python
Compare LLM architectures without downloading weights — structural fingerprint & proxy-test advisor for vLLM, TensorRT-LLM, SGLang, ONNX Runtime
-
Updated
Mar 18, 2026 - Python
Korean 3B LLM (pure Transformer) pretrained from scratch on 8× NVIDIA B200 GPUs with SFT + ORPO alignment
-
Updated
Mar 26, 2026 - Python
Implementing and benchmarking four attention variants from scratch in PyTorch trained on Lichess chess games.
-
Updated
Mar 25, 2026 - Python
Improve this page
Add a description, image, and links to the gqa topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the gqa topic, visit your repo's landing page and select "manage topics."