Skip to content
View davda54's full-sized avatar

Block or report davda54

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Minimal and highly hackable implementation of Looped Transformers with GPT

Python 20 1 Updated Mar 8, 2026

Minimal Claude Code alternative. Single Python file, zero dependencies, ~250 lines.

Python 2,275 224 Updated Jan 14, 2026

A highly compressive and high-quality neural audio codec for speech models.

Python 262 25 Updated Jan 23, 2026

A Norwegian Language Understanding and Generation Evaluation Benchmark

Python 8 2 Updated Nov 25, 2025

The official repository for AdaMuon

Python 37 4 Updated Aug 27, 2025

Fast & memory efficient hashtable based on robin hood hashing for C++11/14/17/20

C++ 1,606 156 Updated May 1, 2023

Trully flash implementation of DeBERTa disentangled attention mechanism.

Python 85 6 Updated Feb 10, 2026

Efficient optimizers

Python 311 27 Updated Apr 4, 2026

supporting pytorch FSDP for optimizers

Python 84 4 Updated Dec 8, 2024

Teaching transformers to play chess

Python 153 12 Updated Dec 31, 2025

Official implementation of "GPT or BERT: why not both?"

Python 63 10 Updated Jul 28, 2025

Fast and accurate language identifier

Rust 6 Updated Apr 10, 2026

Truly independent web browser

C++ 62,335 2,939 Updated Apr 13, 2026

Official implementation of "BERTs are Generative In-Context Learners"

Python 32 Updated Mar 14, 2025

Implementation of the paper "Compositional Generalization with Grounded Language Models", ACL 2024 Findings

Python 3 Updated Jun 3, 2024
Python 4 Updated Jun 16, 2024

Highlight errors in a bib file: missing URLs, capitalization protection, etc

TypeScript 28 Updated May 12, 2024

Suplementary code for the NORA large language models

Python 8 Updated Feb 3, 2025
Python 22 7 Updated Apr 14, 2025

Scripts and documentation on scaling large language model training on the LUMI supercomputer

Shell 10 3 Updated Jun 30, 2023
Python 34 3 Updated Jan 25, 2024
Python 16 Updated May 14, 2024

Enabling easy statistical significance testing for deep neural networks.

Python 340 20 Updated Jul 1, 2024

LTG-Bert

Python 34 5 Updated Jan 8, 2024

PyTorch interface for TrueGrad Optimizers

Python 43 1 Updated Aug 8, 2023
Python 3 Updated Nov 23, 2022
Python 16 3 Updated Oct 31, 2022
Python 12 1 Updated Jan 2, 2024

Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch

Python 253 10 Updated Sep 1, 2022
Next