Skip to content
View davda54's full-sized avatar

Block or report davda54

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Minimal and highly hackable implementation of Looped Transformers with GPT

Python 20 1 Updated Mar 8, 2026

Minimal Claude Code alternative. Single Python file, zero dependencies, ~250 lines.

Python 2,195 206 Updated Jan 14, 2026

A highly compressive and high-quality neural audio codec for speech models.

Python 261 25 Updated Jan 23, 2026

A Norwegian Language Understanding and Generation Evaluation Benchmark

Python 8 2 Updated Nov 25, 2025

The official repository for AdaMuon

Python 36 4 Updated Aug 27, 2025

Fast & memory efficient hashtable based on robin hood hashing for C++11/14/17/20

C++ 1,606 156 Updated May 1, 2023

Trully flash implementation of DeBERTa disentangled attention mechanism.

Python 83 6 Updated Feb 10, 2026

Efficient optimizers

Python 297 26 Updated Mar 28, 2026

supporting pytorch FSDP for optimizers

Python 84 4 Updated Dec 8, 2024

Teaching transformers to play chess

Python 151 12 Updated Dec 31, 2025

Official implementation of "GPT or BERT: why not both?"

Python 63 10 Updated Jul 28, 2025

Fast and accurate language identifier

Rust 6 Updated Jan 7, 2026

Truly independent web browser

C++ 61,655 2,895 Updated Mar 29, 2026

Official implementation of "BERTs are Generative In-Context Learners"

Python 32 Updated Mar 14, 2025

Implementation of the paper "Compositional Generalization with Grounded Language Models", ACL 2024 Findings

Python 3 Updated Jun 3, 2024
Python 4 Updated Jun 16, 2024

Highlight errors in a bib file: missing URLs, capitalization protection, etc

TypeScript 28 Updated May 12, 2024

Suplementary code for the NORA large language models

Python 8 Updated Feb 3, 2025
Python 22 7 Updated Apr 14, 2025

Scripts and documentation on scaling large language model training on the LUMI supercomputer

Shell 10 3 Updated Jun 30, 2023
Python 34 3 Updated Jan 25, 2024
Python 16 Updated May 14, 2024

Enabling easy statistical significance testing for deep neural networks.

Python 340 20 Updated Jul 1, 2024

LTG-Bert

Python 34 5 Updated Jan 8, 2024

PyTorch interface for TrueGrad Optimizers

Python 43 1 Updated Aug 8, 2023
Python 3 Updated Nov 23, 2022
Python 16 3 Updated Oct 31, 2022
Python 12 1 Updated Jan 2, 2024

Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch

Python 253 10 Updated Sep 1, 2022
Next