Skip to content
View cakiki's full-sized avatar
🐈‍⬛
meow
🐈‍⬛
meow

Highlights

  • Pro

Organizations

@lichess-org @bigscience-workshop

Block or report cakiki

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A repository of pretty cool datasets that I collected for network science and machine learning research.

654 83 Updated Dec 20, 2025

advanced compilers

HTML 965 221 Updated Jan 10, 2026

Tokenization and analysis pipeline for full-text search

Rust 105 8 Updated Jun 18, 2026

Lore is a next-generation, open source version control system

Rust 5,573 220 Updated Jun 22, 2026

Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".

Jupyter Notebook 85 19 Updated Mar 11, 2024

📰 Computing the information content of trained neural networks

Jupyter Notebook 24 4 Updated Oct 8, 2021

A compression based language model

Python 71 12 Updated Jun 16, 2026

A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network

Python 303 19 Updated Sep 28, 2024
Java 44 2 Updated Jul 25, 2020

Code for the analysis underlying the article "How localized are computational templates? A machine learning approach""

Jupyter Notebook 3 Updated Nov 12, 2024

Enhancing Chess Reinforcement Learning with Graph Representation

Jupyter Notebook 24 4 Updated Dec 16, 2025

Python package providing an Inverted Index implementation using dictionaries

Python 37 9 Updated May 10, 2021

open source interpretability platform 🧠

TypeScript 927 121 Updated Jun 17, 2026

Bringing ✨interactivity✨ to plotnine

Python 145 9 Updated Jun 20, 2026

Quickly and accurately render even the largest data.

Python 3,552 378 Updated Jun 17, 2026

Interactive visualizations of the geometric intuition behind diffusion models.

Svelte 1,148 59 Updated Jun 1, 2026

Noah Research

Python 972 180 Updated Mar 20, 2026

An engine for displaying slips, the next-gen version of slides

OCaml 823 20 Updated Jun 22, 2026
Julia 129 25 Updated Jun 5, 2026

Fast and memory-efficient classical machine learning operators

Python 536 42 Updated Jun 20, 2026

Chess-World-Model: a 10M-game benchmark for exact state tracking from chess move sequences.

Python 3 Updated May 28, 2026

A Cheater Detection System for Online Chess

Jupyter Notebook 2 Updated Jul 21, 2025

High-performance GPU kernels for Ads and Recsys model training, independently implemented and optimized for real-world workloads and model-specific input characteristics.

Python 23 6 Updated May 28, 2026

Maia-3 is the most accurate and efficient human chess move prediction engine.

Python 124 15 Updated May 25, 2026
Go 1 Updated Jul 7, 2025

A tiny BERT for low-resource monolingual models

HTML 32 6 Updated Dec 24, 2025

Data Migration for the Blaze Project

Python 1,006 130 Updated Jul 15, 2022

Generalization or Memorization? Brittleness Testing for Chess-Trained Language Models

Python 1 Updated May 19, 2026
Next