Skip to content
View bknyaz's full-sized avatar

Block or report bknyaz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Regime-adaptive speculative weight prediction for accelerating neural network training

Python 2 Updated May 2, 2026

A collection of weight space learning including papers, codes, and datasets.

68 6 Updated May 12, 2026

dLLM: Simple Diffusion Language Modeling

Python 2,509 264 Updated Apr 15, 2026
Python 225 14 Updated Nov 26, 2025

A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models

Python 72 5 Updated Feb 25, 2025

Visualize neural network with or without weights

Python 63 23 Updated Sep 7, 2023

incenτivised inτerneτ-wide τraining

Python 154 59 Updated Mar 31, 2026

Code for the paper Don't Pay Attention

Python 59 3 Updated Sep 25, 2025

[ICML 2025] Retraining-Free Merging of Sparse MoE via Hierarchical Clustering

Python 25 8 Updated Oct 26, 2025

Oscillatory State-Space Models

Python 121 14 Updated Mar 1, 2026

Code for Celo: Training Versatile Learned Optimizers on a Compute Diet

Python 4 Updated Mar 24, 2026

An efficient implementation of learned optimizers in PyTorch

Python 45 6 Updated Apr 21, 2026

[KDD'2024] "LLM4Graph: A Survey of Large Language Models for Graphs"

369 18 Updated Mar 15, 2025

[ICML 2024] LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery

Python 83 11 Updated May 31, 2024
Python 13 4 Updated Feb 25, 2025

GraphLLM: Boosting Graph Reasoning Ability of Large Language Model (IEEE Transactions on Big Data)

Python 128 18 Updated Jan 29, 2026
Python 23 3 Updated Sep 29, 2024

[ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer Training" by Peihao Wang, Rameswar Panda, Lucas Torroba Hennigen, Philip Greengard, Leonid Karlinsky, Rogerio Feris, David …

Python 92 11 Updated Feb 26, 2024

Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).

Python 82 5 Updated Jul 23, 2024

[TMLR 2025] Meta-learning Optimizers for Communication-Efficient Learning

Python 4 2 Updated Mar 18, 2025

Automatic gradient descent

TeX 217 13 Updated Jun 26, 2023

Model Zoos published at the NeurIPS 2022 Dataset & Benchmark track: "Model Zoos: A Dataset of Diverse Populations of Neural Network Models"

Python 59 1 Updated Oct 2, 2025

Custom distributed implementation of our proposed DTP algorithm parallelizing feedback weight training across GPUs (ICML 2022)

Python 4 1 Updated Jul 13, 2022

Code Repository for the NeurIPS 2021 paper: "Self-Supervised Representation Learning on Neural Network Weights for Model Characteristic Prediction".

Python 22 3 Updated Jul 10, 2024

Framework for defining machine learning models, including feature generation and transformations, as directed acyclic graphs (DAGs).

Java 354 37 Updated Oct 23, 2023

Tutorial on amortized optimization for learning to optimize over continuous domains

TeX 254 15 Updated Oct 5, 2025

Code for Parameter Prediction for Unseen Deep Architectures (NeurIPS 2021)

Python 490 65 Updated Jul 11, 2023

Official repository for the paper "On Evaluation Metrics for Graph Generative Models"

Jupyter Notebook 25 3 Updated Feb 6, 2022

Official repository of Brick-by-Brick, presented at NeurIPS-2021

Python 15 2 Updated May 11, 2022
Next