Stars
The implementation of "Retraining-Free Merging of Sparse Mixture-of-Experts via Hierarchical Clustering", accepted by ICML 2025.
Code for Celo: Training Versatile Learned Optimizers on a Compute Diet
An efficient implementation of learned optimizers in PyTorch
[KDD'2024] "LLM4Graph: A Survey of Large Language Models for Graphs"
[ICML 2024] LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery
GraphLLM: Boosting Graph Reasoning Ability of Large Language Model
[ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer Training" by Peihao Wang, Rameswar Panda, Lucas Torroba Hennigen, Philip Greengard, Leonid Karlinsky, Rogerio Feris, David …
Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).
[TMLR 2025] Meta-learning Optimizers for Communication-Efficient Learning
Model Zoos published at the NeurIPS 2022 Dataset & Benchmark track: "Model Zoos: A Dataset of Diverse Populations of Neural Network Models"
Custom distributed implementation of our proposed DTP algorithm parallelizing feedback weight training across GPUs (ICML 2022)
Code Repository for the NeurIPS 2021 paper: "Self-Supervised Representation Learning on Neural Network Weights for Model Characteristic Prediction".
Framework for defining machine learning models, including feature generation and transformations, as directed acyclic graphs (DAGs).
Tutorial on amortized optimization for learning to optimize over continuous domains
Code for Parameter Prediction for Unseen Deep Architectures (NeurIPS 2021)
Official repository for the paper "On Evaluation Metrics for Graph Generative Models"
Official repository of Brick-by-Brick, presented at NeurIPS-2021
Graphormer is a general-purpose deep learning backbone for molecular modeling.
Implementation of the Paper "Permutation-Invariant Variational Autoencoder for Graph-Level Representation Learning" by Robin Winter, Frank Noe and Djork-Arne Clevert.
VectorInstitute / ppuda
Forked from facebookresearch/ppudaCode for Parameter Prediction for Unseen Deep Architectures (NeurIPS 2021)
[ICCV 2021] Official code for "Learning to Generate Scene Graph from Natural Language Supervision"