Skip to content
View danchern97's full-sized avatar
  • Amsterdam

Block or report danchern97

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is an official repository for "RTD-Lite: Scalable Topological Analysis for Comparing Weighted Graphs in Learning Tasks" accepted for AISTATS 2025 conference.

Jupyter Notebook 3 Updated May 2, 2025

Open-source implementation of AlphaEvolve

Python 4,418 651 Updated Nov 1, 2025

Code for the paper "VAE with a VampPrior", J.M. Tomczak & M. Welling

Python 234 48 Updated May 6, 2018

This is the official code release for Bayesian Flow Networks.

Python 303 35 Updated Jul 18, 2024

Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors

Python 247 14 Updated Feb 17, 2025

Kernel Herding for probability density estimation

HTML 14 5 Updated Feb 23, 2016

Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".

Jupyter Notebook 375 20 Updated Jun 11, 2024

Resources for the Enigmata Project.

Python 73 4 Updated Aug 13, 2025

Test-time scaling by sampling perturbations in the latent space.

Python 3 1 Updated Oct 2, 2025

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Python 125 11 Updated Sep 9, 2024

A centralized place for deep thinking code and experiments

Python 87 20 Updated Aug 9, 2023

A generative world for general-purpose robotics & embodied AI learning.

Python 27,544 2,531 Updated Nov 5, 2025

Environments for LLM Reinforcement Learning

Python 3,456 423 Updated Nov 5, 2025

The repo for code, that hasn't been published yet

14 Updated May 14, 2025

[COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models

Python 132 11 Updated Aug 15, 2025

Minimal hackable GRPO implementation

Python 299 41 Updated Jan 31, 2025

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,723 483 Updated Jan 8, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,319 807 Updated Oct 31, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,140 2,428 Updated Nov 5, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,410 163 Updated Mar 20, 2025

An extensible benchmark for evaluating large language models on planning

PDDL 426 44 Updated Sep 17, 2025

The official implementation of Self-Play Preference Optimization (SPPO)

Python 582 47 Updated Jan 23, 2025

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 2,928 280 Updated Jan 14, 2025

a curated list of data for reasoning ai

140 5 Updated Aug 4, 2024

Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.

Python 280 14 Updated Nov 3, 2024

Implementation for our paper "How Far Can Transformers Reason? The Locality Barrier and Inductive Scratchpad"

Python 11 1 Updated Jun 11, 2024
Python 11 1 Updated Feb 28, 2025

A Database of Real Faults and an Experimental Infrastructure to Enable Controlled Experiments in Software Engineering Research

Perl 893 350 Updated Oct 11, 2025

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,618 179 Updated Oct 2, 2025
Next