OpenJudge: A Unified Framework for Holistic Evaluation and Quality Rewards
-
Updated
Apr 17, 2026 - Python
OpenJudge: A Unified Framework for Holistic Evaluation and Quality Rewards
🤖 Automate Bing Searches 🔍, Quizzes 🧪, Polls 📝, & more for Bing Rewards. 💸
👤 Multi-Armed Bandit Algorithms Library (MAB) 👮
Scalable and extensible reinforcement learning for LM agents.
An advanced desktop automation tool for Microsoft Rewards. It performs Bing searches and collects Daily Sets using mathematically driven, human-like input simulation (W3C Actions, Bezier curves, and smart scrolling). Built with Python/Selenium and packaged as an executable Windows app for a seamless, plug-and-play experience.
official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives
During our participation in the Internship Exchange Program, my friend and I collaborated with the guidance of our esteemed supervisor from NTHU.
Worksheet and Utilities for AWS DeepRacer – one of the most exciting ways of building strong skills in reinforcement learning and through a hands-on approach. This repository offers: 1) Functionally-rich and flexible reward function 2) Utilities with Jupiter notes for Racing Line calculation and visualisation of track 3) Scripts to parse RoboMak…
Creating an environment to quickly train a variety of Deep Reinforcement Learning algorithms on Street Fighter 2 using tournaments between learning agents
Value & Policy Iteration for the frozenlake environment of OpenAI
Deep Reinforcement Learning with LTL goals.
Script to automagically add channel currency for users that use channel point rewards. Potentially deprecated due to twitch api changes.
Add a description, image, and links to the reward topic page so that developers can more easily learn about it.
To associate your repository with the reward topic, visit your repo's landing page and select "manage topics."