Skip to content
View pr-Mais's full-sized avatar
👾
👾

Organizations

@googlemaps @fluttercommunity @FlutterVikings @Thmanyah-LLC

Block or report pr-Mais

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Curated list of Go design patterns, recipes and idioms

Go 27,950 2,340 Updated May 14, 2024

Terminal UI for AWS (taws) - A terminal-based AWS resource viewer and manager

Rust 2,220 66 Updated May 17, 2026

🐹 Deep clean and optimize your Mac.

Shell 51,667 1,621 Updated May 17, 2026

ASU-sparkysundevil-resume-template

TeX 35 19 Updated Oct 3, 2024

The conventional commits specification

SCSS 8,857 667 Updated Mar 11, 2026

A First Look at Conventional Commits Classification

Python 13 1 Updated Nov 18, 2024

NPM library to splice HLS VOD

JavaScript 19 4 Updated Feb 28, 2026

💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline

Python 64,939 4,737 Updated Mar 23, 2026

methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositories

Python 173 45 Updated Dec 4, 2023

Analysis scripts for log data sets used in anomaly detection.

Python 84 18 Updated Oct 19, 2025

Firebase SDK for Cloud Functions

TypeScript 1,057 227 Updated May 15, 2026

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Python 564 61 Updated Apr 23, 2026

Mastering Diverse Domains through World Models

Python 3,232 538 Updated Sep 23, 2025

Train transformer language models with reinforcement learning.

Python 18,398 2,723 Updated May 17, 2026

Schedule-Free Optimization in PyTorch

Python 2,278 76 Updated May 21, 2025

Fine-tune LLM agents with online reinforcement learning

Python 1,251 63 Updated Mar 19, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 904 52 Updated Sep 30, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,890 234 Updated Aug 11, 2024

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 98,289 9,308 Updated May 15, 2026

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 13,278 2,131 Updated May 11, 2026

Powerful menu bar manager for macOS

Swift 27,987 716 Updated Sep 20, 2025
Python 2 Updated May 13, 2026

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,867 679 Updated Oct 11, 2025

Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.

Python 116 10 Updated Feb 9, 2024

Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various c…

Python 190 19 Updated Feb 24, 2026

Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.

556 37 Updated Nov 17, 2025
Python 147 15 Updated May 2, 2024

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

3,640 247 Updated Jan 26, 2026

Data for paper "Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness"

Python 34 6 Updated May 3, 2023
Python 163 44 Updated Nov 6, 2025
Next