Skip to content
View deligentfool's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report deligentfool

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

一个基于nano banana pro🍌的原生AI PPT生成应用,迈向真正的"Vibe PPT"; 支持上传任意模板图片;上传任意素材&智能解析;一句话/大纲/页面描述自动生成PPT;口头修改指定区域、一键导出可编辑ppt - An AI-native slides generator based on nano banana pro🍌

Python 14,573 1,714 Updated May 15, 2026

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?

Python 249 35 Updated Apr 25, 2026

[NeurIPS 2023] Efficient Diffusion Policy

Python 114 8 Updated Oct 31, 2023

Repo for Implicit Diffusion Q-Learning

Python 124 15 Updated Dec 5, 2023

Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548

Python 42 6 Updated Oct 11, 2023

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 643 40 Updated Feb 10, 2024

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,775 1,079 Updated Apr 20, 2026

🦞 OpenClaw 核心架构的极简复现,涵盖 sessionKey 会话域、队列串行、工具化记忆检索、按需上下文加载、可扩展技能与主动心跳唤醒机制

TypeScript 685 90 Updated Apr 6, 2026

AI agents running research on single-GPU nanochat training automatically

Python 81,543 11,856 Updated Mar 26, 2026
JavaScript 2 Updated Mar 29, 2026

Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —

299 14 Updated Sep 8, 2025
Python 4 1 Updated Sep 25, 2025

MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips

4,466 533 Updated May 29, 2022

Code for paper 'LLM-Explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven by Large Language Models'

Python 8 1 Updated Jan 20, 2025

Contexts Optical Compression

Python 23,128 2,141 Updated Jan 27, 2026

[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat

Python 5,345 668 Updated Oct 16, 2025

An offline deep reinforcement learning library

Python 1,662 264 Updated Sep 10, 2025

Code for conservative Q-learning

Python 484 77 Updated Dec 7, 2021

Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.

JavaScript 2,715 979 Updated May 17, 2026

ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning

Python 38 5 Updated Dec 30, 2024

Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets

Python 135 14 Updated Nov 21, 2024
Python 19 4 Updated Oct 27, 2025

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,347 3,883 Updated May 18, 2026
Jupyter Notebook 58 6 Updated Mar 12, 2026

"Your Fully-Automated Personal AI Assistant"

Python 1,529 215 Updated Oct 16, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 94,343 10,653 Updated May 15, 2026
Next