hbseong97

Haebin Seong hbseong97

47 followers · 35 following

maum.ai
Seoul, Korea
http://hbseong97.github.io/cv

Achievements

Highlights

Organizations

Starred repositories

worv-ai / CostNav

CostNav: A Navigation Benchmark for Cost-Aware Evaluation of Embodied Agents

Python 10 1 Updated Dec 22, 2025

worv-ai / D2E

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

63 2 Updated Dec 19, 2025

yt-dlp / yt-dlp

A feature-rich command-line audio/video downloader

Python 139,260 11,247 Updated Dec 20, 2025

moojink / rlds_dataset_mod

Forked from kpertsch/rlds_dataset_mod

Efficiently apply modification functions to RLDS/TFDS datasets.

Python 30 12 Updated Jun 19, 2024

lmgame-org / GamingAgent

LLM/VLM gaming agents and model evaluation through games.

Python 834 88 Updated Nov 16, 2025

dvanoni / notero

A Zotero plugin for syncing items and notes into Notion

TypeScript 2,989 125 Updated Dec 7, 2025

justinribeiro / zotero-google-scholar-citation-count

Zotero plugin for fetching number of citations from Google Scholar.

JavaScript 345 4 Updated Apr 25, 2025

open-world-agents / open-world-agents

Everything you need to build state-of-the-art foundation multimodal desktop agent, end-to-end.

Python 24 8 Updated Dec 19, 2025

vessl-ai / hyperpocket

Building AI agent with hyperpocket tool in a flash

Python 51 10 Updated Apr 11, 2025

mddunlap924 / PII-Detection

Personal Identifiable Information (PII) entity detection and performance enhancement with synthetic data generation

Python 33 4 Updated Sep 4, 2024

ebmoon / transformers-GAD

[NeurIPS'24] Grammar-Aligned Decoding: An algorithm to constrain LLMs' outputs without distorting its original distribution

Python 24 6 Updated Feb 10, 2025

AGI-Edgerunners / LLM-Agents-Papers

A repo lists papers related to LLM based agent

Python 2,160 133 Updated Jul 12, 2025

microsoft / autogen

A programming framework for agentic AI

Python 52,794 8,024 Updated Oct 8, 2025

hbseong97 / HarmAug

HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models

Python 13 2 Updated Mar 6, 2025

hbseong97 / tf-c-api

Using tensorflow c api, c++ api, tf lite, tf js, model conversion in Windows

Jupyter Notebook 5 1 Updated Nov 5, 2019

hbseong97 / KRFA

Deep learning in production with Keras, Redis, Flask, and Apache [windows ver.]

Python 6 2 Updated Dec 31, 2019

Nautilus-Institute / quals-2024

Source code for the DEF CON 32 CTF Qualifiers.

Elixir 76 5 Updated May 24, 2024

BSidesSF / ctf-2024-release

CSS 22 6 Updated May 7, 2024

google-deepmind / neural-processes

This repository contains notebook implementations of the following Neural Process variants: Conditional Neural Processes (CNPs), Neural Processes (NPs), Attentive Neural Processes (ANPs).

Jupyter Notebook 1,010 154 Updated Jan 19, 2021

etaoxing / multigame-dt

Implementation of Multi-Game Decision Transformers in PyTorch

Python 48 5 Updated Feb 11, 2023

Howuhh / faster-trajectory-transformer

Implementation of Trajectory Transformer with attention caching and batched beam search

Python 116 12 Updated Apr 27, 2023

jannerm / trajectory-transformer

Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

Python 524 69 Updated Oct 6, 2022

huawei-noah / HEBO

Bayesian optimisation & Reinforcement Learning library developed by Huawei Noah's Ark Lab

Jupyter Notebook 2,693 455 Updated Nov 27, 2025

alex-petrenko / sample-factory

High throughput synchronous and asynchronous reinforcement learning

Python 961 143 Updated Nov 14, 2025

google-research / optformer

Python 233 37 Updated Dec 1, 2025

facebookresearch / online-dt

Online Decision Transformer

Python 274 42 Updated Jan 22, 2024

Farama-Foundation / Minari

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities

Python 472 63 Updated Dec 1, 2025

Farama-Foundation / D4RL

A collection of reference environments for offline reinforcement learning

Python 1,625 303 Updated Nov 18, 2024

lucidrains / PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,872 681 Updated Oct 11, 2025

openai / lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,379 174 Updated Jul 25, 2023

Haebin Seong hbseong97

Highlights

Organizations

Starred repositories

meta-rl