-
JHU
- Baltimore, US
-
12:39
(UTC -04:00) - https://kfmei.page/
Lists (2)
Sort Name ascending (A-Z)
Stars
Official implementation of Inductive Moment Matching
Official repository for our work on micro-budget training of large-scale diffusion models.
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
An unofficial LaTeX template for masters thesis and PhD dissertation at Johns Hopkins University.
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
[NeurIPS 2024] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
Clarity: A Minimalist Website Template for AI Research
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
My implementation of a reinforcement learning model using Stable-Baselines3 to play the NES Super Mario Bros.
📚 AIGC 求职面经、必备基础知识、提示词工程、ChatGPT、Stable Diffusion、Prompt、Embedding、Fintune 等 AIGC 求职你所需要知道的一切~
This project aims to utilize reinforcement learning (RL) techniques to train an artificial intelligence agent capable of playing the iconic Super Mario game.
[MICCAI 2024 (early accept)] ModelMix: A New Model-Mixup Strategy to Minimize Vicinal Risk across Tasks for Few-scribble based Cardiac Segmentation
A platform for managing machine learning experiments
Machine Learning and Computer Vision Engineer - Technical Interview Questions
A collection of resources on controllable generation with text-to-image diffusion models.
ControlNet++: All-in-one ControlNet for image generations and editing!
Code for the paper DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents, ICML 2024
Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models
A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.
A collection of awesome text-to-image generation studies.
Refine high-quality datasets and visual AI models
[CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation
Segment Anything for Stable Diffusion WebUI
An open source implementation of CLIP.
Emu Series: Generative Multimodal Models from BAAI