-
Cognitive System Lab., Artificial Intelligence Department, Korea Univ.
- Seoul, Republic of Korea
-
12:30
(UTC +09:00) - hoesungryu.github.io
- https://orcid.org/0000-0002-9515-4402
- https://scholar.google.com/citations?user=mO5hGlkAAAAJ&hl=en
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Model zoo for Gen AI models for Hailo products
Official inference framework for 1-bit LLMs
AI agents running research on single-GPU nanochat training automatically
Convert biological neuronal networks to artificial recurrent neuronal networks
[TVCG 2020] Official Implementation of "DGaze: CNN-Based Gaze Prediction in Dynamic Scenes"
codes for manuscript "Artificial intelligence driven definition of food preference endotypes in UK Biobank volunteers is associated with distinctive health outcomes and blood based metabolomic and …
On-device AI across mobile, embedded and edge for PyTorch
LabStreamingLayer super repository comprising submodules for LSL and associated apps.
Frontier Multimodal Foundation Models for Image and Video Understanding
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Spatial Information (SI) and Temporal Information (TI) calculation using MATLAB
This package aims at simplifying the download of the AudioSet dataset.
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"
Are you ready to FLIRT with your wearable data?
Adafruit code for the Nordic nRF52 BLE SoC on Arduino
[ICLR 2025] MLLM for On-Demand Spatial-Temporal Understanding at Arbitrary Resolution
A self-supervised learning framework for audio-visual speech
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
App for generating QR codes with GitHub logo and export to SVG/PNG/JPEG/WEBP format