- Seoul, South Korea
-
15:02
(UTC +09:00) - linktr.ee/mysticalnd
Stars
PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
Python tool for converting files and office documents to Markdown.
Download market data from Yahoo! Finance's API
A collection of AI Agents papers (Updated biweekly)
[ECCV 2020] Official code for "Comprehensive Image Captioning via Scene Graph Decomposition"
[CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation
[CVPR'2022 Oral] The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation
[ACL 2023] Transforming Visual Scene Graphs to Image Captions
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
GIT: A Generative Image-to-text Transformer for Vision and Language
This repository contains code for the paper 'Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation'.
This is the code of ECCV 2022 (Oral) paper "Fine-Grained Scene Graph Generation with Data Transfer".
Attention Redirection Transformer with Semantic Oriented Learning for Unbiased Scene Graph Generation (24PR)
[ Research ] 취업 준비생을 위한 자기소개서 평가 및 개인화된 자기소개서 생성 프레임워크 개발
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiase…
A python3 version of coco-caption with spice.
SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation
Generative Models by Stability AI
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
(CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning
[ACL 2023 Findings] FACTUAL dataset, the textual scene graph parser trained on FACTUAL.
A high-throughput and memory-efficient inference and serving engine for LLMs
maj34 / ETRI-Paper-Contest
Forked from jin-jae/ETRI-Paper-Contest[Contest] Human Understanding AI Paper Challenge 2024
Foundation Models for Video Understanding: A Survey