Stars
Hello, World! — A structured study repository for World Models papers (Dreamer, MuZero, JEPA, Genie, Cosmos and beyond).
QGuard:Question-based Zero-shot Guard for Multi-modal LLM Safety, ACL 2025 Workshop
Official implementation of "MeshDiffusion: Score-based Generative 3D Mesh Modeling" (ICLR 2023 Spotlight)
This is a repository for listing papers on scene graph generation and application.
Hierarchical Summarizer, HierarchiSummarizer is a document processing tool that uses LLMs and MistralOCR(pdf to markdown) to analyze document hierarchy, extract structured content, summarize, and t…
Convert your PDFs into Markdown files easily with Mistral OCR Software
Best Papers of Top Venues like CVPR, NeurIPS, ICLR, ICML, ICCV, ECCV, ...
Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including CrewAI, Agno, OpenAI Agents SDK, Langchain, Autogen, AG2, and…
Automate the process of making money online.
The 500 AI Agents Projects is a curated collection of AI agent use cases across various industries. It showcases practical applications and provides links to open-source projects for implementation…
코드 유사성 판단 시즌2 AI 경진대회, DACON (2024.03.04 ~ 2024.04.01)
반도체 소자 이상 탐지 AI 경진대회, DACON (2024.02.05 ~ 2024.03.04)
자율주행 센서의 안테나 성능 예측 AI 경진대회, LG AI Research (2022.08.01 ~ 2022.08.26)
2023 Samsung AI Challenge : Camera-Invariant Domain Adaptation, Samsung Advanced Institute of Technology (2023.08.21 ~ 2023.10.02)
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
The pytorch implementation of our CVPR 2023 paper "Conditional Image-to-Video Generation with Latent Flow Diffusion Models"
collection of diffusion model papers categorized by their subareas
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
A curated list of awesome 3d generation papers
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
A collection of 3D reconstruction papers in the deep learning era.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
A minimal, responsive, and feature-rich Jekyll theme for technical writing.
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
📝같은 글을 한 번에 포스팅하자! 블로그 포스트 관리 툴, 동글✍️
A collaboration friendly studio for NeRFs
Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch
[AAAI 2024] AnomalyDiffusion: Few-Shot Anomaly Image Generation with Diffusion Model