-
Microsoft
- Republic of, Korea
Highlights
Lists (1)
Sort Name ascending (A-Z)
Stars
We write your reusable computer vision tools. 💜
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
A TTS model capable of generating ultra-realistic dialogue in one pass.
The absolute trainer to light up AI agents.
[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"
PyTorch package for the discrete VAE used for DALL·E.
Hydra is a framework for elegantly configuring complex applications
Flexible and powerful framework for managing multiple AI agents and handling complex conversations
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Pre-trained Deep Learning models and demos (high quality and extremely fast)
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
The first level of Super Mario Bros made with Python and Pygame.
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171
Web / REST interface for downloading youtube videos onto a server.
Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized model, a RAG fine-tuned model, and an Agent fine-tuned model. …
data-to-paper: Backward-traceable AI-driven scientific research
Machine Learning library for educational purpose.
This is the official implementation of "Blind Image Restoration via Fast Diffusion Inversion"
AWS Presentation script generator specialized for AWS Service Introduction Deck (pptx)