Skip to content
View nak966's full-sized avatar

Highlights

  • Pro

Block or report nak966

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository of Evolutionary Optimization of Model Merging Recipes

Python 1,368 116 Updated Nov 29, 2024

CLIP-based aesthetics predictor inspired by the interface of šŸ¤— huggingface transformers.

Python 40 1 Updated Aug 8, 2025

tiny vision language model

Python 8,735 677 Updated Sep 24, 2025

šŸŽµ Is a free asynchronous library from reverse engineered Shazam API written in Python 3.10+ with asyncio and aiohttp.

Python 728 98 Updated Jun 11, 2025

A framework to enable multimodal models to operate a computer.

Python 9,934 1,390 Updated Sep 19, 2025

🐢 Open-Source Evaluation & Testing library for LLM Agents

Python 4,917 372 Updated Oct 1, 2025

Run Latent Consistency Models on your Mac

Python 196 13 Updated Nov 10, 2023

The official Python library for the OpenAI API

Python 28,878 4,319 Updated Oct 9, 2025

[CVPR 2023 Workshop] The code reproduce the results of our solutions on both tracks for Meta AI Video Similarity Challenge (CVPR 2023 Workshop). Our solutions got the first place on both tracks, in…

Python 53 11 Updated May 30, 2023

Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch

Python 230 14 Updated Sep 6, 2024

An Open-source Toolkit for LLM Development

Python 2,789 177 Updated Jan 13, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,939 1,067 Updated Nov 18, 2024

[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Python 4,871 381 Updated Apr 7, 2024

[ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.

Jupyter Notebook 27 1 Updated Oct 27, 2023

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 58,831 7,129 Updated Oct 4, 2025

An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/pdf/2106.04718.pdf

Python 432 38 Updated Aug 17, 2022

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 59,701 10,577 Updated Oct 9, 2025

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,536 248 Updated Apr 24, 2024

[TMLR23] Official implementation of UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.

Jupyter Notebook 228 22 Updated Dec 22, 2023

A curated list of grounding natural language in video and related area. :-)

102 5 Updated Mar 31, 2022

Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"

Python 144 31 Updated Jun 1, 2022

MERLOT: Multimodal Neural Script Knowledge Models

Python 223 25 Updated Mar 15, 2022

Retrieval-Augmented Video Generation for Telling a Story

258 19 Updated Feb 5, 2024

Official code for VisProg (CVPR 2023 Best Paper!)

Python 747 69 Updated Aug 26, 2024

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)

Python 229 20 Updated Jul 21, 2023

[CVPR2023] All in One: Exploring Unified Video-Language Pre-training

Python 281 19 Updated Mar 25, 2023

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 12,824 1,338 Updated Oct 6, 2025

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 120 15 Updated Jun 20, 2025

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Python 4,301 379 Updated Aug 20, 2025
Next