Skip to content
View mediacontentatlas's full-sized avatar
πŸ“±
πŸ“±

Block or report mediacontentatlas

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mediacontentatlas/README.md

Media Content Atlas (MCA) πŸ“±πŸ—ΊοΈ

A Pipeline to Explore and Investigate Multidimensional Media Space using Multimodal LLMs

Media Content Atlas (MCA) is a first-of-its-kind pipeline that enables large-scale, AI-driven analysis of digital media experiences using multimodal LLMs. It combines recent advances in machine learning and visualization to support both open-ended and hypothesis-driven research into screen content and behavior.

πŸ”— Website & Demo: mediacontentatlas.github.io
πŸŽ₯ Quick Video Explanation: Watch on YouTube
πŸ“„ Paper: Preprint
⏩ See Quickstart Tutorial here

πŸ“Ž Citation: Cerit, M., Zelikman, E., Cho, M., Robinson, T. N., Reeves, B., Ram, N., & Haber, N. (2025). Media Content Atlas: A Pipeline to Explore and Investigate Multidimensional Media Space using Multimodal LLMs. In Extended Abstracts of the CHI Conference on Human Factors in Computing Systems (CHI EA ’25). ACM. https://doi.org/10.1145/3706599.3720055

πŸ” Overview

Built on 1.12 million smartphone screenshots collected from 112 adults over a month, MCA enables researchers to:

  • Perform content-based clustering and topic modeling using semantic and visual signals
  • Automatically generate descriptions of screen content
  • Search and retrieve content across individuals and moments
  • Visualize digital media behavior with an interactive dashboard

Expert reviewers rated MCA's clustering results 96% relevant and AI-generated descriptions 83% accurate.

MCA Pipeline

πŸ—‚οΈ Code Structure

The pipeline is fully modular, with standalone scripts and notebooks for each stage:

2. πŸ“¦ mca_pipeline/ – Core Components

Stage Script Description
πŸ–ΌοΈ Embedding anonymized_clip_embedding_generation.py Generate visual embeddings using CLIP
πŸ“ Captioning anonymized_description_generation.py Generate descriptions using LLaVA-OneVision
πŸ”  Embedding anonymized_description_embedding_generation.py Generate sentence embeddings using GTE-Large
🧡 Clustering anonymized_clustering_topicmodeling_example.py Cluster and label screenshots using BERTopic + LLaMA2
πŸ“Š Visualization anonymized_create_interactive_visualizations.ipynb Create an interactive dashboard using DataMapPlot
πŸ” Retrieval anonymized_image_retrieval_app.py Retrieve screenshots using visual or textual similarity

3. πŸ§ͺ expert_surveys/ – Evaluation Instruments

File Description
anonymized_survey1.py Survey for cluster label relevance
anonymized_survey2.py Survey for description accuracy
anonymized_survey3.py Survey for retrieval performance

πŸ™‹β€β™€οΈ Questions or Feedback?

We’d love to hear from you! Feel free to:

πŸ› οΈ Roadmap

Here’s what’s next for MCA, let us know if you'd like collaborate:

  • πŸ” Reproducibility updates for easier setup
  • 🧩 Customization utilities (label editing, filters, user tagging)
  • πŸ“ˆ Longitudinal visualizations to explore media patterns over time Stay tuned! ⭐ Star this repo to keep up with updates.

πŸ“š Citation

If you use MCA in your research, please cite the CHI 2025 paper:

@inproceedings{cerit2025mca,
  author = {Merve Cerit and Eric Zelikman and Mu-Jung Cho and Thomas N. Robinson and Byron Reeves and Nilam Ram and Nick Haber},
  title = {Media Content Atlas: A Pipeline to Explore and Investigate Multidimensional Media Space using Multimodal LLMs},
  booktitle = {Extended Abstracts of the CHI Conference on Human Factors in Computing Systems (CHI EA '25)},
  year = {2025},
  month = {April},
  location = {Yokohama, Japan},
  publisher = {ACM},
  address = {New York, NY, USA},
  pages = {19},
  doi = {10.1145/3706599.3720055}
}

Popular repositories Loading

  1. mediacontentatlas mediacontentatlas Public

    Code for Media Content Atlas

    Python 4

  2. mediacontentatlas.github.io mediacontentatlas.github.io Public

    HTML