Stars
Research Agent service for the Agentic Workflow course
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
[AAAI-2025 Oral] Official implementation of Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition
[CVPR'25] AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
✨✨Latest Advances on Multimodal Large Language Models
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
The official code for the paper "MMIDR: Teaching Large Language Model to Interpret Multimodal Misinformation via Knowledge Distillation".
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
Official Code for the WWW'24 Paper: "Towards Explainable Harmful Meme Detection through Multimodal Debate between Large Language Models"
A PyTorch toolbox for domain generalization, domain adaptation and semi-supervised learning.
Task Residual for Tuning Vision-Language Models (CVPR 2023)
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
The code of the paper "Cross-Modal Graph Matching Network for Image-Text Retrieval" in ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) .
[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”
List of Computer Science courses with video lectures.
VisualBERT implementation using Huggingface and PyTorch-Lightning for memes classification with the use of both text and images
Official source code repository for the ICML 2021 paper "Hierarchical VAEs Know What They Don't Know"
Text perturbation methods to evaluate the robustness of NLP models
Reading list for research topics in multimodal machine learning
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings
EANN: event-adversarial neural networks for multi-modal fake news detection
Learning Vis Tools: Tutorial materials for Data Visualization course at HKUST