Skip to content
View wzk1015's full-sized avatar
😎
😎

Highlights

  • Pro

Block or report wzk1015

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
wzk1015/README.md

  • I am currently a third-year Ph.D. candidate at Shanghai Jiao Tong University and Shanghai AI Laboratory.
  • My research interests include computer vision and music generation, especially for vision language models.
  • You can contact me via wangzhaokai [at] sjtu [dot] edu [dot] cn.
  • Homepage

一些仓库介绍

  • 发表论文

    • CNMT:Confidence-aware Non-repetitive Multimodal Transformers for TextCaps (AAAI 2021)
    • CMT:Video Background Music Generation with Controllable Music Transformer (ACM MM 2021 Best Paper Award)
    • SymMV:Video Background Music Generation: Dataset, Method and Evaluation (ICCV 2023)
    • PIIP:Parameter-Inverted Image Pyramid Networks
    • ITINERA:Integrating Spatial Optimization with Large Language Models for Open-domain Urban Itinerary Planning
  • 研究笔记

  • 有趣的游戏和工具

    • Sanguosha:文字版三国杀

    • GPT-turtlesoup:ChatGPT实现AI海龟汤,GPT出题、当玩家、当裁判

    • Pokemon-Types-PageRank:宝可梦属性排名,使用PageRank算法

    • wordle-solver:wordle游戏求解器

    • HRM-architecture:基于人力资源机器游戏的CPU、编译器等架构设计

    • wzk-Game-Collection:python小游戏全集,飞行棋、扫雷、德州扑克、2048、五子棋等

    • Arxiv-Assistant: 自动获取每日的arxiv新论文列表、使用GPT筛选、发邮件提醒

    • Scraper:小红书、微信公众号、马蜂窝爬虫

    • luna:简单的版本管理系统

    • hahaha:自动生成表情包

    • wzk-pypi-package:自己的python包,小游戏、爬虫等娱乐性质代码合集

  • 大学课程相关

Pinned Loading

  1. video-bgm-generation video-bgm-generation Public

    [ACM MM 2021 Best Paper Award] Video Background Music Generation with Controllable Music Transformer

    Python 289 34

  2. OpenGVLab/PIIP OpenGVLab/PIIP Public

    [NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)

    Python 55 2

  3. YihongT/ITINERA YihongT/ITINERA Public

    [EMNLP 2024 Industry Track & KDD UrbComp 2024 Best Paper Award] ITINERA: Integrating Spatial Optimization with Large Language Models for Open-domain Urban Itinerary Planning

    6

  4. zhuole1025/SymMV zhuole1025/SymMV Public

    [ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation

    66 1

  5. sanguosha sanguosha Public

    文字版三国杀,10000+行java实现

    Java 73 16

  6. CNMT CNMT Public

    [AAAI 2021] Confidence-aware Non-repetitive Multimodal Transformers for TextCaps

    Python 24 5