Ph.D. student from Fudan University, working on multimodal intelligence.
-
Fudan University
- Shanghai
- https://wdrink.github.io/
Pinned Loading
-
FoundationVision/OmniTokenizer
FoundationVision/OmniTokenizer Public[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.
-
-
M2TR-Multi-modal-Multi-scale-Transformers-for-Deepfake-Detection
M2TR-Multi-modal-Multi-scale-Transformers-for-Deepfake-Detection Public
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.