Follow
Po-Yao (Bernie) Huang
Po-Yao (Bernie) Huang
Other namesBernie Huang, Poyao Huang
FAIR, Meta
Verified email at fb.com - Homepage
Title
Cited by
Cited by
Year
The llama 3 herd of models
A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ...
arXiv e-prints, arXiv: 2407.21783, 2024
7685*2024
Dinov2: Learning robust visual features without supervision
M Oquab, T Darcet, T Moutakanni, H Vo, M Szafraniec, V Khalidov, ...
arXiv preprint arXiv:2304.07193, 2023
5059*2023
A survey of deep active learning
P Ren, Y Xiao, X Chang, PY Huang, Z Li, BB Gupta, X Chen, X Wang
ACM computing surveys (CSUR) 54 (9), 1-40, 2021
16762021
A comprehensive survey of neural architecture search: Challenges and solutions
P Ren, Y Xiao, X Chang, PY Huang, Z Li, X Chen, X Wang
ACM Computing Surveys (CSUR) 54 (4), 1-34, 2021
9352021
Videoclip: Contrastive pre-training for zero-shot video-text understanding
H Xu, G Ghosh, PY Huang, D Okhonko, A Aghajanyan, F Metze, ...
EMNLP 2021, 2021
6712021
Introducing meta llama 3: The most capable openly available llm to date
AI Meta
Meta AI 2 (5), 6, 2024
492*2024
Chameleon: Mixed-modal early-fusion foundation models
C Team
arXiv preprint arXiv:2405.09818, 2024
4042024
Masked autoencoders that listen
PY Huang, H Xu, J Li, A Baevski, M Auli, W Galuba, F Metze, ...
NeurIPS 2022, 2022
393*2022
Support-set bottlenecks for video-text representation learning
M Patrick*, PY Huang*, Y Asano*, F Metze, A Hauptmann, J Henriques, ...
ICLR 2021, 2020
3082020
Self-Supervised Deep Correlation Tracking
D Yuan, X Chang, PY Huang, Q Liu, Z He
IEEE Transactions on Image Processing (TIP), 2020
2772020
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
C Ryali, YT Hu, D Bolya, C Wei, H Fan, PY Huang, V Aggarwal, ...
ICML 2023, 2023
2462023
Attention-based multimodal neural machine translation
PY Huang, F Liu, SR Shiang, J Oh, C Dyer
First Conference on Machine Translation (WMT16), 2016
2342016
Demystifying clip data
H Xu, S Xie, XE Tan, PY Huang, R Howes, V Sharma, SW Li, G Ghosh, ...
ICLR 2024, 2023
2212023
Cm3: A causal masked multimodal model of the internet
A Aghajanyan, B Huang, C Ross, V Karpukhin, H Xu, N Goyal, D Okhonko, ...
arXiv preprint arXiv:2201.07520, 2022
1752022
SeamlessM4T: massively multilingual & multimodal machine translation
L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, PA Duquenne, ...
arXiv preprint arXiv:2308.11596, 2023
1712023
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding
H Xu, G Ghosh, PY Huang, P Arora, M Aminzadeh, C Feichtenhofer, ...
ACL-Findings 2021, 2021
1582021
Structural analysis and optimization of convolutional neural networks with a small sample size
RN D’souza, PY Huang, FC Yeh
Scientific reports 10 (1), 834, 2020
1502020
Video pivoting unsupervised multi-modal machine translation
M Li, PY Huang, X Chang, J Hu, Y Yang, A Hauptmann
IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (3), 3918-3932, 2022
1452022
Rcaa: Relational context-aware agents for person search
X Chang, PY Huang, YD Shen, X Liang, Y Yang, AG Hauptmann
ECCV 2018, 2018
1332018
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild
P Peng, PY Huang, D Li, A Mohamed, D Harwath
ACL 2024, 2024
1092024
The system can't perform the operation now. Try again later.
Articles 1–20