Po-Yao (Bernie) Huang

Cited by

	All	Since 2020
Citations	20614	20407
h-index	29	29
i10-index	51	47

10000

5000

2500

7500

201920202021202220232024202595 155 425 985 1997 7087 9714

Public access

View all

17 articles

1 article

available

not available

Based on funding mandates

Co-authors

Xiaojun ChangDirector of The ReLER Lab and Professor in Artificial Intelligence, University of Technology SydneyVerified email at uts.edu.au
Alex HauptmannCarnegie Mellon UniversityVerified email at cs.cmu.edu
Christoph FeichtenhoferMeta, FAIRVerified email at fb.com
Hu XuMeta AI (FAIR Labs)Verified email at meta.com
Florian MetzeCarnegie Mellon University; Meta AIVerified email at andrew.cmu.edu
Luke ZettlemoyerUniversity of Washington; MetaVerified email at cs.washington.edu
Junwei LiangAssistant Professor, HKUST (Guangzhou) | CSE, HKUST | Ph.D. @CMUVerified email at hkust-gz.edu.cn
Armen AghajanyanFacebook AI ResearchVerified email at fb.com
Billy li (Juncheng)Carnegie Mellon UniversityVerified email at cs.cmu.edu
Mandela PatrickPhD Student, University of OxfordVerified email at robots.ox.ac.uk
Yanghao LiAppleVerified email at fb.com
Jitendra MALIKProfessor of EECS, UC BerkeleyVerified email at eecs.berkeley.edu
Haoqi FanFacebook AI ResearchVerified email at fb.com
Junjie HuAssistant Professor, University of Wisconsin-MadisonVerified email at wisc.edu
Yuki M. AsanoFull Professor, Head of FunAI Lab, University of Technology NurembergVerified email at utn.de
Puyuan PengPhD student, The University of Texas at AustinVerified email at utexas.edu
Chris DyerDeepMind, Carnegie MellonVerified email at google.com
Graham NeubigCarnegie Mellon University, All Hands AIVerified email at cs.cmu.edu

Po-Yao (Bernie) Huang

Other namesBernie Huang, Poyao Huang

FAIR, Meta

Verified email at fb.com - Homepage

Multi-modal learning computer vision natural language processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
The llama 3 herd of models A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ... arXiv e-prints, arXiv: 2407.21783, 2024	7685*	2024
Dinov2: Learning robust visual features without supervision M Oquab, T Darcet, T Moutakanni, H Vo, M Szafraniec, V Khalidov, ... arXiv preprint arXiv:2304.07193, 2023	5059*	2023
A survey of deep active learning P Ren, Y Xiao, X Chang, PY Huang, Z Li, BB Gupta, X Chen, X Wang ACM computing surveys (CSUR) 54 (9), 1-40, 2021	1676	2021
A comprehensive survey of neural architecture search: Challenges and solutions P Ren, Y Xiao, X Chang, PY Huang, Z Li, X Chen, X Wang ACM Computing Surveys (CSUR) 54 (4), 1-34, 2021	935	2021
Videoclip: Contrastive pre-training for zero-shot video-text understanding H Xu, G Ghosh, PY Huang, D Okhonko, A Aghajanyan, F Metze, ... EMNLP 2021, 2021	671	2021
Introducing meta llama 3: The most capable openly available llm to date AI Meta Meta AI 2 (5), 6, 2024	492*	2024
Chameleon: Mixed-modal early-fusion foundation models C Team arXiv preprint arXiv:2405.09818, 2024	404	2024
Masked autoencoders that listen PY Huang, H Xu, J Li, A Baevski, M Auli, W Galuba, F Metze, ... NeurIPS 2022, 2022	393*	2022
Support-set bottlenecks for video-text representation learning M Patrick, PY Huang, Y Asano*, F Metze, A Hauptmann, J Henriques, ... ICLR 2021, 2020	308	2020
Self-Supervised Deep Correlation Tracking D Yuan, X Chang, PY Huang, Q Liu, Z He IEEE Transactions on Image Processing (TIP), 2020	277	2020
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles C Ryali, YT Hu, D Bolya, C Wei, H Fan, PY Huang, V Aggarwal, ... ICML 2023, 2023	246	2023
Attention-based multimodal neural machine translation PY Huang, F Liu, SR Shiang, J Oh, C Dyer First Conference on Machine Translation (WMT16), 2016	234	2016
Demystifying clip data H Xu, S Xie, XE Tan, PY Huang, R Howes, V Sharma, SW Li, G Ghosh, ... ICLR 2024, 2023	221	2023
Cm3: A causal masked multimodal model of the internet A Aghajanyan, B Huang, C Ross, V Karpukhin, H Xu, N Goyal, D Okhonko, ... arXiv preprint arXiv:2201.07520, 2022	175	2022
SeamlessM4T: massively multilingual & multimodal machine translation L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, PA Duquenne, ... arXiv preprint arXiv:2308.11596, 2023	171	2023
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding H Xu, G Ghosh, PY Huang, P Arora, M Aminzadeh, C Feichtenhofer, ... ACL-Findings 2021, 2021	158	2021
Structural analysis and optimization of convolutional neural networks with a small sample size RN D’souza, PY Huang, FC Yeh Scientific reports 10 (1), 834, 2020	150	2020
Video pivoting unsupervised multi-modal machine translation M Li, PY Huang, X Chang, J Hu, Y Yang, A Hauptmann IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (3), 3918-3932, 2022	145	2022
Rcaa: Relational context-aware agents for person search X Chang, PY Huang, YD Shen, X Liang, Y Yang, AG Hauptmann ECCV 2018, 2018	133	2018
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild P Peng, PY Huang, D Li, A Mohamed, D Harwath ACL 2024, 2024	109	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors