0% found this document useful (0 votes)

29 views2 pages

Supplementary Material For Large Pose 3D Face Reconstruction From A Single Image Via Direct Volumetric CNN Regression

The document presents results of a volumetric regression network (VRN) for 3D face reconstruction from a single image. The VRN uses a direct volumetric CNN regression approach to predict an occupancy grid representing the 3D shape. The VRN is shown to outperform other methods on standard benchmarks like AFLW2000, BU4DFE and Florence in terms of normalized mean error. It is also shown to generalize well to challenging in-the-wild videos from 300VW despite being trained on static images only. Additional qualitative results and failure cases are provided.

Uploaded by

Nexus Vyas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views2 pages

Supplementary Material For Large Pose 3D Face Reconstruction From A Single Image Via Direct Volumetric CNN Regression

Uploaded by

Nexus Vyas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Supplementary Material for Large Pose 3D Face Reconstruction from a Single

Image via Direct Volumetric CNN Regression

Aaron S. Jackson1 Adrian Bulat1 Vasileios Argyriou2 Georgios Tzimiropoulos1

1 2
The University of Nottingham, UK Kingston University, UK
1
{aaron.jackson, adrian.bulat, yorgos.tzimiropoulos}@nottingham.ac.uk
2
vasileios.argyriou@kingston.ac.uk

100% 100%

90% VRN - Guided (ours) 90% VRN - Guided (ours)

VRN - Multitask (ours) VRN - Multitask (ours)
80% VRN (ours) 80% VRN (ours)
3DDFA 3DDFA
70% EOS λ = 5000 70% EOS λ = 5000
% of images

% of images

60% 60%

50% 50%

40% 40%

30% 30%

20% 20%

10% 10%

0% 0%
0 0.02 0.04 0.06 0.08 0.1 0 0.02 0.04 0.06 0.08 0.1
NME normalised by outer interocular distance NME normalised by outer interocular distance

Figure 1: NME-based performance on the in-the-wild Figure 2: NME-based performance on our large pose and
AFLW2000-3D dataset, where ICP has been used to remove expression renderings of the BU4DFE dataset, where ICP
the rigid transformation. The proposed Volumetric Regres- has been used to remove the rigid transformation. The
sion Networks, and EOS and 3DDFA are compared. proposed Volumetric Regression Networks, and EOS and
3DDFA are compared.

1. Results with ICP Registration Table 1: Reconstruction accuracy on AFLW2000-3D,

We present results where ICP has been used not only to BU4DFE and Florence in terms of NME where ICP has
find the correspondence between the groundtruth and pre- been used to remove the rigid transformation. Lower is bet-
dicted vertices, but also to remove the rigid transformation ter.
between them. We find that this offers a marginal improve-
ment to all methods. However, the relative performance re- Method AFLW2000 BU4DFE Florence
mains mostly the same between each method. Results on VRN 0.0605 0.0514 0.0470
AFLW2000 [5], BU4DFE [3] and Florence [1] can be seen VRN - Multitask 0.0625 0.0533 0.0439
in Figs. 1, 2 and 3 respectively. Numeric results can be VRN - Guided 0.0543 0.0471 0.0429
found in Table 1. 3DDFA [5] 0.1012 0.1144 0.0784
EOS [2] 0.0890 0.1456 0.1200
2. Results on 300VW
To demonstrate that our method can work in uncon- footage from the 300VW [4] dataset. These videos are chal-
strained environments and video, we ran our VRN - Guided lenging usually for at least one of the following reasons:
method on some of the more challenging Category C large pose, low quality video, heavy motion blurring and

1
100%

90% VRN - Guided (ours)

VRN - Multitask (ours)
80% VRN (ours)
3DDFA
70% EOS λ = 5000
% of images

60%

50%

40%

30%

20%

10%

0%
0 0.02 0.04 0.06 0.08 0.1
NME normalised by outer interocular distance

Figure 3: NME-based performance on our large pose ren- Figure 4: Some failure cases on AFLW2000-2D from our
derings of the Florence dataset, where ICP has been used to VRN - Guided network. In general, these images are diffi-
remove the rigidla transformation. The proposed Volumet- cult poses not seen during training.
ric Regression Networks, and EOS and 3DDFA are com-
pared.

occlusion. We produce these results on a frame-by-frame

basis, each frame is regressed individually without track-
ing. Videos will be made available on our project website
and can also be found in the supplementary material.

3. Additional qualitative results

This section provides additional visual results and com-
parisons. Failure cases are shown in Fig. 4. These are
Figure 5: A visual comparison between VRN and VRN -
mostly unusual poses which can not be found in the train-
Guided. The main difference is that the projection of the
ing set, or are not covered by the augmentation as described
volume has a better fit around the shape of the face.
Section 3.4 of our paper. In Fig. 5 we show some visual
comparison between VRN and VRN - Guided. These dif-
ferences are quite minor. Finally, in Fig. 6 we show some
typical examples from our renderings of BU-4DFE [3] and (a)
Florence [1], taken from their respective testing sets.

References
[1] A. D. ”Bagdanov, I. Masi, and A. Del Bimbo. The flo-
(b)
rence 2d/3d hybrid face datset. In Proc. of ACM Multimedia
Int.l Workshop on Multimedia access to 3D Human Objects
(MA3HO11). ACM, ACM Press, December 2011.
[2] P. Huber, G. Hu, R. Tena, P. Mortazavian, W. P. Koppen, Figure 6: Examples of rendered images from (a) BU4DFE
W. Christmas, M. Rätsch, and J. Kittler. A multiresolution
(containing large poses and expressions), and (b) Florence
3d morphable face model and fitting framework.
(containing large poses) datasets.
[3] L. Yin, X. Chen, Y. Sun, T. Worm, and M. Reale. A high-
resolution 3d dynamic facial expression database. In Auto-
matic Face & Gesture Recognition, 2008. FG’08. 8th IEEE [5] X. Zhu, Z. Lei, X. Liu, H. Shi, and S. Z. Li. Face alignment
International Conference on, pages 1–6. IEEE, 2008. across large poses: A 3d solution. 2016.
[4] S. Zafeiriou, G. Tzimiropoulos, and M. Pantic. The 300
videos in the wild (300-vw) facial landmark tracking in-the-
wild challenge. In ICCV Workshop, 2015.

Large Pose 3D Face Reconstruction From A Single Image Via Direct Volumetric CNN Regression
No ratings yet
Large Pose 3D Face Reconstruction From A Single Image Via Direct Volumetric CNN Regression
9 pages
Towards Fast, Accurate and Stable 3D Dense Face Alignment
No ratings yet
Towards Fast, Accurate and Stable 3D Dense Face Alignment
17 pages
3D Face Papers
No ratings yet
3D Face Papers
12 pages
Face Alignment Across Large Poses - A 3D Solution
No ratings yet
Face Alignment Across Large Poses - A 3D Solution
11 pages
3d Face Reconstruction
No ratings yet
3d Face Reconstruction
15 pages
Hierarchical Network for Detailed 3D Face Reconstruction
No ratings yet
Hierarchical Network for Detailed 3D Face Reconstruction
19 pages
Luận Văn Automatic Discovery of Connections Between Vietnamese's Anthropometric Features
No ratings yet
Luận Văn Automatic Discovery of Connections Between Vietnamese's Anthropometric Features
16 pages
3D Face Reconstruction in Deep Learning Era A Survey
No ratings yet
3D Face Reconstruction in Deep Learning Era A Survey
33 pages
Semnet: A Simple and Efficient MLP Based Network For 3D Face Point Clouds Landmarks Localization
No ratings yet
Semnet: A Simple and Efficient MLP Based Network For 3D Face Point Clouds Landmarks Localization
13 pages
3D Face Reconstruction Based On Convolutional Neural Network
No ratings yet
3D Face Reconstruction Based On Convolutional Neural Network
4 pages
3D Face Reconstruction Based On A Single Image A Review
No ratings yet
3D Face Reconstruction Based On A Single Image A Review
24 pages
Banmo CVPR
No ratings yet
Banmo CVPR
17 pages
DeepFace Summary
No ratings yet
DeepFace Summary
2 pages
Learning Similarity and Dissimilarity in 3D Faces With Triplet Network
No ratings yet
Learning Similarity and Dissimilarity in 3D Faces With Triplet Network
19 pages
Realistic 3D Face Modeling by Fusing Multiple 2D Images
No ratings yet
Realistic 3D Face Modeling by Fusing Multiple 2D Images
8 pages
Meng LiverRegistration
No ratings yet
Meng LiverRegistration
5 pages
Thesis D Kroon
No ratings yet
Thesis D Kroon
159 pages
Facenet: A Unified Embedding For Face Recognition and Clustering
No ratings yet
Facenet: A Unified Embedding For Face Recognition and Clustering
9 pages
Stacked Progressive Auto-Encoders (SPAE) For Face Recognition Across Poses
No ratings yet
Stacked Progressive Auto-Encoders (SPAE) For Face Recognition Across Poses
8 pages
A Vector-Based Representation To Enhance Head Pose Estimation
No ratings yet
A Vector-Based Representation To Enhance Head Pose Estimation
10 pages
An Unsupervised Learning Model For Deformable Medical Image Registration
No ratings yet
An Unsupervised Learning Model For Deformable Medical Image Registration
9 pages
Deep Face
No ratings yet
Deep Face
8 pages
Real-Time 3D Capture with RGBD Sensors
No ratings yet
Real-Time 3D Capture with RGBD Sensors
11 pages
Densereg: Fully Convolutional Dense Shape Regression In-The-Wild
No ratings yet
Densereg: Fully Convolutional Dense Shape Regression In-The-Wild
11 pages
Cheung Kong Man 2003 3TESIS
No ratings yet
Cheung Kong Man 2003 3TESIS
221 pages
Wang 2021
No ratings yet
Wang 2021
5 pages
3D Face Reconstruction by Learning From Synthetic Data
No ratings yet
3D Face Reconstruction by Learning From Synthetic Data
8 pages
Tri-Axial Slicing For 3D Face Recognition From Adapted Rotational Invariants Spatial Moments and Minimal Keypoints Dependence
No ratings yet
Tri-Axial Slicing For 3D Face Recognition From Adapted Rotational Invariants Spatial Moments and Minimal Keypoints Dependence
8 pages
Fine-Grained Head Pose
No ratings yet
Fine-Grained Head Pose
10 pages
Photo-Realistic Facial Details Synthesis From Single Image
No ratings yet
Photo-Realistic Facial Details Synthesis From Single Image
11 pages
Medical Image
No ratings yet
Medical Image
14 pages
C-Arm Positioning Using Virtual Fluoros
No ratings yet
C-Arm Positioning Using Virtual Fluoros
6 pages
The 2 3D Face Alignment in The Wild Challenge (3DFAW-Video) : Dense Reconstruction From Video
No ratings yet
The 2 3D Face Alignment in The Wild Challenge (3DFAW-Video) : Dense Reconstruction From Video
8 pages
Coppola Et Al. 2024
No ratings yet
Coppola Et Al. 2024
10 pages
Real-Time Facial Feature Tracking From 2D+3D Video Streams
No ratings yet
Real-Time Facial Feature Tracking From 2D+3D Video Streams
4 pages
Bobulski IPC2016
No ratings yet
Bobulski IPC2016
8 pages
REALY: Rethinking The Evaluation of 3D Face Reconstruction
No ratings yet
REALY: Rethinking The Evaluation of 3D Face Reconstruction
35 pages
Three-Dimensional Model Based Face Recognition: Figure 1. Face Appearance Variations
No ratings yet
Three-Dimensional Model Based Face Recognition: Figure 1. Face Appearance Variations
4 pages
Neural Networks
No ratings yet
Neural Networks
7 pages
Vox El Morph
No ratings yet
Vox El Morph
34 pages
3D Face Reconstruction With Dense Landmarks: Microsoft
No ratings yet
3D Face Reconstruction With Dense Landmarks: Microsoft
24 pages
Omni3d 10-End 1-5
No ratings yet
Omni3d 10-End 1-5
5 pages
Morphing of Geometrical Objects in Boundary Representation: Martina M Alkov A
No ratings yet
Morphing of Geometrical Objects in Boundary Representation: Martina M Alkov A
63 pages
Lin OcclusionFusion Occlusion-Aware Motion Estimation For Real-Time Dynamic 3D Reconstruction CVPR 2022 Paper
No ratings yet
Lin OcclusionFusion Occlusion-Aware Motion Estimation For Real-Time Dynamic 3D Reconstruction CVPR 2022 Paper
10 pages
Final Thesis
No ratings yet
Final Thesis
162 pages
Face Recognition Based On MTCNN and FaceNet
No ratings yet
Face Recognition Based On MTCNN and FaceNet
6 pages
Robust and Real-Time 3D-Face Model Extraction
No ratings yet
Robust and Real-Time 3D-Face Model Extraction
5 pages
2005 frgc05
No ratings yet
2005 frgc05
7 pages
3D Cephalometric Landmark Detection by Multiple Stage Deep Reinforcement Learning
No ratings yet
3D Cephalometric Landmark Detection by Multiple Stage Deep Reinforcement Learning
13 pages
MM 2
No ratings yet
MM 2
8 pages
Weakly-Supervised 3D Reconstruction of Clothed Humans Via Normal Maps
No ratings yet
Weakly-Supervised 3D Reconstruction of Clothed Humans Via Normal Maps
15 pages
(LU) Automated Blendshape Personalization For Faithful Face Animations Using Commodity Smartphones
No ratings yet
(LU) Automated Blendshape Personalization For Faithful Face Animations Using Commodity Smartphones
9 pages
Dey Generating Diverse 3D Reconstructions From A Single Occluded Face Image CVPR 2022 Paper
No ratings yet
Dey Generating Diverse 3D Reconstructions From A Single Occluded Face Image CVPR 2022 Paper
11 pages
30 Basic Powershell Commands To Start With Windows Server - @TheTunnelix
No ratings yet
30 Basic Powershell Commands To Start With Windows Server - @TheTunnelix
5 pages
FCC 13 21a4
No ratings yet
FCC 13 21a4
1 page
Hidden Markov Models Sean R Eddy: Analysis Has
No ratings yet
Hidden Markov Models Sean R Eddy: Analysis Has
5 pages
Hill University: Headquarters:: Shillong
No ratings yet
Hill University: Headquarters:: Shillong
1 page
FCC 13 21a5
No ratings yet
FCC 13 21a5
2 pages
Amendment of Parts 1, 2, 22, 24, 27, 90, and 95 of The Commission's Rules To Improve Wireless Coverage Through The Use of Signal Boosters, WT Docket No. 10-4
No ratings yet
Amendment of Parts 1, 2, 22, 24, 27, 90, and 95 of The Commission's Rules To Improve Wireless Coverage Through The Use of Signal Boosters, WT Docket No. 10-4
2 pages
19apr Ba BSC
No ratings yet
19apr Ba BSC
11 pages
English Syllabus
No ratings yet
English Syllabus
17 pages
Cable Modem Security Alert
No ratings yet
Cable Modem Security Alert
2 pages
F-BPL-005 Significant Risk Data Sheet - Rev 1 - 10012019
100% (1)
F-BPL-005 Significant Risk Data Sheet - Rev 1 - 10012019
2 pages
UMG510 User Guide for Technicians
No ratings yet
UMG510 User Guide for Technicians
44 pages
ASP - NET Core Tutorial
No ratings yet
ASP - NET Core Tutorial
51 pages
Amplifying Healthcare Chatbot Capabilities Through Llama2, Faiss, and Hugging Face Embeddings For Medical Inquiry Resolution
No ratings yet
Amplifying Healthcare Chatbot Capabilities Through Llama2, Faiss, and Hugging Face Embeddings For Medical Inquiry Resolution
7 pages
Unit 4 - Designing User Interface With View (MSBTE MAD 22617 MCQS) PDF
100% (1)
Unit 4 - Designing User Interface With View (MSBTE MAD 22617 MCQS) PDF
8 pages
TEC2000 Brochure A4 PDF
No ratings yet
TEC2000 Brochure A4 PDF
6 pages
Notes in TTL2
No ratings yet
Notes in TTL2
10 pages
Computer Connectors
No ratings yet
Computer Connectors
3 pages
Q1'24 Trade Up Matrix
No ratings yet
Q1'24 Trade Up Matrix
5 pages
Statements: (I) The Height of Mr. X Is 6 Feet. (Ii) The Height of Mr. Y Is 5 Feet
No ratings yet
Statements: (I) The Height of Mr. X Is 6 Feet. (Ii) The Height of Mr. Y Is 5 Feet
24 pages
SMG Release Notes 10 9 1
No ratings yet
SMG Release Notes 10 9 1
12 pages
User Story Template Guide
100% (1)
User Story Template Guide
3 pages
Nec XN120 Programming Manual
No ratings yet
Nec XN120 Programming Manual
410 pages
Motor Restarting Analysis
No ratings yet
Motor Restarting Analysis
10 pages
Mine Electrician Interview Questions and Answers 51852
No ratings yet
Mine Electrician Interview Questions and Answers 51852
12 pages
Industrial Training Report Shadab On 3d
No ratings yet
Industrial Training Report Shadab On 3d
29 pages
About Softprom by ERC
No ratings yet
About Softprom by ERC
26 pages
Lab Report Template Ict
No ratings yet
Lab Report Template Ict
11 pages
MGT Theories
No ratings yet
MGT Theories
3 pages
CS101 Introduction To Computing Solved MID Term Paper 01
No ratings yet
CS101 Introduction To Computing Solved MID Term Paper 01
4 pages
Petar Pavloski: Curriculum Vitae
No ratings yet
Petar Pavloski: Curriculum Vitae
2 pages
CN CS203 Lab Manual
No ratings yet
CN CS203 Lab Manual
36 pages
Conducting Educational Research Bruce W Tuckman Brian E Harper
No ratings yet
Conducting Educational Research Bruce W Tuckman Brian E Harper
159 pages
RNAV Systems and Navigation Aids
No ratings yet
RNAV Systems and Navigation Aids
16 pages
04 Task Performance PLATECH
100% (1)
04 Task Performance PLATECH
3 pages
Compit06 Proceedings
No ratings yet
Compit06 Proceedings
465 pages
VB Helper - HowTo - Make An ActiveX DLL or EXE
No ratings yet
VB Helper - HowTo - Make An ActiveX DLL or EXE
6 pages
Casio Phys QW-2492
No ratings yet
Casio Phys QW-2492
3 pages
SAP Enterprise Structure Setup Guide
No ratings yet
SAP Enterprise Structure Setup Guide
45 pages
LACLS Fiscal ISV Integrations For Brazil
100% (1)
LACLS Fiscal ISV Integrations For Brazil
266 pages

Supplementary Material For Large Pose 3D Face Reconstruction From A Single Image Via Direct Volumetric CNN Regression

Uploaded by

Supplementary Material For Large Pose 3D Face Reconstruction From A Single Image Via Direct Volumetric CNN Regression

Uploaded by

Supplementary Material for Large Pose 3D Face Reconstruction from a Single

Image via Direct Volumetric CNN Regression

Aaron S. Jackson1 Adrian Bulat1 Vasileios Argyriou2 Georgios Tzimiropoulos1

90% VRN - Guided (ours) 90% VRN - Guided (ours)

1. Results with ICP Registration Table 1: Reconstruction accuracy on AFLW2000-3D,

90% VRN - Guided (ours)

occlusion. We produce these results on a frame-by-frame

3. Additional qualitative results

You might also like