Skip to content

Unable to perceive improvement. #17

@pocketvision

Description

@pocketvision

Thanks for sharing this work, good insight and inspiring.
But I'm unable to perceive improvement of the pretrained model.
My inference of E_expression:
For images, inputs: just concat same image (1, 5, 224, 224), output: Flame parameters (5, 53), chose the center one's param ouput[2, :]
For videos, inputs: just concat same frame (1, 5, 224, 224), output: Flame parameters (5, 53), chose the center one's param ouput[2, :]. Or maybe I should try to concat continous 5 frames for E_expression input ?
My mean_shape for alignment is consist with author's.
Comparison of the results (talkinghead videos and single image reconstruction) between E_flame_without_E_expression and E_flame_with_expression:

E_flame_without_E_expression:

talkinghead_E_flame_without_E_expression.mp4

msk_E_flame_without_E_expression

obm_E_flame_without_E_expression

E_flame_with_expression:

talkinghead_E_flame_with_E_expression.mp4

msk_E_flame_with_E_expression

obm_E_flame_with_E_expression

Sorry, my test maybe not sufficient,and my preprocess maybe not accurate.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions