Publications

FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion
H. Yang, A. Bulat, I. Hadji, H. X. Pham, X. Zhu, G. Tzimiropoulos and B. Martinez
to appear in CVPR 2025 [paper]

Graph Guided Question Answer Generation for Procedural Question-Answering
H. X. Pham, I. Hadji, X. Xu, Z. Degutyte, J. Rainey, E. Kazakos, A. Fazly, G. Tzimiropoulos and B. Martinez
appeared in EACL 2024 (oral presentation) [paper]

Flow Graph to Video Grounding for Weakly-Supervised Multi-step Localization
N. Dvornik, I. Hadji, H. X. Pham, D. Bhatt, B. Martinez, A. Fazly and Allan D. Jepson
appeared in ECCV 2022 (oral presentation) [paper] [code]

Variational Continual Proxy-Anchor for Deep Metric Learning
M. Kim, R. Guerrero*, H. X. Pham* and V. Pavlovic
appeared in AISTATS 2022 [paper]

Cross-modal Retrieval and Synthesis (X-MRS): Closing the Modality Gap in Shared Subspace Learning
R. Guerrero*, H. X. Pham* and V. Pavlovic
appeared in ACM Multimedia 2021 [paper] [code]

CHEF: Cross-modal Hierarchical Embeddings for Food Domain Retrieval
H. X. Pham, R. Guerrero, J. Li and V. Pavlovic
appeared in AAAI 2021 [paper] [code]

Learning Continuous Facial Actions From Speech for Real-Time Animation
H. X. Pham, Y. Wang and V. Pavlovic
in IEEE Transactions on Affective Computing, vol. 13, no. 3, pp. 1567-1580, 1 July-Sept. 2022. Published online in Sept. 2020. [paper] [code]

Before 2019

PhD Thesis: Learning human facial performance: Analysis and Synthesis. [thesis URL]

Generative Adversarial Talking Head: Bringing Portrait To Life with a Weakly Supervised Neural Network
H. X. Pham. Y. Wang and V. Pavlovic
in arXiv, 2018 [paper]

End-to-end Learning for 3D Facial Animation from Speech
H. X. Pham, Y. Wang and V. Pavlovic
appeared in International Conference in Multimodal Interaction (ICMI), Oct 2018. [paper] [code]

Using 3D Face Priors for Depth Recovery
C. Chen*, H. X. Pham*, V. Pavlovic, J. Cai, G. Shi, Y. Gao and H. Cheng in Journal of Visual Communication and Image Representation, Vol 48, Oct 2017. [paper]

Speech-driven 3D Facial Animation with Implicit Emotional Awareness: A Deep Learning Approach
H. X. Pham, S. Cheung and V. Pavlovic
in CVPRW 2017 [paper]

Robust Real-Time 3D Face Tracking from RGBD Videos under Extreme Pose, Depth, and Expression Variations
H. X. Pham and V. Pavlovic
in International Conference in 3D Vision (3DV), 2016 [paper]

Robust Real-time Performance-driven 3D Face Tracking
H. X. Pham, V. Pavlovic, J. Cai and T. Cham
in ICPR, 2016 [paper]

Depth Recover with Face Priors
C. Chen*, H. X. Pham*, V. Pavlovic, J. Cai and G. Shi
in ACCV, 2014 (oral presentation) [paper]

Hybrid On-line 3D Face and Facial Actions Tracking in RGBD Video Sequences
H. X. Pham and V. Pavlovic
in ICPR, 2014. [paper]