Publications

  • FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion
    H. Yang, A. Bulat, I. Hadji, H. X. Pham, X. Zhu, G. Tzimiropoulos and B. Martinez
    to appear in CVPR 2025 [paper]

  • Graph Guided Question Answer Generation for Procedural Question-Answering
    H. X. Pham, I. Hadji, X. Xu, Z. Degutyte, J. Rainey, E. Kazakos, A. Fazly, G. Tzimiropoulos and B. Martinez
    appeared in EACL 2024 (oral presentation) [paper]

  • Flow Graph to Video Grounding for Weakly-Supervised Multi-step Localization
    N. Dvornik, I. Hadji, H. X. Pham, D. Bhatt, B. Martinez, A. Fazly and Allan D. Jepson
    appeared in ECCV 2022 (oral presentation) [paper] [code]

  • Variational Continual Proxy-Anchor for Deep Metric Learning
    M. Kim, R. Guerrero*, H. X. Pham* and V. Pavlovic
    appeared in AISTATS 2022 [paper]

  • Cross-modal Retrieval and Synthesis (X-MRS): Closing the Modality Gap in Shared Subspace Learning
    R. Guerrero*, H. X. Pham* and V. Pavlovic
    appeared in ACM Multimedia 2021 [paper] [code]

  • CHEF: Cross-modal Hierarchical Embeddings for Food Domain Retrieval
    H. X. Pham, R. Guerrero, J. Li and V. Pavlovic
    appeared in AAAI 2021 [paper] [code]

  • Learning Continuous Facial Actions From Speech for Real-Time Animation
    H. X. Pham, Y. Wang and V. Pavlovic
    in IEEE Transactions on Affective Computing, vol. 13, no. 3, pp. 1567-1580, 1 July-Sept. 2022. Published online in Sept. 2020. [paper] [code]

Before 2019

  • PhD Thesis: Learning human facial performance: Analysis and Synthesis. [thesis URL]

  • Generative Adversarial Talking Head: Bringing Portrait To Life with a Weakly Supervised Neural Network
    H. X. Pham. Y. Wang and V. Pavlovic
    in arXiv, 2018 [paper]

  • End-to-end Learning for 3D Facial Animation from Speech
    H. X. Pham, Y. Wang and V. Pavlovic
    appeared in International Conference in Multimodal Interaction (ICMI), Oct 2018. [paper] [code]

  • Using 3D Face Priors for Depth Recovery
    C. Chen*, H. X. Pham*, V. Pavlovic, J. Cai, G. Shi, Y. Gao and H. Cheng in Journal of Visual Communication and Image Representation, Vol 48, Oct 2017. [paper]

  • Speech-driven 3D Facial Animation with Implicit Emotional Awareness: A Deep Learning Approach
    H. X. Pham, S. Cheung and V. Pavlovic
    in CVPRW 2017 [paper]

  • Robust Real-Time 3D Face Tracking from RGBD Videos under Extreme Pose, Depth, and Expression Variations
    H. X. Pham and V. Pavlovic
    in International Conference in 3D Vision (3DV), 2016 [paper]

  • Robust Real-time Performance-driven 3D Face Tracking
    H. X. Pham, V. Pavlovic, J. Cai and T. Cham
    in ICPR, 2016 [paper]

  • Depth Recover with Face Priors
    C. Chen*, H. X. Pham*, V. Pavlovic, J. Cai and G. Shi
    in ACCV, 2014 (oral presentation) [paper]

  • Hybrid On-line 3D Face and Facial Actions Tracking in RGBD Video Sequences
    H. X. Pham and V. Pavlovic
    in ICPR, 2014. [paper]