Publications

(*) denotes for equal contribution

2026

  1. mmot.png
    An Optimal Transport-driven Approach for Cultivating Latent Space in Online Incremental Learning
    Quyen Tran*, Hai Nguyen*, Quan Dao, Hoang Phan*, Linh Van, Khoat Than, Dinh Phung, Dimitris Metaxas, and Trung Le
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2026
  2. mpdit.png
    Multi-Patch Global-to-Local Transformer Architecture For Efficient Flow Matching and Diffusion Model
    Quan Dao, and Dimitris Metaxas
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2026
  3. varin.png
    Discrete Noise Inversion for Next-scale Autoregressive Text-based Image Editing
    Quan Dao*, Xiaoxiao He*, Ligong Han, Ngan Hoai Nguyen, Amin Heyrani Nobar, Faez Ahmed, Han Zhang, Viet Anh Nguyen, and Dimitris Metaxas
    In European Conference on Computer Vision, Jun 2026

2025

  1. autoedit.png
    AutoEdit: Automatic Hyperparameter Tuning for Image Editing
    Chau Pham, Quan Dao, Mahesh Bhosale, Yunjie Tian, Dimitris Metaxas, and David Doermann
    In The Thirty-nine Annual Conference on Neural Information Processing Systems, Jun 2025
  2. sLCT.png
    Improved Training Technique for Latent Consistency Models
    Quan Dao*, Khanh Doan*, Di Liu, Trung Le, and Dimitris Metaxas
    In International Conference on Learning Representations, Jun 2025
  3. SCflow.png
    Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Text-to-Image Generation
    Quan Dao*, Hao Phung*, Trung Dao, Dimitris Metaxas, and Anh Tran
    In Association for the Advancement of Artificial Intelligence, Jun 2025

2024

  1. dice.png
    DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models
    Xiaoxiao He*, Quan Dao*, Ligong Han, Song Wen, Minhao Bai, Di Liu, Han Zhang, Martin Renqiang Min, Felix Juefei-Xu, Chaowei Tan, and 1 more author
    arXiv preprint arXiv:2410.08207, Jun 2024
  2. dimsum.png
    DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image Generation
    Hao Phung*, Quan Dao*, Trung Dao, Hoang Phan, Dimitris Metaxas, and Anh Tran
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems, Jun 2024
  3. rduot.png
    A High-Quality Robust Diffusion Framework for Corrupted Dataset
    Quan Dao*, Binh Ta*, Tung Pham, and Anh Tran
    In European Conference on Computer Vision, Jun 2024

2023

  1. flow.png
    Flow Matching in Latent Space
    Quan Dao*, Hao Phung*, Binh Nguyen, and Anh Tran
    arXiv preprint arXiv:2307.08698, Jun 2023
  2. anti.png
    Anti-DreamBooth: Protecting users from personalized text-to-image synthesis
    Thanh Van Le*, Hao Phung*, Thuan Hoang Nguyen*, Quan Dao*, Ngoc Tran, and Anh Tran
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Oct 2023
  3. single_wavelet.png
    Wavelet Diffusion Models Are Fast and Scalable Image Generators
    Hao Phung*, Quan Dao*, and Anh Tran
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2023