Quan Dao

CS PhD student

mypro.jpg

I am 2nd PhD student at Rutgers University under supervision of Distinguished Prof. Dimitris Metaxas. My research focuses on generative models, specifically diffusion models and visual autoregressive models, with a primary emphasis on fundamental research. For diffusion models, I concentrate on developing efficient and robust training methodologies. During my PhD, I was very lucky to do internship in Apple MLR. Previously, I was a Research Resident under the supervision of Dr. Tuan Anh Tran at QualcommAI Research, Vietnam (which was VinAI research) and spent 2 wonderful years there. I received a bachelor degree in computer science from Monash University in 2020.


news

Feb 28, 2025 :zap: AutoEdit got accepted at NeurIPS 2025. This paper proposes RL-based method to select hyperparameters for diffusion editing technique
Feb 28, 2025 :zap: DICE got accepted at CVPR 2025. This paper proposes editing technique for discrete diffusion model)
Jan 22, 2025 :zap: Improved Latent Consistency Model got accepted at ICLR 2025. This paper proposes series of novel techniques like Cauchy loss, OT coupling, adaptive robust scale scheduler and diff loss at early timestep to efficiently train latent consistency model from scatch. Our technique bridges the performance gap between LDM and LCM training. (this is the first work discovering the unstability of consistency model on latent space due to impulsive outlier.)
Dec 10, 2024 :zap: SCFlow got accepted at AAAI 2025. This is the first work attempting to distill flow matching model into one and few step generation. With SCFlow, we could achieve consistent one and few step generation, which means starting from a noise, no matter how many NFEs is used for sampling, the final generated image is indentical.
Sep 23, 2024 :zap: Yummy DimSUM got accepted at NeurIPS 2024. DimSUM proposes novel hybrid transformer-mamba architecture allowing faster convergence training of diffusion/flow matching model and also achieve SoTA image generation.
Jul 21, 2024 :zap: RDUOT got accepted at ECCV 2024. This paper combines UOT generative framework with diffusion noising to allow train fast-converged and robust generative framework.
Jul 13, 2023 :zap: Antidreambooth got accepted at ICCV 2023. AntiDreambooth adds small undistinguished noise to your images to break the malicous explotation of Dreambooth on your images.
Feb 26, 2023 :zap: My first paper Wavediff got accepted at CVPR 2023. Wavediff proposes the frequency-aware Unet architecture allowing fast converence training for DiffusionGAN framework.

selected publications

  1. autoedit.png
    AutoEdit: Automatic Hyperparameter Tuning for Image Editing
    Chau Pham, Quan Dao, Mahesh Bhosale, Yunjie Tian, Dimitris Metaxas, and David Doermann
    In The Thirty-nine Annual Conference on Neural Information Processing Systems, 2025
  2. varin.png
    Discrete Noise Inversion for Next-scale Autoregressive Text-based Image Editing
    Quan Dao, Xiaoxiao He, Ligong Han, Ngan Hoai Nguyen, Amin Heyrani Nobar, Faez Ahmed, Han Zhang, Viet Anh Nguyen, and Dimitris Metaxas
    arXiv preprint arXiv:2509.01984, 2025
  3. sLCT.png
    Improved Training Technique for Latent Consistency Models
    Quan Dao*, Khanh Doan*, Di Liu, Trung Le, and Dimitris Metaxas
    In International Conference on Learning Representations, 2025
  4. dice.png
    DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models
    Xiaoxiao He, Ligong Han, Quan Dao, Song Wen, Minhao Bai, Di Liu, Han Zhang, Martin Renqiang Min, Felix Juefei-Xu, Chaowei Tan, and 1 more author
    arXiv preprint arXiv:2410.08207, 2024
  5. dimsum.png
    DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image Generation
    Hao Phung*, Quan Dao*, Trung Dao, Hoang Phan, Dimitris Metaxas, and Anh Tran
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
  6. SCflow.png
    Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Text-to-Image Generation
    Quan Dao*, Hao Phung*, Trung Dao, Dimitris Metaxas, and Anh Tran
    In Association for the Advancement of Artificial Intelligence, 2025
  7. rduot.png
    A High-Quality Robust Diffusion Framework for Corrupted Dataset
    Quan Dao*, Binh Ta*, Tung Pham, and Anh Tran
    In European Conference on Computer Vision, 2024
  8. flow.png
    Flow Matching in Latent Space
    Quan Dao*, Hao Phung*, Binh Nguyen, and Anh Tran
    arXiv preprint arXiv:2307.08698, 2023
  9. anti.png
    Anti-DreamBooth: Protecting users from personalized text-to-image synthesis
    Thanh Van Le*, Hao Phung*, Thuan Hoang Nguyen*, Quan Dao*, Ngoc Tran, and Anh Tran
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Oct 2023
  10. single_wavelet.png
    Wavelet Diffusion Models Are Fast and Scalable Image Generators
    Hao Phung*, Quan Dao*, and Anh Tran
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2023