About Me!

I am a third-year PhD student at the Kahlert School of Computing at the University of Utah working with Professor Neal Patwari.

My research interests are centered on computer vision and machine learning, with an emphasis on multimodal perception and audio-visual learning. I am particularly interested in developing intelligent systems that can effectively learn from and fuse multiple sensory modalities, especially visual and auditory signals, to achieve a deeper understanding of complex real-world environments. My work also extends to wireless communication systems, where I focus on leveraging real-world Channel Impulse Response (CIR) data to estimate spectrum occupancy and develop efficient spectrum reuse strategies for next-generation communication networks.

News

  • June 2025 - ICCV 2025 Paper on Material Controlled RIR Generation
  • Jan 2025 - Teaching Assistant for CS6960 Multimodal LLM Agents
  • Jan 2026 - Teaching Assistant for CS6962 Automatic Speech Recognition and Accent
  • Feb 2026 - Conducted tutorials for CS6962 about getting started with Pytorch, creating custom ASR models and observing biases in current SOTA ASR Models
  • March 2026 - CVPR 2026 Findings Paper on Realistic RIR Estimation
  • April 2026 - Awarded Kahlert Impact Award at the Kahlert School of Computing

Publications

  • M-CAPA
    How Would It Sound? Material-Controlled Multimodal Acoustic Profile Generation for Indoor Scenes
    Mahnoor Fatima Saad, Ziad Al-Halah
    International Conference on Computer Vision (ICCV), 2025
    Project Page Code Data PDF BibTeX arXiv
  • MatRIR
    Materialistic RIR: Material Conditioned Realistic RIR Generation
    Mahnoor Fatima Saad, Sagnik Majumder, Kristen Grauman, Ziad Al-Halah
    IEEE/CVF Conference on Computer Vision and Pattern Recognition Findings, 2026
    Project Page Code Data PDF BibTeX arXiv