AIR Lab

News

2025.08 - Congrats on Yujia's successful PhD defense! Well done, Dr. Yujia Yan!

2025.06 - Congrats on Ge's impressive PhD defense! Well deserved, Dr. Ge Zhu!

2025.06 - Xingjian, Huiran, Christos and Melissa are out for summer internships at MERL, Bosch, Smule, and Meta!

2025.06 - Congrats on Yujia's full-time researcher position at Cartesia!

2025.06 - Congrats on Neil's full-time senior researcher position at Dolby!

2025.05 - Congrats on the graduation to Kyungbok Lee (BS in ECE), Paul Berggren (BS in CS), and Brynn Lee (MS in Data Science)!

2025.05 - Zhiyao will be promoted to Full Professor on July 1, 2025! A Big Thank You to all collaborators!

2025.03 - Congrats on Moji's full-time research engineer position at Apple!

2025.02 - Zhiyao is invited to give a talk at AAAI 2025 Workshop on Artificial Intelligence for Music.

2024.11 - Congratulations to Yujia for his paper on piano transcription receiving a best paper nomination at ISMIR 2024! Try his awesome model on any piano recording you can find!

2024.11 - Congrats to Zhiyao and Stella on playing a violin-piano duet at the conference jam session!

2024.11 - Congratulations to Neil on receiving a 2024 IEEE SPS Scholarship among 45 receipients worldwide!

2024.10 - Zhiyao is invited to give a talk at SANE 2025 on speech synthesis and another talk at Boston Music AI Meetup on AI-Powered Music Production.

2024.10 - We are proud to receive an unrestricted gift and the Audiobox license from Meta. Thank you Meta Audiobox!

2024.09 - We are grateful to be on a team with Honeywell, UT Dallas, and Texas A&M to work on the IARPA Anonymous Real-Time Speech (ARTS) program. UR had a news release.

2024.09 - Congratulations to our undergraduate student, Kyungbok Lee (CS'25), on his first first-authored peer-reviewed paper at IEEE MMSP! He also won a student travel award from this workshop and the University.

2024.09 - Welcome two new PhD students, Stella Wong and Baotong Tian! Welcome back current PhD students, Moji, Neil and Melissa, from internships and visits at Apple, Meta and Trinity College Dublin!

2024.08 - Congratulations to our undergraduate student Paul Berggren (ECE'25) on receiving the Tau Beta Pi engineering honor society award (see news here)!

2024.05 - Congratulations to our new graduate, Yutong Wen (AME'24), who will pursue a PhD at UIUC this fall! Also Congratulations to last year's MS graduate, Qiaoyu Yang, who will pursue a PhD at Georgia Tech this fall!

2024.03 - Neil, Yongyi and I are proud to co-organize the Singing Voice Deepfake Detection (SVDD) challenge at IEEE Spoken Language Technology Workshop (SLT) 2024, together with Jiatong Shi from CMU and Ryuichi Yamamoto and Prof. Tomoki Toda from Nagoya University

2024.01 - Zhiyao became the President of the International Society for Music Information Retrieval (ISMIR) for the term of 2024-2025.

2023.11 - Christos, I and Prof. Philippe Pasquier from Simon Fraser University delivered a tutorial on Computer-Assisted Music-Making Systems: Taxonomy, Review, and Live Coding. In addition to a comprehensive review, tt also features a live coding session on building a real-time musical agent using Euterpe, a prototyping framework for creating music interactions on the web.

2023.11 - Neil's NIJ fellowship and research was covered by News10NBC (Rochester local TV station) and a nice article!

2023.10 - Congratulations to Melissa for winning an NSF NRT travel award for a three-month visit to Trinity College Dublin in Summer 2024!

2023.10 - Neil Zhang received an 2023 National Institute of Justice (NIJ) Graduate Research Fellowship among a total of 24 awardees nationwide. Congratulations, Neil!

2023.09 - Yutong Wen received an 2023 University of Rochester Undergraduate Research Presentation Award and an WASPAA 2023 travel grant! These awards will support his travel to WASPAA 2023 to present his first-authored paper.

2023.07 - Undergrads in AIR lab are doing awesome research! Yongyi and Yutong had their first publication as the first author at Interspeech 2023 and WASPAA 2023, respectively. They were both co-mentored by Neil!

2023.06 - Our paper co-authored by You (Neil) Zhang, Yuxiang Wang, and Zhiyao Duan was recognized as among the top 3% of all accepted papers at ICASSP 2023! Neil was also selected as one of the 24 presenters at the inaugural "Rising Star Program in Signal Processing" program. Zhiyao received an outstanding reviewer award.

2023.06 - Several AIR lab PhD students will go for summer internships at Adobe, Meta, Microsoft, and Sony.

2023.05 - Congratulations to our MS graduates, Qiaoyu Yang (ECE) and Zehua Li (TEAM), and undergraduate graduates, Yongyi Zang (AME) and Enting Zhou (CS)! All the Best!!

2023.05 - Yongyi Zang received an 2023 University of Rochester Undergraduate Research Presentation Award! This award will support his travel to Interspeech 2023 to present his first-authored paper.

2023.04 - Yongyi Zang presented an oral presentation and a poster at the Undergraduate Research Exposition about "Euterpe". Yiyang Wang and Neil Zhang presented at the Graduate Research Symposium.

2023.03 - Zhiyao Duan and our collaboration with industry partner IngenID are featured on this article.

2023.02 - Four papers from AIR lab are accepted by ICASSP 2023. Congrats to all students and collaborators!

2023.01 - Zhiyao Duan was on WXXI Connections radio program together with Raffaella Borasi and Blaire Koerner, discussing how artificial intelligence may affect the music industry. This is an interview with host Mona Seghatoleslami about our NSF project on ``Toward an Ecosystem of Artificial Intelligence-Powered Music Production (TEAMuP)''. Listen to the recording here.

2022.12 - AIR lab is awarded a New York State Center of Excellence in Data Science grant to develop and deploy spoofing aware speaker verification systems with IngenID. Thank you, NYS!

2022.11 - AIR lab is awarded a seed funding from the University of Rochester Goergen Institute for Data Science to investigate Personalized Immersive Spatial Audio with Physics Informed Neural Field.

2022.09 - NSF grants $1.8M to a fantastic and diverse team of researchers between UofR and Northwestern to build foundations for AI-powered music production ecosystems! AIR lab and Interactive Audio Lab at Northwestern will co-lead the technical component of this project. Thank you, NSF!

2022.04 - Several AIR lab members will go for summer internships at ByteDance, Yousician and Tencent.

2022.04 - We are happy to release the web-based version of BachDuet. Everyone with a web browser can now improvise with AI in real time in the style of Bach counterpoint! Great work, Yongyi, Christos and Tianyu!

2021.10 - AIR lab is awarded a New York State Center of Excellence in Data Science grant and a University of Rochester Goergen Institute for Data Science grant. Thank you, NYS and UR!

2021.05 - Congratulations to Bochen Li on winning an 2021 Outstanding PhD Dissertation Award at the University of Rochester! Well deserved!

2021.05 - Several AIR lab members are going for a summer internship this year, at Adobe, ByteDance, Chordify, Tencent, and Pandora.

2020.08 - Check out AIR lab's YouTube channel!

2019.12 - Our Vroom! search engine for sounds using vocal imitation as queries is online!

2019.10 - Check out our demo video of BachDuet, a system for real-time interactive duet counterpoint improvisation between human and machine in the Bach chorale style. A brief description of the system is here.

2019.10 - Check out the AIR lab production for the ISMIR2019 Call for Music - Variations on ISMIR: some funny reflections on AI.

Welcome to AIR!

At the AIR lab, we conduct research in the emerging field of computer audition, i.e., designing computational systems that are able to analyze and understand sounds including music, speech, and environmental sounds. We address fundamental issues such as parsing polyphonic auditory scenes (the cocktail party effect), as well as designing novel applications such as sound retrieval and music information retrieval. We also combine sound analysis with the analysis of other signal modalities such as text and video towards multi-modal scene analysis. Various projects that we have been working on include audio source separation, automatic music transcription, audio-score alignment, speech enhancement, speech diarization and emotion recognition, sound retrieval, sound event detection, and audio-visual scene understanding.

Position Openings

We are looking for highly motivated students to join the AIR lab. Students are expected to have a solid background in mathematics, programming, and academic writing. Experiences in music activities will be a plus. Most importantly, students should be passionate to do research in the exciting fields of computer audition, music information retrieval, and multimodal learning. If you are interested, please apply to the ECE Ph.D. program, and mention Prof. Zhiyao Duan in your application. If you are a master's or undergrad student at UR and want to do a project/thesis in the AIR lab, please send Dr. Duan an email or stop by his office at Room 720 in the Computer Studies Building.

Funding Sources

Our work is funded by the National Science Foundation under grants No. 1617107, titled "III: Small: Collaborative Research: Algorithms for Query by Example of Audio Databases" (project website), No. 1741472, titled "BIGDATA: F: Audio-Visual Scene Understanding" (project website), No. 1846184, titled "CAREER: Human-Computer Collaborative Music Making" (project website), and No. 2222129, titled "Collaborative Research: FW-HTF-R: Toward an Ecosystem of Artificial Intelligence-Powered Music Production (TEAMuP)". Our work is also funded by National Instite of Health (NIH), National Institute of Justice (NIJ), the National Artificial Intelligence Research Resource (NAIRR) Pilot, the New York State Center of Excellence (CoE) in Data Science, University of Rochester internal awards on AR/VR, health analytics, data science, and integrated research computing, as well as gifts from Adobe, ByteDance, Kwai, IngenID, and Microsoft. We are very grateful for their support!