Zhiyao Duan

Cited by

	All	Since 2019
Citations	4424	3528
h-index	31	26
i10-index	76	61

1100

550

275

825

200920102011201220132014201520162017201820192020202120222023202417 21 42 48 77 84 96 108 137 237 324 460 625 820 1014 285

Public access

View all

49 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Chenliang XuAssociate Professor, University of RochesterVerified email at rochester.edu
Bryan PardoComputer Science, Northwestern UniversityVerified email at northwestern.edu
You ZhangPhD Candidate, University of RochesterVerified email at rochester.edu
Changshui ZhangDept. Automation, Tsinghua University, Beijing, ChinaVerified email at mail.tsinghua.edu.cn
Ross K MaddoxUniversity of MichiganVerified email at umich.edu
Sefik Emre EskimezMicrosoftVerified email at microsoft.com
Lele ChenResearch Scientist @ Sony AIVerified email at sony.com
Ge ZhuPhD Candidate at University of RochesterVerified email at rochester.edu
Yichi ZhangAppleVerified email at apple.com
Andrea Cogliati, PhDLighTopTech Corp.Verified email at lightoptech.com
Fei JiangTencent TechnologyVerified email at tencent.com
Gaurav SharmaUniversity of RochesterVerified email at rochester.edu
Wendi B HeinzelmanUniversity of RochesterVerified email at rochester.edu
Karthik DineshStudent of University of RochesterVerified email at ur.rochester.edu
Frank CwitkowitzPhD Student, University of RochesterVerified email at ur.rochester.edu
Yapeng TianAssistant Professor, University of Texas at DallasVerified email at utdallas.edu
Jing ShiResearch Scientist, AdobeVerified email at adobe.com
Brendt WohlbergLos Alamos National LaboratoryVerified email at lanl.gov
Mojtaba (Moji) HeydariUniversity of RochesterVerified email at ur.rochester.edu
Jinyu HanMetaVerified email at u.northwestern.edu

Zhiyao Duan

Electrical and Computer Engineering, University of Rochester

Verified email at rochester.edu - Homepage

Computer Audition Music Information Retrieval Audio-Visual Processing Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Audio-visual event localization in unconstrained videos Y Tian, J Shi, B Li, Z Duan, C Xu Proceedings of the European Conference on Computer Vision (ECCV), 247-263, 2018	421	2018
Hierarchical cross-modal talking face generation with dynamic pixel-wise loss L Chen, RK Maddox, Z Duan, C Xu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019	364	2019
Automatic Music Transcription: An Overview E Benetos, S Dixon, Z Duan, S Ewert IEEE Signal Processing Magazine 36 (1), 20-30, 2018	292	2018
Lip movements generation at a glance L Chen, Z Li, RK Maddox, Z Duan, C Xu Proceedings of the European Conference on Computer Vision (ECCV), 520-535, 2018	226	2018
Deep Cross-Modal Audio-Visual Generation L Chen, S Srivastava, Z Duan, C Xu Proceedings of the on Thematic Workshops of ACM Multimedia 2017, 349-357, 2017	219	2017
Multiple fundamental frequency estimation by modeling spectral peaks and non-peak regions Z Duan, B Pardo, C Zhang IEEE Transactions on Audio, Speech, and Language Processing 18 (8), 2121-2133, 2010	215	2010
One-class learning towards synthetic voice spoofing detection Y Zhang, F Jiang, Z Duan IEEE Signal Processing Letters 28, 937-941, 2021	176	2021
Creating a Multitrack Classical Music Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications B Li, X Liu, K Dinesh, Z Duan, G Sharma IEEE Transactions on Multimedia 21 (2), 522-535, 2018	160	2018
Creating a Multitrack Classical Music Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications B Li, X Liu, K Dinesh, Z Duan, G Sharma IEEE Transactions on Multimedia 21 (2), 522-535, 2018	160	2018
Soundprism: An online system for score-informed source separation of music audio Z Duan, B Pardo IEEE Journal of Selected Topics in Signal Processing 5 (6), 1205-1215, 2011	129	2011
Unsupervised single-channel music source separation by average harmonic structure modeling Z Duan, Y Zhang, C Zhang, Z Shi IEEE Transactions on Audio, Speech, and Language Processing 16 (4), 766-778, 2008	121	2008
Bidirectional GRU for sound event detection R Lu, Z Duan Detection and Classification of Acoustic Scenes and Events, 1-3, 2017	78	2017
Unsupervised Learning Approach to Feature Analysis for Automatic Speech Emotion Recognition SE Eskimez, Z Duan, W Heinzelman 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	74	2018
Multi-pitch streaming of harmonic sound mixtures Z Duan, J Han, B Pardo IEEE/ACM Transactions on Audio, Speech, and Language Processing 22 (1), 138-150, 2014	68	2014
Online PLCA for Real-Time Semi-supervised Source Separation Z Duan, G Mysore, P Smaragdis International Conference on Latent Variable Analysis and Signal Separation …, 2012	67	2012
A state space model for online polyphonic audio-score alignment Z Duan, B Pardo IEEE International Conference on Acoustics, Speech and Signal Processing …, 2011	65	2011
Speech enhancement by online non-negative spectrogram decomposition in nonstationary noise environments Z Duan, GJ Mysore, P Smaragdis Thirteenth Annual Conference of the International Speech Communication …, 2012	64	2012
Generating talking face landmarks from speech SE Eskimez, RK Maddox, C Xu, Z Duan Latent Variable Analysis and Signal Separation: 14th International …, 2018	58	2018
Siamese Style Convolutional Neural Networks for Sound Search by Vocal Imitation Y Zhang, B Pardo, Z Duan IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (2), 429-441, 2018	57	2018
RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning N Jiang, S Jin, Z Duan, C Zhang arXiv preprint arXiv:2002.03082, 2020	56	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors