|
|
|
|
Publications |
|
Journal Articles | |
1. |
D. A. M. G. Wisnu, S. Rini, R. E. Zezario, H.-M. Wang, and Y. Tsao, "HAAQI-Net: A Non-intrusive Neural Music Audio Quality Assessment Model for Hearing Aids," to appear in IEEE/ACM Transactions on Audio, Speech, and Language Processing. ::: |
2. |
X. Gao, Y. Chen, X. Yue, Y. Tsao, and N. F. Chen, "TTSlow: Slow Down Text-to-Speech with Efficiency Robustness Evaluations," to appear in IEEE/ACM Transactions on Audio, Speech, and Language Processing. ::: |
3. |
C.-H. Hsin, C.-Y. Lee, and Y. Tsao, "Exploring N400 Predictability Effects during Sustained Speech Comprehension: From Listening-Related Fatigue to Speech Enhancement Evaluation," to appear in Ear and Hearing. ::: |
4. |
A. Hashmi, S. A. Shahzad, C.-W. Lin, Y. Tsao, and H.-M. Wang, "AVTENet: A Human-Cognition-Inspired Audio-Visual Transformer-Based Ensemble Network for Video Deepfake Detection," to appear in IEEE Transactions on Cognitive and Developmental Systems. ::: |
5. |
M.-C. Yen, C.-H. Wu, S.-W. Tsai, J.-S. R. Jang, Y. Tsao, A. Hussain, and H.-M. Wang, "Mandarin Electrolaryngeal Speech Voice Conversion with Speech Encoder Loss Learning and Seq2seq Modeling," to appear in IEEE Internet of Things Magazine. ::: |
6. |
Y.-R. Chien, P.-H. Chou, Y.-J. Peng, C.-Y. Huang, H.-W. Tsao, and Y. Tsao, "NGGAN: Noise Generation GAN Based on the Practical Measurement Dataset for Narrowband Powerline Communications," IEEE Transactions on Instrumentation and Measurement, volume 26, pages 1-16, December 2024. ::: |
7. |
W.‑L. Mao, C.‑C. Wang, P.‑H. Chou, K.‑C. Liu, and Y. Tsao, "MECKD: Deep Learning-Based Fall Detection in Multi-Layer Mobile Edge Computing With Knowledge Distillation," IEEE Sensors Journal, volume 24, pages 42195-42209, December 2024. ::: |
8. |
K.-C. Liu, S.-Y. Peng, Y. Tsao, C.-Y. Liu, Z.-A. Chen, Z.-H. Han, W.-C. Chen, P.-Q. Hsieh, and Y.-J. Hsu, "A Cross-Modal Autoencoder for Contactless Electrocardiography Monitoring Using Frequency-Modulated Continuous Wave Radar," IEEE Sensors Journal, volume 24, pages 41462-41473, December 2024. ::: |
9. |
H.-T. Chiang, S.-W. Fu, H.-M. Wang, Y. Tsao, and J. H. L. Hansen, "Multi-objective Non-intrusive Hearing-aid Speech Assessment Model," Journal of the Acoustical Society of America (JASA), volume 156, pages 3574-3587, November 2024. ::: |
10. |
K.-C. Wang, K.-C. Liu, P.-C. Yeh, S.-Y. Peng, and Y. Tsao, "TrustEMG-Net: Using Representation-Masking Transformer with U-Net for Surface Electromyography Enhancement," IEEE Journal of Biomedical and Health Informatics, volume 1, pages 1-14, October 2024, Dataset_Model: http://surl.li/qejfig ::: |
11. |
S.-S. Wang, J.-Y. Chen, B.-R. Bai, S.-H. Fang, and Y. Tsao, "Unsupervised Face-Masked Speech Enhancement Using Generative Adversarial Networks with Human-in-the-Loop Assessment Metrics," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 32, pages 3826-3837, July 2024. ::: |
12. |
E. Cooper, W.-C. Huang, Y. Tsao, H.-M. Wang, T. Toda, and J. Yamagishi, "A Review on Subjective and Objective Evaluation of Synthetic Speech," Acoustical Science and Technology, volume 45, number 4, pages 161-183, April 2024, (Invited Review Paper) ::: |
13. |
S.-Y. Peng, I-C. Liu, Y.-H. Wu, T.-J. Lin, C.-J. Chen, X.-Z. Li, Y.-Q. Cheng, P.-H. Lin, K.-H. Hung, and Y. Tsao, "An SRAM-Based Reconfigurable Cognitive Computation Matrix for Sensor Edge Applications," IEEE Journal of Solid-State Circuits, volume 59, number 2, pages 636-648, February 2024. ::: |
14. |
E. H.-H. Huang, R. Chao, Y. Tsao, and C.-M. Wu, "ElectrodeNet – A deep-learning-based sound coding strategy for cochlear implants," IEEE Transactions on Cognitive and Developmental Systems, volume 16, number 1, pages 346-357, February 2024. ::: |
15. |
K.-C. Ting, Y.-C. Lin, C.-T. Chan, T.-Y. Tu, Y. Tsao, K.-C. Liu, and C.-C. Shih, "Inertial Measurement Unit-based Romberg Test in Assessing Adults with Vestibular Hypofunction," IEEE Journal of Translational Engineering in Health and Medicine, volume 12, pages 245-255, December 2023. ::: |
16. |
K.-C. Ting, S.-S. Wang, Y.-J. Li, C.-Y. Huang, T.-Y. Tu, C.-C. Shih, K.-C. Liu and Y. Tsao, "Detection of Otitis Media with Effusion Using In-Ear Microphones and Machine Learning," IEEE Sensors Journal, volume 23, pages 28411-28420, October 2023. ::: |
17. |
H.-C. Kuo, Y.-P. Hsieh, H.-H. Tseng, C.-T. Wang, S.-H. Fang, and Y, Tsao, "Toward Real-World Voice Disorder Classification," IEEE Transactions on Biomedical Engineering, volume 70, number 10, pages 2922-2932, October 2023. ::: |
18. |
L.-C. Chen, K.-H. Hung, Y.-J. Tseng, H.-Y. Wang, T.-M. Lu, W.-C. Huang, and Y. Tsao, "Self-supervised Learning Based General Laboratory Progress Pretrained Model for Cardiovascular Event Detection," IEEE Journal of Translational Engineering in Health and Medicine, volume 12, pages 43-55, August 2023. ::: |
19. |
Y.-J. Lu, C.-Y. Chang, C. Yu, C.-F. Liu, J.-w. Hung, S. Watanabe, and Y. Tsao, "Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 31, pages 2738-2750, June 2023. ::: |
20. |
C.-Y. Cheng, H.-S. Lee, Y. Tsao, and H.-M. Wang, "Multi-target Filter and Detector for Unknown-number Speaker Diarization," IEEE Signal Processing Letters, volume 30, pages 638-642, May 2023. ::: |
21. |
T.-M. Chen, Y.-H. Tsai, H.-H. Tseng, K.-C. Liu, J.-Y. Chen, C.-H. Huang, G.-Y. Li, C.-Y. Shen, and Y. Tsao, "SRECG: ECG Signal Super-resolution Framework for Portable/Wearable Devices in Cardiac Arrhythmias Classification," IEEE Transactions on Consumer Electronics, volume 1, pages 1, January 2023, (This paper recieved 2025 IEEE Chester W. Sall Memorial Awards Best Paper Award) ::: |
22. |
K.-C. Liu, K.-H. Hung, C.-Y. Hsieh, H.-Y. Huang, C.-T. Chan, and Y. Tsao, "Deep Learning Based Signal Enhancement of Low-Resolution Accelerometer for Fall Detection Systems," IEEE Transactions on Cognitive and Developmental Systems, volume 14, number 3, pages 1270-1281, September 2022. ::: |
23. |
S.-Y. Niu, L.-Z. Guo, Y. Li, Z. Zhang, T.-D. Wang, K.-C. Liu, Y. Tsao, T.-M. Liu, "Boundary-Preserved Deep Denoising of the Stochastic Resonance Enhanced Multiphoton Images," IEEE Journal of Translational Engineering in Health and Medicine, volume 10, pages 1-12, September 2022. |
24. |
R. E. Zezario, S.-W. Fu, F. Chen, C.-S. Fuh, H.-M. Wang, and Y. Tsao, "Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 31, pages 54-70, September 2022. ::: |
25. |
L.-C. Chen, P.-H. Chen, R. T.-H. Tsai, and Y. Tsao,, "EPG2S: Speech Generation and Speech Enhancement based on Electropalatography and Audio Signals using Multimodal Learning," IEEE Signal Processing Letters, volume 29, pages 2582-2586, June 2022. ::: |
26. |
T. Hussain, W.-C. Wang, M. Gogate, K. Dashtipour, Y. Tsao, X. Lu, A. Ahsan, and A. Hussain, "A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement," IEEE Transactions on Artificial Intelligence, volume 1, number 1, pages 1-12, April 2022. ::: |
27. |
Y.-W. Chen, K.-H. Hung, Y.-J. Li, A. C.-F. Kang, Y.-S. Lai, K.-C. Liu, S.-W. Fu, S.-S. Wang, Y. Tsao, "CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application," IEEE Access, volume 10, pages 46082-46099, February 2022. ::: |
28. |
L.-C. Chen, J.-T. Sheu, Y.-J. Chuang, and Y. Tsao, "Predicting the Travel Distance of Patients while Accessing Healthcare using Deep Neural Network," IEEE Journal of Translational Engineering in Health and Medicine, volume 10, pages 1-11, February 2022. ::: |
29. |
S.-Y. Chuang, H.-M. Wang, and Y. Tsao, "Improved Lite Audio-Visual Speech Enhancement," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 30, pages 1345-1359, February 2022. ::: |
30. |
C.-H. Hu, Y.-H. Peng, J.Yamagishi, Y. Tsao, and H.-M. Wang, "SVSNet: An End-to-end Speaker Voice Similarity Assessment Model," IEEE Signal Processing Letters, volume 29, pages 767-771, February 2022. ::: |
31. |
S.-S. Wang, C.-C. Lai, C.-T. Wang, Y. Tsao, S.-H. Fang, "Continuous Speech for Improved Learning Pathological Voice Disorders," IEEE Open Journal of Engineering in Medicine and Biology, volume 3, pages 2644-1276, February 2022. ::: |
32. |
Y.-C. Lin, C. Yu, Y.-T. Hsu, S.-W. Fu, Y. Tsao, T.-W. Kuo, "SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 30, pages 1016-1031, December 2021. ::: |
33. |
R.-Y. Tseng, T.-W. Wang, S.-W. Fu, C.-Y. Lee, and Y. Tsao, "A Study of Joint Effect on Denoising Techniques and Visual Cues to Improve Speech Intelligibility in Cochlear Implant Simulation," IEEE Transactions on Cognitive and Developmental Systems, volume 13, pages 984-994, December 2021. ::: |
34. |
X. Lu, P. Shen, Y. Tsao, and H. Kawai, "Coupling A Generative Model With A Discriminative Learning Framework for Speaker Verification," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 29, pages 3631-3641, November 2021. ::: |
35. |
F. S. Abousaleh, W.-H. Cheng, N.-H. Yu, and Y. Tsao, "Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media," IEEE Transactions on Cognitive and Developmental Systems, volume 13, number 3, pages 679-692, September 2021. ::: |
36. |
K.-C. Liu, M. Chan, C.-Y. Hsieh, H.-Y. Huang, C.-T. Chan, Y. Tsao, "Domain-adaptive Fall Detection Using Deep Adversarial Training," IEEE Transactions on Neural Systems & Rehabilitation Engineering, volume 29, pages 1243-1251, June 2021. ::: |
37. |
T. Hussain, S. M. Siniscalchi, H.-L. S. Wang, Y. Tsao, S. V. Mario, and W.-H. Liao, "Ensemble Hierarchical Extreme Learning Machine for Speech Dereverberation," IEEE Transactions on Cognitive and Developmental Systems, volume 12, number 4, pages 744-758, December 2020. ::: |
38. |
N. Y.-H. Wang, H.-L. S. Wang, T.-W. Wang, S.-W. Fu, X. Lu, H.-M. Wang, and Y. Tsao, "Improving the Intelligibility of Speech for Simulated Electric and Acoustic Stimulation Using Fully Convolutional Neural Networks," IEEE Transactions on Neural Systems & Rehabilitation Engineering, volume 29, pages 184-195, December 2020. ::: |
39. |
H.-S. Lee, Y. Tsao, S.-K. Jeng, and H.-M. Wang, "Subspace-based Representation and Learning for Phonotactic Spoken Language Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 28, pages 3065-3079, November 2020. ::: |
40. |
T.-A. Hsieh, H.-M. Wang, X. Lu, and Y. Tsao, "WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement," IEEE Signal Processing Letters, volume 27, pages 2149-2153, November 2020. ::: |
41. |
X. Wang et al.,, "ASVspoof 2019: A Large-scale Public Database of Synthetized, Converted and Replayed Speech," Computer Speech and Language, volume 64, pages 1-27, November 2020. ::: |
42. |
K.-H. Tsai, W.-C. Wang, C.-H. Cheng, C.-Y. Tsai, J.-K. Wang, T.-H. Lin, S.-H. Fang, L.-C. Chen, and Y. Tsao, "Blind Monaural Source Separation on Heart and Lung Sounds Based on Periodic-Coded Deep Autoencoder," IEEE Journal of Biomedical and Health Informatics, volume 24, number 11, pages 3203-3214, November 2020. ::: |
43. |
C. Yu*, R. E. Zezario*, S.-S. Wang, J. Sherman, Y.-Y. Hsieh, X. Lu, H.-M. Wang, and Y. Tsao, "Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 28, pages 2756-2769, October 2020, (*equal contributions) ::: |
44. |
W.-C. Huang, H. Luo, H.-T. Hwang, C.-C. Lo, Y.-H. Peng, Y. Tsao, and H.-M. Wang, "Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion," IEEE Transactions on Emerging Topics in Computational Intelligence, volume 4, number 4, pages 468-479, August 2020. ::: |
45. |
C. Yu, K.-H. Hung, S.-S. Wang, Y. Tsao, and J.-w. Hung, "Time-Domain Multi-modal Bone/air Conducted Speech Enhancement," IEEE Signal Processing Letters, volume 27, pages 1035-1039, June 2020. ::: |
46. |
S. C. Hidayati, T. W. Goh, Ji.-S. G. Chan, C.-C. Hsu, J. See, L.-K. Wong, K.-L. Hua, Y. Tsao, and W.-H. Cheng, "Dress With Style: Learning Style from Joint Deep Embedding of Clothing Styles and Body Shapes," IEEE Transactions on Multimedia, volume 23, pages 365-377, March 2020. |
47. |
C.-L. Liu, S.-W. Fu, Y.-J. Li, J.-W. Huang, H.-M. Wang, and Y. Tsao, "Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 28, pages 1888-1900, February 2020. ::: |
48. |
J.-Y. Wu, C. Yu, S.-W. Fu, C.-T. Liu, S.-Y. Chien, Y. Tsao, "Increasing Compactness of Deep Learning based Speech Enhancement Models with Parameter Pruning and Quantization Techniques," IEEE Signal Processing Letters, volume 26, number 12, pages 1887-1891, December 2019. ::: |
49. |
S.-W. Fu, C.-F. Liao, Y. Tsao, "Learning with Learned Loss Function: Speech Enhancement with Quality-Net to Improve Perceptual Evaluation of Speech Quality," IEEE Signal Processing Letters, volume 27, pages 26-30, December 2019. ::: |
50. |
C.-T. Wang, F.-C. Lin, J.-Y. Chen, M.-J. Hsiao, S.-H. Fang, Y.-H. Lai, Y. Tsao, "Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach," Journal of Voice, volume 33, number 5, pages pp. 634-641, September 2019. ::: |
51. |
H.-T. Chiang, Y.-Y. Hsieh, S.-W. Fu, K.-H. Hung, Y. Tsao, S.-Y. Chien, "Noise Reduction in ECG Signals Using Fully Convolutional Denoising Autoencoders," IEEE Access, volume 7, pages 60806-60813, April 2019. ::: |
52. |
C.-T. Liu, T.-W. Lin, Y.-H. Wu, Y.-S. Lin, H. Lee, Y. Tsao, and S.-Y. Chien, "Computation-Performance Optimization of Convolutional Neural Networks with Redundant Filter Removal," IEEE Transactions on Circuits and Systems I, volume 66, pages 1908-1921, December 2018. ::: |
53. |
H.-P. Liu, Y. Tsao, and C.-S. Fuh, "Bone Conducted Speech Enhancement Using Deep Denoising Autoencoder," Speech Communication, volume 104, pages 106-112, November 2018. ::: ::: |
54. |
S.-W. Fu, T.-W. Wang, Y. Tsao, X. Lu, and H. Kawai, "End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 26, number 9, pages 1570-1584, September 2018. ::: |
55. |
Y.-H. Lai, Y. Tsao, X. Lu, F. Chen, Y.-T. Su, K.-C. Chen, Y.-H. Chen, L.-C. Chen, P.-H. Li, and C.-H. Lee, "Deep Learning based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients," Ear and Hearing, volume 39(4), number 4, pages 795-809, July 2018, This work receives the National Innovation Award 2018 ::: ::: |
56. |
T.-H. Lin, T. Akamatsu, and Y, Tsao, "Comparison of passive acoustic soniferous fish monitoring with supervised and unsupervised approaches," Journal of the Acoustical Society of America (JASA), volume 143, number 4, pages published onlione, April 2018. ::: |
57. |
Y. Tsao, H.-C. Chu, S.-H. Fang, J. Lee, and C.-M. Lin, "Adaptive Noise Cancellation using Deep Cerebellar Model Articulation Controller," IEEE Access, volume 6, pages 37395-37402, April 2018. ::: ::: |
58. |
J.-C. Hou, S.-S. Wang, Y.-H. Lai, Y. Tsao, H.-W. Chang, and H.-M. Wang, "Audio-visual Speech Enhancement using Multimodal Deep Convolutional Neural Networks," IEEE Transactions on Emerging Topics in Computational Intelligence, volume 2, number 2, pages 117-128, April 2018. ::: |
59. |
S.-S. Wang, P. Lin, Y. Tsao, J.-W. Hung, and B. Su, "Suppression by Selecting Wavelets for Feature Compression in Distributed Speech Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 26, number 3, pages 564-579, March 2018. ::: |
60. |
S.-W. Fu, P.-C. Li, Y.-H. Lai, C.-C. Yang, L.-C. Hsieh, and Y. Tsao, "Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery," IEEE Transactions on Biomedical Engineering, volume 64, number 11, pages 2584 - 2594, November 2017. ::: |
61. |
T. Hussain, S. M. Siniscalchi, C.-C. Lee, S.-S. Wang, Y. Tsao and W.-H. Liao, "Experimental Study on Extreme Learning Machine Applications for Speech Enhancement," IEEE Access, volume 99, number 99, pages 1-1, October 2017. ::: |
62. |
S.-H. Fang, Y.-X. Fei, Z. Xu, and Y. Tsao, "Learning Transportation Modes from Smartphone Sensors Based on Deep Neural Network," IEEE Sensors Journal, volume 17, pages 6111 - 6118, September 2017. ::: |
63. |
S.-W. Hsiao, H.-C. Sun, M.-C. Hsieh, M.-H. Tsai, Y. Tsao, and C.-C. Lee, "Toward Automating Oral Presentation Scoring during Principal Certification Program using Audio-Video Low-level Behavior Profiles," IEEE Transactions on Affective Computing, volume PP, number PP, pages PP, September 2017. ::: |
64. |
F. Chen, D. Zheng, Y. Tsao, "Effects of Noise Suppression and Envelope Dynamic Range Compression on the Intelligibility of Vocoded Sentences for a Tonal Language," Journal of the Acoustical Society of America (JASA), volume 142, number 3, pages 1157-1166, September 2017. ::: |
65. |
Y.-H. Lai, F. Chen, S.-S. Wang, X. Lu, Y. Tsao, and C.-H. Lee, "A Deep Denoising Autoencoder Approach to Improving the Intelligibility of Vocoded Speech in Cochlear Implant Simulation," IEEE Transactions on Biomedical Engineering, volume 64, number 7, pages 1568 - 1578, July 2017. ::: |
66. |
A. Chern, Y.-H. Lai, Y.-p. Chang, Y. Tsao, R. Y. Chang, and H.-W. Chang, "A Smartphone-Based Multi-Functional Hearing Assistive System to Facilitate Speech Recognition in the Classroom," IEEE Access, volume 5, pages 10339 - 10351, June 2017, This paper has been selected as a Featured Article (http://ieeeaccess.ieee.org/special-sections/featured-articles/smartphone-based-multi-functional-hearing-assistive-system-facilitate-speech-recognition-classroom/) ::: |
67. |
T.-E. Chen, S.-I Yang, L.-T. Ho, K.-H. Tsai, Y.-H. Chen, Y.-F. Chang, Y.-H. Lai, S.-S. Wang, Y. Tsao*, and C.-C. Wu, "S1 and S2 Heart Sound Recognition using Deep Neural Networks," IEEE Transactions on Biomedical Engineering, volume 64, number 2, pages 372 - 380, February 2017. ::: |
68. |
H.-y. Lee, B.-H. Tseng, T.-H. Wen, and Y. Tsao, "Personalizing Recurrent Neural Network based Language Model by Social Network," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 25, number 3, pages 519 - 530, December 2016. ::: |
69. |
T. Guan, G.-x. Chu, Y. Tsao, F. Chen, "Assessing the Perceptual Contributions of Level-dependent Segments to Sentence Intelligibility," Journal of the Acoustical Society of America (JASA), volume 140, number 5, pages 3745-3754, November 2016. ::: |
70. |
S.-H. Fang, W.-H. Chang, Y. Tsao, H.-C. Shih, and C. Wang, "Channel State Reconstruction Using Multilevel Discrete Wavelet Transform for Improved Fingerprinting-Based Indoor Localization," IEEE Sensors Journal, volume 16, number 21, pages 7784 - 7791, November 2016. ::: |
71. |
S.-S. Wang, A. Chern, Y. Tsao, J.-w. Hung, X. Lu, Y.-H. Lai, B. Su, "Wavelet Speech Enhancement based on Nonnegative Matrix Factorization," IEEE Signal Processing Letters, volume 23, number 8, pages 1101-1105, August 2016. ::: |
72. |
Y. Tsao and Y.-H. Lai, "Generalized Maximum a Posteriori Spectral Amplitude Estimation for Speech Enhancement," Speech Communication, volume 76, pages 112–126, February 2016. ::: ::: |
73. |
S.-H. Fang, C.-H. Wang, and Y. Tsao, "Compensating for Orientation Mismatch in Robust WiFi Localization Using Histogram Equalization," IEEE Transactions on Vehicular Technology, volume 64, number 11, pages 5210-5220, November 2015. ::: |
74. |
Y.-C. Lin, Y.-H. Lai, H.-W. Chang, Y. Tsao, Y.-p. Chang, and R. Y. Chang,, "A Smartphone-Based Remote Microphone Hearing Assistive System Using Wireless Technologies," IEEE Systems Journal, volume PP, pages 1-10, October 2015, Smarthear Demo: https://www.youtube.com/watch?v=e9HqIj09QJs ::: |
75. |
Y. Tsao, S.-H. Fang, and Y. Hsiao, "Acoustic Echo Cancellation Using a Vector-Space-Based Adaptive Filtering Algorithm," IEEE Signal Processing Letters, volume 22, pages 351-355, March 2015. ::: ::: |
76. |
Y. Tsao, S. Matsuda, C. Hori, H. Kashioka, and C.-H. Lee, "A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 22, number 2, pages 403-416, February 2014. ::: |
77. |
Y. Tsao and C.-H. Lee, "An Ensemble Speaker and Speaking Environment Modeling Approach to Robust Speech Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 17, pages 1025 - 1037, June 2009. ::: |
78. |
Y. Tsao, S.-M. Lee, and L.-S. Lee, "Segmental Eigenvoice with Delicate Eigenspace for Improved Speaker Adaptation," IEEE Transactions on Speech and Audio Processing, volume 13, pages 399 - 411, April 2005. ::: |
|
|
Conference Papers | |
1. |
Y.-J. Li, R. Chao, B. Su, and Y. Tsao, "Speech Enhancement with MAP-based Training for Robust ASR," to appear in IEEE ICASSP 2025,. ::: |
2. |
J. Lin, I Chiu, K.-C. Wang, K.-C. Liu, H.-M. Wang, P.-C. Yeh, and Y. Tsao, "MSECG: Incorporating Mamba for Robust and Efficient ECG Super-Resolution," to appear in IEEE ICASSP 2025,. ::: |
3. |
D.-Y. Lu, J.-J. Ding, and Y. Tsao, "Neural Variational Mode Decomposition and Its Application for ECG Denoising," to appear in IEEE ICASSP 2025,. ::: |
4. |
R. E. Zezario, S. M. Siniscalchi, H.-M. Wang, and Y. Tsao, "A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models," to appear in IEEE ICASSP 2025,. ::: |
5. |
Y.-T. Liu, K.-C. Wang, R. Chao, S. M. Siniscalchi, P.-C. Yeh, and Y. Tsao, "MSEMG: Surface Electromyography Denoising with a Mamba-based Efficient Network," to appear in IEEE ICASSP 2025,. ::: |
6. |
W. Ren, H. Wu, Y.-C. Lin, X. Chen, R. Chao, K.-H. Hung, Y.-J. Li, W.-Y. Ting, H.-M. Wang, and Y. Tsao, "Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement," to appear in IEEE ICASSP 2025,. ::: |
7. |
P.-Y. Huang, S.-W. Fu, and Y. Tsao, "RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier," to appear in NeurIPS 2024,. ::: |
8. |
R. Chao, W.-H. Cheng, M. L. Quatra, S. M. Siniscalchi, C.-H. H. Yang, S.-W. Fu, and Y. Tsao, "An Investigation of Incorporating Mamba for Speech Enhancement," IEEE SLT 2024, December 2024. ::: |
9. |
C.-H. H. Yang et al., "Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition," IEEE SLT 2024, December 2024. ::: |
10. |
M. L. Quatra, V. M. Salerno, Y. Tsao, S. M. Siniscalchi, "FLANEC: Exploring Flan-T5 for Post-ASR Error Correction," IEEE SLT 2024, December 2024. ::: |
11. |
W.-C. Huang, S.-W. Fu, E. Cooper, R. E. Zezario, T. Toda, H.-M. Wang, J. Yamagishi, and Y. Tsao, "The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction," IEEE SLT2024, December 2024. ::: |
12. |
X. Lu, P. Shen, Y. Tsao, and H. Kawai, "Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR," IEEE SLT 2024, December 2024. ::: |
13. |
S.-F. Huang, H.-C. Kuo, Z. Chen, X. Yang, C.-H. H. Yang, Y. Tsao, Y.-C. F. Wang, H.-y. Lee, and S.-W. Fu, "Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits," IEEE SLT 2024, December 2024. ::: |
14. |
J. Du, I-M. Lin, I-H. Chiu, X. Chen, H. Wu, W. Ren, Y. Tsao, H.-y. Lee, and J.-S. R. Jang, "DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset," IEEE SLT 2024, December 2024. ::: |
15. |
K.-H. Hung, K.-C. Wang, K.-C. Liu, W.-L. Chen, X. Lu, Y. Tsao, and C.-W. Lin, "MECG-E: Mamba-based ECG Enhancer for Baseline Wander Removal," IEEE BigData 2024, November 2024. ::: |
16. |
S.-C. Chiu, C.-H. Wu, J.-K. Hsieh, Y. Tsao, and H.-M. Wang, "Learnable Layer Selection and Model Fusion for Speech Self-Supervised Learning Models," Interspeech 2024, September 2024. ::: |
17. |
R. E. Zezario, F. Chen, C.-S.Fuh, H.-M. Wang, and Y. Tsao, "Non-Intrusive Speech Intelligibility Prediction for Hearing Aids using Whisper and Metadata," Interspeech 2024, September 2024. ::: |
18. |
C. Yin, T.-S. Chi, Y. Tsao, and H.-M. Wang, "SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models," Interspeech 2024, September 2024. ::: |
19. |
K.-C. Wang, Y.-J. Li, W.-L. Chen, Y.-W. Chen, Y.-C. Wang, P.-C. Yeh, C. Zhang, and Y. Tsao, "Bridging the Gap: Integrating Pre-trained Speech Enhancement and Recognition Models for Robust Speech Recognition," IEEE EUSIPCO 2024, August 2024. ::: |
20. |
R. E. Zezario, Y.-W. Chen, S.-W. Fu, Y. Tsao, H.-M. Wang, C.-S. Fuh, "A Study on Incorporating Whisper for Robust Speech Assessment," IEEE ICME 2024, July 2024, (Top Performance on the Track 3 - VoiceMOS Challenge 2023) ::: |
21. |
S.-W. Fu, K.-H. Hung, Y. Tsao, and Y.-C. F. Wang, "Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech," ICLR 2024, May 2024. ::: |
22. |
R. E. Zezario, B.-R. B. Bai, C.-S. Fuh, H.-M. Wang, and Y. Tsao, "Multi-task Pseudo-label Learning For Non-intrusive Speech Quality Assessment Model," IEEE ICASSP 2024, April 2024. ::: |
23. |
Y.-T. Liu, K.-C. Wang, K.-C. Liu, S.-Y. Peng, and Y. Tsao, "SDEMG: Score-based Diffusion Model For Surface Electromyographic Signal Denoising," IEEE ICASSP 2024, April 2024. ::: |
24. |
Y. Tseng, L. Berry, and Y.-T. Chen et al.,, "A Multi-task Evaluation Benchmark For Audio-visual Representation Models," IEEE ICASSP 2024, April 2024. ::: |
25. |
H. Wu, H.-C. Kuo, Y. Tsao, H.-y. Lee, "Scalable Ensemble-based Detection Method Against Adversarial Attacks For Speaker Verification," IEEE ICASSP 2024, April 2024. ::: |
26. |
X. Lu, P. Shen, Y. Tsao, and H. Kawai, "Hierarchical Cross-modality Knowledge Transfer With Sinkhorn Attention For Ctc-based ASR," IEEE ICASSP 2024, April 2024. ::: |
27. |
X. Lu, P. Shen, Y. Tsao, and H. Kawa, "Cross-modal alignment with optimal transport for CTC-based ASR," IEEE ASRU 2023, December 2023. ::: |
28. |
H.-T. Chiang, K.-H. Hung, S.-W. Fu, H.-C. Kuo, M.-H. Tsai, and Y. Tsao, "Study on the Correlation between Objective Evaluations and Subjective Speech Quality and Intelligibility," IEEE ASRU 2023, December 2023. ::: |
29. |
C.-C. Lee, H.-W. Chen, C.-S. Chen, H.-M. Wang, T.-T. Liu, and Y. Tsao, "LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models," IEEE ASRU 2023, December 2023. ::: |
30. |
E. Cooper, W.-C. Huang, Y.Tsao, H.-M. Wang, T. Toda, and J. Yamagishi, "The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains," IEEE ASRU 2023, December 2023. ::: |
31. |
W.-Y. Ting, S.-S. Wang, Y. Tsao, and B. Su, "IANS: Intelligibility-aware Null-steering Beamforming for Dual-Microphone Arrays," IEEE MLSP 2023, September 2023. ::: |
32. |
I-C. Chern, S. Chern, H.-C. Kuo, H.-H. Tseng, K.-H. Hung, and Y. Tsao, "Voice Direction-of-Arrival Conversion," IEEE MLSP 2023, September 2023. ::: |
33. |
T.-A. Hsieh, C.-H. Huck Y., P.-Y. Chen, S. M. Siniscalchi, Y. Tsao, "Inference and Denoise: Causal Inference-based Neural Speech Enhancement," IEEE MLSP 2023, September 2023. ::: |
34. |
Y.-L. Chien, H.-H. Chen, M.-C. Yen, S.-W. Tsai, H.-M. Wang, Y. Tsao, T.-S. Chi, "Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion," Interspeech 2023, August 2023. ::: |
35. |
H. Yen, P.-J. Ku, C.-H. H. Yang, H. Hu, S. M. Siniscalchi, P.-Y. Chen, and Y. Tsao, "Neural Model Reprogramming with Similarity Based Mapping for Low-Resource Spoken Command Recognition," Interspeech 2023, August 2023. ::: |
36. |
H.-H. Chen, Y.-L. Chien, M.-C. Yen, S.-W. Tsai, T.-S. Chi, Y. Tsao, and H.-M. Wang, "Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features," Interspeech 2023, August 2023. ::: |
37. |
L.-W. Chen, Y.-F. Cheng, H.-S. Lee, Y. Tsao, and H.-M. Wang, "A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech," Interspeech 2023, August 2023. ::: |
38. |
E.-P. Chu, K.-C. Liu, C.-Y. Hsieh, C.-Y. Chang, Y. Tsao, and C.-T. Chan, "Multi-Task Learning U-Net for Functional Shoulder Sub-Task Segmentation," IEEE EMBC 2023, July 2023. ::: |
39. |
H.-Y. Lin, H.-H. Tseng, and Y. Tsao, "On the Robustness of Non-intrusive Speech Quality Model by Adversarial Examples," IEEE ICASSP 2023, June 2023. ::: |
40. |
C.-P. Liu, J.-H. Li, E.-P. Chu, C.-Y. Hsieh, K.-C. Liu, C.-T. Chan, and Y. Tsao:, "Deep Learning-based Fall Detection Algorithm Using Ensemble Model of Coarse-fine CNN and GRU Networks," IEEE MeMeA 2023, June 2023. ::: |
41. |
J. Kirton-Wingate, S. Ahmed, M. Gogate, Y. Tsao, A. Hussain, "Towards Individualised Speech Enhancement: An SNR Preference learning System For Multi-modal Hearing Aids," IEEE ICASSP 2023 (AMHAT 2023 Workshop), June 2023. ::: |
42. |
C.-J. Hsu, H.-L. Chung, H.-y. Lee, amd Y. Tsao, "T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5," IEEE ICASSP 2023, June 2023. ::: |
43. |
I-C. Chern, K.-H. Hung, Y.-T. Chen, T. Hussain, M. Gogate, A. Hussain, Y. Tsao, and J.-C. Hou, "Audio-visual Speech Enhancement And Separation By Utilizing Multi-modal Self-supervised Embeddings," IEEE ICASSP 2023 (AMHAT 2023 Workshop), June 2023. ::: |
44. |
K.-C. Wang, K.-C. Liu, S.-Y. Peng, Y. Tsao, "ECG Artifact Removal from Single-Channel Surface EMG Using Fully Convolutional Networks," IEEE ICASSP 2023, June 2023. ::: |
45. |
T.-H. Chi, K.-C. Liu, C.-Y. Hsieh, Y. Tsao, and C.-T. Chan, "Pre-Impact Fall Detection via CNN-ViT Knowledge Distillation," IEEE ICASSP 2023, June 2023. ::: |
46. |
C.-C. Lee, Y. Tsao, H.-M. Wang, C.-S. Chen, "D4AM: A General Denoising Framework for Downstream Acoustic Models," ICLR 2023, May 2023. ::: |
47. |
H.-H. Tseng, H.-Y. Lin, H.-K. Hsuan, and Y. Tsao, "Interpretations of Domain Adaptations via Layer Variational Analysis," ICLR 2023, May 2023. ::: |
48. |
C.-H. Chen, K.-C. Liu, T.-Y. Lu, C.-Y. Chang, C.-T. Chan, and Y. Tsao, "Wearable-based Pain Assessment in Patients with Adhesive Capsulitis Using Machine Learning," IEEE NER 2023, April 2023. |
49. |
R. E. Zezario, F. Chen, C.-S. Fuh, H.-M. Wang and Y. Tsao, "MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids," Interspeech 2022, pages 3944-3948, September 2022, 1st Place, Machine Learning Challenges for Hearing Aids Challenge; 1st Place, The Hearing Industry Research Consortium Student Prize ::: |
50. |
R. E. Zezario, S.-W. Fu, F. Chen, C.-S. Fuh, H.-M. Wang and Y. Tsao, "MTI-Net: A Multi-Target Speech Intelligibility Prediction Model," Interspeech 2022, pages 5463-5467, September 2022. ::: |
51. |
Y.-W. Chen and Y. Tsao, "InQSS: a speech intelligibility and quality assessment model using a multi-task learning network," Interspeech 2022, September 2022. ::: |
52. |
K.-H. Hung, S.-W. Fu, H.-H. Tseng, H.-T. Chiang, Y. Tsao, C.-W. Lin, "Boosting Self-Supervised Embeddings for Speech Enhancement," Interspeech 2022, September 2022. ::: |
53. |
F.-L. Wang, H.-S. Lee, Y. Tsao and H.-M. Wang, "Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks Authors," Interspeech 2022, September 2022. ::: |
54. |
C.-C. Lee, C.-H. Hu, Y.-C. Lin, C.-S. Chen, H.-M. Wang and Y. Tsao, "NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling," Interspeech 2022, September 2022. ::: |
55. |
C. Yu, S.-W. Fu, T.-An Hsieh, Y. Tsao and M. Ravanelli, "OSSEM: one-shot speaker adaptive speech enhancement using meta learning," Interspeech 2022, September 2022. ::: |
56. |
C.-J. Peng, Y.-J. Chan, Y.-L.Shen, C. Yu, Y. Tsao and T.-S. Chi, "Perceptual Characteristics Based Multi-objective Model for Speech Enhancement," Interspeech 2022, September 2022. ::: |
57. |
R. Chao, C. Yu, S.-W. Fu, X. Lu and Y. Tsao, "Perceptual Contrast Stretching on Target Feature for Speech Enhancement," Interspeech 2022, September 2022. ::: |
58. |
Y.-J. Lu et al.,, "ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding," Interspeech 2022, September 2022. |
59. |
W.-C. Huang, E. C., Y. Tsao, H.-M. Wang, T. Toda and J. Yamagishi, "The VoiceMOS Challenge 2022," Interspeech 2022, September 2022. ::: |
60. |
T. Hussain, M. Diyan, M. Gogate, K. Dashtipour, A. Adeel, Y. Tsao, A. Hussain, "A Novel Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning," IEEE EMBC 2022, July 2022. ::: |
61. |
S.–S. Wang, Y. Tsao, W.–Z. Zheng, H.–W. Yeh, P.–C. Li, S.–H. Fang, Y.–H. Lai, "Dysarthric Speech Enhancement Based on Convolution Neural Network," IEEE EMBC 2022, July 2022. ::: |
62. |
Y.-C. Lin,T.-A. Hsieh, K.-H. Hung, C. Yu, H. Garudadri, Y. Tsao, and T.-W. Kuo, "Speech Recovery For Real-world Self-powered Intermittent Devices," ICASSP 2022, May 2022. ::: |
63. |
C.-J. Hsu, H.-y. Lee, Y. Tsao, "XDBERT: Distilling Visual Information to BERT via Cross-Modal Encoders to Improve Language Understanding," ACL 2022, May 2022, (Short Paper) |
64. |
S.-W. Fu, C. Yu, K.-H. Hung, M. Ravanelli, and Y. Tsao, "MetricGAN-U: Unsupervised Speech Enhancement/ Dereverberation based Only On Noisy/ Reverberated Speech," ICASSP 2022, May 2022. ::: |
65. |
Y.-J. Lu, Z.-Q. Wang, S. Watanabe, A. Richard, C. Yu, and Y. Tsao, "Conditional Diffusion Probabilistic Model For Speech Enhancement," ICASSP 2022, May 2022. ::: |
66. |
G.-T. Lin, C.-J. Hsu, D.-R. Liu, H.-Y. Lee, and Y. Tsao, "Analyzing The Robustness Of Unsupervised Speech Recognition," ICASSP 2022, May 2022. ::: |
67. |
C.-H. H. Yang, J. Qi, S. Y.-C. Chen, Y. Tsao, P.-Y. Chen, "When Bert Meets Quantum Temporal Convolution Learning for Text Classification In Heterogeneous Computing," ICASSP 2022, May 2022. ::: |
68. |
H. Wu, H.-C. Kuo, N. Zheng, K.-H. Hung, H.-Y. Lee, Y. Tsao, H.-M. Wang, H. Meng, "Partially Fake Audio Detection by Self-attention-based Fake Span Discovery," ICASSP 2022, May 2022. ::: |
69. |
K.-C. Wang, K.-C. Liu, H.-M. Wang, and Y. Tsao, "EMGSE: Acoustic/emg Fusion For Multimodal Speech Enhancement," ICASSP 2022, May 2022. ::: |
70. |
H.-Y. Lin, H.-H. Tseng, X. Lu, Yu Tsao, "Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport," NeurIPS 2021, December 2021. ::: |
71. |
Y.-J. Li, S.-S. Wang, Y. Tsao, and B. Su, "MIMO Speech Compression and Enhancement Based on Convolutional Denoising Autoencoder," APSIPA ASC 2021, December 2021. ::: |
72. |
Z. Feng, Yu Tsao, and F. Chen, "Estimation and Correction of Relative Transfer Function for Binaural Speech Separation Networks to Preserve Spatial Cues," APSIPA ASC 2021, December 2021. ::: |
73. |
H.-T. Chiang, Y.-C. Wu, C. Yu, T. Toda, H.-M. Wang, Y.-C. Hu, and Y. Tsao, "HASA-NET: A Non-Intrusive Hearing-Aid Speech Assessment Network," ASRU 2021, December 2021. ::: |
74. |
X. Lu, P. Shen, Y. Tsao, and H. Kawai, "Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification," APSIPA ASC 2021, December 2021. ::: |
75. |
Y.-J. Lu, Y. Tsao, and S. Watanabe, "A Study on Speech Enhancement Based on Diffusion Probabilistic Model," APSIPA ASC 2021, December 2021. ::: |
76. |
Y.-S. Liou, W.-C. Huang, M.-C. Yen, S.-W. Tsai, Y.-H. Peng, T. Toda, Y. Tsao, and H.-M. Wang,, "Time Alignment Using Lip Images for Frame-Based Electrolaryngeal Voice Conversion," APSIPA ASC 2021, December 2021. ::: |
77. |
X. Chang, T. Maekaku, P. Guo, J. Shi,Y.-J. Lu, A. S. Subramanian, T. Wang, S.-w. Yang, Y. Tsao, H.-y. Lee, S. Watanabe, "An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition," ASRU 2021, December 2021. ::: |
78. |
M.-C. Yen, W.-C. Huang, K. Kobayashi, Y.-H. Peng, S.-W. Tsai, Y. Tsao, T. Toda, J.-S. Jang, and H.-M. Wang, "Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Model-ing," ASRU 2021, December 2021. ::: |
79. |
M. E Noor, Y.-J. Lu, S.-Si. Wang, S. Ghose, C.-Y. Chang, R. E. Zezario, S. Ahmed, W.-H. Chung, Y. Tsao and H.-M. Wang, "Investigation of A Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-To-End Bengali Automatic Speech Recogni-tion Under Unseen Noisy Conditions," Oriental COCOSDA 2021, November 2021. ::: |
80. |
W.-C. Huang, K. Kobayashi, Y.-H. Peng, C.-F. Liu, Y. Tsao, H.-M. Wang, T. Toda, "A Preliminary Study of a Two-Stage Paradigm for Preserving SpeakerIdentity in Dysarthric Voice Conversion," Interspeech 2021, September 2021. ::: |
81. |
S.-W. Fu, C. Yu, T.-A. Hsieh, P. Plantinga, M. Ravanelli, X. Lu, Y. Tsao, "MetricGAN +: An Improved Version of MetricGAN for Speech Enhancement," Interspeech 2021, September 2021. ::: |
82. |
Y,-C. Wu, C.-H. Hu, H.-S. Lee, Y.-H. Peng, W.-C. Huang, Y. Tsao, H.-M. Wang and T. Toda, "Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder," Interspeech 2021, September 2021. ::: |
83. |
G.-X. Lin, S.-W. Hu, Y.-J. Lu, Y. Tsao, and C.-S. Lu, "QISTA-Net-Audio: Audio Super-resolution via Non-Convex Lq-normMinimization," Interspeech 2021, September 2021. |
84. |
T.-A. Hsieh, C. Yu, S.-W. Fu, X. Lu, and Y. Tsao, "Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement," Interspeech 2021, September 2021. ::: |
85. |
Y.-W. Chen, K.-H. Hung, S.-Y. Chuang, J. Sherman, X. Lu, Y. Tsao, "A Study of Incorporating Articulatory Movement Information in Speech Enhancement," EUSIPCO 2021, August 2021. ::: |
86. |
R. E Zezario, C.-S. Fuh, H.-M. Wang, Y. Tsao, "Speech Enhancement with Zero-Shot Model Selection," EUSIPCO 2021, August 2021. ::: |
87. |
T.-Y. Lu, K.-C. Liu, C.-Y. Hsieh, C.-Y. Chang, Y. Tsao, C.-T. Chan, "Instrumented Shoulder Functional Assessment using Inertial Measurement Units for Frozen Shoulder," IEEE BHI 2021, pages 1-4, July 2021. ::: |
88. |
X. Lu, P. Shen, Y. Tsao, H. Kawai, "Unsupervised neural adaptation model based on optimal transport for spoken language identification," ICASSP 2021, June 2021. ::: |
89. |
Y.-K. Wu, K.-P. Huang, Y. Tsao, H.-y. Lee, "One shot learning for speech separation," ICASSP 2021, June 2021. ::: |
90. |
Y.-W. Chen, K.-H. Hung, S.-Y. Chuang, J. Sherman, W.-C. Huang, X. Lu, Y. Tsao, "EMA2S: An End-to-End Multimodal Articulatory-to-Speech System," ISCAS 2021, May 2021. ::: |
91. |
C.-J. Peng, Y.-J. Chan, C. Yu, S.-S. Wang, Y. Tsao, T.-S. Chi, "Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario," ISCAS 2021, May 2021. ::: |
92. |
Y.-T. Chang, Y.-H. Yang, Y.-H. Peng, S.-S. Wang, T.-S. Chi, Y. Tsao and H.-M. Wang, "MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation Acceleration," ISCSLP 2021, January 2021. ::: |
93. |
S.-W. Fu et al., "Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing," APSIPA 2020, December 2020. ::: |
94. |
R. E. Zezario, S.-W. Fu, C.-S. Fuh, Y. Tsao, and H.-M. Wang, "STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model," APSIPA 2020, December 2020. ::: |
95. |
C.-Y. Chen, W.-Z. Zheng, S.-S. Wang, Y. Tsao, P.-C. Li and Y.-H. Lai, "Enhancing Intelligibility of Dysarthric Speech Using Gated Convolutional-based Voice Conversion System," Interspeech 2020, October 2020. ::: |
96. |
H. Li, S.-W. Fu, Y. Tsao, J. Yamagishi, "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning," Interspeech 2020, October 2020. ::: |
97. |
C.-C. Lee, Y.-C. Lin, H.-T. Lin, H.-M. Wang, Y. Tsao, "SERIL: Noise Adaptive Speech Enhancement using Regularization-based Incremental Learning," Interspeech 2020, October 2020. ::: |
98. |
Y.-J. Lu, C.-F. Liao, X. Lu, J.-w. Hung, Y. Tsao, "Incorporating Broad Phonetic Information for Speech Enhancement," Interspeech 2020, October 2020. ::: |
99. |
S.-Y. Chuang, Y. Tsao, C.-C. Lo, H.-M. Wang, "Lite Audio-Visual Speech Enhancement," Interspeech 2020, October 2020. ::: |
100. |
R. E. Zezario, T. Hussain, X. Lu, H.-M. Wang, and Y. Tsao, "Self-supervised Denoising Autoencoder with Linear Regression Decoder for Speech Enhancement," ICASSP 2020, May 2020. ::: |
101. |
W.-C. Lin, Y. Tsao, F. Chen, and H.-M. Wang, "Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature Enhancement," APSIPA 2019, pages 1179-1184, November 2019. ::: |
102. |
F. Ye, Y. Tsao, and F. Chen, "Subjective Feedback-based Neural Network Pruning for Speech Enhancement," APSIPA 2019, November 2019. ::: |
103. |
T. Hussaink, Y. Tsao, H.-M. Wang, J.-C. Wang, S. M. Siniscalchi, and W.-H. Liao, "Compressed Multimodal Hierarchical Extreme Learning Machine for Speech Enhancement," APSIPA 2019, November 2019. ::: |
104. |
K.-Y. Liu, S.-S. Wang, Y. Tsao, J.-w. Hung, "Speech Enhancement Based on the Integration of Fully Convolutional Network, Temporal Lowpass Filtering and Spectrogram Masking," ROCLING 2019, October 2019. ::: |
105. |
P.-T. Huang, H.-S. Lee, S.-S. Wang, K.-Y. Chen, Y. Tsao and H.-M. Wang, "Exploring the Encoder Layers of Discriminative Autoencoders for LVCSR," Interspeech 2019, September 2019, (with ISCA Travel Grant) ::: |
106. |
W.-C. Huang et al.,, "Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion," ISCA SSW 10, September 2019. ::: |
107. |
R. E. Zezario, S.-W. Fu, X. Lu, H.-M. Wang, and Y. Tsao, "Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric," Interspeech 2019, September 2019. ::: |
108. |
C.-F. Liao, Y. Tsao, X. Lu and H. Kawai, "Incorporating Symbolic Sequential Modeling for Speech Enhancement," Interspeech 2019, September 2019, (with ISCA Travel Grant) ::: |
109. |
X. Lu, P. Shen, S. Li, Y. Tsao, and H. Kawai, "Class-wise Centroid Distance Metric Learning for Acoustic Event Detection," Interspeech 2019, September 2019. ::: |
110. |
Y.-C. Lin, Y.-T. Hsu, S.-W. Fu, Y. Tsao, and T.-W. Kuo, "IA-NET: Acceleration and Compression of Speech Enhancement using Integer-adder Deep Neural Network," Interspeech 2019, September 2019. ::: |
111. |
C.-C. Lo, S.-w. Fu, W. C. Huang, X. Wang, J. Yamagishi, Y. Tsao and H.-M. Wang, "MOSNet: Deep Learning based Objective Assessment for Voice Conversion," Interspeech 2019, September 2019. ::: |
112. |
C.-F. Liao, Y. Tsao, H.-y. Lee and H.-M. Wang, "Noise Adaptive Speech Enhancement using Domain Adversarial Training," Interspeech 2019, September 2019, (with ISCA Travel Grant) ::: |
113. |
W.-C. Huang, Y.-C. Wu, H.-T. Hwang, P. L. Tobing, T. Hayashiy, K. Kobayashi, T. Toda, Y. Tsao, H.-M. Wang, "Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion," EUSIPCO 2019, September 2019. ::: |
114. |
F.-K. Chuang, S.-S. Wang, J.-w. Hung, Y. Tsao, and S.-H. Fang, "Speaker-aware Deep Denoising Autoencoder with Embedded Speaker Identity for Speech Enhancement," Interspeech 2019, September 2019. ::: |
115. |
T. Hussain, Y. Tsao, H.-M. Wang, J.-C. Wang, S. M. Siniscalchi, W.-H. Liao, "Audio-Visual Speech Enhancement Using Hierarchical Extreme Learning Machine," EUSIPCO 2019, September 2019. ::: |
116. |
W.-C. Huang, Y.-C. Wu, C.-C. Lo, P. L. Tobing, T. Hayashi, K. Kobayashi, T. Toda, Y. Tsao and H.-M. Wang, "Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion," Interspeech 2019, September 2019, (with ISCA Travel Grant) ::: |
117. |
L.-W. Chen, H.-Y. Lee, and Y. Tsao, "Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech," Interspeech 2019, September 2019. ::: |
118. |
S.-W. Fu, C.-F. Liao, Y. Tsao, S.-D. Lin, "MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement," ICML 2019, June 2019, Long Oral with ICML (top 3%) Travel Grant; Codes: https://github.com/JasonSWFu/MetricGAN ::: |
119. |
Y.-L. Shen, C.-Y. Huang, S.-S. Wang, Y. Tsao, H.-M. Wang, and T.-S. Chi, "Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition," ICASSP 2019, May 2019. ::: |
120. |
K.-Y. Liu, S.-k. Lee, S.-S. Wang, Y. Tsao, J.-w. Hung, "Reducing noise and reverberation in speech signals via the integration of denoising autoencoder and temporal lowpass filtering," ICASI 2019, April 2019. ::: |
121. |
T. Hussain, Y. Tsao, S. M. Sinicalchi, J.-C. Wang, H.-M. Wang, and W.-H. Liao, "Bone-conducted Speech Enhancement using Hierarchical Extreme Learning Machine," IWSDS 2019, April 2019. ::: |
122. |
R. E. Zezario, J.-W. Huang, X. Lu, Y. Tsao, H.-T. Hwang, H.-M. Wang, "Deep Denoising Autoencoder Based Post Filtering for Speech Enhancement," APSIPA 2018, December 2018. ::: |
123. |
Shang-Chih Lin*, Yu Tsao, Shun-Feng Su, Yennun Huang, and Zi-Qing Zhong, "An Abnormal Detection Strategy of Rotating Electric Machine based on Frequency Distribution," The 39th Symposium on Electrical Power Engineering, December 2018. |
124. |
S.-k. Lee, S.-S. Wang, Y. Tsao, J.-w. Hung, "Speech Enhancement based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via Discrete Wavelet Transform," ISCSLP 2018, November 2018. ::: |
125. |
W.-C. Huang, H.-T. Hwang, Y.-H. Peng, Y. Tsao, H.-M. Wang, "Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders," ISCSLP 2018, November 2018, Best Student Paper Award ::: |
126. |
Shang-Chih Lin*, Chuan-Hsiang Su, Yu Tsao, Shun-Feng Su, Hong-Yuan Mark Liao, and Yennun Huang, "FIS-based Domestic Milling Machine PHM System Considering Multi-speed Frequency Variation," IEEE International Conference on Advanced Manufacturing, November 2018, (Best Paper Award) (獲推薦轉投SCI期刊, 擴充研究修改中) |
127. |
Y.-T. Hsu, Z. Zhu, C.-T. Wang, S.-H. Fang, F. Rudzicz, and Y. Tsao, "Robustness against the channel effect in pathological voice detection," NeurIPS 2018, Machine Learning for Health (ML4H) Workshop, November 2018. ::: |
128. |
Shang-Chih Lin*, Yu Tsao, Shun-Feng Su, and Yennun Huang, "An Industrial IoT Analysis System Based on Machining Data of Metal Materials," International Conference on Fuzzy Theory and Its Applications, November 2018. |
129. |
Hung-Chung Li, Shang-Chih Lin, Yu Tsao, Shun-Feng Su, Pei-Li Sun and Yennun Huang, "A Supervised Learning Algorithm Considering Light Conditions for Visual Inspection of Metal Objects," The 54th Annual Conference of Chinese Society for Quality 2018 International Symposium of Quality Management, November 2018, (Makalot Industry-Academic Collaboration Award) (獲推薦轉投EI期刊, 擴充研究修改中) |
130. |
Y.-T. Hsu, Y.-C. Lin, S.-W. Fu, Y. Tsao, T.-W. Kuo, "A study on speech enhancement using exponent-only floating point quantized neural network (EOFP-QNN)," SLT 2018, November 2018. ::: |
131. |
Y.-Y. Kao, H.-P. Hsu, C.-F. Liao, Y. Tsao, H.-C. Yang, J.-L. Li, C.-C. Lee, H.-S. Lee, and H.-M. Wang, "Automatic Detection of Speech Under Cold Using Discriminative Autoencoders and Strength Modeling with Multiple Sub-Dictionary Generation," IEEE IWAENC, September 2018. ::: |
132. |
X. Lu, P. Shen, S. Li, Y. Tsao, H. Kawai, "Temporal Attentive Pooling for Acoustic Event Detection," Interspeech 2018, September 2018. ::: |
133. |
S.-W. Fu, Y. Tsao, H.-T. Hwang, H.-M. Wang, "Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM," Interspeech 2018, September 2018. ::: |
134. |
Y.-H. Peng, H.-T. Hwang, Y.-C. Wu, Y. Tsao, H.-M. Wang, "Exemplar-Based Spectral Detail Compensation for Voice Conversion," Interspeech 2018, September 2018. ::: |
135. |
B.-S. Yu, Y. Tsao, S.-W. Yang, Y.-K. Chen, and S.Y. Chien, "Architecture Design of Convolutional Neural Networks for Face Detection on an FPGA Platform," IEEE SiPS 2018, September 2018. |
136. |
N. Ryant et al., "Enhancement and Analysis of Conversational Speech: JSALT 2017," ICASSP, April 2018. ::: |
137. |
Y.-H. Lai, W.-Z. Zheng, S.-T. Tang, S.-H. Fang, W.-H. Liao, and Y. Tsao, "Improving the Performance of Hearing Aids in Noisy Environments based on Deep Learning Technology," EMBC 2018, April 2018. ::: |
138. |
W.-J. Lee, S.-S. Wang, F. Chen, X. Lu, S.-Y. Chien, and Y. Tsao,, "Speech Dereverberation Based on Integrated Deep and Ensemble Learning Algorithm," ICASSP, April 2018. ::: |
139. |
L. Sun, J. Du, T. Gao, Y.-D. Lu, Y. Tsao, C.-H. Lee, N. Ryant, "A Novel LSTM-based Speech Preprocessor For Speaker Diarization in Realistic Mismatch Conditions," ICASSP, April 2018. ::: |
140. |
S.-W. Fu, Y. Tsao, X. Lu, and H. Kawai, "Raw Waveform-based Speech Enhancement by Fully Convolutional Networks," APSIPA 2017, November 2017. ::: |
141. |
Y.-H. Peng, C.-C. Hsu, Y.-C. Wu, H.-T. Hwang, Y.-W. Liu, Y. Tsao, and H.-M. Wang, "Fast Locally Linear Embedding Algorithm for Exemplar-based Voice Conversion," APSIPA 2017, November 2017, (Poster Presentation Award) ::: |
142. |
S.-S. Wang, Y. Tsao, H.-L. S. Wang, Y.-H. Lai*, and L. P.-H. Li, "A Deep Learning based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients in the Presence of Competing Speech Noise," APSIPA 2017, November 2017. ::: |
143. |
T.-H. Lin, Y.-H. Wang, S.-S. Lu, H.-W. Yen, and Y. Tsao, "Computing Biodiversity Change via a Soundscape Monitoring Network," PNC 2017 Annual Conference and Joint Meetings, November 2017. ::: |
144. |
S.-W. Fu, T.-y. Hu, Y. Tsao, X. Lu, "Complex Spectrogram Enhancement by Convolutional Neural Network with Multi-metrics Learning," IEEE MLSP 2017, September 2017. ::: |
145. |
T.-H. Lin and Y. Tsao, "Deblending of Simultaneous-source Seismic Data via Periodicity-coded Nonnegative Matrix Factorization," IEEE Dataport, September 2017. ::: |
146. |
M.-H. Yang, H.-S. Lee, Y.-D. Lu, K.-Y. Chen, Y. Tsao, B. Chen, and H.-M. Wang, "Discriminative Autoencoders for Acoustic Modeling," Interspeech2017, August 2017. ::: |
147. |
Y.-C. Wu, H.-T. Hwang, S.-S. Wang, C.-C. Hsu, Y. Tsao, and H.-M. Wang, "A Post-filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech Enhancement," Interspeech2017, August 2017. ::: |
148. |
C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao, and H.-M. Wang, "Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks," Interspeech2017, August 2017. ::: |
149. |
C.-L. Wu, H.-P. Hsu, S.-S. Wang, J.-W. Hung, Y.-H. Lai, H.-M. Wang, and Y. Tsao, "Wavelet Speech Enhancement Based on Robust Principal Component Analysis," Interspeech2017, August 2017. ::: |
150. |
S.-T. Lin, Y.-H. Liao, Y. Tsao, and S.-Y. Chien,, "Object-based on-line video summarization for internet of video things," EEE ISCAS, May 2017. ::: |
151. |
Y.-C. Wu, H.-T. Hwang, S.-S. Wang, C.-C. Hsu, Y.-H. Lai, Y. Tsao, and H.-M. Wang, "A Locally Linear Embbeding Based Postfiltering Approach for Speech Enhancement," IEEE ICASSP, March 2017. ::: |
152. |
H.-S. Lee, Y.-D. Lu, C.-C. Hsu, Y. Tsao, H.-M. Wang, and S.-K. Jeng, "Discriminative Autoencoders for Speaker Verification," IEEE ICASSP, March 2017. ::: |
153. |
J.-C. Hou, S.-S. Wang, Y.-H. Lai, J.-C. Lin, Y. Tsao, H.-W. Chang, and H.-M. Wang, "Audio-Visual Speech Enhancement using Deep Neural Networks," APSIPA 2016, December 2016. ::: |
154. |
C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao and H.-M. Wang, "Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder," APSIPA ASC, December 2016. ::: |
155. |
C.-C. Hsu, H.-T. Hwang, Y.-C. Wu, Y. Tsao, and H.-M. Wang, "Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network," ISCSLP, November 2016. ::: |
156. |
Y.-Y. Hsieh, C.-D. Wu, Y. Tsao, and S.-S. Lu, "A Linear Regression Model with Dynamic Pulse Transit Time Features for Noninvasive Blood Pressure Prediction," BioCAS, October 2016. ::: |
157. |
Y.-H. Lai, S.-S. Wang, Y.-T. Su, H.-C. Cheng, F. K. Fu, and Y. Tsao, "Improving the Performance of Speech Perception in Noisy Environment based on a FAME Strategy," ISCSLP 2016, October 2016. ::: |
158. |
C.-Y. Hsu, R. E. Zezario, J.-C. Wang, X. Lu, and Y. Tsao, "Incorporating Local Environment Information with Ensemble Neural Networks to Robust Automatic Speech Recognition," ISCSLP 2016, October 2016. ::: |
159. |
H.-S. Lee, Y. Tsao, C.-C. Lee, H.-M. Wang, W.-C. Lin, W.-C. Chen, S.-W. Hsiao, S.-K. Jeng, "Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation," Interspeech, September 2016. ::: |
160. |
S.-W. Fu, Y. Tsao, X. Lu, "SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement," Interspeech, September 2016. ::: |
161. |
Y.-C. Wu, H.-T. Hwang, C.-C. Hsu, Y. Tsao, H.-M. Wang, "Locally Linear Embedding for Exemplar-Based Spectral Conversion," Interspeech, September 2016. ::: |
162. |
X. Lu, P. Shen, Y. Tsao, H. Kawai, "Pair-wise Distance Metric Learning of Neural Network Model for Spoken Language Identification," Interspeech, September 2016. ::: |
163. |
Y.-H. Lai, C.-H. Wang, S.-Y. Hou, B.-Y. Chen, Y. Tsao, and Y.-W. Liu, "DCASE Report for Task 3: Sound Event Detection in Real Life Audio," DCASE 2016 workshop, September 2016. ::: |
164. |
C.-W. Wu, M.-T. Zhong, Y. Tsao, S.-W. Yang, Y.-K. Chen, and S.-Y. Chien, "Track-clustering Error Evaluation for Track-based Multi-camera Tracking System Employing Human Re-identification," CVPR workshop, August 2016, Codes: https://github.com/cw1204772/ClustTMCT ::: |
165. |
Syu-Siang Wang, Jeremy Chiaming Yang, Yu Tsao, and Jeih-weih Hung, "Leveraging Nonnegative Matrix Factorization in Processing the Temporal Modulation Spectrum for Speech Enhancement," IEEE ICCE-Taiwan 2016, May 2016. ::: |
166. |
Y.-T. Liu, Y. Tsao, R. Y. Chang:, "Nonnegative Matrix Factorization-based Frequency Lowering Technology for Mandarin-speaking Hearing Aid Users," IEEE ICASSP2016, pages 5905-5909, May 2016. ::: |
167. |
Jeremy Chiaming Yang, Syu-Siang Wang, Yu Tsao, and Jeih-weih Hung, "Speech Enhancement via Ensemble Modeling NMF Adaptation," IEEE ICCE-Taiwan 2016, May 2016. ::: |
168. |
S.-S. Wang and Y. Tsao, "Temporal Modulation Spectral Restoration for Robust Speech Recognition," IEEE International Conference on Multimedia Big Data, April 2016. ::: |
169. |
Ying-Hui Lai, Chien-Hsun Chen, Shih-Tsang Tang, Zong-Mu Yeh, and Yu Tsao, "Improving the Performance of Noise Reduction in Hearing Aids Based on the Genetic Algorithm," IFMBE Proceedings 57, March 2016. |
170. |
S.-S. Wang, H.-T. Hwang, Y.-H. Lai, Y. Tsao, X. Lu, H.-M. Wang, and B. Su, "Improving Denoising Auto-encoder Based Speech Enhancement With the Speech Parameter Generation Algorithm," APSIPA 2015, December 2015. ::: |
171. |
P. Lin, D.-C. Lyu, Y.-F. Chang, and Y. Tsao, "Temporal Alignment for Deep Neural Networks," GlobalSIP 2015, December 2015. ::: |
172. |
H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen, "A Probabilistic Interpretation for Artificial Neural Network-based Voice Conversion," APSIPA 2015, December 2015. ::: |
173. |
Y.-T. Liu, R. Y. Chang, Y. Tsao, and Y.-p. Chang, "A New Frequency Lowering Technique for Mandarin-Speaking Hearing Aid Users," GlobalSIP 2015, December 2015. ::: |
174. |
P. Lin, D.-C. Lyu, Y.-F. Chang, and Y. Tsao, "Speech Recognition with Temporal Neural Networks," Interspeech 2015, ISCA, editor, pages 21–25, September 2015. ::: |
175. |
X. Lu, P. Shen, Y. Tsao, C. Hori, H. Kawai, "Sparse Representation with Temporal Max-Smoothing for Acoustic Event Detection," Interspeech 2015, ISCA, editor, pages 1176-1180, September 2015. ::: |
176. |
P. Lin, S.-S. Wang, and Y. Tsao, "Temporal Information in Tone Recognition," IEEE ICCE 2015, June 2015. ::: |
177. |
W.-C. Chen, P.-T. Lai, Y. Tsao, and C.-C. Lee, "Multimodal Arousal Rating using Unsupervised Fusion Technique," ICASSP 2015, April 2015. ::: |
178. |
Y.-H. Lai, S.-S. Wang, P.-C. Li, and Yu Tsao, "A Discriminative Post-filter for Speech Enhancement in Hearing Aids," ICASSP 2015, April 2015. ::: |
179. |
Y.-F. Chang, P. Lin, S.-H. Cheng, K.-H. Chan, Y.-C. Zeng, C.-W. Liao, W.-T. Chang, Y.-C. Wang, Y. Tsao, "Robust Anchorperson Detection Based on Audio Streams using a Hybrid I-vector and DNN System," APSIPA 2014, December 2014. ::: |
180. |
Y.-H. Lai, F. Chen, and Y. Tsao, "Effect of Adaptive Envelope Compression in Simulated Electric Hearing in Reverberation," ISIC 2014, December 2014. ::: |
181. |
H. Jing, A.-C. Liang, S.-D. Lin, and Y. Tsao, "A Transfer Probabilistic Collective Factorization Model to Handle Sparse Data in Collaborative Filtering," ICDM 2014, December 2014, accepted as a regular paper (acceptance rate=9.5%) ::: |
182. |
X. Lu, Y. Tsao, S. Matsuda, and C. Hori, "Ensemble Modeling of Denoising Autoencoder for Speech Spectrum Restoration," Interspeech 2014, September 2014. ::: |
183. |
H.-S. Lee, Y. Tsao, H.-M. Wang and S.-K. Jen, "Clustering-Based I-Vector Formulation for Speaker Recognition," Interspeech 2014, September 2014. ::: |
184. |
H. Jing, T.-Y. Hu, H.-S. Lee, W.-C. Chen, C.-C. Lee, Y. Tsao and H.-M. Wang, "Ensemble of Machine Learning Algorithms for Cognitive and Physical Speaker Load Detection," Interspeech 2014, September 2014. ::: |
185. |
P. Lin, F. Chen, S.-S. Wang, Y. Tsao and Y. H. Lai, "Automatic Speech Recognition with Primarily Temporal Envelope Information," Interspeech 2014, September 2014. ::: |
186. |
X. Lu, Y. Tsao, P. Shen, and C. Hori, "Spectral Patch Based Sparse Coding for Acoustic Event Detection," ISCSLP 2014, September 2014. ::: |
187. |
Y. H. Lai, F. Chen, and Y. Tsao, "An Adaptive Envelope Compression Strategy for Speech Processing in Cochlear Implants," Interspeech 2014, September 2014. ::: |
188. |
S.-S. Wang, P. Lin, D.-C. Lyu, Y. Tsao, H.-T. Hwang, B. Su and H.-M. Wang, "Acoustic Feature Conversion using a Polynomial based Feature Transferring Algorithm," ISCSLP 2014, September 2014. ::: |
189. |
X. Lu, Yu Tsao, S. Matsuda, and C. Hori, "Sparse Representation Based on a Bag of Spectral Exemplars for Acoustic Event Detection," ICASSP 2014, May 2014. ::: |
190. |
H.-S. Lee, Y. Tsao, Y.-F. Chang, H.-M. Wang, and S.-K. Jeng, "Speaker Verification Using Kernel-Based Binary Classifiers with Binary Operation Derived Features," ICASSP 2014, May 2014. ::: |
191. |
H.-t. Fan, J.-w. Hung, X. Lu, S.-S. Wang, Yu Tsao, "Speech Enhancement using Segmental Nonnegative Matrix Factorization," ICASSP 2014, May 2014. ::: |
192. |
H. Jing, Y. Tsao, K.-Y. Chen and H.-M. Wang, "Semantic Naïve Bayes Classifier for Document Classification," IJCNLP, December 2013. ::: |
193. |
H.-T. Hwang, Y. Tsao, H.-M. Wang, Y.-R. Wang, S.-H. Chen, "Incorporating Global Variance in the Training Phase of GMM-based Voice Conversion," APSIPA 2013, October 2013. ::: |
194. |
C.-H. Wang, T.-W. Kao, S.-H. Fang, Y. Tsao, L.-C. Kuo, S.-W. Kao, and N.-C. Lin, "Robust Wi-Fi Location Fingerprinting Against Device Diversity based on Spatial Mean Normalization," APSIPA 2013, October 2013. ::: |
195. |
Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang and Sin-Horng Chen, "Alleviating the Over-Smoothing Problem in GMM-Based Voice Conversion with Discriminative Training," Interspeech 2013, August 2013. ::: |
196. |
Bo Li, Yu Tsao and Khe Chai Sim, "An Investigation of Spectral Restoration Algorithms for Deep Neural Networks based Noise Robust Speech Recognition," Interspeech 2013, August 2013. ::: |
197. |
Tsung-Hsien Wen, Aaron Heidel, Hung-yi Lee, Yu Tsao and Lin-Shan Lee, "Recurrent Neural Network Based Language Model Personalization by Social Network Crowdsourcing," Interspeech 2013, August 2013, (Best Student Paper Award Nomination) ::: |
198. |
Xugang Lu, Yu Tsao, Shigeki Matsuda and Chiori Hori, "Speech Enhancement Based on Deep Denoising Autoencoder," Interspeech 2013, August 2013, Codes: Tensor Flow: https://github.com/jonlu0602/DeepDenoisingAutoencoder; Keras: https://github.com/jerrygood0703/DDAE; Matlab: https://drive.google.com/open?id=0B8ZEsMh6ITIlNVZ1VmROdTdQNUU ::: ::: |
199. |
Hung-yi Lee, Ting-yao Hu, How Jing, Yun-Fan Chang, Yu Tsao, Yu-Cheng Kao and Tsang-Long Pao, "Ensemble of Machine Learning and Acoustic Segment Model Techniques for Speech Emotion and Autism Spectrum Disorders Recognition," Interspeech 2013, August 2013, (Second Place In the Autism Sub-Challenge) ::: |
200. |
Ying-Hui Lai, Yu-Cheng Su, Yu Tsao, Shuenn-Tsong Young, "Evaluation of Generalized Maximum a Posteriori Spectral Amplitude (GMAPA) Speech Enhancement Algorithm in Hearing Aids," ISCE 2013, June 2013. ::: |
201. |
Syu-Siang Wang, Yu Tsao, Jeih-weih Hung, "Filtering on the Temporal Probability Sequence in Histogram Equalization for Robust Speech Recognition," ICASSP 2013, IEEE, May 2013. ::: |
202. |
Yu-Cheng Su, Yu Tsao, Jung-En Wu, Fu-Rong Jean, "Speech Enhancement using Generalized Maximum a Posteriori Spectral Amplitude Estimator," ICASSP 2013, IEEE, May 2013. ::: |
203. |
How Jing and Yu Tsao, "Sparse Maximum Entropy Deep Belief Nets," IJCNN 2013, IEEE, April 2013. ::: |
204. |
H.-T. Hwang, Yu Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen, "Exploring Mutual Information for GMM-Based Spectral Conversion," ISCSLP 2012, IEEE, December 2012. ::: |
205. |
X. Lu, Yu Tsao, S. Matsuda, C. Hori, and H. Kashioka, "Acoustic Space Partition based on Broad Phonetic Class for Ensemble Acoustic Modeling," ISCSLP 2012, IEEE, December 2012. ::: |
206. |
S.-S. Wang, J.-W. Hung, and Yu Tsao, "A Study on Cepstral Subband Normalization for Robust ASR," ISCSLP 2012, IEEE, December 2012. ::: |
207. |
T.-Y. Hu, Yu Tsao, and L.-S. Lee, "Discriminative Fuzzy Clustering Maximum a Posterior Linear Regression for Speaker Adaptation," Interspeech 2012, ISCA, September 2012. ::: |
208. |
H.-T. Hwang, Yu Tsao, H.-M. Wang, Y.-R. Wang, and S.-H. Chen, "A Study of Mutual Information for GMM-Based Spectral Conversion," Interspeech 2012, ISCA, September 2012. ::: |
209. |
Yu Tsao, C.-L. Huang, S. Matsuda, C. Hori, and H. Kashioka, "A Linear Projection Approach to Environment Modeling for Robust Speech Recognition," ICASSP 2012, IEEE, April 2012. ::: |
210. |
C.-L. Huang, Yu Tsao, and C. Hori, "Feature Normalization and Selection for Robust Speaker State Recognition," COCOSDA 2011, IEEE, October 2011. ::: |
211. |
Yu Tsao, P. R. Dixon, C. Hori, and H. Kawai, "Incorporating Regional Information to Enhance MAP-based Stochastic Feature Compensation for Robust Speech Recognition," Interspeech, ISCA, August 2011. ::: |
212. |
Yu Tsao, R. Isotani, H. Kawai, and S. Nakamura, "Increasing Discriminative Capability on Map-based Mapping Function Estimation for Acoustic Model Adaptation," ICASSP, IEEE, May 2011. ::: |
213. |
Y. Tsao, S. Matsuda, S. Sakai, R. Isotani, H. Kawai, and S. Nakamura, "A Sampling-based Environment Population Projection Approach for Rapid Acoustic Model Adaptation," ICASSP, IEEE, May 2011. ::: |
214. |
J. Li, Y. Tsao, and C.-H. Lee, "Shrinkage Model Adaptation in Automatic Speech Recognition," Interspeech, ISCA, September 2010. ::: |
215. |
A. Mushtaq, Y. Tsao, and C.-H. Lee, "A Particle Filter Feature Compensation Approach to Robust Speech Recognition," Interspeech, ISCA, September 2010. ::: |
216. |
Yu Tsao, H. Sun, H. Li, and C.-H. Lee, "An Acoustic Segment Model Approach to Incorporating Temporal Information into Speaker Modeling for Text-Independent Speaker Recognition," ICASSP, IEEE, May 2010. ::: |
217. |
Y. Tsao, S. Matsuda, S. Nakamura, and C.-H. Lee, "MAP Estimation of Online Mapping Parameters in Ensemble Speaker and Speaking Environment Modeling," ASRU, IEEE, December 2009. ::: |
218. |
S. Matsuda, Y. Tsao, J. Li, S. Nakamura, and C.-H. Lee, "A Study on Soft Margin Estimation of Linear Regression Parameters for Speaker Adaptation," Interspeech, ISCA, December 2009. ::: |
219. |
Y. Tsao, J. Li, C.-H. Lee, and S. Nakamura, "Soft Margin Estimation on Improving Environment Structures for Ensemble Speaker and Speaking Environment Modeling," IUCS, ACM, December 2009. ::: |
220. |
Y. Tsao, J. Li, and C.-H. Lee, "Ensemble Speaker and Speaking Environment Modeling Approach with Advanced Online Estimation Process," ICASSP, IEEE, May 2009. ::: |
221. |
S.-Y. Peng, Y. Tsao, P. E. Hasler, and D. V. Anderson, "A Programmable Analog Radial-Basis-Function Based Classifier," ICASSP, IEEE, December 2008. ::: |
222. |
Y. Tsao and C.-H. Lee, "Improving the Ensemble Speaker and Speaking Environment Modeling Approach by Enhancing the Precision of the Online Estimation Process," Interspeech, ISCA, September 2008. ::: |
223. |
Y. Tsao and C.-H. Lee, "Two Extensions to Ensemble Speaker and Speaking Environment Modeling for Robust Automatic Speech Recognition," ASRU, IEEE, December 2007. ::: |
224. |
I. Bromberg, Q. Fu, J. Hou, J. Li, C. Ma, B. Mattews, A. Moreno-Daniel, J. Morris, S. M. Siniscalchi, Y. Tsao, and Y. Wang, "Detection-based ASR In the Automatic Speech Attribute Transcription Project," Interspeech, ISCA, September 2007. ::: |
225. |
Y. Tsao and C.-H. Lee, "An Ensemble Modeling Approach to Joint Characterization of Speaker and Speaking Environments," Interspeech, ISCA, September 2007. ::: |
226. |
Y. Tsao and C.-H. Lee, "A Vector Space Approach to Environment Modeling for Robust Speech Recognition," Interspeech, ISCA, September 2006. ::: |
227. |
C. Ma, Y. Tsao, and C.-H. Lee, "A Study on Detection Based Automatic Speech Recognition," Interspeech, ISCA, September 2006. ::: |
228. |
Y. Tsao, J. Li, and C.-H. Lee, "A Study on Separation between Acoustic Models and Its Applications," Eurospeech, ISCA, September 2005. ::: |
229. |
J. Li, Y. Tsao, and C.-H. Lee, "A Study on Knowledge Source Integration for Candidate Rescoring in Automatic Speech Recognition," ICASSP, IEEE, April 2005. ::: |
230. |
Y. Tsao, S.-M. Lee, and L.-S. Lee, "Segmental Eigenvoice for Rapid Speaker Adaptation," Eurospeech, ISCA, September 2001. ::: |
|
|
Technical Reports | |
1. |
王豫煌、林誠謙、嚴漢偉、林子皓、陸聲山、曹昱、端木茂甯、黃俊嘉、莊庭瑞, "亞洲聲景長期監測網," number 3, 臺灣生態學會、中央研究院、日本國立研究開發法人海洋研究開發機構、林業試驗所森林保護組, August 2019. ::: |
2. |
張佑榕、曹昱, "研之有物(智慧聽)," 中央研究院, 2019. ::: |
3. |
曹昱, "基於人工智慧之語音溝通輔具," 中研院 | 數理科學, 漫步科研, 科普專欄 2019-06-20, 2019. ::: |
4. |
端木茂甯, "研之有物(蝙蝠的超音波,藏了什麼訊息?)," 中央研究院, 2018. ::: |
|
|
Book & Book Chapters | |
1. |
P. Lin, Y. Tsao, and L.-W. Kuo,, chapter "Controlling the Biocompatibility and Mechanical Effects of Implantable Microelectrodes to Improve Chronic Neural Recordings in the Auditory Nervous System," "An Excursus into Hearing Loss," S. Hatzopoulos and A. Ciorba, editor, pages 173-195, IntechOpen, May 2018. ::: |
2. |
Y.-H. Lai, Fe. Chen, and Y. Tsao,, chapter "Adaptive Dynamic Range Compression for Improving Envelope-Based Speech Perception: Implications for Cochlear Implants," "Emerging Technology and Architecture for Big-data Analytics," A. Chattopadhyay and Y. Hao, editor, pages 191-214, Springer, April 2017. ::: |
|
|
Others | |
1. |
"Yu Tsao's CV,". ::: |
2. |
Yu Tsao, "Interspeech 2024 Survey Talk: Neural Speech Assessment," September 2024. ::: |
3. |
X. Lu and Y. Tsao, "Optimal Transport (OT) in Speech: OT Meets Speech,", Tutorial in INTERSPEECH 2024 September 2024. ::: |
4. |
Fei Chen and Yu Tsao, "Advances in Objective Speech Intelligibility and Quality Assessment: From Psychoacoustics to Machine Learning,", Tutorial in IEEE ICASSP 2024 April 2024. ::: ::: |
5. |
Yu Tsao, "Utilizing Deep Learning for Speech Enhancement in Assistive Oral Communication Technologies,", Keynote Speech in M3Oriental Workshop, ACM Multimedia Asia 2023 December 2023. ::: |
6. |
Yu Tsao, "Wearable Devices and Machine Learning Algorithms for Augmented Oral Communication Assistance,", CTSoc Technical Talk November 2023. ::: |
7. |
Fei Chen and Yu Tsao, "Advances in Psychoacoustics and Machine Learning towards Objective Speech Intelligibility Evaluation," October 2023. ::: ::: |
8. |
Fei Chen and Yu Tsao, "Speech Assessment Metrics: From Psychoacoustics to Machine Learning,", Tutorial in Interspeech 2023 August 2023. ::: ::: |
9. |
Yu Tsao, "聽說 AI," November 2022, 國科會工程處記者會 ::: ::: |
10. |
Fei Chen and Yu Tsao, "Speech enhancement for cochlear implants: From psychoacoustics to machine learning,", Tutorial in Interspeech 2022 September 2022. ::: |
11. |
Fei Chen and Yu Tsao, "Advances in Cochlear Implants: From Speech Perception, Enhancement to Evaluation,", Tutorial in EUSIPCO 2022 September 2022. ::: |
12. |
Yu Tsao, "基於深度學習之語音增強技術及其應用,", 2020大數據人工智能 March 2022. ::: ::: |
13. |
Fei Chen and Yu Tsao, "Speech Perception and Enhancement in Cochlear Implants," December 2021, Tutorial in APSIPA 2021 ::: ::: |
14. |
Berrak Sisman, Yu Tsao, Haizhou Li, "Theory and Practice of Voice Conversion,", Tutorial in APSIPA 2020 December 2020. ::: |
15. |
Fei Chen and Yu Tsao, "Intelligibility Evaluation and Speech Enhancement based on Deep Learning,", Tutorial in Interspeech 2020 October 2020, Video: https://www.youtube.com/watch?v=89S4CgfPWG0 ::: ::: |
16. |
Yu Tsao, "Speech Enhancement based on Deep Learning and Intelligibility Evaluation,", Tutorial in APSIPA 2019 November 2019. ::: |
17. |
H.-Y. Lee and Y. Tsao, "Generative Adversarial Network and its Applications to Speech Signal Processing and Natural Language Processing,", Tutorial in Interspeech 2019 September 2019. ::: |
18. |
"Improving biodiversity monitoring through soundscape information retrieval," May 2018. ::: |
19. |
Hung-iy Lee and Yu Tsao, "Generative Adversarial Network and its Applications to Speech Signal Processing and Natural Language Processing,", Tutorial in ICASSP 2018 April 2018. ::: |
20. |
Y.-C. Lin, Y.-H. Lai, H.-W. Chang, Y. Tsao, Y.-p. Chang, and R. Y. Chang, "PAD-MMRT," August 2014, Original corpus is prepared by K.-S. Tsai, L.-H. Tseng, C.-J.Wu, and S.-T. Young: “Development of a Mandarin monosyllable recognition test,” Ear and Hearing, vol. 30, no. 1, pp. 90–99, 2009. ::: ::: |
21. |
曹昱,蘇煜程,王緒翔, "線性映射轉換函數於聲學模型調適之強健式語音辨識,", 計算語言學學會通訊 第 23 卷第 2 期 (2012 年 6 月 ) June 2012. ::: |
|
|
|
|
|
|
|
 |
|
|
|
|