Assistant Professor Xingfeng Li - Faculty of Data Science

Dr. Xingfeng Li

Assistant Professor Master Supervisor

Email: xfli@cityu.edu.mo

Tel: (853)8590 2338

Office address: Room S502, Stanley Ho Building, City University of Macau (Taipa)

Website: https://sites.google.com/view/xingfeng/home

Educational qualifications

2019 Doctor of Philosophy in Information Science, Japan Advanced Institute of Science and Technology

2016 Master of Science in Information Science, Japan Advanced Institute of Science and Technology

2016 Master of Engineering in Software Engineering, Tianjin University, China

2013 Bachelor of Software Engineering, Changchun University of Science and Technology, China

Incumbent

Assistant Professor, Faculty of Data Science, City University of Macau

Courses taught

Data mining

Algorithm Analysis and Design

Data structure

Introduction to Artificial Intelligence

Digital image processing

Computer Basics and Artificial Intelligence

Research direction

Intelligent voice interaction

Affective computing

Psychoacoustics

Digital health

Research and publishing

2026

Journal Articles

Shi X, He J, Li X, Toda T. A Comprehensive Study on the Effectiveness of ASR Representations for Noise-Robust Speech Emotion Recognition. IEEE Transactions on Audio, Speech and Language Processing. vol. 34, pp. 707-722, 2026, doi: 10.1109/TASLPRO.2026.3654273.
Shenzhi Li, Xingfeng Li, Hao Zhu, Chao Li, Peng Wang, PFDBooster: A Unified Post-Image Fusion Dual-Domain Boosting Paradigm, Knowledge-Based Systems, 2026, 115577, ISSN 0950-7051, https://doi.org/10.1016/j.knosys.2026.115577.
X. Li, N. Luo, F. Yu, X. Shi, J. Li and Y. Liu, "Multi-Task Deep Learning with Over-Sampling and Style Randomization for Improved Cross-Regional Bird Vocalization Recognition," in IEEE Transactions on Audio, Speech and Language Processing, doi: 10.1109/TASLPRO.2026.3675794. https://ieeexplore.ieee.org/document/11447412

2025

Journal Articles

Xingfeng Li, Ningfeng Luo, Feifei Yu, Junjie Li, Kai Li, Yongwei Li, Zhen Zhao, Yang Liu, Xiaohan Shi, "Human Auditory Representation Learning for cross-dialect bird species recognition" in Ecological Informatics, Volume 93, February 2026, 103554. https://doi.org/10.1016/j.ecoinf.2025.103554
J. He, X. Shi, C. H. Hu, J. Mi, X. Li and T. Toda, "M4SER: Multimodal, Multirepresentation, Multitask, and Multistrategy Learning for Speech Emotion Recognition," in IEEE Transactions on Audio, Speech and Language Processing, vol. 33, pp. 4055-4070, 2025, doi: 10.1109/TASLPRO.2025.3614428.
Yang Liu, Xin Chen, Yarong Li, Jie Ma, Xiaoqi Yang, Yuan Song, Xiaolei Meng, Yongwei Li, Xingfeng Li, Zhen Zhao, "Enhanced Speech Emotion Recognition in Noisy Environments: Adaptive Emotion Denoising Diffusion Approach With Iterative Confidence Learning Strategy," in IEEE Internet of Things Journal, vol. 12, no. 20, pp. 43241-43254, 15 Oct.15, 2025, doi: 10.1109/JIOT.2025.3595096.
K. Li, K. Zaman, X. Li, M. Akagi, J. Dang, and M. Unoki, "Machine Anomalous Sound Detection Using Spectral-Temporal Modulation Representations Derived From Machine-Specific Filterbanks," in IEEE Transactions on Audio, Speech and Language Processing, vol. 33, pp. 2059-2073, 2025, doi: 10.1109/TASLPRO.2025.3570956.
Y. Liu, X. Chen, Z. Peng, Y. Li, X. Li, P. Song, M. Unoki, and Z. Zhao, "Enhancing Speech Emotion Recognition With Conditional Emotion Feature Diffusion and Progressive Interleaved Learning Strategy," in IEEE Transactions on Audio, Speech and Language Processing, vol. 33, pp. 1787-1800, 2025, doi: 10.1109/TASLPRO.2025.3561606.
Gao, Shun and Xia, Yan and Li, Xingfeng and Cui, Feifei and Zhang, Qingchen and Zou, Quan and Zhang, Zilong, "ACP-ESM2: Enhancing Anticancer Peptide Prediction With Pre-Trained Protein Language Models," in IEEE Transactions on Computational Biology and Bioinformatics, vol. 22, no. 3, pp. 1041-1051, May-June 2025, doi: 10.1109/TCBBIO.2025.3547952.

Conference Proceedings

Liu, Xiaokang, Xingfeng Li, Yudong Yang, Lan Wang, and Nan Yan. "Addressing Task Conflicts in Stuttering Detection via MMoE-Based Multi-Task Learning." In Proc. Interspeech 2025, pp. 798-802. 2025.
Shi, Xiaohan, Xingfeng Li, and Tomoki Toda. "Who, When, and What: Leveraging the “Three Ws” Concept for Emotion Recognition in Conversation." In Proc. Interspeech 2025, pp. 1763-1767. 2025.
Shi, Xiaohan, Xingfeng Li, and Tomoki Toda. "Speaker-Aware Multi-Task Learning for Speech Emotion Recognition." In Proc. Interspeech 2025, pp. 4333-4337. 2025.
Shi, Xiaohan, Jinyi Mi, Xingfeng Li, and Tomoki Toda. "Advancing emotion recognition via ensemble learning: Integrating speech, context, and text representations." In Proc. Interspeech 2025, pp. 4693-4697. 2025.
X. Li and J. Li, "Valence-Arousal Emotion Recognition Using a Deep Three-Layer Model with Aural Perceptual Representations," 2025 IEEE International Conference on Big Data (BigData), Macau, China, 2025, pp. 1964-1973, doi: 10.1109/BigData66926.2025.11401009.
X. Li and F. Yu, "Phase-Aware Spectrogram Fusion with Dual-Stream Residual Networks for Underwater Acoustic Recognition," 2025 IEEE International Conference on Big Data (BigData), Macau, China, 2025, pp. 1954-1963, doi: 10.1109/BigData66926.2025.11402377.
X. Li and J. Kang, "Musically-Inspired Colored Pitch Features for Emotion and Speaker Recognition in Speech," 2025 IEEE International Conference on Big Data (BigData), Macau, China, 2025, pp. 7493-7502, doi: 10.1109/BigData66926.2025.11401704.

2024

Journal Articles

Yidi Sun, Lingling Kong, Jiayi Huang, Hongyan Deng, Xinling Bian, Xingfeng Li, Feifei Cui, Lijun Dou, Chen Cao, Quan Zou, Zilong Zhang, A comprehensive survey of dimensionality reduction and clustering methods for single-cell and spatial transcriptomics data, Briefings in Functional Genomics, Volume 23, Issue 6, November 2024, Pages 733–744, https://doi.org/10.1093/bfgp/elae023
Xiangrun LI, Qiyu SHENG, Guangda ZHOU, Jialong WEI, Yanmin SHI, Zhen ZHAO, Yongwei LI, Xingfeng LI, Yang LIU, "Pool-Unet: A Novel Tongue Image Segmentation Method Based on Pool-Former and Multi-Task Mask Learning" in IEICE TRANSACTIONS on Fundamentals, vol. E107-A, no. 10, pp. 1609-1620, October 2024, doi: 10.1587/transfun.2024EAP1015.
Fu, Xiuhao and Duan, Hao and Zang, Xiaofeng and Liu, Chunling and Li, Xingfeng and Zhang, Qingchen and Zhang, Zilong and Zou, Quan and Cui, Feifei, "Hyb_SEnc: An Antituberculosis Peptide Predictor Based on a Hybrid Feature Vector and Stacked Ensemble Learning," in IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 21, no. 6, pp. 1897-1910, Nov.-Dec. 2024, doi: 10.1109/TCBB.2024.3425644.
Yidi Sun , Lingling Kong , Jiayi Huang , Hongyan Deng , Xinling Bian , Xingfeng Li , Feifei Cui , Lijun Dou , Chen Cao , Quan Zou , Zilong Zhang, msBERT-Promoter: a multi-scale ensemble predictor based on BERT pre-trained model for the two-stage prediction of DNA promoters and their strengths. BMC Biol 22, 126 (2024). https://doi.org/10.1186/s12915-024-01923-z
Duan H, Zhang Y, Qiu H, Fu X, Liu C, Zang X, Xu A, Wu Z, Li X, Zhang Q, Zhang Z. Machine learning-based prediction model for distant metastasis of breast cancer. Computers in Biology and Medicine. 2024 Feb 1;169:107943.

Conference Proceedings

J. Yuan, X. Li, Z. Zhang, Q. Zhang, Q. Zou and F. Cui, "RNASite: A one-stop tool website that integrates multiple RNA modification site databases and servers," 2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Lisbon, Portugal, 2024, pp. 2816-2820, doi: 10.1109/BIBM62325.2024.10822116.
X. Shi, Y. Gao, J. He, J. Mi, X. Li and T. Toda, "A Study on Multimodal Fusion and Layer Adapter in Emotion Recognition," 2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Macau, Macao, 2024, pp. 1-6, doi: 10.1109/APSIPAASC63619.2025.10848773.
Li, Xingfeng and Shi, Xiaohan and Si, Yuke and Zhang, Zilong and Cui, Feifei and Li, Yongwei and Liu, Yang and Unoki, Masashi and Akagi, Masato, "BEES: A New Acoustic Task for Blended Emotion Estimation in Speech," 2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Macau, Macao, 2024, pp. 1-6, doi: 10.1109/APSIPAASC63619.2025.10848842.
Shi X, Li X, Toda T. Multimodal Fusion of Music Theory-Inspired and Self-Supervised Representations for Improved Emotion Recognition. InAnnual Conference of the International Speech Communication Association 2024 (pp. 2024-2350). ISCA.
He J, Shi X, Li X, Toda T. Mf-aed-aec: Speech emotion recognition by leveraging multimodal fusion, asr error detection, and asr error correction. InICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024 Apr 14 (pp. 11066-11070). IEEE.

Academic Awards

2022 IEEE Outstanding Leadership Award, IEEE Smart World Congress

2016 BEST PAPER AWARD, IEEE OCOCOSDA