Publication
-
REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR
Liang-Hsuan Tseng, En-Pei Hu, Cheng-Han Chiang, Yuan Tseng, Hung-yi Lee, Lin-shan Lee, Shao-Hua SunNeurIPS 2024
-
Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration
Yunyen Chuang, Hung-Min Hsu, Kevin Lin, Chen-Sheng Gu, Ling Zhen Li, Ray-I Chang, Hung-yi LeeNeurIPS 2024
-
StreamBench: Towards Benchmarking Continuous Improvement of Language Agents
Cheng-Kuang Wu, Zhi Rui Tam, Chieh-Yen Lin, Yun-Nung Chen, Hung-yi LeeNeurIPS 2024
-
Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course
Cheng-Han Chiang, Wei-Chih Chen, Chun-Yi Kuan, Chienchou Yang, Hung-yi LeeEMNLP 2024
-
Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition
Hsuan Su, Hua Farn, Fan-Yun Sun, Shang-Tse Chen, Hung-yi LeeEMNLP 2024
-
Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech
Guan-Ting Lin, Wei Ping Huang, Hung-yi LeeEMNLP 2024
-
DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging
Tzu-Han Lin, Chen-An Li, Hung-yi Lee, Yun-Nung ChenEMNLP 2024
-
I Need Help! Evaluating LLM's Ability to Ask for Users' Support: A Case Study on Text-to-SQL Generation
Cheng-Kuang Wu, Zhi Rui Tam, Chao-Chung Wu, Chieh-Yen Lin, Hung-yi Lee, Yun-Nung ChenEMNLP 2024
-
Let Me Speak Freely? A Study On The Impact Of Format Restrictions On Large Language Model Performance
Zhi Rui Tam, Cheng-Kuang Wu, Yi-Lin Tsai, Chieh-Yen Lin, Hung-yi Lee, Yun-Nung ChenEMNLP 2024
-
Can LLMs Understand the Implication of Emphasized Sentences in Dialogue?
Guan-Ting Lin, Hung-yi LeeEMNLP findings 2024
-
Unveiling Narrative Reasoning Limits of Large Language Models with Trope in Movie Synopses
Hung-Ting Su, Ya-Ching Hsu, Xudong Lin, Xiang-Qian Shi, Yulei Niu, Han-Yuan Hsu, Hung-yi Lee, Winston H. HsuEMNLP findings 2024
-
Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation
Chun-Yi Kuan, Chih-Kai Yang, Wei-Ping Huang, Ke-Han Lu, Hung-yi LeeSLT 2024
-
Efficient Training of Self-Supervised Speech Foundation Models on a Compute Budget
Andy T. Liu, Yi-Cheng Lin, Haibin Wu, Stefan Winkler, Hung-yi LeeSLT 2024
-
Do Prompts Really Prompt? Exploring the Prompt Understanding Capability of Whisper
Chih-Kai Yang, Kuan-Po Huang, Hung-yi LeeSLT 2024
-
Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural codec models
Haibin Wu, Xuanjun Chen, Yi-Cheng Lin, Jiawei Du, Kai-Wei Chang, Ke-Han Lu, Alexander Liu, Ho Lam Chung, Yuan-Kuei Wu, Dongchao Yang, Songxiang Liu, Yi-Chiao Wu, Xu Tan, James Glass, Shinji Watanabe, Hung-yi LeeSLT 2024
-
Leave No Knowledge Behind during Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data
Liang-Hsuan Tseng, Zih-Ching Chen, Weishun Chang, Cheng-Kuang Lee, Tsung-Ren Huang, Hung-yi LeeSLT 2024
-
Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Sung-Feng Huang, Heng-Cheng Kuo, Zhehuai Chen, Xuesong Yang, Chao-Han Huck Yang, Yu Tsao, Yu-Chiang Frank Wang, Hung-yi Lee, Szu-Wei FuSLT 2024
-
Property Neurons in Self-Supervised Speech Transformers
Tzu-Quan Lin, Guan-Ting Lin, Hung-yi Lee, Hao TangSLT 2024
-
Embracing Ambiguity And Subjectivity Using The All-inclusive Aggregation Rule For Evaluating Multi-label Speech Emotion Recognition Systems
Huang-Cheng Chou, Haibin Wu, Lucas Goncalves, Seong-Gyun Leem, Ali Salman, Carlos Busso, Hung-yi Lee, Chi-Chun LeeSLT 2024
-
Stimulus Modality Matters: Impact of Perceptual Evaluations Elicited by Different Modalities on Performances of Speech Emotion Recognition Systems
Huang-Cheng Chou, Haibin Wu, Hung-yi Lee, Chi-Chun LeeSLT 2024
-
Open-Emotion: A Reproducible Emo-Superb for Speech Emotion Recognition Systems
Haibin Wu, Huang-Cheng Chou, Kai-Wei Chang, Lucas Goncalves, Jiawei Du, Jyh-Shing Roger Jang, Chi-Chun Lee, Hung-yi LeeSLT 2024
-
A Preliminary Study: Large Language Model-Based Data Automation for Multi-Label Speech Emotion Recognition with Human Subjective Typed Descriptions
Haibin Wu, Huang-Cheng Chou, Kai-Wei Chang, Lucas Goncalves, Jiawei Du, Jyh-Shing Roger Jang, Chi-Chun Lee, Hung-yi LeeSLT 2024
-
Fusion of Discrete Representations and Self-Augmented Representations for Multilingual Automatic Speech Recognition
Shih-Heng Wang, Jiatong Shi, Chien-yu Huang, Shinji Watanabe, Hung-yi LeeSLT 2024
-
EMO-Codec: A Depth Look at Emotion Preservation Capability of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Wenze Ren, Yi-Cheng Lin, Haibin Wu, Huang-Cheng Chou, Chi-Chun Lee, Yu Tsao, Hung-yi LeeSLT 2024
-
Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models
Yi-Cheng Lin, Tzu-Quan Lin, Chih-Kai Yang, Ke-Han Lu, Wei-Chih Chen, Chun-Yi Kuan, Hung-yi LeeSLT 2024
-
Spoken Stereoset: On Evaluating Social Bias Toward Speaker in Speech Large Language Models
Yi-Cheng Lin, Wei-Chih Chen, Hung-yi LeeSLT 2024
-
DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset
Jiawei Du, I-Ming Lin, I-Hsiang Chiu, Xuanjun Chen, Haibin Wu, Wenze Ren, Yu Tsao, Hung-yi Lee, Roger JangSLT 2024
-
Ensemble Knowledge Distillation from Speech SSL Models Considering Inter-teacher Differences
Pei Jun Liao, Hung-yi Lee, Hsin-Min WangISCSLP 2024
-
LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play
Li-Chun Lu, Shou-Jen Chen, Tsung-Min Pai, Chan-Hung Yu, Hung-yi Lee, Shao-Hua SunCOLM, 2024
-
DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment
Ke-Han Lu, Zhehuai Chen, Szu-Wei Fu, He Huang, Boris Ginsburg, Yu-Chiang Frank Wang, Hung-yi LeeINTERSPEECH 2024
-
ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints
Jiatong Shi, Shih-Heng Wang, William Chen, Martijn Bartelds, Vanya Bannihatti Kumar, Jinchuan Tian, Xuankai Chang, Dan Jurafsky, Karen Livescu, Hung-yi Lee, Shinji WatanabeINTERSPEECH 2024
-
CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems
Haibin Wu, Yuan Tseng, Hung-yi LeeINTERSPEECH 2024
-
Exploring In-Context Learning of Textless Speech Language Model for Speech Classification Tasks
Ming-Hao Hsu, Kai-Wei Chang, Shang-Wen Li, Hung-yi LeeINTERSPEECH 2024
-
GSQA: An End-to-End Model for Generative Spoken Question Answering
Min-Han Shih, Ho-Lam Chung, Yu-Chi Pai, Ming-Hao Hsu, Guan-Ting Lin, Shang-Wen Li, Hung-yi LeeINTERSPEECH 2024
-
Dataset-Distillation Generative Model for Speech Emotion Recognition
Fabian Ritter-Gutierrez, Kuan-Po Huang, Jeremy H.M Wong, Dianwen Ng, Hung-yi Lee, Nancy F. Chen, Eng Siong ChngINTERSPEECH 2024
-
Neural Codec-based Adversarial Sample Detection for Speaker Verification
Xuanjun Chen, Jiawei Du, Haibin Wu, Jyh-Shing Roger Jang, Hung-yi LeeINTERSPEECH 2024
-
Singing Voice Graph Modeling for SingFake Detection
Xuanjun Chen, Haibin Wu, Jyh-Shing Roger Jang, Hung-yi LeeINTERSPEECH 2024
-
Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition
Yi-Cheng Lin, Haibin Wu, Huang-Cheng Chou, Chi-Chun Lee, Hung-yi LeeINTERSPEECH 2024
-
Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models
Chun-Yi Kuan, Wei-Ping Huang, Hung-yi LeeINTERSPEECH 2024
-
DAISY: Data Adaptive Self-Supervised Early Exit for Speech Representation Models
Tzu-Quan Lin, Hung-yi Lee, Hao TangINTERSPEECH 2024
-
On the Social Bias of Speech Self-Supervised Models
Yi-Cheng Lin, Tzu-Quan Lin, Hsi-Che Lin, Andy T. Liu, Hung-yi LeeINTERSPEECH 2024
-
Parameter-Efficient Fine-Tuning of Speaker-Aware Dynamic Prompts for Speaker Verification
Zhe Li, Man-wai Mak, Hung-yi Lee, Helen MengINTERSPEECH 2024
-
Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New Languages
Shih-Cheng Huang, Pin-Zu Li, Yu-Chi Hsu, Kuang-Ming Chen, Yu Tung Lin, Shih-Kai Hsiao, Richard Tzong-Han Tsai, Hung-yi LeeACL 2024
-
Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken Conversations
Guan-Ting Lin, Cheng-Han Chiang, Hung-yi LeeACL 2024
-
Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form Generations
Cheng-Han Chiang, Hung-yi LeeACL findings 2024
-
Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Haibin Wu, Ho-Lam Chung, Yi-Cheng Lin, Yuan-Kuei Wu, Xuanjun Chen, Yu-Chi Pai, Hsiu-Hsuan Wang, Kai-Wei Chang, Alexander H. Liu, Hung-yi LeeACL findings 2024
-
On the Evaluation of Speech Foundation Models for Spoken Language Understanding
Siddhant Arora, Ankita Pasad, Chung-Ming Chien, Jionghao Han, Roshan Sharma, Jee-weon Jung, Hira Dhamyal, William Chen, Suwon Shon, Hung-yi Lee, Karen Livescu, Shinji WatanabeACL findings 2024
-
Dynamic-SUPERB: Towards a Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech
Chien-yu Huang, Ke-Han Lu, Shih-Heng Wang, Chun-Yi Kuan, Chi-Yuan Hsiao, Haibin Wu, Siddhant Arora, Kai-Wei Chang, Jiatong Shi, Yifan Peng, Roshan Sharma, Shinji Watanabe, Bhiksha Ramakrishnan, Shady Shehata, Hung-yi LeeICASSP 2024
-
Zero Resource Code-Switched Speech Benchmark Using Speech Utterance Pairs for Multiple Spoken Languages
Kuan-Po Huang, Chih-Kai Yang, Yu-Kuan Fu, Ewan Dunbar, Hung-yi LeeICASSP 2024
-
Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks
Kevin Everson, Yile Gu, Huck Yang, Prashanth Gurunath Shivakumar, Guan-Ting Lin, Jari Kolehmainen, Ivan Bulyko, Ankur Gandhe, Shalini Ghosh, Wael Hamza, Hung-yi Lee, Ariya Rastrow, Andreas StolckeICASSP 2024
-
Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue
Guan-Ting Lin, Prashanth Gurunath Shivakumar, Ankur Gandhe, Chao-Han Huck Yang, Yile Gu, Shalini Ghosh, Andreas Stolcke, Hung-yi Lee, Ivan BulykoICASSP 2024
-
Scalable Ensemble-Based Detection Method Against Adversarial Attacks for Speaker Verification
Haibin Wu, Heng-Cheng Kuo, Yu Tsao, Hung-yi LeeICASSP 2024
-
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Yuan Tseng, Layne Berry, Yi-Ting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Po-Yao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Shinji Watanabe, Abdelrahman Mohamed, Chi Luen Feng, Hung-yi LeeICASSP 2024
-
Multimodal Transformer Distillation for Audio-Visual Synchronization
Xuanjun Chen, Haibin Wu, Chung-Che Wang, Hung-yi Lee, Jyh-Shing Roger JangICASSP 2024
-
SpeechDPR: End-To-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering
Chyi-Jiunn Lin, Guan-Ting Lin, Yung-Sung Chuang, Wei-Lun Wu, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-shan LeeICASSP 2024
-
Minisuperb: Lightweight Benchmark for Self-Supervised Speech Models
Yu-Hsiang Wang, Huang-Yu Chen, Kai-Wei Chang, Winston Hsu, Hung-yi LeeASRU 2023
-
Zero-shot singing voice synthesis from musical score
Jun-You Wang, Hung-yi Lee, Roger Jang, Li SuASRU 2023
-
Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond
Jiatong Shi, William Chen, Dan Berrebbi, Hsiu-Hsuan Wang, Wei-Ping Huang, En-Pei Hu, Ho-Lam Chuang, Xuankai Chang, Yuxun Tang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji WatanabeASRU 2023
-
MelHuBERT: A simplified HuBERT on Mel spectrograms
Tzu-Quan Lin, Hung-yi Lee, Hao TangASRU 2023
-
Towards General-Purpose Text-Instruction-Guided Voice Conversion
Chun-Yi Kuan, Chen An Li, Tsu-Yuan Hsu, Tse-Yang Lin, Ho-Lam Chung, Kai-Wei Chang, Shuo-Yiin Chang, Hung-Yi LeeASRU 2023
-
Prompting and Adapter Tuning for Self-supervised Encoder-Decoder Speech Model
Kai-Wei Chang, Ming-Hsin Chen, Yun-Ping Lin, Jing Neng Hsu, Paul Kuo-Ming Huang, Chien-yu Huang, Shang-Wen Li, Hung-yi LeeASRU 2023
-
Maximizing Data Efficiency for Cross-Lingual TTS Adaptation by Self-Supervised Representation Mixing and Embedding Initialization
Wei-Ping Huang, Sung-Feng Huang, Hung-yi LeeASRU 2023
-
Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences
Yuan Tseng, Cheng-I Lai, Hung-yi LeeICASSP, 2023
-
Bridging Speech and Text Pre-trained Models with Unsupervised ASR
Jiatong Shi, Chan-Jan Hsu, Ho Lam Chung, Dongji Gao, Paola Garcia, Shinji Watanabe, Ann Lee, Hung-yi LeeICASSP, 2023
-
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
Chan-Jan Hsu, Ho Lam Chung, Hung-yi Lee, Yu TsaoICASSP, 2023
-
Once-for-All Sequence Compression for Self-Supervised Speech Models
Hsuan-Jui Chen, Yen Meng, Hung-yi LeeICASSP, 2023
-
M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval
Layne Berry, Yi-Jen Shih, Hsuan-Fu Wang, Heng-Jui Chang, Hung-yi Lee, David HarwathICASSP, 2023
-
EURO: ESPnet Unsupervised ASR Open-source Toolkit
Dongji Gao, Jiatong Shi, Shun-Po Chuang, Paola Garcia, Hung-yi Lee, Shinji Watanabe, Sanjeev KhudanpurICASSP, 2023
-
Personalized Lightweight Text-to-Speech: Voice Cloning with Adaptive Structured Pruning
Sung-Feng Huang, Chia-ping Chen, Zhi-Sheng Chen, Yu-Pao Tsai, Hung-yi LeeICASSP, 2023
-
Ensemble knowledge distillation of self-supervised speech models
Kuan-Po Huang, Tzu-hsun Feng, Yu-Kuan Fu, Tsu-Yuan Hsu, Po-Chieh Yen, Wei-Cheng Tseng, Kai-Wei Chang, Hung-yi LeeICASSP, 2023
-
On the Utility of Self-supervised Models for Prosody-related Tasks
Guan-Ting Lin, Chi-Luen Feng, Wei-Ping Huang, Yuan Tseng, Tzu-Han Lin, Chen-An Li, Hung-yi Lee, Nigel G. WardSLT, 2022
-
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning
Tzu-hsun Feng, Annie Dong, Ching-Feng Yeh, Shu-wen Yang, Tzu-Quan Lin, Jiatong Shi, Kai-Wei Chang, Zili Huang, Haibin Wu, Xuankai Chang, Shinji Watanabe, Abdelrahman Mohamed, Shang-Wen Li, Hung-yi LeeSLT, 2022
-
On Compressing Sequences for Self-Supervised Speech Models
Yen Meng, Hsuan-Jui Chen, Jiatong Shi, Shinji Watanabe, Paola Garcia, Hung-yi Lee, Hao TangSLT, 2022
-
On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting
Wei-Tsung Kao, Yuan-Kuei Wu, Chia-Ping Chen, Zhi-Sheng Chen, Yu-Pao Tsai, Hung-Yi LeeSLT, 2022
-
SpeechCLIP: Integrating Speech with Pre-trained Vision and Language Model
Yi-Jen Shih, Hsuan-Fu Wang, Heng-Jui Chang, Layne Berry, Hung-yi Lee, David HarwathSLT, 2022
-
Exploring Efficient-tuning Methods in Self-supervised Speech Models
Zih-Ching Chen, Chin-Lun Fu, Chih-Ying Liu, Shang-Wen Li, Hung-yi LeeSLT, 2022
-
Improving generalizability of distilled self-supervised speech processing models under distorted settings
Kuan-Po Huang, Yu-Kuan Fu, Tsu-Yuan Hsu, Fabian Ritter Gutierrez, Fan-Lin Wang, Liang-Hsuan Tseng, Yu Zhang, Hung-yi LeeSLT, 2022
-
Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection
Xuanjun Chen, Haibin Wu, Helen Meng, Hung-yi Lee, Jyh-Shing Roger JangSLT, 2022
-
Few-Shot Cross-Lingual TTS Using Transferable Phoneme Embedding
Wei-Ping Huang, Po-Chun Chen, Sung-Feng Huang, Hung-yi LeeINTERSPEECH, 2022
-
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Jeff Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi LeeACL, 2022 
-
SUPERB: Speech processing Universal PERformance Benchmark
Shu-wen Yang, Po-Han Chi, Yung-Sung Chuang, Cheng-I Lai, Kushal Lakhotia, Yist Y. Lin, Andy T. Liu, Jiatong Shi, Xuankai Chang, Guan-Ting Lin, Tzu-Hsien Huang, Wei-Cheng Tseng, Ko-tik Lee, Da-Rong Liu, Zili Huang, Shuyan Dong, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi LeeINTERSPEECH, 2021 
-
Voting for the right answer: Adversarial defense for speaker verification
Haibin Wu, Yang Zhang, Zhiyong Wu, Dong Wang and Hung-yi LeeINTERSPEECH, 2021
-
Auto-KWS 2021 Challenge: Task, Datasets, and Baselines
Jingsong Wang, Yuxuan He, Chunyu Zhao, Qijie Shao, Wei-Wei Tu, Tom Ko, Hung-yi Lee, lei xieINTERSPEECH, 2021
-
Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation
Shun-Po Chuang, Yung-Sung Chuang, Chih-Chiang Chang, Hung-yi LeeACL Findings, 2021
-
Hierarchical Prosody Modeling For Non-Autoregressive Speech Synthesis
Chung-Ming Chien, Hung-yi LeeSLT, 2021
-
Audio Albert: A Lite Bert For Self-Supervised Learning Of Audio Representation
Po-Han Chi, Pei-Hung Chung, Tsung-Han Wu, Chun-Cheng Hsieh, Yen-Hao Chen, Shang-Wen Li, Hung-yi LeeSLT, 2021
-
How Far Are We From Robust Voice Conversion: A Survey
Tzu-hsien Huang, Jheng-hao Lin, Hung-yi LeeSLT, 2021
-
Defending Your Voice: Adversarial Attack On Voice Conversion
Chien-yu Huang, Yist Y. Lin, Hung-yi Lee, Lin-shan LeeSLT, 2021
-
End-To-End Whispered Speech Recognition With Frequency-Weighted Approaches And Pseudo Whisper Pre-Training
Heng-Jui Chang, Alexander H. Liu, Hung-yi Lee, Lin-shan LeeSLT, 2021
-
TaylorGAN: Neighbor-Augmented Policy Update for Sample-Efficient Natural Language Generation
Chun-Hsing Lin, Siang-Ruei Wu, Hung-Yi Lee, Yun-Nung ChenNeurIPS, 2020
-
Understanding Self-Attention of Self-Supervised Audio Transformers
Shu-wen Yang, Andy T. Liu, Hung-yi LeeINTERSPEECH, 2020
-
Defense for Black-box Attacks on Anti-spoofing Models by Self-Supervised Learning
Haibin Wu, Andy T. Liu, Hung-yi LeeINTERSPEECH, 2020
-
WG-WaveNet: Real-Time High-Fidelity Speech Synthesis without GPU
Po-chun Hsu, Hung-yi LeeINTERSPEECH, 2020
-
DARTS-ASR: Differentiable Architecture Search for Multilingual Speech Recognition and Adaptation
Yi-Chen Chen, Jui-Yang Hsu, Cheng-Kuang Lee, Hung-yi LeeINTERSPEECH, 2020
-
VQVC+: One-Shot Voice Conversion by Vector Quantization and U-Net architecture
Da-Yi Wu, Yen-Hao Chen, Hung-Yi LeeINTERSPEECH, 2020
-
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
Tao Tu, Yuan-Jui Chen, Alexander H. Liu, Hung-yi LeeINTERSPEECH, 2020
-
SpeechBERT: An Audio-and-text Jointly Learned Language Model for End-to-end Spoken Question Answering
Yung-Sung Chuang, Chi-Liang Liu, Hung-Yi Lee, Lin-shan LeeINTERSPEECH, 2020
-
Worse WER, but Better BLEU? Leveraging Word Embedding as Intermediate in Multitask End-to-End Speech Translation
Shun-Po Chuang, Tzu-Wei Sung, Alexander H Liu, Hung-yi LeeACL, 2020
-
MOCKINGJAY: UNSUPERVISED SPEECH REPRESENTATION LEARNING WITH DEEP BIDIRECTIONAL TRANSFORMER ENCODERS
Andy T. Liu, Shu-wen Yang, Po-Han Chi, Po-chun Hsu, Hung-yi LeeICASSP, 2020
-
WHAT DOES A NETWORK LAYER HEAR? ANALYZING HIDDEN REPRESENTATIONS OF END-TO-END ASR THROUGH SPEECH SYNTHESIS
Chung-Yi Li, Pei-Chieh Yuan, Hung-Yi LeeICASSP, 2020
-
INTERRUPTED AND CASCADED PERMUTATION INVARIANT TRAINING FOR SPEECH SEPARATION
Gene-Ping Yang, Szu-Lin Wu, Yao-Wen Mao, Hung-yi Lee, Lin-shan LeeICASSP, 2020
-
SEQUENCE-TO-SEQUENCE AUTOMATIC SPEECH RECOGNITION WITH WORD EMBEDDING REGULARIZATION AND FUSED DECODING
Alexander H. Liu, Tzu-Wei Sung, Shun-Po Chuang, Hung-yi Lee, Lin-shan LeeICASSP, 2020
-
TRAINING A CODE-SWITCHING LANGUAGE MODEL WITH MONOLINGUAL DATA
Shun-Po Chuang, Tzu-Wei Sung, Hung-Yi LeeICASSP, 2020
-
TOWARDS UNSUPERVISED SPEECH RECOGNITION AND SYNTHESIS WITH QUANTIZED SPEECH REPRESENTATION LEARNING
Alexander H. Liu, Tao Tu, Hung-yi Lee, Lin-shan LeeICASSP, 2020
-
ONE-SHOT VOICE CONVERSION BY VECTOR QUANTIZATION
Da-Yi Wu, Hung-yi LeeICASSP, 2020
-
Defense against adversarial attacks on spoofing countermeasures of ASV
Haibin Wu, Songxiang Liu, Helen Meng, Hung-yi LeeICASSP, 2020
-
META LEARNING FOR END-TO-END LOW-RESOURCE SPEECH RECOGNITION
Jui-Yang Hsu, Yuan-Jui Chen, Hung-yi LeeICASSP, 2020
-
SELF-SUPERVISED DEEP LEARNING FOR FISHEYE IMAGE RECTIFICATION
Chun-Hao Chao, Pin-Lun Hsu, Hung-Yi Lee, Yu-Chiang Frank WangICASSP, 2020
-
LAMOL: LAnguage MOdeling for Lifelong Language Learning
Fan-Keng Sun, Cheng-Hao Ho, Hung-Yi LeeICLR, 2020
-
Order-free Learning Alleviating Exposure Bias in Multi-label Classification
Che-Ping Tsai, Hung-Yi LeeAAAI, 2020
-
Adversarial attacks on spoofing countermeasures of automatic speaker verification
Songxiang Liu, Haibin Wu, Hung-yi Lee, Helen MengASRU, 2019
-
Zero-shot Reading Comprehension by Cross-lingual Transfer Learning with Multi-lingual Language Representation Model
Tsung-Yuan Hsu, Chi-Liang Liu and Hung-yi LeeEMNLP, 2019
-
Polly Want a Cracker: Analyzing Performance of Parroting on Paraphrase Generation Datasets
Hong-Ren Mao and Hung-Yi LeeEMNLP, 2019
-
Tree Transformer: Integrating Tree Structures into Self-Attention
Yaushian Wang, Hung-Yi Lee and Yun-Nung ChenEMNLP, 2019
-
DyKgChat: Benchmarking Dialogue Generation Grounding on Dynamic Knowledge Graphs
Yi-Lin Tuan, Yun-Nung Chen and Hung-yi LeeEMNLP, 2019
-
One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization
Ju-chieh Chou, Cheng-chieh Yeh, Hung-yi LeeINTERSPEECH, 2019
-
Unsupervised End-to-End Learning of Discrete Linguistic Units for Voice Conversion
Andy T. Liu, Po-chun Hsu and Hung-yi LeeINTERSPEECH, 2019
-
Personalized Dialogue Response Generation Learned from Monologues
Feng-Guang Su, Aliyah Hsu, Yi-Lin Tuan and Hung-yi LeeINTERSPEECH, 2019
-
End-to-end Text-to-speech for Low-resource Languages by Cross-Lingual Transfer Learning
Yuan-Jui Chen, Tao Tu, Cheng-chieh Yeh, Hung-yi LeeINTERSPEECH, 2019
-
Code-switching Sentence Generation by Generative Adversarial Networks and its Application to Data Augmentation
Ching-Ting Chang, Shun-Po Chuang, Hung-Yi LeeINTERSPEECH, 2019
-
Completely Unsupervised Phoneme Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Models
Kuan-yu Chen, Che-ping Tsai, Da-Rong Liu, Hung-yi Lee and Lin-shan LeeINTERSPEECH, 2019
-
Noise Adaptive Speech Enhancement using Domain Adversarial Training
Chien-Feng Liao, Yu Tsao, Hung-yi Lee and Hsin-Min WangINTERSPEECH, 2019
-
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Gene-Ping Yang, ChaoI Tuan, Hung-yi Lee and Lin-shan LeeINTERSPEECH, 2019
-
Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech
Li-Wei Chen, Hung-Yi Lee, Yu TsaoINTERSPEECH, 2019
-
, ICASSP, 2019
Che-Ping Tsai, Hung-Yi Lee, Adversarial Learning of Label Dependency: A Novel Framework for Multi-class ClassificationICASSP, 2019
-
Towards Audio to Scene Image Synthesis using Generative Adversarial Network
Chia-Hung Wan, Shun-Po Chuang, Hung-Yi LeeICASSP, 2019
-
Mitigating the Impact of Speech Recognition Errors on Spoken Question Answering by Adversarial Domain Adaptation
Chia-Hsuan Lee, Yun-Nung Chen, Hung-Yi LeeICASSP, 2019
-
Towards End-to-end Speech-to-text Translation with Two-pass Decoding
Tzu-Wei Sung, Jun-You Liu, Hung-yi Lee, Lin-shan LeeICASSP, 2019
-
Adversarial Training of End-to-end Speech Recognition Using a Criticizing Language Model
Alexander H. Liu, Hung-yi Lee, Lin-shan LeeICASSP, 2019
-
Using Deep-Q Network to Select Candidates from N-best Speech Recognition Hypotheses for Enhancing Dialogue State Tracking
Richard Tzong-Han Tsai, Chia-Hao Chen, Chun-Kai Wu, Yu-Cheng Hsiao, Hung-Yi LeeICASSP, 2019
-
Learning to Encode Text as Human-Readable Summaries using Generative Adversarial Networks
Yau-Shian Wang, Hung-Yi LeeEMNLP, 2018
-
Improving Unsupervised Style Transfer in End-to-End Speech Synthesis with End-to-End Speech Recognition
Da-Rong Liu, Chi-Yu Yang, Szu-Lin Wu, Hung-Yi LeeSLT, 2018
-
ODSQA: Open-domain Spoken Question Answering Dataset
Chia-Hsuan Lee, Shang-Ming Wang, Huan-Cheng Chang, Hung-Yi LeeSLT, 2018
-
Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences
Cheng-chieh Yeh, Po-chun Hsu, Ju-chieh Chou, Hung-yi Lee, Lin-shan LeeSLT, 2018
-
Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval
Yi-Chen Chen, Sung-Feng Huang, Chia-Hao Shen, Hung-yi Lee, Lin-shan LeeSLT, 2018
-
Spoken SQuAD: A Study of Mitigating the Impact of Speech Recognition Errors on Listening Comprehension
Chia-Hsuan Li, Szu-Lin Wu, Chi-Liang Liu, Hung-yi LeeINTERSPEECH, 2018
-
Joint Learning of Interactive Spoken Content Retrieval and Trainable User Simulator
Pei-Hung Chung, Kuan Tung, Ching-Lun Tai, Hung-Yi LeeINTERSPEECH, 2018
-
Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio Representations
Ju-chieh Chou, Cheng-chieh Yeh, Hung-yi Lee, Lin-shan LeeINTERSPEECH, 2018
-
Completely Unsupervised Phoneme Recognition by Adversarially Learning Mapping Relationships from Audio Embeddings
Da-Rong Liu, Kuan-Yu Chen, Hung-Yi Lee, Lin-shan LeeINTERSPEECH, 2018
-
Language Transfer of Audio Word2Vec: Learning Audio Segment Representations without Target Language Data
Chia-Hao Shen, Janet Y. Sung, Hung-Yi LeeICASSP, 2018
-
Query-by-example Spoken Term Detection using Attention-based Multi-hop Networks
Chia-Wei Ao, Hung-yi LeeICASSP, 2018
-
Domain Independent Key Term Extraction from Spoken Content based on Context and Term Location Information
Hsien-Chin Lin, Chi-Yu Yang, Hung-Yi Lee, Lin-Shan LeeICASSP, 2018
-
Scalable Sentiment for Sequence-to-sequence Chatbot Response with Performance Analysis
Chih-Wei Lee, Yau-Shian Wang, Tsung-Yuan Hsu, Kuan-Yu Chen, Hung-Yi Lee, Lin-Shan LeeICASSP, 2018
-
Segmental Audio Word2vec: Representing Utterances as Sequences of Vectors with Applications in Spoken Term Detection
Yu-Hsuan Wang, Hung-Yi Lee, Lin-Shan LeeICASSP, 2018
-
Supervised and Unsupervised Transfer Learning for Question Answering
Yu-An Chung, Hung-Yi Lee, James GlassNAACL, 2018
-
Query-based Attention CNN for Text Similarity Map
Tzu-Chien Liu, Yu-Hsueh Wu, Hung-Yi LeeICCV, 2018
-
Mitigating the Impact of Speech Recognition Errors on Chatbot using Sequence-to-sequence Model
Pin-Jung Chen, I-Hung Hsu, Yi Yao Huang, Hung-Yi LeeASRU, 2017
-
Seeing and Hearing Too: Audio Representation for Video Captioning
Shun Po Chuang, Chia-Hung Wan, Pang-Chi Huang, Chi-Yu Yang, Hung-Yi LeeASRU, 2017
-
Personalized Word Representations Carrying Personalized Semantics Learned from Social Network Posts
Zih-Wei Lin, Tzu-Wei Sung, Hung-Yi Lee, Lin-Shan LeeASRU, 2017
-
Learning Chinese Word Representations From Glyphs Of Characters
Tzu-Ray Su, Hung-Yi LeeEMNLP, 2017
-
Gate Activation Signal Analysis for Gated Recurrent Neural Networks and Its Correlation with Phoneme Boundaries
Yu-Hsuan Wang, Cheng-Tao Chung, Hung-yi LeeINTERSPEECH, 2017
-
Order-Preserving Abstractive Summarization for Spoken Content based on Connectionist Temporal Classification
Bo-Ru Lu, Frank Shyu, Yun-Nung Chen, Hung-Yi Lee, Lin-Shan LeeINTERSPEECH, 2017
-
Recurrent Neural Network based Language Modeling with Controllable External Memory
Wei-Jen Ko, Bo-Hsiang Tseng, Hung-yi LeeICASSP, 2017
-
Personalized Acoustic Modeling by Weakly Supervised Multi-task Deep Learning using Acoustic Tokens Discovered from Unlabeled Data
Cheng-Kuan Wei, Cheng-Tao Chung, Hung-yi Lee, Lin-Shan LeeICASSP, 2017
-
Abstractive Headline Generation for Spoken Content by Attentive Recurrent Neural Networks with ASR Error Modeling�
Lang-Chi Yu, Hung-yi Lee, Lin-Shan LeeSLT, 2016
-
Hierarchical Attention Model for Improved Machine Comprehension of Spoken Content
Wei Fang, Juei-Yang Hsu, Hung-yi Lee, Lin-Shan LeeSLT, 2016
-
Towards Machine Comprehension of Spoken Content: Initial TOEFL Listening Comprehension Test by Machine
Bo-Hsiang Tseng, Sheng-syun Shen, Hung-Yi Lee, Lin-Shan LeeINTERSPEECH, 2016
-
Interactive Spoken Content Retrieval by Deep Reinforcement Learning
Yen-Chen Wu, Tzu-Hsiang Lin, Yang-De Chen, Hung-Yi Lee, Lin-Shan LeeINTERSPEECH, 2016
-
Audio Word2Vec: Unsupervised Learning of Audio Segment Representations Using Sequence-to-Sequence Autoencoder
Yu-An Chung, Chao-Chung Wu, Chia-Hao Shen, Hung-Yi Lee, Lin-Shan LeeINTERSPEECH, 2016
-
Neural Attention Models for Sequence Classification: Analysis and Application to Key Term Extraction and Dialogue Act Detection
Sheng-syun Shen, Hung-Yi LeeINTERSPEECH, 2016
-
Towards Structured Deep Neural Network for Automatic Speech Recognition
Yi-Hsiu Liao, Hung-yi Lee, Lin-shan LeeASRU, 2015
-
Personalizing Universal Recurrent Neural Network Language Model with User Characteristic Features by Social Network Crowdsourcing
Bo-Hsiang Tseng, Hung-yi Lee, Lin-Shan LeeASRU, 2015
-
An Iterative Deep Learning Framework for Unsupervised Discovery of Speech Features and Linguistic Units with Applications on Spoken Term Detection
Cheng-Tao Chung, Cheng-Yu Tsai, Hsiang-Hung Lu, Chia-Hsiang Liu, Hung-yi Lee, Lin-shan LeeASRU, 2015
-
Structuring Lectures in Massive Open Online Courses (MOOCs) for Efficient Learning by Linking Similar Sections and Predicting Prerequisites
Sheng-syun Shen, Hung-yi Lee, Shang-wen Li, Victor Zue and Lin-shan LeeINTERSPEECH, 2015
-
Semantic Retrieval of Personal Photos using a Deep Autoencoder Fusing Visual Features with Speech Annotations Represented as Word/Paragraph Vectors
Hung-tsung Lu, Yuan-ming Liou, Hung-yi Lee and Lin-shan LeeINTERSPEECH, 2015
-
Personalized Speech Recognizer with Keyword-based Personalized Lexicon and Language Model using Word Vector Representations
Ching-Feng Yeh, Yuan-ming Liou, Hung-yi Lee and Lin-shan LeeINTERSPEECH, 2015
-
Graph-based Re-ranking using Acoustic Feature Similarity between Search Results for Spoken Term Detection on Low-resource Languages
Hung-yi Lee, Yu Zhang, Ekapol Chuangsuwanich, James GlassINTERSPEECH, 2014
-
Alignment of Spoken Utterances with Slide Content for Easier Learning with Recorded Lectures using Structured Support Vector Machine (SVM)
Han Lu, Sheng-syun Shen, Sz-Rung Shiang, Hung-yi Lee and Lin-shan LeeINTERSPEECH, 2014
-
Spoken Question Answering Using Tree-structured Conditional Random Fields and Two-layer Random Walk
Sz-Rung Shiang, Hung-yi Lee and Lin-shan LeeINTERSPEECH, 2014
-
Semantic Retrieval of Personal Photos using Matrix Factorization and Two-layer Random Walk Fusing Sparse Speech Annotations with Visual Features
Yuan-ming Liou, Yi-sheng Fu, Hung-yi Lee and Lin-shan LeeINTERSPEECH, 2014
-
Ensemble of Machine Learning and Acoustic Segment Model Techniques for Speech Emotion and Autism Spectrum Disorders Recognition
Hung-yi Lee, Ting-yao Hu, How Jing, Yun-Fan Chang, Yu Tsao, Yu-Cheng Kao, Tsang-Long PaoINTERSPEECH, 2013
-
Unsupervised Domain Adaptation for Spoken Document Summarization with Structured Support Vector Machine
Hung-yi Lee, Yu-yu Chou, Yow-Bang Wang, Lin-shan LeeICASSP, 2013
-
Enhancing Query Expansion for Semantic Retrieval of Spoken Content with Automatically Discovered Acoustic Patterns
Hung-yi Lee, Yun-Chiao Li, Cheng-Tao Chung, Lin-shan LeeICASSP, 2013
-
Towards Unsupervised Semantic Retrieval of Spoken Content with Query Expansion based on Automatically Discovered Acoustic Patterns
Yun-Chiao Li, Hung-yi Lee, Cheng-Tao Chung, Chun-an Chan, and Lin-shan LeeASRU, 2013
-
Supervised Spoken Document Summarization Based on Structured Support Vector Machine with Utterance Clusters as Hidden Variables
Sz-Rung Shiang, Hung-yi Lee, Lin-shan LeeINTERSPEECH, 2013
-
Recurrent Neural Network Based Language Model Personalization by Social Network Crowdsourcing
Tsung-Hsien Wen, Aaron Heidel, Hung-yi Lee, Yu Tsao, Lin-shan LeeINTERSPEECH, 2013
-
Speaking Rate Normalization with Lattice-based Context-dependent Phoneme Duration Modeling for Personalized Speech Recognizers on Mobile Devices
Ching-Feng Yeh, Hung-yi Lee and Lin-shan LeeINTERSPEECH, 2013
-
Interactive Spoken Content Retrieval by Extended Query Model and Continuous State Space Markov Decision Process
Tsung-Hsien Wen, Hung-yi Lee, Pei-Hao Su, Lin-shan LeeICASSP, 2013
-
Improved Semantic Retrieval of Spoken Content by Language models Enhanced with Acoustic Similari"
Hung-yi Lee, Tsung-Hsien Wen, Lin-shan LeeSLT, 2012
-
Personalized Language Modeling by Crowd Sourcing with Social Network Data for Voice Access of Cloud Applications
Tsung-Hsien Wen, Hung-yi Lee, Lin-shan LeeSLT, 2012
-
Supervised Spoken Document Summarization Jointly Considering Utterance Importance and Redundancy by Structured Support Vector Machine
Hung-yi Lee, Yu-yu Chou, Yow-Bang Wang, Lin-shan LeeINTERSPEECH, 2012
-
Open-Vocabulary Retrieval of Spoken Content with Shorter/Longer Queries Considering Word/Subword-based Acoustic Feature Similarity
Hung-yi Lee, Po-wei Chou, Lin-shan LeeINTERSPEECH, 2012
-
Utterance-level Latent Topic Transition Modeling for Spoken Documents and its Application in Automatic Summarization
Hung-yi Lee, Yun-nung Chen, Lin-shan LeeICASSP, 2012
-
Interactive Spoken Content Retrieval with Different Types of Actions Optimized by a Markov Decision Process
Tsung-Hsien Wen, Hung-yi Lee, Lin-shan LeeINTERSPEECH, 2012
-
Semantic Query Expansion and Context-based Discriminative Term Modeling for Spoken Document Retrieval
Tsung-wei Tu, Hung-yi Lee, Lin-shan LeeICASSP, 2012
-
Unsupervised Two-Stage Keyword Extraction from Spoken Documents by Topic Coherence and Support Vector Machine
Yun-Nung Chen, Yu Huang, Hung-yi Lee, Lin-shan LeeICASSP, 2012
-
Recognition of Highly Imbalanced Code-mixed Bilingual Speech with Frame-level Language Detection based on Blurred Posteriorgram
Ching-Feng Yeh, Aaron Heidel, Hung-yi Lee, Lin-shan LeeICASSP, 2012
-
Improved Speech Summarization and Spoken Term Detection with Graphical Analysis of Utterance Similarities
Hung-yi Lee, Yun-nung Chen, Lin-shan LeeAPSIPA ASC, 2011
-
Improved Spoken Term Detection Using Support Vector Machines based on Lattice Context Consistency
Hung-yi Lee, Tsung-wei Tu, Chia-ping Chen, Chao-yu Huang, Lin-shan LeeICASSP, 2011
-
Improved Spoken Term Detection using Support Vector Machines with Acoustic and Context Features from Pseudo-relevance Feedback
Tsung-wei Tu, Hung-yi Lee, Lin-shan LeeASRU, 2011
-
Improved Spoken Term Detection with Graph-based Re-ranking in Feature Space
Yun-nung Chen, Chia-ping Chen, Hung-yi Lee, Chun-an Chan, Lin-shan LeeICASSP, 2011
-
A Framework Integrating Different Relevance Feedback Scenarios and Approaches for Spoken Term Detection
Hung-yi Lee, Chia-ping Chen, Ching-feng Yeh, Lin-shan LeeSLT, 2010
-
Improved Spoken Term Detection by Discriminative Training of Acoustic Models based on User Relevance Feedback
Hung-yi Lee, Chia-ping Chen, Ching-feng Yeh, Lin-shan LeeINTERSPEECH, 2010
-
Integrating Recognition and Retrieval with User Feedback: A New Framework for Spoken Term Detection
Hung-yi Lee and Lin-shan LeeICASSP, 2010
-
Improved Spoken Term Detection by Feature Space Pseudo-Relevance Feedback
Chia-ping Chen, Hung-yi Lee, Ching-feng Yeh, Lin-shan LeeINTERSPEECH, 2010
-
An Initial Attempt to Improve Spoken Term Detection by Learning Optimal Weights for Different Indexing Features
Yu-Hui Chen, Chia-Chen Chou, Hung-yi Lee, Lin-shan LeeICASSP, 2010
-
Spoken Term Detection from Bilingual Spontaneous Speech Using Code-switched Lattice-based Structures for Words and Subword Units
Hung-yi Lee, Yueh-Lien Tang, Hao Tang, Lin-shan LeeASRU, 2009
-
Improved Lattice-based Spoken Document Retrieval by Directly Learning from the evaluation Measures
Chao-hong Meng, Hung-yi Lee, Lin-shan LeeICASSP, 2009