Publication
-
Minisuperb: Lightweight Benchmark for Self-Supervised Speech Models
Yu-Hsiang Wang, Huang-Yu Chen, Kai-Wei Chang, Winston Hsu, Hung-yi LeeASRU 2023
-
Zero-shot singing voice synthesis from musical score
Jun-You Wang, Hung-yi Lee, Roger Jang, Li SuASRU 2023
-
Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond
Jiatong Shi, William Chen, Dan Berrebbi, Hsiu-Hsuan Wang, Wei-Ping Huang, En-Pei Hu, Ho-Lam Chuang, Xuankai Chang, Yuxun Tang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji WatanabeASRU 2023
-
MelHuBERT: A simplified HuBERT on Mel spectrograms
Tzu-Quan Lin, Hung-yi Lee, Hao TangASRU 2023
-
Towards General-Purpose Text-Instruction-Guided Voice Conversion
Chun-Yi Kuan, Chen An Li, Tsu-Yuan Hsu, Tse-Yang Lin, Ho-Lam Chung, Kai-Wei Chang, Shuo-Yiin Chang, Hung-Yi LeeASRU 2023
-
Prompting and Adapter Tuning for Self-supervised Encoder-Decoder Speech Model
Kai-Wei Chang, Ming-Hsin Chen, Yun-Ping Lin, Jing Neng Hsu, Paul Kuo-Ming Huang, Chien-yu Huang, Shang-Wen Li, Hung-yi LeeASRU 2023
-
Maximizing Data Efficiency for Cross-Lingual TTS Adaptation by Self-Supervised Representation Mixing and Embedding Initialization
Wei-Ping Huang, Sung-Feng Huang, Hung-yi LeeASRU 2023
-
Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences
Yuan Tseng, Cheng-I Lai, Hung-yi LeeICASSP, 2023
-
Bridging Speech and Text Pre-trained Models with Unsupervised ASR
Jiatong Shi, Chan-Jan Hsu, Ho Lam Chung, Dongji Gao, Paola Garcia, Shinji Watanabe, Ann Lee, Hung-yi LeeICASSP, 2023
-
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
Chan-Jan Hsu, Ho Lam Chung, Hung-yi Lee, Yu TsaoICASSP, 2023
-
Once-for-All Sequence Compression for Self-Supervised Speech Models
Hsuan-Jui Chen, Yen Meng, Hung-yi LeeICASSP, 2023
-
M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval
Layne Berry, Yi-Jen Shih, Hsuan-Fu Wang, Heng-Jui Chang, Hung-yi Lee, David HarwathICASSP, 2023
-
EURO: ESPnet Unsupervised ASR Open-source Toolkit
Dongji Gao, Jiatong Shi, Shun-Po Chuang, Paola Garcia, Hung-yi Lee, Shinji Watanabe, Sanjeev KhudanpurICASSP, 2023
-
Personalized Lightweight Text-to-Speech: Voice Cloning with Adaptive Structured Pruning
Sung-Feng Huang, Chia-ping Chen, Zhi-Sheng Chen, Yu-Pao Tsai, Hung-yi LeeICASSP, 2023
-
Ensemble knowledge distillation of self-supervised speech models
Kuan-Po Huang, Tzu-hsun Feng, Yu-Kuan Fu, Tsu-Yuan Hsu, Po-Chieh Yen, Wei-Cheng Tseng, Kai-Wei Chang, Hung-yi LeeICASSP, 2023
-
On the Utility of Self-supervised Models for Prosody-related Tasks
Guan-Ting Lin, Chi-Luen Feng, Wei-Ping Huang, Yuan Tseng, Tzu-Han Lin, Chen-An Li, Hung-yi Lee, Nigel G. WardSLT, 2022
-
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning
Tzu-hsun Feng, Annie Dong, Ching-Feng Yeh, Shu-wen Yang, Tzu-Quan Lin, Jiatong Shi, Kai-Wei Chang, Zili Huang, Haibin Wu, Xuankai Chang, Shinji Watanabe, Abdelrahman Mohamed, Shang-Wen Li, Hung-yi LeeSLT, 2022
-
On Compressing Sequences for Self-Supervised Speech Models
Yen Meng, Hsuan-Jui Chen, Jiatong Shi, Shinji Watanabe, Paola Garcia, Hung-yi Lee, Hao TangSLT, 2022
-
On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting
Wei-Tsung Kao, Yuan-Kuei Wu, Chia-Ping Chen, Zhi-Sheng Chen, Yu-Pao Tsai, Hung-Yi LeeSLT, 2022
-
SpeechCLIP: Integrating Speech with Pre-trained Vision and Language Model
Yi-Jen Shih, Hsuan-Fu Wang, Heng-Jui Chang, Layne Berry, Hung-yi Lee, David HarwathSLT, 2022
-
Exploring Efficient-tuning Methods in Self-supervised Speech Models
Zih-Ching Chen, Chin-Lun Fu, Chih-Ying Liu, Shang-Wen Li, Hung-yi LeeSLT, 2022
-
Improving generalizability of distilled self-supervised speech processing models under distorted settings
Kuan-Po Huang, Yu-Kuan Fu, Tsu-Yuan Hsu, Fabian Ritter Gutierrez, Fan-Lin Wang, Liang-Hsuan Tseng, Yu Zhang, Hung-yi LeeSLT, 2022
-
Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection
Xuanjun Chen, Haibin Wu, Helen Meng, Hung-yi Lee, Jyh-Shing Roger JangSLT, 2022
-
Few-Shot Cross-Lingual TTS Using Transferable Phoneme Embedding
Wei-Ping Huang, Po-Chun Chen, Sung-Feng Huang, Hung-yi LeeINTERSPEECH, 2022
-
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Jeff Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi LeeACL, 2022 
-
SUPERB: Speech processing Universal PERformance Benchmark
Shu-wen Yang, Po-Han Chi, Yung-Sung Chuang, Cheng-I Lai, Kushal Lakhotia, Yist Y. Lin, Andy T. Liu, Jiatong Shi, Xuankai Chang, Guan-Ting Lin, Tzu-Hsien Huang, Wei-Cheng Tseng, Ko-tik Lee, Da-Rong Liu, Zili Huang, Shuyan Dong, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi LeeINTERSPEECH, 2021 
-
Voting for the right answer: Adversarial defense for speaker verification
Haibin Wu, Yang Zhang, Zhiyong Wu, Dong Wang and Hung-yi LeeINTERSPEECH, 2021
-
Auto-KWS 2021 Challenge: Task, Datasets, and Baselines
Jingsong Wang, Yuxuan He, Chunyu Zhao, Qijie Shao, Wei-Wei Tu, Tom Ko, Hung-yi Lee, lei xieINTERSPEECH, 2021
-
Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation
Shun-Po Chuang, Yung-Sung Chuang, Chih-Chiang Chang, Hung-yi LeeACL Findings, 2021
-
Hierarchical Prosody Modeling For Non-Autoregressive Speech Synthesis
Chung-Ming Chien, Hung-yi LeeSLT, 2021
-
Audio Albert: A Lite Bert For Self-Supervised Learning Of Audio Representation
Po-Han Chi, Pei-Hung Chung, Tsung-Han Wu, Chun-Cheng Hsieh, Yen-Hao Chen, Shang-Wen Li, Hung-yi LeeSLT, 2021
-
How Far Are We From Robust Voice Conversion: A Survey
Tzu-hsien Huang, Jheng-hao Lin, Hung-yi LeeSLT, 2021
-
Defending Your Voice: Adversarial Attack On Voice Conversion
Chien-yu Huang, Yist Y. Lin, Hung-yi Lee, Lin-shan LeeSLT, 2021
-
End-To-End Whispered Speech Recognition With Frequency-Weighted Approaches And Pseudo Whisper Pre-Training
Heng-Jui Chang, Alexander H. Liu, Hung-yi Lee, Lin-shan LeeSLT, 2021
-
TaylorGAN: Neighbor-Augmented Policy Update for Sample-Efficient Natural Language Generation
Chun-Hsing Lin, Siang-Ruei Wu, Hung-Yi Lee, Yun-Nung ChenNeurIPS, 2020
-
Understanding Self-Attention of Self-Supervised Audio Transformers
Shu-wen Yang, Andy T. Liu, Hung-yi LeeINTERSPEECH, 2020
-
Defense for Black-box Attacks on Anti-spoofing Models by Self-Supervised Learning
Haibin Wu, Andy T. Liu, Hung-yi LeeINTERSPEECH, 2020
-
WG-WaveNet: Real-Time High-Fidelity Speech Synthesis without GPU
Po-chun Hsu, Hung-yi LeeINTERSPEECH, 2020
-
DARTS-ASR: Differentiable Architecture Search for Multilingual Speech Recognition and Adaptation
Yi-Chen Chen, Jui-Yang Hsu, Cheng-Kuang Lee, Hung-yi LeeINTERSPEECH, 2020
-
VQVC+: One-Shot Voice Conversion by Vector Quantization and U-Net architecture
Da-Yi Wu, Yen-Hao Chen, Hung-Yi LeeINTERSPEECH, 2020
-
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
Tao Tu, Yuan-Jui Chen, Alexander H. Liu, Hung-yi LeeINTERSPEECH, 2020
-
SpeechBERT: An Audio-and-text Jointly Learned Language Model for End-to-end Spoken Question Answering
Yung-Sung Chuang, Chi-Liang Liu, Hung-Yi Lee, Lin-shan LeeINTERSPEECH, 2020
-
Worse WER, but Better BLEU? Leveraging Word Embedding as Intermediate in Multitask End-to-End Speech Translation
Shun-Po Chuang, Tzu-Wei Sung, Alexander H Liu, Hung-yi LeeACL, 2020
-
MOCKINGJAY: UNSUPERVISED SPEECH REPRESENTATION LEARNING WITH DEEP BIDIRECTIONAL TRANSFORMER ENCODERS
Andy T. Liu, Shu-wen Yang, Po-Han Chi, Po-chun Hsu, Hung-yi LeeICASSP, 2020
-
WHAT DOES A NETWORK LAYER HEAR? ANALYZING HIDDEN REPRESENTATIONS OF END-TO-END ASR THROUGH SPEECH SYNTHESIS
Chung-Yi Li, Pei-Chieh Yuan, Hung-Yi LeeICASSP, 2020
-
INTERRUPTED AND CASCADED PERMUTATION INVARIANT TRAINING FOR SPEECH SEPARATION
Gene-Ping Yang, Szu-Lin Wu, Yao-Wen Mao, Hung-yi Lee, Lin-shan LeeICASSP, 2020
-
SEQUENCE-TO-SEQUENCE AUTOMATIC SPEECH RECOGNITION WITH WORD EMBEDDING REGULARIZATION AND FUSED DECODING
Alexander H. Liu, Tzu-Wei Sung, Shun-Po Chuang, Hung-yi Lee, Lin-shan LeeICASSP, 2020
-
TRAINING A CODE-SWITCHING LANGUAGE MODEL WITH MONOLINGUAL DATA
Shun-Po Chuang, Tzu-Wei Sung, Hung-Yi LeeICASSP, 2020
-
TOWARDS UNSUPERVISED SPEECH RECOGNITION AND SYNTHESIS WITH QUANTIZED SPEECH REPRESENTATION LEARNING
Alexander H. Liu, Tao Tu, Hung-yi Lee, Lin-shan LeeICASSP, 2020
-
ONE-SHOT VOICE CONVERSION BY VECTOR QUANTIZATION
Da-Yi Wu, Hung-yi LeeICASSP, 2020
-
Defense against adversarial attacks on spoofing countermeasures of ASV
Haibin Wu, Songxiang Liu, Helen Meng, Hung-yi LeeICASSP, 2020
-
META LEARNING FOR END-TO-END LOW-RESOURCE SPEECH RECOGNITION
Jui-Yang Hsu, Yuan-Jui Chen, Hung-yi LeeICASSP, 2020
-
SELF-SUPERVISED DEEP LEARNING FOR FISHEYE IMAGE RECTIFICATION
Chun-Hao Chao, Pin-Lun Hsu, Hung-Yi Lee, Yu-Chiang Frank WangICASSP, 2020
-
LAMOL: LAnguage MOdeling for Lifelong Language Learning
Fan-Keng Sun, Cheng-Hao Ho, Hung-Yi LeeICLR, 2020
-
Order-free Learning Alleviating Exposure Bias in Multi-label Classification
Che-Ping Tsai, Hung-Yi LeeAAAI, 2020
-
Adversarial attacks on spoofing countermeasures of automatic speaker verification
Songxiang Liu, Haibin Wu, Hung-yi Lee, Helen MengASRU, 2019
-
Zero-shot Reading Comprehension by Cross-lingual Transfer Learning with Multi-lingual Language Representation Model
Tsung-Yuan Hsu, Chi-Liang Liu and Hung-yi LeeEMNLP, 2019
-
Polly Want a Cracker: Analyzing Performance of Parroting on Paraphrase Generation Datasets
Hong-Ren Mao and Hung-Yi LeeEMNLP, 2019
-
Tree Transformer: Integrating Tree Structures into Self-Attention
Yaushian Wang, Hung-Yi Lee and Yun-Nung ChenEMNLP, 2019
-
DyKgChat: Benchmarking Dialogue Generation Grounding on Dynamic Knowledge Graphs
Yi-Lin Tuan, Yun-Nung Chen and Hung-yi LeeEMNLP, 2019
-
One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization
Ju-chieh Chou, Cheng-chieh Yeh, Hung-yi LeeINTERSPEECH, 2019
-
Unsupervised End-to-End Learning of Discrete Linguistic Units for Voice Conversion
Andy T. Liu, Po-chun Hsu and Hung-yi LeeINTERSPEECH, 2019
-
Personalized Dialogue Response Generation Learned from Monologues
Feng-Guang Su, Aliyah Hsu, Yi-Lin Tuan and Hung-yi LeeINTERSPEECH, 2019
-
End-to-end Text-to-speech for Low-resource Languages by Cross-Lingual Transfer Learning
Yuan-Jui Chen, Tao Tu, Cheng-chieh Yeh, Hung-yi LeeINTERSPEECH, 2019
-
Code-switching Sentence Generation by Generative Adversarial Networks and its Application to Data Augmentation
Ching-Ting Chang, Shun-Po Chuang, Hung-Yi LeeINTERSPEECH, 2019
-
Completely Unsupervised Phoneme Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Models
Kuan-yu Chen, Che-ping Tsai, Da-Rong Liu, Hung-yi Lee and Lin-shan LeeINTERSPEECH, 2019
-
Noise Adaptive Speech Enhancement using Domain Adversarial Training
Chien-Feng Liao, Yu Tsao, Hung-yi Lee and Hsin-Min WangINTERSPEECH, 2019
-
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Gene-Ping Yang, ChaoI Tuan, Hung-yi Lee and Lin-shan LeeINTERSPEECH, 2019
-
Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech
Li-Wei Chen, Hung-Yi Lee, Yu TsaoINTERSPEECH, 2019
-
, ICASSP, 2019
Che-Ping Tsai, Hung-Yi Lee, Adversarial Learning of Label Dependency: A Novel Framework for Multi-class ClassificationICASSP, 2019
-
Towards Audio to Scene Image Synthesis using Generative Adversarial Network
Chia-Hung Wan, Shun-Po Chuang, Hung-Yi LeeICASSP, 2019
-
Mitigating the Impact of Speech Recognition Errors on Spoken Question Answering by Adversarial Domain Adaptation
Chia-Hsuan Lee, Yun-Nung Chen, Hung-Yi LeeICASSP, 2019
-
Towards End-to-end Speech-to-text Translation with Two-pass Decoding
Tzu-Wei Sung, Jun-You Liu, Hung-yi Lee, Lin-shan LeeICASSP, 2019
-
Adversarial Training of End-to-end Speech Recognition Using a Criticizing Language Model
Alexander H. Liu, Hung-yi Lee, Lin-shan LeeICASSP, 2019
-
Using Deep-Q Network to Select Candidates from N-best Speech Recognition Hypotheses for Enhancing Dialogue State Tracking
Richard Tzong-Han Tsai, Chia-Hao Chen, Chun-Kai Wu, Yu-Cheng Hsiao, Hung-Yi LeeICASSP, 2019
-
Learning to Encode Text as Human-Readable Summaries using Generative Adversarial Networks
Yau-Shian Wang, Hung-Yi LeeEMNLP, 2018
-
Improving Unsupervised Style Transfer in End-to-End Speech Synthesis with End-to-End Speech Recognition
Da-Rong Liu, Chi-Yu Yang, Szu-Lin Wu, Hung-Yi LeeSLT, 2018
-
ODSQA: Open-domain Spoken Question Answering Dataset
Chia-Hsuan Lee, Shang-Ming Wang, Huan-Cheng Chang, Hung-Yi LeeSLT, 2018
-
Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences
Cheng-chieh Yeh, Po-chun Hsu, Ju-chieh Chou, Hung-yi Lee, Lin-shan LeeSLT, 2018
-
Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval
Yi-Chen Chen, Sung-Feng Huang, Chia-Hao Shen, Hung-yi Lee, Lin-shan LeeSLT, 2018
-
Spoken SQuAD: A Study of Mitigating the Impact of Speech Recognition Errors on Listening Comprehension
Chia-Hsuan Li, Szu-Lin Wu, Chi-Liang Liu, Hung-yi LeeINTERSPEECH, 2018
-
Joint Learning of Interactive Spoken Content Retrieval and Trainable User Simulator
Pei-Hung Chung, Kuan Tung, Ching-Lun Tai, Hung-Yi LeeINTERSPEECH, 2018
-
Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio Representations
Ju-chieh Chou, Cheng-chieh Yeh, Hung-yi Lee, Lin-shan LeeINTERSPEECH, 2018
-
Completely Unsupervised Phoneme Recognition by Adversarially Learning Mapping Relationships from Audio Embeddings
Da-Rong Liu, Kuan-Yu Chen, Hung-Yi Lee, Lin-shan LeeINTERSPEECH, 2018
-
Language Transfer of Audio Word2Vec: Learning Audio Segment Representations without Target Language Data
Chia-Hao Shen, Janet Y. Sung, Hung-Yi LeeICASSP, 2018
-
Query-by-example Spoken Term Detection using Attention-based Multi-hop Networks
Chia-Wei Ao, Hung-yi LeeICASSP, 2018
-
Domain Independent Key Term Extraction from Spoken Content based on Context and Term Location Information
Hsien-Chin Lin, Chi-Yu Yang, Hung-Yi Lee, Lin-Shan LeeICASSP, 2018
-
Scalable Sentiment for Sequence-to-sequence Chatbot Response with Performance Analysis
Chih-Wei Lee, Yau-Shian Wang, Tsung-Yuan Hsu, Kuan-Yu Chen, Hung-Yi Lee, Lin-Shan LeeICASSP, 2018
-
Segmental Audio Word2vec: Representing Utterances as Sequences of Vectors with Applications in Spoken Term Detection
Yu-Hsuan Wang, Hung-Yi Lee, Lin-Shan LeeICASSP, 2018
-
Supervised and Unsupervised Transfer Learning for Question Answering
Yu-An Chung, Hung-Yi Lee, James GlassNAACL, 2018
-
Query-based Attention CNN for Text Similarity Map
Tzu-Chien Liu, Yu-Hsueh Wu, Hung-Yi LeeICCV, 2018
-
Mitigating the Impact of Speech Recognition Errors on Chatbot using Sequence-to-sequence Model
Pin-Jung Chen, I-Hung Hsu, Yi Yao Huang, Hung-Yi LeeASRU, 2017
-
Seeing and Hearing Too: Audio Representation for Video Captioning
Shun Po Chuang, Chia-Hung Wan, Pang-Chi Huang, Chi-Yu Yang, Hung-Yi LeeASRU, 2017
-
Personalized Word Representations Carrying Personalized Semantics Learned from Social Network Posts
Zih-Wei Lin, Tzu-Wei Sung, Hung-Yi Lee, Lin-Shan LeeASRU, 2017
-
Learning Chinese Word Representations From Glyphs Of Characters
Tzu-Ray Su, Hung-Yi LeeEMNLP, 2017
-
Gate Activation Signal Analysis for Gated Recurrent Neural Networks and Its Correlation with Phoneme Boundaries
Yu-Hsuan Wang, Cheng-Tao Chung, Hung-yi LeeINTERSPEECH, 2017
-
Order-Preserving Abstractive Summarization for Spoken Content based on Connectionist Temporal Classification
Bo-Ru Lu, Frank Shyu, Yun-Nung Chen, Hung-Yi Lee, Lin-Shan LeeINTERSPEECH, 2017
-
Recurrent Neural Network based Language Modeling with Controllable External Memory
Wei-Jen Ko, Bo-Hsiang Tseng, Hung-yi LeeICASSP, 2017
-
Personalized Acoustic Modeling by Weakly Supervised Multi-task Deep Learning using Acoustic Tokens Discovered from Unlabeled Data
Cheng-Kuan Wei, Cheng-Tao Chung, Hung-yi Lee, Lin-Shan LeeICASSP, 2017
-
Abstractive Headline Generation for Spoken Content by Attentive Recurrent Neural Networks with ASR Error Modeling�
Lang-Chi Yu, Hung-yi Lee, Lin-Shan LeeSLT, 2016
-
Hierarchical Attention Model for Improved Machine Comprehension of Spoken Content
Wei Fang, Juei-Yang Hsu, Hung-yi Lee, Lin-Shan LeeSLT, 2016
-
Towards Machine Comprehension of Spoken Content: Initial TOEFL Listening Comprehension Test by Machine
Bo-Hsiang Tseng, Sheng-syun Shen, Hung-Yi Lee, Lin-Shan LeeINTERSPEECH, 2016
-
Interactive Spoken Content Retrieval by Deep Reinforcement Learning
Yen-Chen Wu, Tzu-Hsiang Lin, Yang-De Chen, Hung-Yi Lee, Lin-Shan LeeINTERSPEECH, 2016
-
Audio Word2Vec: Unsupervised Learning of Audio Segment Representations Using Sequence-to-Sequence Autoencoder
Yu-An Chung, Chao-Chung Wu, Chia-Hao Shen, Hung-Yi Lee, Lin-Shan LeeINTERSPEECH, 2016
-
Neural Attention Models for Sequence Classification: Analysis and Application to Key Term Extraction and Dialogue Act Detection
Sheng-syun Shen, Hung-Yi LeeINTERSPEECH, 2016
-
Towards Structured Deep Neural Network for Automatic Speech Recognition
Yi-Hsiu Liao, Hung-yi Lee, Lin-shan LeeASRU, 2015
-
Personalizing Universal Recurrent Neural Network Language Model with User Characteristic Features by Social Network Crowdsourcing
Bo-Hsiang Tseng, Hung-yi Lee, Lin-Shan LeeASRU, 2015
-
An Iterative Deep Learning Framework for Unsupervised Discovery of Speech Features and Linguistic Units with Applications on Spoken Term Detection
Cheng-Tao Chung, Cheng-Yu Tsai, Hsiang-Hung Lu, Chia-Hsiang Liu, Hung-yi Lee, Lin-shan LeeASRU, 2015
-
Structuring Lectures in Massive Open Online Courses (MOOCs) for Efficient Learning by Linking Similar Sections and Predicting Prerequisites
Sheng-syun Shen, Hung-yi Lee, Shang-wen Li, Victor Zue and Lin-shan LeeINTERSPEECH, 2015
-
Semantic Retrieval of Personal Photos using a Deep Autoencoder Fusing Visual Features with Speech Annotations Represented as Word/Paragraph Vectors
Hung-tsung Lu, Yuan-ming Liou, Hung-yi Lee and Lin-shan LeeINTERSPEECH, 2015
-
Personalized Speech Recognizer with Keyword-based Personalized Lexicon and Language Model using Word Vector Representations
Ching-Feng Yeh, Yuan-ming Liou, Hung-yi Lee and Lin-shan LeeINTERSPEECH, 2015
-
Graph-based Re-ranking using Acoustic Feature Similarity between Search Results for Spoken Term Detection on Low-resource Languages
Hung-yi Lee, Yu Zhang, Ekapol Chuangsuwanich, James GlassINTERSPEECH, 2014
-
Alignment of Spoken Utterances with Slide Content for Easier Learning with Recorded Lectures using Structured Support Vector Machine (SVM)
Han Lu, Sheng-syun Shen, Sz-Rung Shiang, Hung-yi Lee and Lin-shan LeeINTERSPEECH, 2014
-
Spoken Question Answering Using Tree-structured Conditional Random Fields and Two-layer Random Walk
Sz-Rung Shiang, Hung-yi Lee and Lin-shan LeeINTERSPEECH, 2014
-
Semantic Retrieval of Personal Photos using Matrix Factorization and Two-layer Random Walk Fusing Sparse Speech Annotations with Visual Features
Yuan-ming Liou, Yi-sheng Fu, Hung-yi Lee and Lin-shan LeeINTERSPEECH, 2014
-
Ensemble of Machine Learning and Acoustic Segment Model Techniques for Speech Emotion and Autism Spectrum Disorders Recognition
Hung-yi Lee, Ting-yao Hu, How Jing, Yun-Fan Chang, Yu Tsao, Yu-Cheng Kao, Tsang-Long PaoINTERSPEECH, 2013
-
Unsupervised Domain Adaptation for Spoken Document Summarization with Structured Support Vector Machine
Hung-yi Lee, Yu-yu Chou, Yow-Bang Wang, Lin-shan LeeICASSP, 2013
-
Enhancing Query Expansion for Semantic Retrieval of Spoken Content with Automatically Discovered Acoustic Patterns
Hung-yi Lee, Yun-Chiao Li, Cheng-Tao Chung, Lin-shan LeeICASSP, 2013
-
Towards Unsupervised Semantic Retrieval of Spoken Content with Query Expansion based on Automatically Discovered Acoustic Patterns
Yun-Chiao Li, Hung-yi Lee, Cheng-Tao Chung, Chun-an Chan, and Lin-shan LeeASRU, 2013
-
Supervised Spoken Document Summarization Based on Structured Support Vector Machine with Utterance Clusters as Hidden Variables
Sz-Rung Shiang, Hung-yi Lee, Lin-shan LeeINTERSPEECH, 2013
-
Recurrent Neural Network Based Language Model Personalization by Social Network Crowdsourcing
Tsung-Hsien Wen, Aaron Heidel, Hung-yi Lee, Yu Tsao, Lin-shan LeeINTERSPEECH, 2013
-
Speaking Rate Normalization with Lattice-based Context-dependent Phoneme Duration Modeling for Personalized Speech Recognizers on Mobile Devices
Ching-Feng Yeh, Hung-yi Lee and Lin-shan LeeINTERSPEECH, 2013
-
Interactive Spoken Content Retrieval by Extended Query Model and Continuous State Space Markov Decision Process
Tsung-Hsien Wen, Hung-yi Lee, Pei-Hao Su, Lin-shan LeeICASSP, 2013
-
Improved Semantic Retrieval of Spoken Content by Language models Enhanced with Acoustic Similari"
Hung-yi Lee, Tsung-Hsien Wen, Lin-shan LeeSLT, 2012
-
Personalized Language Modeling by Crowd Sourcing with Social Network Data for Voice Access of Cloud Applications
Tsung-Hsien Wen, Hung-yi Lee, Lin-shan LeeSLT, 2012
-
Supervised Spoken Document Summarization Jointly Considering Utterance Importance and Redundancy by Structured Support Vector Machine
Hung-yi Lee, Yu-yu Chou, Yow-Bang Wang, Lin-shan LeeINTERSPEECH, 2012
-
Open-Vocabulary Retrieval of Spoken Content with Shorter/Longer Queries Considering Word/Subword-based Acoustic Feature Similarity
Hung-yi Lee, Po-wei Chou, Lin-shan LeeINTERSPEECH, 2012
-
Utterance-level Latent Topic Transition Modeling for Spoken Documents and its Application in Automatic Summarization
Hung-yi Lee, Yun-nung Chen, Lin-shan LeeICASSP, 2012
-
Interactive Spoken Content Retrieval with Different Types of Actions Optimized by a Markov Decision Process
Tsung-Hsien Wen, Hung-yi Lee, Lin-shan LeeINTERSPEECH, 2012
-
Semantic Query Expansion and Context-based Discriminative Term Modeling for Spoken Document Retrieval
Tsung-wei Tu, Hung-yi Lee, Lin-shan LeeICASSP, 2012
-
Unsupervised Two-Stage Keyword Extraction from Spoken Documents by Topic Coherence and Support Vector Machine
Yun-Nung Chen, Yu Huang, Hung-yi Lee, Lin-shan LeeICASSP, 2012
-
Recognition of Highly Imbalanced Code-mixed Bilingual Speech with Frame-level Language Detection based on Blurred Posteriorgram
Ching-Feng Yeh, Aaron Heidel, Hung-yi Lee, Lin-shan LeeICASSP, 2012
-
Improved Speech Summarization and Spoken Term Detection with Graphical Analysis of Utterance Similarities
Hung-yi Lee, Yun-nung Chen, Lin-shan LeeAPSIPA ASC, 2011
-
Improved Spoken Term Detection Using Support Vector Machines based on Lattice Context Consistency
Hung-yi Lee, Tsung-wei Tu, Chia-ping Chen, Chao-yu Huang, Lin-shan LeeICASSP, 2011
-
Improved Spoken Term Detection using Support Vector Machines with Acoustic and Context Features from Pseudo-relevance Feedback
Tsung-wei Tu, Hung-yi Lee, Lin-shan LeeASRU, 2011
-
Improved Spoken Term Detection with Graph-based Re-ranking in Feature Space
Yun-nung Chen, Chia-ping Chen, Hung-yi Lee, Chun-an Chan, Lin-shan LeeICASSP, 2011
-
A Framework Integrating Different Relevance Feedback Scenarios and Approaches for Spoken Term Detection
Hung-yi Lee, Chia-ping Chen, Ching-feng Yeh, Lin-shan LeeSLT, 2010
-
Improved Spoken Term Detection by Discriminative Training of Acoustic Models based on User Relevance Feedback
Hung-yi Lee, Chia-ping Chen, Ching-feng Yeh, Lin-shan LeeINTERSPEECH, 2010
-
Integrating Recognition and Retrieval with User Feedback: A New Framework for Spoken Term Detection
Hung-yi Lee and Lin-shan LeeICASSP, 2010
-
Improved Spoken Term Detection by Feature Space Pseudo-Relevance Feedback
Chia-ping Chen, Hung-yi Lee, Ching-feng Yeh, Lin-shan LeeINTERSPEECH, 2010
-
An Initial Attempt to Improve Spoken Term Detection by Learning Optimal Weights for Different Indexing Features
Yu-Hui Chen, Chia-Chen Chou, Hung-yi Lee, Lin-shan LeeICASSP, 2010
-
Spoken Term Detection from Bilingual Spontaneous Speech Using Code-switched Lattice-based Structures for Words and Subword Units
Hung-yi Lee, Yueh-Lien Tang, Hao Tang, Lin-shan LeeASRU, 2009
-
Improved Lattice-based Spoken Document Retrieval by Directly Learning from the evaluation Measures
Chao-hong Meng, Hung-yi Lee, Lin-shan LeeICASSP, 2009