Conference

  1. Yau-Shian Wang, Hung-Yi Lee, "Learning to Encode Text as Human-Readable Summaries using Generative Adversarial Networks", EMNLP, 2018 icon
  2. Chia-Hung Wan, Shun-Po Chuang, Hung-Yi Lee, "Towards Audio to Scene Image Synthesis using Generative Adversarial Network", arXiv, 2018 icon demo
  3. Da-Rong Liu, Chi-Yu Yang, Szu-Lin Wu, Hung-Yi Lee, "Improving Unsupervised Style Transfer in End-to-End Speech Synthesis with End-to-End Speech Recognition", arXiv, 2018 icon
  4. Cheng-chieh Yeh, Po-chun Hsu, Ju-chieh Chou, Hung-yi Lee, Lin-shan Lee, "Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences", arXiv, 2018 icon
  5. Chia-Hsuan Lee, Shang-Ming Wang, Huan-Cheng Chang, Hung-Yi Lee, "ODSQA: Open-domain Spoken Question Answering Dataset", arXiv, 2018 icon dataset
  6. Chia-Hsuan Li, Szu-Lin Wu, Chi-Liang Liu, Hung-yi Lee, "Spoken SQuAD: A Study of Mitigating the Impact of Speech Recognition Errors on Listening Comprehension", INTERSPEECH, 2018 icon dataset
  7. Pei-Hung Chung, Kuan Tung, Ching-Lun Tai, Hung-Yi Lee, "Joint Learning of Interactive Spoken Content Retrieval and Trainable User Simulator", INTERSPEECH, 2018 icon(one of the 12 finalists for the best student paper award)
  8. Yi-Chen Chen, Chia-Hao Shen, Sung-Feng Huang, Hung-yi Lee, "Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval", arXiv, 2018 icon
  9. Yi-Chen Chen, Chia-Hao Shen, Sung-Feng Huang, Hung-yi Lee, "Towards Unsupervised Automatic Speech Recognition Trained by Unaligned Speech and Text only", arXiv, 2018 icon
  10. Ju-chieh Chou, Cheng-chieh Yeh, Hung-yi Lee, Lin-shan Lee, "Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio Representations", INTERSPEECH, 2018 icon(one of the 12 finalists for the best student paper award)
  11. Da-Rong Liu, Kuan-Yu Chen, Hung-Yi Lee, Lin-shan Lee, "Completely Unsupervised Phoneme Recognition by Adversarially Learning Mapping Relationships from Audio Embeddings", INTERSPEECH, 2018 icon
  12. Chia-Hao Shen, Janet Y. Sung, Hung-Yi Lee, "Language Transfer of Audio Word2Vec: Learning Audio Segment Representations without Target Language Data", ICASSP, 2018 icon
  13. Chia-Wei Ao, Hung-yi Lee, "Query-by-example Spoken Term Detection using Attention-based Multi-hop Networks", ICASSP, 2018 icon
  14. Hsien-Chin Lin, Chi-Yu Yang, Hung-Yi Lee, Lin-Shan Lee, "Domain Independent Key Term Extraction from Spoken Content based on Context and Term Location Information", ICASSP, 2018 icon
  15. Chih-Wei Lee, Yau-Shian Wang, Tsung-Yuan Hsu, Kuan-Yu Chen, Hung-Yi Lee, Lin-Shan Lee, "Scalable Sentiment for Sequence-to-sequence Chatbot Response with Performance Analysis", ICASSP, 2018 icon
  16. Yu-Hsuan Wang, Hung-Yi Lee, Lin-Shan Lee, "Segmental Audio Word2vec: Representing Utterances as Sequences of Vectors with Applications in Spoken Term Detection", ICASSP, 2018 icon
  17. Yu-An Chung, Hung-Yi Lee, James Glass, "Supervised and Unsupervised Transfer Learning for Question Answering", NAACL, 2018 icon code
  18. Tzu-Chien Liu, Yu-Hsueh Wu, Hung-Yi Lee, "Query-based Attention CNN for Text Similarity Map", ICCV workshop, 2018 icon code
  19. Pin-Jung Chen, I-Hung Hsu, Yi Yao Huang, Hung-Yi Lee, "Mitigating the Impact of Speech Recognition Errors on Chatbot using Sequence-to-sequence Model", the 12th biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'17), Okinawa, Japan, December 2017
  20. Shun Po Chuang, Chia-Hung Wan, Pang-Chi Huang, Chi-Yu Yang, Hung-Yi Lee, "Seeing and Hearing Too: Audio Representation for Video Captioning" the 12th biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'17), Okinawa, Japan, December 2017 icon
  21. Zih-Wei Lin, Tzu-Wei Sung, Hung-Yi Lee, Lin-Shan Lee, "Personalized Word Representations Carrying Personalized Semantics Learned from Social Network Posts", the 12th biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'17), Okinawa, Japan, December 2017 icon
  22. Tzu-Ray Su, Hung-Yi Lee, "Learning Chinese Word Representations From Glyphs Of Characters", Conference on Empirical Methods in Natural Language Processing (EMNLP), Copenhagen, Denmark, Sept. 2017 icon
  23. Yu-Hsuan Wang, Cheng-Tao Chung, Hung-yi Lee, "Gate Activation Signal Analysis for Gated Recurrent Neural Networks and Its Correlation with Phoneme Boundaries", the 18th Annual Conference of the International Speech Communication Association (INTERSPEECH'17), Stockholm, Sweden, August 2017 icon
  24. Bo-Ru Lu, Frank Shyu, Yun-Nung Chen, Hung-Yi Lee, Lin-Shan Lee, "Order-Preserving Abstractive Summarization for Spoken Content based on Connectionist Temporal Classification", the 18th Annual Conference of the International Speech Communication Association (INTERSPEECH'17), Stockholm, Sweden, August 2017 icon
  25. Wei-Jen Ko, Bo-Hsiang Tseng, Hung-yi Lee, "Recurrent Neural Network based Language Modeling with Controllable External Memory", the 42th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'17), New Orleans, March 2017 icon
  26. Cheng-Kuan Wei, Cheng-Tao Chung, Hung-yi Lee, Lin-Shan Lee, "Personalized Acoustic Modeling by Weakly Supervised Multi-task Deep Learning using Acoustic Tokens Discovered from Unlabeled Data", the 42th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'17), New Orleans, March 2017
  27. Lang-Chi Yu, Hung-yi Lee, Lin-Shan Lee, “Abstractive Headline Generation for Spoken Content by Attentive Recurrent Neural Networks with ASR Error Modeling”, the 6th IEEE Workshop on Spoken Language Technology (SLT'16), San Diego, Dec. 2016 icon
  28. Wei Fang, Juei-Yang Hsu, Hung-yi Lee, Lin-Shan Lee, "Hierarchical Attention Model for Improved Machine Comprehension of Spoken Content", the 6th IEEE Workshop on Spoken Language Technology (SLT'16), San Diego, Dec. 2016 code icon
  29. Da-Rong Liu, Shun-Po Chuang, Hung-yi Lee, "Attention-based Memory Selection Recurrent Network for Language Modeling", arXiv preprint, November 2016 icon
  30. Bo-Hsiang Tseng, Sheng-syun Shen, Hung-Yi Lee, Lin-Shan Lee, "Towards Machine Comprehension of Spoken Content: Initial TOEFL Listening Comprehension Test by Machine", the 17th Annual Conference of the International Speech Communication Association (INTERSPEECH'16), San Francisco, Sept. 2016 (one of the 12 finalists for the best student paper award) icon
  31. Yen-Chen Wu, Tzu-Hsiang Lin, Yang-De Chen, Hung-Yi Lee, Lin-Shan Lee, "Interactive Spoken Content Retrieval by Deep Reinforcement Learning", the 17th Annual Conference of the International Speech Communication Association (INTERSPEECH'16), San Francisco, Sept. 2016 icon
  32. Yu-An Chung, Chao-Chung Wu, Chia-Hao Shen, Hung-Yi Lee, Lin-Shan Lee, "Audio Word2Vec: Unsupervised Learning of Audio Segment Representations Using Sequence-to-Sequence Autoencoder", the 17th Annual Conference of the International Speech Communication Association (INTERSPEECH'16), San Francisco, Sept. 2016 icon
  33. Sheng-syun Shen, Hung-Yi Lee, "Neural Attention Models for Sequence Classification: Analysis and Application to Key Term Extraction and Dialogue Act Detection", the 17th Annual Conference of the International Speech Communication Association (INTERSPEECH'16), San Francisco, Sept. 2016 icon
  34. Yi-Hsiu Liao, Hung-yi Lee, Lin-shan Lee, "Towards Structured Deep Neural Network for Automatic Speech Recognition", the 11th biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'15), Arizona, December 2015 icon
  35. Bo-Hsiang Tseng, Hung-yi Lee, Lin-Shan Lee, "Personalizing Universal Recurrent Neural Network Language Model with User Characteristic Features by Social Network Crowdsourcing", the 11th biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'15), Arizona, December 2015 icon
  36. Cheng-Tao Chung, Cheng-Yu Tsai, Hsiang-Hung Lu, Chia-Hsiang Liu, Hung-yi Lee, Lin-shan Lee, "An Iterative Deep Learning Framework for Unsupervised Discovery of Speech Features and Linguistic Units with Applications on Spoken Term Detection", the 11th biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'15), Arizona, December 2015icon
  37. Sheng-syun Shen, Hung-yi Lee, Shang-wen Li, Victor Zue and Lin-shan Lee, "Structuring Lectures in Massive Open Online Courses (MOOCs) for Efficient Learning by Linking Similar Sections and Predicting Prerequisites", the 16th Annual Conference of the International Speech Communication Association (INTERSPEECH'15), Dresden, Germany, Sept. 2015 icon
  38. Hung-tsung Lu, Yuan-ming Liou, Hung-yi Lee and Lin-shan Lee, "Semantic Retrieval of Personal Photos using a Deep Autoencoder Fusing Visual Features with Speech Annotations Represented as Word/Paragraph Vectors", the 16th Annual Conference of the International Speech Communication Association (INTERSPEECH'15), Dresden, Germany, Sept. 2015icon
  39. Ching-Feng Yeh, Yuan-ming Liou, Hung-yi Lee and Lin-shan Lee, "Personalized Speech Recognizer with Keyword-based Personalized Lexicon and Language Model using Word Vector Representations", the 16th Annual Conference of the International Speech Communication Association (INTERSPEECH'15), Dresden, Germany, Sept. 2015
  40. Hung-yi Lee, Yu Zhang, Ekapol Chuangsuwanich, James Glass, "Graph-based Re-ranking using Acoustic Feature Similarity between Search Results for Spoken Term Detection on Low-resource Languages", the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH'14), Singapore, Sept. 2014 icon
  41. Han Lu, Sheng-syun Shen, Sz-Rung Shiang, Hung-yi Lee and Lin-shan Lee, "Alignment of Spoken Utterances with Slide Content for Easier Learning with Recorded Lectures using Structured Support Vector Machine (SVM)", the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH'14), Singapore, Sept. 2014
  42. Sz-Rung Shiang, Hung-yi Lee and Lin-shan Lee, "Spoken Question Answering Using Tree-structured Conditional Random Fields and Two-layer Random Walk", the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH'14), Singapore, Sept. 2014
  43. Yuan-ming Liou, Yi-sheng Fu, Hung-yi Lee and Lin-shan Lee, "Semantic Retrieval of Personal Photos using Matrix Factorization and Two-layer Random Walk Fusing Sparse Speech Annotations with Visual Features", the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH'14), Singapore, Sept. 2014
  44. Hung-yi Lee, Ting-yao Hu, How Jing, Yun-Fan Chang, Yu Tsao, Yu-Cheng Kao, Tsang-Long Pao, "Ensemble of Machine Learning and Acoustic Segment Model Techniques for Speech Emotion and Autism Spectrum Disorders Recognition", the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH'13), Lyon, France, August 2013
  45. Hung-yi Lee, Yu-yu Chou, Yow-Bang Wang, Lin-shan Lee, "Unsupervised Domain Adaptation for Spoken Document Summarization with Structured Support Vector Machine", the 38th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'13), Vancouver, Canada, May 2013
  46. Hung-yi Lee, Yun-Chiao Li, Cheng-Tao Chung, Lin-shan Lee, "Enhancing Query Expansion for Semantic Retrieval of Spoken Content with Automatically Discovered Acoustic Patterns", the 38th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'13), Vancouver, Canada, May 2013
  47. Yun-Chiao Li, Hung-yi Lee, Cheng-Tao Chung, Chun-an Chan, and Lin-shan Lee, "Towards Unsupervised Semantic Retrieval of Spoken Content with Query Expansion based on Automatically Discovered Acoustic Patterns", the 10th biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'13), Olomouc, Czech Republic, December 2013 icon
  48. Sz-Rung Shiang, Hung-yi Lee, Lin-shan Lee, "Supervised Spoken Document Summarization Based on Structured Support Vector Machine with Utterance Clusters as Hidden Variables", the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH'13), Lyon, France, August 2013
  49. Tsung-Hsien Wen, Aaron Heidel, Hung-yi Lee, Yu Tsao, Lin-shan Lee, "Recurrent Neural Network Based Language Model Personalization by Social Network Crowdsourcing", the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH'13), Lyon, France, August 2013 (one of the 12 finalists for the best student paper award)icon
  50. Ching-Feng Yeh, Hung-yi Lee and Lin-shan Lee, "Speaking Rate Normalization with Lattice-based Context-dependent Phoneme Duration Modeling for Personalized Speech Recognizers on Mobile Devices", the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH'13), Lyon, France, August 2013
  51. Tsung-Hsien Wen, Hung-yi Lee, Pei-Hao Su, Lin-shan Lee, " Interactive Spoken Content Retrieval by Extended Query Model and Continuous State Space Markov Decision Process", the 38th IEEE International Conference on Acoustics, Speech and Signal Processing Vancouver, Canada, May 2013 icon
  52. Hung-yi Lee, Tsung-Hsien Wen, Lin-shan Lee, "Improved Semantic Retrieval of Spoken Content by Language models Enhanced with Acoustic Similarity Graph", the 4th IEEE Workshop on Spoken Language Technology (SLT'12), Miami, Florida, December 2012
  53. Tsung-Hsien Wen, Hung-yi Lee, Lin-shan Lee, "Personalized Language Modeling by Crowd Sourcing with Social Network Data for Voice Access of Cloud Applications", the 4th IEEE Workshop on Spoken Language Technology (SLT'12), Miami, Florida, December 2012icon
  54. Hung-yi Lee, Yu-yu Chou, Yow-Bang Wang, Lin-shan Lee, "Supervised Spoken Document Summarization Jointly Considering Utterance Importance and Redundancy by Structured Support Vector Machine", the 13th Annual Conference of the International Speech Communication Association (INTERSPEECH'12), Portland, Oregon, September 2012
  55. Hung-yi Lee, Po-wei Chou, Lin-shan Lee, "Open-Vocabulary Retrieval of Spoken Content with Shorter/Longer Queries Considering Word/Subword-based Acoustic Feature Similarity", the 13th Annual Conference of the International Speech Communication Association (INTERSPEECH'12), Portland, Oregon, September 2012
  56. Hung-yi Lee, Yun-nung Chen, Lin-shan Lee, "Utterance-level Latent Topic Transition Modeling for Spoken Documents and its Application in Automatic Summarization", the 37th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'12), Kyoto, Japan, March 2012
  57. Tsung-Hsien Wen, Hung-yi Lee, Lin-shan Lee, "Interactive Spoken Content Retrieval with Different Types of Actions Optimized by a Markov Decision Process", the 13th Annual Conference of the International Speech Communication Association (INTERSPEECH'12), Portland, Oregon, September 2012 (one of the 10 finalists for the best student paper award)
  58. Tsung-wei Tu, Hung-yi Lee, Lin-shan Lee, "Semantic Query Expansion and Context-based Discriminative Term Modeling for Spoken Document Retrieval", the 37th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'12), Kyoto, Japan, March 2012 (IEEE Spoken Language Processing Student Travel Grant)
  59. Yun-Nung Chen, Yu Huang, Hung-yi Lee, Lin-shan Lee, "Unsupervised Two-Stage Keyword Extraction from Spoken Documents by Topic Coherence and Support Vector Machine", the 37th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'12), Kyoto, Japan, March 2012
  60. Ching-Feng Yeh, Aaron Heidel, Hung-yi Lee, Lin-shan Lee, "Recognition of Highly Imbalanced Code-mixed Bilingual Speech with Frame-level Language Detection based on Blurred Posteriorgram", the 37th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'12), Kyoto, Japan, March 2012
  61. Hung-yi Lee, Yun-nung Chen, Lin-shan Lee, "Improved Speech Summarization and Spoken Term Detection with Graphical Analysis of Utterance Similarities", the 3rd Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, October 2011
  62. Hung-yi Lee, Tsung-wei Tu, Chia-ping Chen, Chao-yu Huang, Lin-shan Lee , "Improved Spoken Term Detection Using Support Vector Machines based on Lattice Context Consistency", the 36th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'11), Prague, Czech Republic, May 2011
  63. Tsung-wei Tu, Hung-yi Lee, Lin-shan Lee, "Improved Spoken Term Detection using Support Vector Machines with Acoustic and Context Features from Pseudo-relevance Feedback", the 9th biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'11), Hawaii, December 2011 (one of the 5 finalists for the best student paper award)
  64. Yun-nung Chen, Chia-ping Chen, Hung-yi Lee, Chun-an Chan, Lin-shan Lee, "Improved Spoken Term Detection with Graph-based Re-ranking in Feature Space", the 36th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'11), Prague, Czech Republic, May 2011
  65. Hung-yi Lee, Chia-ping Chen, Ching-feng Yeh, Lin-shan Lee, "A Framework Integrating Different Relevance Feedback Scenarios and Approaches for Spoken Term Detection", the 3rd IEEE Workshop on Spoken Language Technology (SLT'10), Berkeley, California, December 2010
  66. Hung-yi Lee, Chia-ping Chen, Ching-feng Yeh, Lin-shan Lee, "Improved Spoken Term Detection by Discriminative Training of Acoustic Models based on User Relevance Feedback", the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH'10), Makuhari, Japan, September 2010
  67. Hung-yi Lee and Lin-shan Lee, "Integrating Recognition and Retrieval with User Feedback: A New Framework for Spoken Term Detection", the 35th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'10), Dallas, Texas, March 2010 (cited in textbook)
  68. Chia-ping Chen, Hung-yi Lee, Ching-feng Yeh, Lin-shan Lee, "Improved Spoken Term Detection by Feature Space Pseudo-Relevance Feedback", the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH'10), Makuhari, Japan, September 2010
  69. Yu-Hui Chen, Chia-Chen Chou, Hung-yi Lee, Lin-shan Lee, "An Initial Attempt to Improve Spoken Term Detection by Learning Optimal Weights for Different Indexing Features", the 35th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'10), Dallas, Texas, March 2010 (cited in textbook)
  70. Hung-yi Lee, Yueh-Lien Tang, Hao Tang, Lin-shan Lee, "Spoken Term Detection from Bilingual Spontaneous Speech Using Code-switched Lattice-based Structures for Words and Subword Units", the 8th biannual IEEE workshop on Automatic Speech Recognition and Understanding, (ASRU'09), Merano, Italy, December 2009
  71. Chao-hong Meng, Hung-yi Lee, Lin-shan Lee, "Improved Lattice-based Spoken Document Retrieval by Directly Learning from the evaluation Measures", the 34th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'09), Taipei, Taiwan, April 2009

Journal

  1. Hung-yi Lee, Bo-Hsiang Tseng, Tsung-Hsien Wen, Yu Tsao, "Personalizing Recurrent Neural Network Based Language Model by Social Network," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 3, pp. 519-530, March 2017 icon
  2. Lin-shan Lee, James Glass, Hung-yi Lee, Chun-an Chan, "Spoken Content Retrieval —Beyond Cascading Speech Recognition with Text Retrieval," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.23, no.9, pp.1389-1420, Sept. 2015 icon
  3. Hung-yi Lee, Ching-feng Yeh, Yun-Nung Chen, Yu Huang, Sheng-Yi Kong and Lin-shan Lee, “Spoken Knowledge Organization by Semantic Structuring and a Prototype Course Lecture System for Personalized Learning”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, no.5, pp.883-898, May 2014 icon (Figure 9 of the article selected as journal cover)
  4. Hung-yi Lee, Po-wei Chou, Lin-shan Lee, Improved open-vocabulary spoken content retrieval with word and subword lattices using acoustic feature similarity, Computer Speech & Language, Volume 28, Issue 5, pp. 1045-1065, Sept. 2014icon
  5. Hung-yi Lee, Lin-shan Lee, "Improved Semantic Retrieval of Spoken Content by Document/Query Expansion with Random Walk over Acoustic Similarity Graphs," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, no.1, pp.80-94, Jan. 2014icon(Figure 2 of the article selected as journal cover)
  6. Hung-yi Lee, Lin-shan Lee, "Enhanced Spoken Term Detection Using Support Vector Machines and Weighted Pseudo Examples," IEEE Transactions on Audio, Speech, and Language Processing, vol.21, no.6, pp.1272-1284, June 2013icon
  7. Hung-yi Lee, Chia-ping Chen, Lin-shan Lee, "Integrating Recognition and Retrieval with Relevance Feedback for Spoken Term Detection," IEEE Transactions on Audio, Speech, and Language Processing, vol.20, no.7, pp.2095-2110, Sept. 2012icon
  8. Yi-cheng Pan, Hung-yi Lee, Lin-shan Lee, "Interactive Spoken Document Retrieval With Suggested Key Terms Ranked by a Markov Decision Process", IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.2, pp. 632-645, Feb. 2012icon