Conference

  1. Chia-Hao Shen, Janet Y. Sung, Hung-Yi Lee, "Language Transfer of Audio Word2Vec: Learning Audio Segment Representations without Target Language Data", arXiv preprint, Sept. 2017 icon
  2. Chia-Wei Ao, Hung-yi Lee, "Query-by-example Spoken Term Detection using Attention-based Multi-hop Networks", arXiv preprint, Sept. 2017 icon
  3. Pin-Jung Chen, I-Hung Hsu, Yi Yao Huang, Hung-Yi Lee, "Mitigating the Impact of Speech Recognition Errors on Chatbot using Sequence-to-sequence Model", the 12th biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'17), Okinawa, Japan, December 2017
  4. Shun Po Chuang, Chia-Hung Wan, Pang-Chi Huang, Chi-Yu Yang, Hung-Yi Lee, "Seeing and Hearing Too: Audio Representation for Video Captioning" the 12th biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'17), Okinawa, Japan, December 2017 icon
  5. Zih-Wei Lin, Tzu-Wei Sung, Hung-Yi Lee, Lin-Shan Lee, "Personalized Word Representations Carrying Personalized Semantics Learned from Social Network Posts", the 12th biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'17), Okinawa, Japan, December 2017 icon
  6. Tzu-Ray Su, Hung-Yi Lee, "Learning Chinese Word Representations From Glyphs Of Characters", Conference on Empirical Methods in Natural Language Processing (EMNLP), Copenhagen, Denmark, Sept. 2017 icon
  7. Yu-Hsuan Wang, Cheng-Tao Chung, Hung-yi Lee, "Gate Activation Signal Analysis for Gated Recurrent Neural Networks and Its Correlation with Phoneme Boundaries", the 18th Annual Conference of the International Speech Communication Association (INTERSPEECH'17), Stockholm, Sweden, August 2017 icon
  8. Bo-Ru Lu, Frank Shyu, Yun-Nung Chen, Hung-Yi Lee, Lin-Shan Lee, "Order-Preserving Abstractive Summarization for Spoken Content based on Connectionist Temporal Classification", the 18th Annual Conference of the International Speech Communication Association (INTERSPEECH'17), Stockholm, Sweden, August 2017 icon
  9. Wei-Jen Ko, Bo-Hsiang Tseng, Hung-yi Lee, "Recurrent Neural Network based Language Modeling with Controllable External Memory", the 42th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'17), New Orleans, March 2017 icon
  10. Cheng-Kuan Wei, Cheng-Tao Chung, Hung-yi Lee, Lin-Shan Lee, "Personalized Acoustic Modeling by Weakly Supervised Multi-task Deep Learning using Acoustic Tokens Discovered from Unlabeled Data", the 42th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'17), New Orleans, March 2017
  11. Lang-Chi Yu, Hung-yi Lee, Lin-Shan Lee, “Abstractive Headline Generation for Spoken Content by Attentive Recurrent Neural Networks with ASR Error Modeling”, the 6th IEEE Workshop on Spoken Language Technology (SLT'16), San Diego, Dec. 2016 icon
  12. Wei Fang, Juei-Yang Hsu, Hung-yi Lee, Lin-Shan Lee, "Hierarchical Attention Model for Improved Machine Comprehension of Spoken Content", the 6th IEEE Workshop on Spoken Language Technology (SLT'16), San Diego, Dec. 2016 code icon
  13. Da-Rong Liu, Shun-Po Chuang, Hung-yi Lee, "Attention-based Memory Selection Recurrent Network for Language Modeling", arXiv preprint, November 2016 icon
  14. Bo-Hsiang Tseng, Sheng-syun Shen, Hung-Yi Lee, Lin-Shan Lee, "Towards Machine Comprehension of Spoken Content: Initial TOEFL Listening Comprehension Test by Machine", the 17th Annual Conference of the International Speech Communication Association (INTERSPEECH'16), San Francisco, Sept. 2016 (one of the 12 finalists for the best student paper award) icon
  15. Yen-Chen Wu, Tzu-Hsiang Lin, Yang-De Chen, Hung-Yi Lee, Lin-Shan Lee, "Interactive Spoken Content Retrieval by Deep Reinforcement Learning", the 17th Annual Conference of the International Speech Communication Association (INTERSPEECH'16), San Francisco, Sept. 2016 icon
  16. Yu-An Chung, Chao-Chung Wu, Chia-Hao Shen, Hung-Yi Lee, Lin-Shan Lee, "Audio Word2Vec: Unsupervised Learning of Audio Segment Representations Using Sequence-to-Sequence Autoencoder", the 17th Annual Conference of the International Speech Communication Association (INTERSPEECH'16), San Francisco, Sept. 2016 icon
  17. Sheng-syun Shen, Hung-Yi Lee, "Neural Attention Models for Sequence Classification: Analysis and Application to Key Term Extraction and Dialogue Act Detection", the 17th Annual Conference of the International Speech Communication Association (INTERSPEECH'16), San Francisco, Sept. 2016 icon
  18. Yi-Hsiu Liao, Hung-yi Lee, Lin-shan Lee, "Towards Structured Deep Neural Network for Automatic Speech Recognition", the 11th biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'15), Arizona, December 2015 icon
  19. Bo-Hsiang Tseng, Hung-yi Lee, Lin-Shan Lee, "Personalizing Universal Recurrent Neural Network Language Model with User Characteristic Features by Social Network Crowdsourcing", the 11th biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'15), Arizona, December 2015 icon
  20. Cheng-Tao Chung, Cheng-Yu Tsai, Hsiang-Hung Lu, Chia-Hsiang Liu, Hung-yi Lee, Lin-shan Lee, "An Iterative Deep Learning Framework for Unsupervised Discovery of Speech Features and Linguistic Units with Applications on Spoken Term Detection", the 11th biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'15), Arizona, December 2015icon
  21. Sheng-syun Shen, Hung-yi Lee, Shang-wen Li, Victor Zue and Lin-shan Lee, "Structuring Lectures in Massive Open Online Courses (MOOCs) for Efficient Learning by Linking Similar Sections and Predicting Prerequisites", the 16th Annual Conference of the International Speech Communication Association (INTERSPEECH'15), Dresden, Germany, Sept. 2015 icon
  22. Hung-tsung Lu, Yuan-ming Liou, Hung-yi Lee and Lin-shan Lee, "Semantic Retrieval of Personal Photos using a Deep Autoencoder Fusing Visual Features with Speech Annotations Represented as Word/Paragraph Vectors", the 16th Annual Conference of the International Speech Communication Association (INTERSPEECH'15), Dresden, Germany, Sept. 2015icon
  23. Ching-Feng Yeh, Yuan-ming Liou, Hung-yi Lee and Lin-shan Lee, "Personalized Speech Recognizer with Keyword-based Personalized Lexicon and Language Model using Word Vector Representations", the 16th Annual Conference of the International Speech Communication Association (INTERSPEECH'15), Dresden, Germany, Sept. 2015
  24. Hung-yi Lee, Yu Zhang, Ekapol Chuangsuwanich, James Glass, "Graph-based Re-ranking using Acoustic Feature Similarity between Search Results for Spoken Term Detection on Low-resource Languages", the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH'14), Singapore, Sept. 2014 icon
  25. Han Lu, Sheng-syun Shen, Sz-Rung Shiang, Hung-yi Lee and Lin-shan Lee, "Alignment of Spoken Utterances with Slide Content for Easier Learning with Recorded Lectures using Structured Support Vector Machine (SVM)", the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH'14), Singapore, Sept. 2014
  26. Sz-Rung Shiang, Hung-yi Lee and Lin-shan Lee, "Spoken Question Answering Using Tree-structured Conditional Random Fields and Two-layer Random Walk", the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH'14), Singapore, Sept. 2014
  27. Yuan-ming Liou, Yi-sheng Fu, Hung-yi Lee and Lin-shan Lee, "Semantic Retrieval of Personal Photos using Matrix Factorization and Two-layer Random Walk Fusing Sparse Speech Annotations with Visual Features", the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH'14), Singapore, Sept. 2014
  28. Hung-yi Lee, Ting-yao Hu, How Jing, Yun-Fan Chang, Yu Tsao, Yu-Cheng Kao, Tsang-Long Pao, "Ensemble of Machine Learning and Acoustic Segment Model Techniques for Speech Emotion and Autism Spectrum Disorders Recognition", the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH'13), Lyon, France, August 2013
  29. Hung-yi Lee, Yu-yu Chou, Yow-Bang Wang, Lin-shan Lee, "Unsupervised Domain Adaptation for Spoken Document Summarization with Structured Support Vector Machine", the 38th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'13), Vancouver, Canada, May 2013
  30. Hung-yi Lee, Yun-Chiao Li, Cheng-Tao Chung, Lin-shan Lee, "Enhancing Query Expansion for Semantic Retrieval of Spoken Content with Automatically Discovered Acoustic Patterns", the 38th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'13), Vancouver, Canada, May 2013
  31. Yun-Chiao Li, Hung-yi Lee, Cheng-Tao Chung, Chun-an Chan, and Lin-shan Lee, "Towards Unsupervised Semantic Retrieval of Spoken Content with Query Expansion based on Automatically Discovered Acoustic Patterns", the 10th biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'13), Olomouc, Czech Republic, December 2013 icon
  32. Sz-Rung Shiang, Hung-yi Lee, Lin-shan Lee, "Supervised Spoken Document Summarization Based on Structured Support Vector Machine with Utterance Clusters as Hidden Variables", the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH'13), Lyon, France, August 2013
  33. Tsung-Hsien Wen, Aaron Heidel, Hung-yi Lee, Yu Tsao, Lin-shan Lee, "Recurrent Neural Network Based Language Model Personalization by Social Network Crowdsourcing", the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH'13), Lyon, France, August 2013 (one of the 12 finalists for the best student paper award)icon
  34. Ching-Feng Yeh, Hung-yi Lee and Lin-shan Lee, "Speaking Rate Normalization with Lattice-based Context-dependent Phoneme Duration Modeling for Personalized Speech Recognizers on Mobile Devices", the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH'13), Lyon, France, August 2013
  35. Tsung-Hsien Wen, Hung-yi Lee, Pei-Hao Su, Lin-shan Lee, " Interactive Spoken Content Retrieval by Extended Query Model and Continuous State Space Markov Decision Process", the 38th IEEE International Conference on Acoustics, Speech and Signal Processing Vancouver, Canada, May 2013 icon
  36. Hung-yi Lee, Tsung-Hsien Wen, Lin-shan Lee, "Improved Semantic Retrieval of Spoken Content by Language models Enhanced with Acoustic Similarity Graph", the 4th IEEE Workshop on Spoken Language Technology (SLT'12), Miami, Florida, December 2012
  37. Tsung-Hsien Wen, Hung-yi Lee, Lin-shan Lee, "Personalized Language Modeling by Crowd Sourcing with Social Network Data for Voice Access of Cloud Applications", the 4th IEEE Workshop on Spoken Language Technology (SLT'12), Miami, Florida, December 2012icon
  38. Hung-yi Lee, Yu-yu Chou, Yow-Bang Wang, Lin-shan Lee, "Supervised Spoken Document Summarization Jointly Considering Utterance Importance and Redundancy by Structured Support Vector Machine", the 13th Annual Conference of the International Speech Communication Association (INTERSPEECH'12), Portland, Oregon, September 2012
  39. Hung-yi Lee, Po-wei Chou, Lin-shan Lee, "Open-Vocabulary Retrieval of Spoken Content with Shorter/Longer Queries Considering Word/Subword-based Acoustic Feature Similarity", the 13th Annual Conference of the International Speech Communication Association (INTERSPEECH'12), Portland, Oregon, September 2012
  40. Hung-yi Lee, Yun-nung Chen, Lin-shan Lee, "Utterance-level Latent Topic Transition Modeling for Spoken Documents and its Application in Automatic Summarization", the 37th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'12), Kyoto, Japan, March 2012
  41. Tsung-Hsien Wen, Hung-yi Lee, Lin-shan Lee, "Interactive Spoken Content Retrieval with Different Types of Actions Optimized by a Markov Decision Process", the 13th Annual Conference of the International Speech Communication Association (INTERSPEECH'12), Portland, Oregon, September 2012 (one of the 10 finalists for the best student paper award)
  42. Tsung-wei Tu, Hung-yi Lee, Lin-shan Lee, "Semantic Query Expansion and Context-based Discriminative Term Modeling for Spoken Document Retrieval", the 37th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'12), Kyoto, Japan, March 2012 (IEEE Spoken Language Processing Student Travel Grant)
  43. Yun-Nung Chen, Yu Huang, Hung-yi Lee, Lin-shan Lee, "Unsupervised Two-Stage Keyword Extraction from Spoken Documents by Topic Coherence and Support Vector Machine", the 37th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'12), Kyoto, Japan, March 2012
  44. Ching-Feng Yeh, Aaron Heidel, Hung-yi Lee, Lin-shan Lee, "Recognition of Highly Imbalanced Code-mixed Bilingual Speech with Frame-level Language Detection based on Blurred Posteriorgram", the 37th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'12), Kyoto, Japan, March 2012
  45. Hung-yi Lee, Yun-nung Chen, Lin-shan Lee, "Improved Speech Summarization and Spoken Term Detection with Graphical Analysis of Utterance Similarities", the 3rd Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, October 2011
  46. Hung-yi Lee, Tsung-wei Tu, Chia-ping Chen, Chao-yu Huang, Lin-shan Lee , "Improved Spoken Term Detection Using Support Vector Machines based on Lattice Context Consistency", the 36th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'11), Prague, Czech Republic, May 2011
  47. Tsung-wei Tu, Hung-yi Lee, Lin-shan Lee, "Improved Spoken Term Detection using Support Vector Machines with Acoustic and Context Features from Pseudo-relevance Feedback", the 9th biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'11), Hawaii, December 2011 (one of the 5 finalists for the best student paper award)
  48. Yun-nung Chen, Chia-ping Chen, Hung-yi Lee, Chun-an Chan, Lin-shan Lee, "Improved Spoken Term Detection with Graph-based Re-ranking in Feature Space", the 36th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'11), Prague, Czech Republic, May 2011
  49. Hung-yi Lee, Chia-ping Chen, Ching-feng Yeh, Lin-shan Lee, "A Framework Integrating Different Relevance Feedback Scenarios and Approaches for Spoken Term Detection", the 3rd IEEE Workshop on Spoken Language Technology (SLT'10), Berkeley, California, December 2010
  50. Hung-yi Lee, Chia-ping Chen, Ching-feng Yeh, Lin-shan Lee, "Improved Spoken Term Detection by Discriminative Training of Acoustic Models based on User Relevance Feedback", the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH'10), Makuhari, Japan, September 2010
  51. Hung-yi Lee and Lin-shan Lee, "Integrating Recognition and Retrieval with User Feedback: A New Framework for Spoken Term Detection", the 35th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'10), Dallas, Texas, March 2010 (cited in textbook)
  52. Chia-ping Chen, Hung-yi Lee, Ching-feng Yeh, Lin-shan Lee, "Improved Spoken Term Detection by Feature Space Pseudo-Relevance Feedback", the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH'10), Makuhari, Japan, September 2010
  53. Yu-Hui Chen, Chia-Chen Chou, Hung-yi Lee, Lin-shan Lee, "An Initial Attempt to Improve Spoken Term Detection by Learning Optimal Weights for Different Indexing Features", the 35th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'10), Dallas, Texas, March 2010 (cited in textbook)
  54. Hung-yi Lee, Yueh-Lien Tang, Hao Tang, Lin-shan Lee, "Spoken Term Detection from Bilingual Spontaneous Speech Using Code-switched Lattice-based Structures for Words and Subword Units", the 8th biannual IEEE workshop on Automatic Speech Recognition and Understanding, (ASRU'09), Merano, Italy, December 2009
  55. Chao-hong Meng, Hung-yi Lee, Lin-shan Lee, "Improved Lattice-based Spoken Document Retrieval by Directly Learning from the evaluation Measures", the 34th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'09), Taipei, Taiwan, April 2009

Journal

  1. Hung-yi Lee, Bo-Hsiang Tseng, Tsung-Hsien Wen, Yu Tsao, "Personalizing Recurrent Neural Network Based Language Model by Social Network," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 3, pp. 519-530, March 2017 icon
  2. Lin-shan Lee, James Glass, Hung-yi Lee, Chun-an Chan, "Spoken Content Retrieval —Beyond Cascading Speech Recognition with Text Retrieval," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.23, no.9, pp.1389-1420, Sept. 2015 icon
  3. Hung-yi Lee, Ching-feng Yeh, Yun-Nung Chen, Yu Huang, Sheng-Yi Kong and Lin-shan Lee, “Spoken Knowledge Organization by Semantic Structuring and a Prototype Course Lecture System for Personalized Learning”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, no.5, pp.883-898, May 2014 icon (Figure 9 of the article selected as journal cover)
  4. Hung-yi Lee, Po-wei Chou, Lin-shan Lee, Improved open-vocabulary spoken content retrieval with word and subword lattices using acoustic feature similarity, Computer Speech & Language, Volume 28, Issue 5, pp. 1045-1065, Sept. 2014icon
  5. Hung-yi Lee, Lin-shan Lee, "Improved Semantic Retrieval of Spoken Content by Document/Query Expansion with Random Walk over Acoustic Similarity Graphs," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, no.1, pp.80-94, Jan. 2014icon(Figure 2 of the article selected as journal cover)
  6. Hung-yi Lee, Lin-shan Lee, "Enhanced Spoken Term Detection Using Support Vector Machines and Weighted Pseudo Examples," IEEE Transactions on Audio, Speech, and Language Processing, vol.21, no.6, pp.1272-1284, June 2013icon
  7. Hung-yi Lee, Chia-ping Chen, Lin-shan Lee, "Integrating Recognition and Retrieval with Relevance Feedback for Spoken Term Detection," IEEE Transactions on Audio, Speech, and Language Processing, vol.20, no.7, pp.2095-2110, Sept. 2012icon
  8. Yi-cheng Pan, Hung-yi Lee, Lin-shan Lee, "Interactive Spoken Document Retrieval With Suggested Key Terms Ranked by a Markov Decision Process", IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.2, pp. 632-645, Feb. 2012icon