Your browser does not support the video tag. Please use IE9+ or Google Chrome.
投影片 1 (Lab531, 1:20:43)
 
 
 
  • 1. Speech Production and Source Model
  • 2. Waveform plots of typical vowel sounds - Voiced(濁音)
  • 3. Speech Production and Source Model
  • 4. Slide 4
  • 5. Waveform plots of typical consonant sounds
  • 6. Slide 4
  • 7. Speech Production and Source Model
  • 8. Slide 4
  • 9. Speech Production and Source Model
  • 10. Waveform plots of typical vowel sounds - Voiced(濁音)
  • 11. Speech Production and Source Model
  • 12. Slide 4
  • 13. Waveform plots of typical consonant sounds
  • 14. Waveform plot of a sentence
  • 15. Time and Frequency Domains (P.12 of 2.0)
  • 16. Frequency domain spectra of speech signals
  • 17. Frequency Domain
  • 18. Frequency domain spectra of speech signals
  • 19. Time and Frequency Domains (P.12 of 2.0)
  • 20. Waveform plot of a sentence
  • 21. Waveform plots of typical consonant sounds
  • 22. Slide 4
  • 23. Waveform plots of typical consonant sounds
  • 24. Waveform plot of a sentence
  • 25. Time and Frequency Domains (P.12 of 2.0)
  • 26. Frequency domain spectra of speech signals
  • 27. Frequency Domain
  • 28. Input/Output Relationship for Time/Frequency Domains
  • 29. Spectrogram
  • 30. Spectrogram
  • 31. Formant Frequencies
  • 32. Formant frequency contours
  • 33. Voiced/unvoiced濁音、清音Pitch/tone音高、聲調Vocal tract 聲道Frequency domain/formant frequencySpectrogram representationSpeech Source Model
  • 34. Slide 16
  • 35. Voiced/unvoiced濁音、清音Pitch/tone音高、聲調Vocal tract 聲道Frequency domain/formant frequencySpectrogram representationSpeech Source Model
  • 36. Slide 16
  • 37. Voiced/unvoiced濁音、清音Pitch/tone音高、聲調Vocal tract 聲道Frequency domain/formant frequencySpectrogram representationSpeech Source Model
  • 38. Slide 16
  • 39. Speech Source Model
  • 40. Simplified Speech Source Model
  • 41. Speech Source Model
  • 42. Feature Extraction - MFCC
  • 43. Pre-emphasis
  • 44. Feature Extraction - MFCC
  • 45. Pre-emphasis
  • 46. Why pre-emphasis?
  • 47. Pre-emphasis
  • 48. Feature Extraction - MFCC
  • 49. Speech Source Model
  • 50. Simplified Speech Source Model
  • 51. Speech Source Model
  • 52. Slide 16
  • 53. Voiced/unvoiced濁音、清音Pitch/tone音高、聲調Vocal tract 聲道Frequency domain/formant frequencySpectrogram representationSpeech Source Model
  • 54. Formant frequency contours
  • 55. Formant Frequencies
  • 56. Spectrogram
  • 57. Spectrogram
  • 58. Input/Output Relationship for Time/Frequency Domains
  • 59. Frequency Domain
  • 60. Frequency domain spectra of speech signals
  • 61. Frequency Domain
  • 62. Input/Output Relationship for Time/Frequency Domains
  • 63. Spectrogram
  • 64. Spectrogram
  • 65. Formant Frequencies
  • 66. Formant frequency contours
  • 67. Voiced/unvoiced濁音、清音Pitch/tone音高、聲調Vocal tract 聲道Frequency domain/formant frequencySpectrogram representationSpeech Source Model
  • 68. Slide 16
  • 69. Speech Source Model
  • 70. Simplified Speech Source Model
  • 71. Speech Source Model
  • 72. Feature Extraction - MFCC
  • 73. Pre-emphasis
  • 74. Why pre-emphasis?
  • 75. Why Windowing?
  • 76. Waveform plot of a sentence
  • 77. Why Windowing?
  • 78. Waveform plot of a sentence
  • 79. Why Windowing?
  • 80. Waveform plot of a sentence
  • 81. Slide 25
  • 82. Waveform plot of a sentence
  • 83. Slide 25
  • 84. Effect of Windowing (1)
  • 85. Input/Output Relationship for Time/Frequency Domains (P.10 of 7.0)
  • 86. Windowing
  • 87. Input/Output Relationship for Time/Frequency Domains (P.10 of 7.0)
  • 88. Effect of Windowing (1)
  • 89. Input/Output Relationship for Time/Frequency Domains (P.10 of 7.0)
  • 90. Windowing
  • 91. Effect of Windowing (2)
  • 92. Windowing
  • 93. Effect of Windowing (2)
  • 94. Windowing
  • 95. Input/Output Relationship for Time/Frequency Domains (P.10 of 7.0)
  • 96. Effect of Windowing (1)
  • 97. Slide 25
  • 98. Waveform plot of a sentence
  • 99. Slide 25
  • 100. Effect of Windowing (1)
  • 101. Input/Output Relationship for Time/Frequency Domains (P.10 of 7.0)
  • 102. Windowing
  • 103. Effect of Windowing (2)
  • 104. DFT and Mel-filter-bank Processing
  • 105. Peripheral Processing for Human Perception
  • 106. Mel-scale Filter Bank
  • 107. Peripheral Processing for Human Perception
  • 108. Mel-scale Filter Bank
  • 109. Peripheral Processing for Human Perception
  • 110. DFT and Mel-filter-bank Processing
  • 111. Peripheral Processing for Human Perception
  • 112. Mel-scale Filter Bank
  • 113. Why Filter-bank Processing?
  • 114. Mel-scale Filter Bank
  • 115. Peripheral Processing for Human Perception
  • 116. DFT and Mel-filter-bank Processing
  • 117. Peripheral Processing for Human Perception
  • 118. Mel-scale Filter Bank
  • 119. Peripheral Processing for Human Perception
  • 120. DFT and Mel-filter-bank Processing
  • 121. Peripheral Processing for Human Perception
  • 122. Mel-scale Filter Bank
  • 123. Why Filter-bank Processing?
  • 124. Feature Extraction - MFCC
  • 125. Logarithmic Operation and IDFT
  • 126. Why Log Energy Computation?
  • 127. Logarithmic Operation and IDFT
  • 128. Why Log Energy Computation?
  • 129. Logarithmic Operation and IDFT
  • 130. Why Log Energy Computation?
  • 131. Logarithmic Operation and IDFT
  • 132. Why Log Energy Computation?
  • 133. Why Inverse DFT?
  • 134. Why Log Energy Computation?
  • 135. Logarithmic Operation and IDFT
  • 136. Why Log Energy Computation?
  • 137. Why Inverse DFT?
  • 138. Why Log Energy Computation?
  • 139. Logarithmic Operation and IDFT
  • 140. Feature Extraction - MFCC
  • 141. Why Filter-bank Processing?
  • 142. Mel-scale Filter Bank
  • 143. Peripheral Processing for Human Perception
  • 144. DFT and Mel-filter-bank Processing
  • 145. Effect of Windowing (2)
  • 146. Windowing
  • 147. Input/Output Relationship for Time/Frequency Domains (P.10 of 7.0)
  • 148. Windowing
  • 149. Effect of Windowing (2)
  • 150. DFT and Mel-filter-bank Processing
  • 151. Peripheral Processing for Human Perception
  • 152. Mel-scale Filter Bank
  • 153. Why Filter-bank Processing?
  • 154. Feature Extraction - MFCC
  • 155. Logarithmic Operation and IDFT
  • 156. Why Log Energy Computation?
  • 157. Why Inverse DFT?
  • 158. Speech Production and Source Model (P.3 of 7.0)
  • 159. Slide 39
  • 160. Frequency domain spectra of speech signals (P.8 of 7.0)
  • 161. Frequency Domain (P.9 of 7.0)
  • 162. Input/Output Relationship for Time/Frequency Domains (P.10 of 7.0)
  • 163. Slide 43
  • 164. Input/Output Relationship for Time/Frequency Domains (P.10 of 7.0)
  • 165. Frequency Domain (P.9 of 7.0)
  • 166. Frequency domain spectra of speech signals (P.8 of 7.0)
  • 167. Slide 39
  • 168. Speech Production and Source Model (P.3 of 7.0)
  • 169. Why Inverse DFT?
  • 170. Why Log Energy Computation?
  • 171. Logarithmic Operation and IDFT
  • 172. Why Log Energy Computation?
  • 173. Why Inverse DFT?
  • 174. Why Log Energy Computation?
  • 175. Logarithmic Operation and IDFT
  • 176. Feature Extraction - MFCC
  • 177. Why Filter-bank Processing?
  • 178. Feature Extraction - MFCC
  • 179. Logarithmic Operation and IDFT
  • 180. Why Log Energy Computation?
  • 181. Why Inverse DFT?
  • 182. Speech Production and Source Model (P.3 of 7.0)
  • 183. Slide 39
  • 184. Frequency domain spectra of speech signals (P.8 of 7.0)
  • 185. Frequency Domain (P.9 of 7.0)
  • 186. Input/Output Relationship for Time/Frequency Domains (P.10 of 7.0)
  • 187. Slide 43
  • 188. Derivatives
  • 189. Slide 43
  • 190. Slide 43
  • 191. Derivatives
  • 192. (xi, yi)
  • 193. Derivatives
  • 194. (xi, yi)
  • 195. Derivatives
  • 196. Slide 43
  • 197. Derivatives
  • 198. (xi, yi)
  • 199. Why Delta Coefficients?
  • 200. (xi, yi)
  • 201. Derivatives
  • 202. (xi, yi)
  • 203. Why Delta Coefficients?
  • 204. (xi, yi)
  • 205. Why Delta Coefficients?
  • 206. (xi, yi)
  • 207. Derivatives
  • 208. (xi, yi)
  • 209. Why Delta Coefficients?
  • 210. Slide 47
  • 211. Why Delta Coefficients?
  • 212. Slide 47
  • 213. Why Delta Coefficients?
  • 214. Slide 47
  • 215. End-point Detection
  • 216. End-point Detection
  • 217. End-point Detection
  • 218. Slide 47
  • 219. End-point Detection
  • 220. End-point Detection
  • 221. End-point Detection
  • 222. End-point Detection
  • 223. End-point Detection
  • 224. End-point Detection
  • 225. 與語音學、訊號波型、頻譜特性有關的網址
0/225
Volume
1.0x
00:00/1:20:43