2017
  1. Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, and Hsin-Min Wang, "An Information Distillation Framework for Extractive Summarization," accepted to appear in IEEE/ACM Transactions on Audio, Speech, and Language Processing.
  2. Jen-Cheng Hou, Syu-Siang Wang, Ying-Hui Lai, Yu Tsao, Hsiu-Wen Chang, and Hsin-Min Wang, "Audio-Visual Speech Enhancement Based on Multimodal Deep Convolutional Neural Networks," accepted to appear in IEEE Transactions on Emerging Topics in Computational Intelligence.
  3. Hsin-Te Hwang, Yi-Chiao Wu, Yu-Huai Peng, Chin-Cheng Hsu, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, and Sin-Horng Chen, "Voice Conversion Based on Locally Linear Embedding," accepted to appear in Journal of Information Science and Engineering.
  4. Hsin-Te Hwang, Yi-Chiao Wu, Syu-Siang Wang, Chin-Cheng Hsu, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, and Sin-Horng Chen, "Locally Linear Embedding Based Post-filtering for Speech Enhancement," accepted to appear in Journal of Information Science and Engineering.
  5. Yu-Ding Lu, Yu Tsao, Hung-Shin Lee, Hsin-Min Wang, "A Replay Spoofing Detection System Based on Discriminative Autoencoders," accepted to appear in International Journal of Computational Linguistics and Chinese Language Processing. (in Chinese)
  6. Tien-Hong Lo, Ying-Wen Chen, Berlin Chen, Kuan-Yu Chen, and Hsin-Min Wang, "Exploring Query Intent and Neural Network Modeling Techniques for Spoken Document Retrieval," accepted to appear in International Journal of Computational Linguistics and Chinese Language Processing. (in Chinese)
  7. Shih-Hung Liu, Kuan-Yu Chen, Yu-Lun Hsieh, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, and Wen-Lian Hsu, "A Position-Aware Language Modeling Framework for Extractive Broadcast News Speech Summarization," ACM Transactions on Asian and Low-Resource Language Information Processing, 16(4), Article 27, August 2017.
  8. Wen-Li Wei, Jen-Chun Lin, Tyng-Luh Liu, Yi-Hsuan Yang, Hsin-Min Wang, Hsiao-Rong Tyan, and Hong-Yuan Mark Liao, "Deep-Net Fusion to Classify Shots in Concert Videos," in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), New Orleans, LA, USA, March 2017.
  9. Po-Yuan Shih, Chia-Ping Chen, and Hsin-Min Wang, "Speech Emotion Recognition with Skew-Robust Neural Networks," in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), New Orleans, LA, USA, March 2017.
  10. Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, and Hsin-Min Wang, "A Locality-Preserving Essence Vector Modeling Framework for Spoken Document Retrieval," in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), New Orleans, LA, USA, March 2017.
  11. Hung-Shin Lee, Yu-Ding Lu, Chin-Cheng Hsu, Yu Tsao, Hsin-Min Wang, and Shyh-Kang Jeng, "Discriminative Autoencoders for Speaker Verification," in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), New Orleans, LA, USA, March 2017.
  12. Shih-Hung Liu, Kuan-Yu Chen, Berlin Chen, Hsin-Min Wang, and Wen-Lian Hsu, "Leveraging Manifold Learning for Extractive Broadcast News Summarization," in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), New Orleans, LA, USA, March 2017.
  13. Yi-Chiao Wu, Hsin-Te Hwang, Syu-Siang Wang, Chin-Cheng Hsu, Ying-Hui Lai, Yu Tsao, and Hsin-Min Wang, "A Locally Linear Embbeding Based Postfiltering Approach for Speech Enhancement," in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), New Orleans, LA, USA, March 2017.
  14. Yi-Chiao Wu, Hsin-Te Hwang, Syu-Siang Wang, Chin-Cheng Hsu, Yu Tsao, and Hsin-Min Wang, "A Post-filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech Enhancement," in Proc. Interspeech2017, Stockholm, Sweden, August 2017.
  15. Ming-Han Yang, Hung-Shin Lee, Yu-Ding Lu, Kuan-Yu Chen, Yu Tsao, Berlin Chen, and Hsin-Min Wang, "Discriminative Autoencoders for Acoustic Modeling," in Proc. Interspeech2017, Stockholm, Sweden, August 2017.
  16. Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, and Hsin-Min Wang, "Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks," in Proc. Interspeech2017, Stockholm, Sweden, August 2017.
  17. Ying-Wen Chen, Kuan-Yu Chen, Hsin-Min Wang, and Berlin Chen, "Exploring the Use of Significant Words Language Modeling for Spoken Document Retrieval," in Proc. Interspeech2017, Stockholm, Sweden, August 2017.
  18. Chia-Lung Wu, Hsiang-Ping Hsu, Syu-Siang Wang, Jeih-Weih Hung, Ying-Hui Lai, Hsin-Min Wang, and Yu Tsao, "Wavelet Speech Enhancement Based on Robust Principal Component Analysis," in Proc. Interspeech2017, Stockholm, Sweden, August 2017.
  19. Jen-Chun Lin, Wen-Li Wei, James Yang, Hsin-Min Wang, and Hong-Yuan Mark Liao, "Automatic Music Video Generation Based on Simultaneous Soundtrack Recommendation and Video Editing," in Proc. ACM Multimedia Conference 2017, Mountain View, CA, USA, October 2017.
  20. Cheng-Jo Ray Chang, Hung-Shin Lee, Hsin-Min Wang, Jyh-Shing Roger Jang, "基於i-vector與PLDA並使用GMM-HMM強制對位之自動語者分段標記系統," in Proc. The 29th ROCLING Conference on Computational Linguistics and Speech Processing (ROCLING2017), Taipei, Taiwan, November 2017. (in Chinese)
  21. Yu-Huai Peng, Chin-Cheng Hsu, Yi-Chiao Wu, Hsin-Te Hwang, Yi-Wen Liu, Yu Tsao, and Hsin-Min Wang, "Fast Locally Linear Embedding Algorithm for Exemplar-based Voice Conversion," in Proc. APSIPA Annual Summit and Conference (APSIPA ASC 2017), Kuala Lumpur, Malaysia, December 2017.
  22. Ming-Hsiang Su, Chung-Hsien Wu, Kun-Yi Huang, Qian-Bei Hong, and Hsin-Min Wang, "Personality Trait Perception from Speech Signals Using Multiresolution Analysis and Convolutional Neural Networks," in Proc. APSIPA Annual Summit and Conference (APSIPA ASC 2017), Kuala Lumpur, Malaysia, December 2017.
  23. Tien-Hong Lo, Ying-Wen Chen, Kuan-Yu Chen, Hsin-Min Wang, and Berlin Chen, "Neural Relevance-Aware Query Modeling for Spoken Document Retrieval," in Proc. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2017), Okinawa, Japan, December 2017.

2016
  1. Yu-Ren Chien, Hsin-Min Wang, and Shyh-Kang Jeng, "Alignment of Lyrics With Accompanied Singing Audio Based on Acoustic-Phonetic Vowel Likelihood Modeling," IEEE/ACM Transactions on Audio, Speech, and Language Processing, 24(11), pp. 1998-2008, November 2016.
  2. Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang, and Hsin-Hsi Chen, "Exploring the Use of Unsupervised Query Modeling Techniques for Speech Recognition and Summarization," Speech Communication, 80, pp. 49-59, June 2016.
  3. Jen-Chun Lin, Wen-Li Wei, and Hsin-Min Wang, "DEMV-Matchmaker: Emotional Temporal Course Representation and Deep Similarity Matching for Automatic Music Video Generation," in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, China, March 2016.
  4. Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, and Hsin-Min Wang, "Improved Spoken Document Summarization with Coverage Modeling Techniques," in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, China, March 2016.
  5. Yi-Chiao Wu, Hsin-Te Hwang, Chin-Cheng Hsu, Yu Tsao, and Hsin-Min Wang, "Locally Linear Embedding for Exemplar-Based Spectral Conversion," in Proc. Interspeech2016, San Francisco, USA, September 2016.
  6. Hung-Shin Lee, Yu Tsao, Chi-Chun Lee, Hsin-Min Wang, Wei-Cheng Lin, Wei-Chen Chen, Shan-Wen Hsiao, and Shyh-Kang Jeng, "Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation," in Proc. Interspeech2016, San Francisco, USA, September 2016.
  7. Shih-Hung Liu, Kuan-Yu Chen, Yu-Lun Hsieh, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, and Wen-Lian Hsu, "Exploring Word Mover’s Distance and Semantic-Aware Embedding Techniques for Extractive Broadcast News Summarization," in Proc. Interspeech2016, San Francisco, USA, September 2016.
  8. Yu-Lun Hsieh, Shih-Hung Liu, Kuan-Yu Chen, Hsin-Min Wang, Wen-Lian Hsu, and Berlin Chen, "Exploiting Sequence-to-Sequence Generation Framework for Automatic Abstractive Summarization," in Proc. The 28th ROCLING Conference on Computational Linguistics and Speech Processing (ROCLING2016), Tainan, Taiwan, September 2016. (in Chinese)
  9. Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang, and Hsin-Hsi Chen, "Novel Word Embedding and Translation-based Language Modeling for Extractive Speech Summarization," in Proc. ACM Multimedia Conference 2016, Amsterdam, The Netherlands, October 2016, (SHORT PAPER, acceptance rate = 30%)
  10. Jen-Chun Lin, Wen-Li Wei, and Hsin-Min Wang, "Automatic Music Video Generation Based on Emotion-Oriented Pseudo Song Prediction and Matching," in Proc. ACM Multimedia Conference 2016, Amsterdam, The Netherlands, October 2016, (SHORT PAPER, acceptance rate = 30%)
  11. Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, and Hsin-Min Wang, "Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP2016), Tianjin, China, October 2016.
  12. Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao and Hsin-Min Wang, "Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder," in Proc. APSIPA Annual Summit and Conference (APSIPA ASC 2016), Jeju Island, Korea, December 2016.
  13. Jen-Cheng Hou, Syu-Siang Wang, Ying-Hui Lai, Jen-Chun Lin, Yu Tsao, Hsiu-Wen Chang, and Hsin-Min Wang, "Audio-Visual Speech Enhancement using Deep Neural Networks," in Proc. APSIPA Annual Summit and Conference (APSIPA ASC 2016), Jeju Island, Korea, December 2016.
  14. Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen and Hsin-Min Wang, "A Novel Paragraph Embedding Method for Spoken Document Summarization," in Proc. APSIPA Annual Summit and Conference (APSIPA ASC 2016), Jeju Island, Korea, December 2016.
  15. Shih-Hung Liu, Kuan-Yu Chen, Yu-Lun Hsieh, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen and Wen-Lian Hsu, "Exploiting Graph Regularized Nonnegative Matrix Factorization for Extractive Speech Summarization," in Proc. APSIPA Annual Summit and Conference (APSIPA ASC 2016), Jeju Island, Korea, December 2016.
  16. Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen and Hsin-Min Wang, "Learning to Distill: The Essence Vector Modeling Framework," in Proc. International Conference on Computational Linguistics (COLING2016), Osaka, Japan, December 2016.

2015
  1. Kai-Wun Shih, Kuan-Yu Chen, Shih-Hung Liu, Hsin-Min Wang, and Berlin Chen, "Extractive spoken document summarization with representation learning techniques," International Journal of Computational Linguistics and Chinese Language Processing, 20(2), pp. 65-86, December 2015. (in Chinese)
  2. Ting-Hao Chang, Hsiao-Tsung Hung, Kuan-Yu Chen, Hsin-Min Wang, and Berlin Chen, "Investigating modulation spectrum factorization techniques for robust speech recognition," International Journal of Computational Linguistics and Chinese Language Processing, 20(2), pp. 87-106, December 2015. (in Chinese)
  3. Kuan-Yu Chen, Hsin-Min Wang, and Hsin-Hsi Chen, "A Probabilistic Framework for Chinese Spelling Check," ACM Transactions on Asian and Low-Resource Language Information Processing - Special Issue on Chinese Spell Checking, 14(4), Article 15: 17 pages, October 2015.
  4. Yu-Ren Chien, Hsin-Min Wang, and Shyh-Kang Jeng, "An Acoustic-Phonetic Model of F0 Likelihood for Vocal Melody Extraction," IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23(9), pp. 1457-1468, September 2015.
  5. Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang, Ea-Ee Jan, Wen-Lian Hsu, and Hsin-Hsi Chen, "Extractive Broadcast News Summarization Leveraging Recurrent Neural Network Language Modeling Techniques," IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23(8), pp. 1322-1334, August 2015.
  6. Shih-Hung Liu, Kuan-Yu Chen, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, and Wen-Lian Hsu, "Combining Relevance Language Modeling and Clarity Measure for Extractive Speech Summarization," IEEE/ACM Trans. on Audio, Speech, and Language Processing, 23(6), pp. 957-969, June 2015.
  7. Ju-Chiang Wang, Yi-Hsuan Yang, Hsin-Min Wang, and Shyh-Kang Jeng, "Modeling the Affective Content of Music with a Gaussian Mixture Model," IEEE Transactions on Affective Computing, 6(1), pp. 56-68, January-March, 2015. Source codes are available at http://slam.iis.sinica.edu.tw/demo/AEG/.
  8. Ju-Chiang Wang, Hsin-Min Wang, and Gert Lanckriet, "A Histogram Density Modeling Approach to Music Emotion Recognition," in Proc. The 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2015), Brisbane, Australia, April 2015. Source codes are available at https://github.com/asriverwang/HDM_codes.
  9. Kuan-Yu Chen, Hsin-Min Wang, Berlin Chen, and Hsin-Hsi Chen, "I-Vector based Language Modeling for Query Representation," in Proc. The 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2015), Brisbane, Australia, April 2015.
  10. Kuan-Yu Chen, Shih-Hung Liu, Hsin-Min Wang, Berlin Chen and Hsin-Hsi Chen, "Leveraging Word Embeddings for Spoken Document Summarization," in Proc. Interspeech2015, Dresden, Germany, September 2015.
  11. Shih-Hung Liu, Kuan-Yu Chen, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen and Wen-Lian Hsu, "Positional Language Modeling for Extractive Broadcast News Speech Summarization," in Proc. Interspeech2015, Dresden, Germany, September 2015.
  12. Jen-Chun Lin, Wen-Li Wei, and Hsin-Min Wang, "EMV-matchmaker: Emotional Temporal Course Modeling and Matching for Automatic Music Video Generation," in Proc. ACM Multimedia Conference (ACMMM2015), Brisbane, Australia, October 2015. (Short Paper)
  13. Kuan-Yu Chen, Kai-Wun Shih, Shih-Hung Liu, Berlin Chen, and Hsin-Min Wang, "Incorporating Paragraph Embeddings and Density Peaks Clustering for Spoken Document Summarization," IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU2015), Scottsdale, Arizona, USA, December 13-17, 2015.
  14. Shih-Hung Liu, Hung-Shih Lee, Hsiao-Tsung Hung, Kuan-Yu Chen, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, and Wen-Lian Hsu, "Incorporating Proximity Information in Relevance Language Modeling for Extractive Speech Summarization," APSIPA Annual Summit and Conference (APSIPA ASC 2015), Hong Kong, December 16-19, 2015.
  15. Syu-Siang Wang, Hsin-Te Hwang, Ying-Hui Lai, Yu Tsao, Xugang Lu, Hsin-Min Wang and Borching Su, "Improving Denoising Auto-encoder Based Speech Enhancement With the Speech Parameter Generation Algorithm," APSIPA Annual Summit and Conference (APSIPA ASC 2015), Hong Kong, December 16-19, 2015.
  16. Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang and Sin-Horng Chen, "A Probabilistic Interpretation for Artificial Neural Network-based Voice Conversion," APSIPA Annual Summit and Conference (APSIPA ASC 2015), Hong Kong, December 16-19, 2015.

2014
  1. Hung-Yi Lo, Shou-De Lin, and Hsin-Min Wang, "Generalized k-Labelsets Ensemble for Multi-Label and Cost-Sensitive Classification," IEEE Transactions on Knowledge and Data Engineering, 26(7), pp. 1679-1691, July 2014.
  2. Berlin Chen, Yi-Wen Chen, Kuan-Yu Chen, Hsin-Min Wang, and Kuen-Tyng Yu, "Enhancing query formulation for spoken document retrieval,"Journal of Information Science and Engineering, 30(3), pp. 553-569, May 2014.
  3. Kuan-Yu Chen, Hung-Shin Lee, Hsin-Min Wang, Berlin Chen, and Hsin-Hsi Chen, "I-vector Based Language Modeling for Spoken Document Retrieval," in Proc. The 39th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2014), Florence, Italy, May 2014.
  4. Shih-Hung Liu, Kuan-Yu Chen, Yu-Lun Hsieh, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, and Wen-Lian Hsu, "Effective Pseudo-relevance Feedback for Language Modeling in Extractive Speech Summarization," in Proc. The 39th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2014), Florence, Italy, May 2014.
  5. Chin-Chia Michael Yeh, Ju-Chiang Wang, Yi-Hsuan Yang, and Hsin-Min Wang, "Improving Music Auto-Tagging by Intra-Song Instance Bagging," in Proc. The 39th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2014), Florence, Italy, May 2014.
  6. Hung-Shin Lee, Yu Tsao, Yun-Fan Chang, Hsin-Min Wang, and Shyh-Kang Jeng, "Speaker Verification Using Kernel-Based Binary Classifiers with Binary Operation Derived Features," in Proc. The 39th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2014), Florence, Italy, May 2014.
  7. Shuo-Yang Wang, Ju-Chiang Wang, Yi-Hsuan Yang, and Hsin-Min Wang, "Towards Time-Varying Music Auto-Tagging based on CAL500 Expansion," in Proc. IEEE International Conference on Multimedia & Expo (ICME 2014), Chengdu, China, July 2014.
  8. Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang, Wen-Lian Hsu, and Hsin-Hsi Chen, "A Recurrent Neural Network Language Modeling Framework for Extractive Speech Summarization," in Proc. IEEE International Conference on Multimedia & Expo (ICME 2014), Chengdu, China, July 2014.
  9. Hung-Shin Lee, Yu Tsao, Hsin-Min Wang, and Shyh-Kang Jeng, "Clustering-Based I-Vector Formulation for Speaker Recognition," in Proc. Interspeech2014, Singapore, September 2014.
  10. Shih-Hung Liu, Kuan-Yu Chen, Yu-Lun Hsieh, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, and Wen-Lian Hsu, "Enhanced Language Modeling for Extractive Speech Summarization with Sentence Relatedness Information," in Proc. Interspeech2014, Singapore, September 2014.
  11. How Jing, Ting-Yao Hu, Hung-Shin Lee, Wei-Chen Chen, Chi-Chun Lee, Yu Tsao, and Hsin-Min Wang, "Ensemble of Machine Learning Algorithms for Cognitive and Physical Speaker Load Detection," in Proc. Interspeech2014, Singapore, September 2014.
  12. Shih-Hung Liu, Kuan-Yu Chen, Yu-Lun Hsieh, Hsin-Min Wang, Wen-Lian Hsu, and Berlin Chen, "Investigating Novel Sentence Modeling Techniques for Extractive Speech Summarization," in Proc. The 26th ROCLING Conference on Computational Linguistics and Speech Processing, Jhongli, Taiwan, September 2014. (in Chinese)
  13. Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Ea-Ee Jan, Hsin-Min Wang, Wen-Lian Hsu, and Hsin-Hsi Chen, "Leveraging Effective Query Modeling Techniques for Speech Recognition and Summarization," in Proc. Conference on Empirical Methods in Natural Language Processing (EMNLP2014), Doha, Qatar, October 2014. (Short Paper)
  14. Ju-Chiang Wang, Ming-Chi Yen, Yi-Hsuan Yang, and Hsin-Min Wang, "Automatic Set List Identification and Song Segmentation of Full-Length Concert Videos," in Proc. International Society for Music Information Retrieval Conference (ISMIR2014), Taipei, Taiwan, October 2014.
  15. Shih-Hung Liu, Kuan-Yu Chen, Berlin Chen, Ea-Ee Jan, Hsin-Min Wang, Hsu-Chun Yen, and Wen-Lian Hsu, "A Margin-based Discriminative Modeling Approach for Extractive Speech Summarization," in Proc. APSIPA Annual Summit and Conference (APSIPA ASC 2014), Siem Reap, Cambodia, December 2014.
  16. Jen-Chun Lin, Wen-Li Wei, Chung-Hsien Wu, and Hsin-Min Wang, "Emotion Recognition of Conversational Affective Speech Using Temporal Course Modeling-Based Error Weighted Cross-Correlation Model," in Proc. APSIPA Annual Summit and Conference (APSIPA ASC 2014), Siem Reap, Cambodia, December 2014.

2013
  1. Meng-Sung Wu, Chia-Ping Chen, and Hsin-Min Wang, "Query-document Relevance Topic Models," in Proc. The 17th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD2013), Lecture Notes in Artificial Intelligence 7819, Gold Coast, Australia, April 2013.
  2. Hung-Shin Lee, Yu-Chin Shih, Hsin-Min Wang, and Shyh-Kang Jeng, "Subspace-based Phonotactic Language Recognition Using Multivariate Dynamic Linear Models," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP2013), Vancouver, Canada, May 2013.
  3. Kuan-Yu Chen, Hsin-Min Wang, Berlin Chen, and Hsin-Hsi Chen, "Weighted Matrix Factorization for Spoken Document Retrieval," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP2013), Vancouver, Canada, May 2013.
  4. Yi-Wen Chen, Kuan-Yu Chen, Hsin-Min Wang, and Berlin Chen, "Effective Pseudo-Relevance Feedback for Spoken Document Retrieval," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP2013), Vancouver, Canada, May 2013.
  5. Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, and Sin-Horng Chen, "Alleviating the Over-Smoothing Problem in GMM-Based Voice Conversion with Discriminative Training," in Proc. Interspeech2013, Leon, France, August 2013.
  6. Shih-Hung Liu, Kuan-Yu Chen, Hsin-Min Wang, Wen-Lian Hsu, and Berlin Chen, "Improved Sentence Modeling Techniques for Extractive Speech Summarization," in Proc. The 25th ROCLING Conference on Computational Linguistics and Speech Processing, Kaohsiung, Taiwan, October 2013. (in Chinese)
  7. How Jing, Yu Tsao, Kuan-Yu Chen, and Hsin-Min Wang, "Naive Bayes Classifier for Document Classification," in Proc. International Joint Conference on Natural Language Processing (IJCNLP2013), Nagoya, Japan, October 2013.
  8. Kuan-Yu Chen, Hung-Shin Lee, Chung-Han Lee, Hsin-Min Wang, and Hsin-Hsi Chen, "A Study of Language Modeling for Chinese Spelling Check," in Proc. The Seventh SIGHAN Workshop on Chinese Language Processing (SIGHAN2013), Nagoya, Japan, October 2013.
  9. Zhonghua Li, Ju-Chiang Wang, Jingli Cai, Zhiyan Duan, Hsin-Min Wang and Ye Wang, "Non-Reference Audio Quality Assessment for Online Live Music Recordings," in Proc. ACM International Conference on Multimedia (ACMMM2013), Barcelona, Spain, October 2013. (Full Paper)
  10. Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, Sin-Horng Chen, "Incorporating Global Variance in the Training Phase of GMM-based Voice Conversion," in Proc. APSIPA Annual Summit and Conference (APSIPA ASC 2013), Kaohsiung, Taiwan, October 2013.

2012
  1. Kuan-Yu Chen, Hsin-Min Wang, and Berlin Chen, "Spoken Document Retrieval Leveraging Unsupervised and Supervised Topic Modeling Techniques," IEICE Trans. on Information and Systems, E95-D(5), pp. 1195-1205, May 2012.
  2. Ju-Chiang Wang, Yi-Hsuan Yang, Hsin-Min Wang, and Shyh-Kang Jeng, "The Acoustic Emotion Gaussians Model for Emotion-based Music Annotation and Retrieval," in Proc. ACM International Conference on Multimedia (ACMMM2012), Nara, Japan, October 2012. (Full Paper)
  3. Hung-Yi Lo, Shou-De Lin, and Hsin-Min Wang, "Generalized k-Labelset Ensemble for Multi-label Classification," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP2012), Kyoto, Japan, March 2012.
  4. Ju-Chiang Wang, Hsin-Min Wang, and Shyh-Kang Jeng, "Playing with Tagging: A Real-Time Tagging Music Player," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP2012), Kyoto, Japan, March, 2012.
  5. Meng-Sung Wu and Hsin-Min Wang, "A Term Association Translation Model for Naive Bayes Text Classification," in Proc. The 16th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD2012), Lecture Notes in Artificial Intelligence 7301, Kuala Lumpur, Malaysia, May 2012.
  6. Yu-Ren Chien, Hsin-Min Wang, and Shyh-Kang Jeng, "Simulated Formant Modeling of Accompanied Singing Signals for Vocal Melody Extraction," in Proc. The 9th Sound and Music Computing Conference (SMC2012), Copenhagen, Denmark, July 2012.
  7. Kuan-Yu Chen, Hao-Chin Chang, Berlin Chen and Hsin-Min Wang, "Word Relevance Modeling for Speech Recognition," in Proc. Interspeech2012, Portland, Oregon, USA, September 2012.
  8. Yu-Chin Shih, Hung-Shin Lee, Hsin-Min Wang, and Shyh-Kang Jeng, "Subspace-Based Feature Representation and Learning for Language Recognition," in Proc. Interspeech2012, Portland, Oregon, USA, September 2012.
  9. Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang and Sin-Horng Chen, "A Study of Mutual Information for GMM-Based Spectral Conversion," in Proc. Interspeech2012, Portland, Oregon, USA, September 2012.
  10. Ju-Chiang Wang, Yi-Hsuan Yang, I-Hong Jhuo, Yen-Yu Lin and Hsin-Min Wang, "The Acousticvisual Emotion Gaussians Model for Automatic Generation of Music Video," in Proc. ACM International Conference on Multimedia (ACMMM2012), October 2012. (First Prize in Multimedia Grand Challenge)
  11. Ju-Chiang Wang, Yi-Hsuan Yang, Kaichun Chang, Hsin-Min Wang, and Shyh-Kang Jeng, "Exploring the Relationship between Categorical and Dimensional Emotion Semantics of Music," in Proc. ACM Workshop on Music Information Retrieval with User-Centered and Multimodal Strategies (MIRUM2012), Nara, Japan, November 2012. (in conjunction with ACM Multimedia 2012)
  12. Meng-Sung Wu and Hsin-Min Wang, "Term Relevance Dependency Model for Text Classification," in Proc. International Conference on Pattern Recognition (ICPR2012), Tsukuba Science City, Japan, November 2012.
  13. Ju-Chiang Wang, Yi-Hsuan Yang, Hsin-Min Wang, and Shyh-Kang Jeng, "Personalized Music Emotion Recognition via Model Adaptation," in Proc. APSIPA Annual Summit and Conference (ASC), Hollywood, California, USA, December 2012.
  14. Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, and Sin-Horng Chen, "Exploring Mutual Information for GMM-Based Spectral Conversion," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP2012), Hong Kong, December 2012.

2011
  1. Hung-Yi Lo, Ju-Chiang Wang, Hsin-Min Wang, and Shou-De Lin, "Cost-sensitive Multi-label Learning for Audio Tag Annotation and Retrieval," IEEE Trans. on Multimedia, 13(3), pp. 518-529, June 2011.
  2. Ju-Chiang Wang, Yu-Chin Shih, Meng-Sung Wu, Hsin-Min Wang and Shyh-Kang Jeng, "Colorizing Tags in Tag Cloud: A Novel Query-by-Tag Music Search System," in Proc. ACM Multimedia 2011, pp. 293-302, Scottsdale, Arizona, USA, November 2011. (Full Paper) (SoTag Web)
  3. Hung-Yi Lo, Shou-De Lin, and Hsin-Min Wang, "Audio Tag Annotation and Retrieval Using Tag Count Information," in Proc. International Conference on MultiMedia Modeling (MMM2011), Taipei, Taiwan, January 2011.
    Lecture Notes in Computer Science, LNCS6325, Springer.
  4. Hung-Yi Lo, Ju-Chiang Wang, Hsin-Min Wang, Shou-De Lin, "Cost-Sensitive Stacking for Audio Tag Annotation and Retrieval," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP2011), Prague, Czech Republic, may 2011.
  5. S.-W. Sun, Y.-C. F. Wang, Y.-L. Hung, C.-L. Chang, K.-C. Chen, S.-S. Cheng, H.-M. Wang, and H.-Y. M. Liao, "Automatic Annotation of Web Videos," in Proc. IEEE International Conference on Multimedia & Expo (ICME 2011), Barcelona, Spain, July 2011.
  6. Ju-Chiang Wang, Meng-Sung Wu, Hsin-Min Wang and Shyh-Kang Jeng, "Query by Multi-tags with Multi-level Preferences for Content-based Music Retrieval," in Proc. IEEE International Conference on Multimedia & Expo (ICME 2011), Barcelona, Spain, July 2011.
  7. Ju-Chiang Wang, Hung-Shin Lee, Hsin-Min Wang, and Shyh-Kang Jeng, "Learning the Similarity of Audio Music in Bag-of-frames Representation from Tagged Music Data," in Proc. The 12th International Society for Music Information Retrieval Conference (ISMIR2011), Miami,Florida, USA, October 2011.
  8. Ju-Chiang Wang, Meng-Sung Wu, Hsin-Min Wang and Shyh-Kang Jeng, "A Content-based Music Search System Using Query by Multi-tags with Multi-levels of Preference," Demo Session, The 12th International Society for Music Information Retrieval Conference (ISMIR2011), Miami,Florida, USA, October 2011. (SoTag DEMO)
  9. Yu-Ren Chien, Hsin-Min Wang, and Shyh-Kang Jeng, "An Acoustic-Phonetic Approach to Vocal Melody Extraction," in Proc. The 12th International Society for Music Information Retrieval Conference (ISMIR2011), Miami,Florida, USA, October 2011.
  10. Ju-Chiang Wang, Meng-Sung Wu, Hsin-Min Wang and Shyh-Kang Jeng, "Music Tag Annotation and Clustering Using Latent Music Semantic Analysis," in Proc. International Workshop on Computer Music and Audio Technology (WOCMAT 2011), Taipei, Taiwan, December 2011.

2010
  1. Chih-Yi Chiu and Hsin-Min Wang, "Time-series Linear Search for Video Copies Based on Compact Signature Manipulation and Containment Relation Modeling," IEEE Trans. on Circuits and Systems for Video Technology, 20(11), pp. 1603 - 1613, November 2010.
  2. Chih-Yi Chiu, Hsin-Min Wang, and Chu-Song Chen, "Fast Min-hashing Indexing and Robust Spatio-temporal Matching for Detecting Video Copies," ACM Trans. on Multimedia Computing, Communications, and Applications, 6(2), Article 10: 1-23, March 2010.
  3. Shih-Sian Cheng, Hsin-Min Wang, and Hsin-Chia Fu, "BIC-based Speaker Segmentation Using Divide-and-Conquer Strategies with Application to Speaker Diarization," IEEE Trans. on Audio, Speech, and Language Processing, 18(1), pp. 141-157, Jan. 2010. There are some errors in Section VI-A4, please see the corrections.
  4. Chih-Yi Chiu, Dimitrios Bountouridis, Ju-Chiang Wang, and Hsin-Min Wang, "Background music identification through content filtering and min-hash matching," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP2010), Dallas, Texas, USA, March 2010.
  5. Hung-Yi Lo, Ju-Chiang Wang, and Hsin-Min Wang, "Homogeneous Segmentation and Classifier Ensemble for Audio Tag Annotation and Retrieval," in Proc. IEEE International Conference on Multimedia & Expo (ICME 2010), Singapore, July 2010.
  6. Hung-Shin Lee, Hsin-Min Wang, and Berlin Chen, "A Discriminative and Heteroscedastic Linear Feature Transformation for Multiclass Classification," in Proc. International Conference on Pattern Recognition (ICPR2010), Istanbul, Turkey, August 2010.
  7. Chih-Yi Chiu, Wei-Ming Chang, Po-Chih Lin, Hsin-Min Wang, and Shi-Nine Yang, "Detecting Pitching Frames in Baseball Game Video Using Markov Random Walk," in Proc. International Conference on Image Processing (ICIP2010), Hong Kong, September 2010.
  8. I-Fan Chen, Shih-Sian Cheng, and Hsin-Min Wang, "Phonetic Subspace Mixture Model for Speaker Diarization," in Proc. Interspeech2010, Makuhari, Japan, September 2010.
  9. Shih-Sian Cheng, I-Fan Chen, and Hsin-Min Wang, "Bayesian Speaker Recognition Using Gaussian Mixture Model and Laplace Approximation," in Proc. Interspeech2010, Makuhari, Japan, September 2010.
  10. Ju-Chiang Wang, Hung-Yi Lo, Shyh-Kang Jeng and Hsin-Min Wang, "Audio Classification Using Semantic Transformation and Classifier Ensemble," in Proc. The 2010 Workshop on Computer Music and Audio Technology (WOCMAT2010), Taoyuan, Taiwan, November 2010.
  11. Meng-Sung Wu and Hsin-Min Wang, "Semantic Associative Topic Models for Information Retrieval," in Proc. The 2010 Conference on Technologies and Applications of Artificial Intelligence (TAAI 2010), Hsinchu, Taiwan, November 2010, (in Chinese).
  12. Hung-Yi Lo and Hsin-Min Wang, "Phone Boundary Refinement Using Ranking Methods," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP2010), Tainan, Taiwan, November 2010.
  13. Yi-Hsiang Chao, Wei-Ho Tsai and Hsin-Min Wang, "Speaker Verification Using Support Vector Machine with LLR-based Sequence Kernels," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP2010), Tainan, Taiwan, November 2010.
  14. Ju-Chiang Wang, Hung-Shin Lee, Shyh-Kang Jeng and Hsin-Min Wang, "Posterior Weighted Bernoulli Mixture Model for Music Tag Annotation and Retrieval," in Proc. APSIPA ASC 2010, Singapore, December 2010.
  15. Meng-Sung Wu, Hung-Shin Lee, and Hsin-Min Wang, "Exploiting Semantic Associative Information in Topic Modeling," in Proc. IEEE Workshop on Spoken Language Technology (SLT2010), Berkeley, California, USA, December 2010.

2009
  1. Yi-Hsiang Chao, Wei-Ho Tsai, Hsin-Min Wang, and Ruei-Chuan Chang, "Improving the Characterization of the Alternative Hypothesis via Minimum Verification Error Training with Applications to Speaker Verification," Pattern Recognition, 42(7), pp. 1351-1360, July 2009.
  2. Yi-Hsiang Chao, Wei-Ho Tsai and Hsin-Min Wang, "Improving GMM-UBM Speaker Verification Using Discriminative Feedback Adaptation," Computer Speech and Language, 23(3), pp. 376-388, July 2009.
  3. Shih-Sian Cheng, Hsin-Chia Fu, and Hsin-Min Wang, "Model-based Clustering by Probabilistic Self-organizing Maps," IEEE Trans. on Neural Networks, 20(5), pp. 805-826, May 2009. demonstration
  4. Wei-Ho Tsai and Hsin-Min Wang, "Evolutionary Minimization of the Rand Index for Speaker Clustering," Computer Speech and Language, 23(2), pp.165-175, April 2009.
  5. Shih-Hsiang Lin, Berlin Chen, and Hsin-Min Wang, "A Comparative Study of Probabilistic Ranking Models for Chinese Spoken Document Summarization," ACM Trans. on Asian Language Information Processing, 8(1), pp. 3:1-3:23, March 2009.
  6. Yi-Ting Chen, Berlin Chen, Hsin-Min Wang, "A Probabilistic Generative Framework for Extractive Broadcast News Speech Summarization," IEEE Trans. on Audio, Speech and Language Processing, 17(1), pp.95-106, January 2009.
  7. Shih-Sian Cheng, "Probabilistic Model-based Clustering and Its Applications," PhD thesis, National Chiao Tung University, May 2009.
  8. Yi-Hsiang Chao, "Discriminative Training Methods for Speaker Verification," PhD thesis, National Chiao Tung University, January 2009.
  9. I-Fan Chen and Hsin-Min Wang, "Articulatory Feature Asynchrony Analysis and Compensation in Detection-Based ASR," in Proc. Interspeech2009, Brighton, UK, Sept 2009.
  10. Shih-Sian Cheng, Chun-Han Tseng, Chia-Ping Chen, and Hsin-Min Wang, "Speaker Diarization Using Divide-and-Conquer," in Proc. Interspeech2009, Brighton, UK, Sept 2009.
  11. Hsin-Min Wang and Berlin Chen, "Mandarin Chinese Broadcast News Retrieval and Summarization Using Probabilistic Generative Models," in Proc. APSIPA ASC 2009, Sapporo, Japan, Oct 2009.
  12. Yu-Ren Chien and Hsin-Min Wang, "Vocality-Sensitive Melody Extraction from Popular Songs," in Proc. APSIPA ASC 2009, Sapporo, Japan, Oct 2009.
  13. Jen-Wei Kuo, Pu-Jen Cheng, and Hsin-Min Wang, "Learning to Rank from Bayesian Decision Inference," in Proc. ACM Conference on Information and Knowledge Management (CIKM2009), Hong Kong, Nov 2009.

2008
  1. Hung-Ming Yu, Wei-Ho Tsai, and Hsin-Min Wang, "A Query-by-Singing System for Retrieving Karaoke Music," IEEE Trans. on Multimedia, 10(8), pp. 1626-1637, December 2008.
  2. Yi-Hsiang Chao, Wei-Ho Tsai, Hsin-Min Wang, and Ruei-Chuan Chang, "Using Kernel Discriminant Analysis to Improve the Characterization of the Alternative Hypothesis for Speaker Verification," IEEE Trans. on Audio, Speech and Language Processing, 16(8), pp. 1675-1684, November 2008.
  3. Wei-Ho Tsai, Hung-Ming Yu, and Hsin-Min Wang, "Using the Similarity of Main Melodies to Identify Cover Versions of Popular Songs for Music Document Retrieval," Journal of Information Science and Engineering, 24(6), pp. 1669-1687, November 2008.
  4. Shih-Sian Cheng, Hsin-Min Wang, Hsin-Chia Fu, "Bic-based Audio Segmentation by Divide-and-Conquer," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP2008), Las Vegas, USA, March 2008.
  5. Shih-Hsiang Lin, Yi-Ting Chen, Hsin-Min Wang, Berlin Chen, "A Comparative Study of Probabilistic Ranking Models for Spoken Document Summarization," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP2008), Las Vegas, USA, March 2008.
  6. Chih-Yi Chiu and Hsin-Min Wang, "A Novel Video Matching Framework for Copy Detection," in Proc. The 21th IPPR Conference on Computer Vision, Graphics and Image Processing (CVGIP2008), Taipei, Taiwan, August 2008.
  7. Hsin-Min Wang, Jen-Wei Kuo, and Hung-Yi Lo, "Towards A Phoneme Labeled Mandarin Chinese Speech Corpus," in Proc. Oriental COCOSDA 2008: International Conference on Speech Databases and Assessments, Kyoto, Japan, November 2008.
  8. Yi-Hsiang Chao, Wei-Ho Tsai, and Hsin-Min Wang, "Discriminative Feedback Adaptation for GMM-UBM Speaker Verification," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP2008), Kunming, China, December 2008.
  9. I-Fan Chen and Hsin-Min Wang, "An Investigation of Phonological Feature Systems Used in Detection-Based ASR," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP2008), Kunming, China, December 2008.

2007
  1. Yi-Hsiang Chao, Hsin-Min Wang, and Ruei-Chuan Chang, "A Novel Characterization of the Alternative Hypothesis Using Kernel Discriminant Analysis for LLR-based Speaker Verification," International Journal of Computational Linguistics and Chinese Language Processing, 12(3), pp. 255-272, 2007. (Special Issue on "Invited Papers from ISCSLP 2006")
  2. Wei-Ho Tsai and Hsin-Min Wang, "Automatic Identification of the Sung Language in Popular Music Recordings," Journal of New Music Research, 36(2), pp. 105 - 114, 2007.
  3. Wei-Ho Tsai, Shih-Sian Cheng, and Hsin-Min Wang, "Automatic Speaker Clustering Using a Voice Characteristic Reference Space and Maximum Purity Estimation," IEEE Trans. on Audio, Speech, and Language Processing, 15(4), pp. 1461-1474, 2007.
  4. Hung-Yi Lo and Hsin-Min Wang, "Phonetic Boundary Refinement Using Support Vector Machine," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal processing (ICASSP2007), Hawaii, USA, April 2007.
  5. Yi-Hsiang Chao, Wei-Ho Tsai, Hsin-Min Wang, Ruei-Chuan Chang, "Improved Methods For Characterizing The Alternative Hypothesis Using Minimum Verification Error Training For LLR-Based Speaker Verification," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal processing (ICASSP2007), Hawaii, USA, April 2007.
  6. Wei-Ho Tsai and Hsin-Min Wang, "Speaker Clustering Based on Minimum Rand Index," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal processing (ICASSP2007), Hawaii, USA, April 2007.
  7. Ping-Han Lee, Lu-Jong Chu, Yi-Ping Hung, Sheng-Wen Shih, Chu-Song Chen, and Hsin-Min Wang, "Cascading Multimodal Verification Using Face, Voice and Iris Information," in Proc. IEEE Conference on Multimedia and Expo (ICME2007), Beijing, China, July 2007.
  8. Jen-Wei Kuo, Hung-Yi Lo, and Hsin-Min Wang, "Improved HMM/SVM Methods for Automatic Phoneme Segmentation," in Proc. The Tenth European Conference on Speech Communication and Technology (Interspeech2007 - Eurospeech), Antwerp, Belgium, Aug 2007.
  9. Yi-Hsiang Chao, Wei-Ho Tsai, Shih-Sian Cheng, Hsin-Min Wang, and Ruei-Chuan Chang, "Evolutionary Minimum Verification Error Learning of the Alternative Hypothesis Model for LLR-based Speaker Verification," in Proc. The Tenth European Conference on Speech Communication and Technology (Interspeech2007 - Eurospeech), Antwerp, Belgium, Aug 2007.
  10. Yi-Ting Chen, Hsuan-Sheng Chiu, Hsin-Min Wang and Berlin Chen, "A Unified Probabilistic Generative Framework for Extractive Spoken Document Summarization," in Proc. The Tenth European Conference on Speech Communication and Technology (Interspeech2007 - Eurospeech), Antwerp, Belgium, Aug 2007.
  11. Shih-Sian Cheng, Hsin-Chia Fu, and Hsin-Min Wang, "CEM, EM, and DAEM Algorithms for Learning Self-Organizing Maps," in Proc. IEEE International Workshop on. Machine Learning for Signal Processing (MLSP2007), Thessaloniki, Greece, Aug 2007.
  12. Yi-Ting Chen, Shih-Hsiang Lin, Hsin-Min Wang, and Berlin Chen, "Spoken Document Summarization Using Relevant Information," in Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU2007), Kyoto, Japan, Dec 2007.

2006
  1. Sin-Horng Chen, Chiu-yu Tseng, and Hsin-Min Wang, "Chapter 4: Tone Modeling for Speech Synthesis," Advances in Chinese Spoken Language Processing, edited by Chin-Hui Lee, Haizhou Li, Lin-shan Lee, Ren-Hua Wang, and Qiang Huo, pp. 77-98, World Scientific, 2006.
  2. Berlin Chen, Hsin-Min Wang, and Lin-shan Lee, "Chapter 13: Spoken Document Retrieval and Summarization," Advances in Chinese Spoken Language Processing, edited by Chin-Hui Lee, Haizhou Li, Lin-shan Lee, Ren-Hua Wang, and Qiang Huo, pp. 301-320, World Scientific, 2006.
  3. Wei-Ho Tsai and Hsin-Min Wang, "Speech Utterance Clustering Based on the Maximization of Within-cluster Homogeneity of Speaker Voice Characteristics," The Journal of the Acoustical Society of America, Vol. 120, No. 3, pp. 1631-1645, September 2006.
  4. Jen-Wei Kuo, Shih-Hung Liu, Hsin-Min Wang, and Berlin Chen, "An Empirical Study on Word Error Minimization Approaches for Mandarin Large Vocabulary Continuous Speech Recognition," International Journal of Computational Linguistics and Chinese Language Processing, vol. 11, no. 3, pp. 201-222, September 2006. (Special Issue on "Selected Papers from ROCLING XVII")
  5. Chuang-Hua Chueh, Hsin-Min Wang, and Jen-Tzung Chien, "A Maximum Entropy Approach for Semantic Language Modeling," International Journal of Computational Linguistics and Chinese Language Processing, vol. 11, no. 1, pp. 37-56, March 2006.
  6. Wei-Ho Tsai and Hsin-Min Wang, "Automatic singer recognition of popular music recordings via estimation and modeling of solo vocal signals," IEEE Trans. on Audio, Speech and Language Processing, vol. 14, no. 1, pp. 330-341, Jan 2006.
  7. Wei-Ho Tsai and Hsin-Min Wang, "On maximizing the within-cluster homogeneity of speaker voice characteristics for speech utterance clustering," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal processing (ICASSP2006), Touluse, France, May 2006.
  8. Yi-Hsiang Chao, Wei-Ho Tsai, Hsin-Min Wang, and Ruei-Chuan Chang, "A Kernel-based Discrimination Framework for Solving Hypothesis Testing Problems with Application to Speaker Verification," in Proc. International Conference on Pattern Recognition (ICPR2006), Hong Kong, Aug 2006.
  9. Shih-Sian Cheng, Yi-Hsiang Chao, Hsin-Min Wang, and Hsin-Chia Fu, "A Prototypes-Embedded Genetic K-means Algorithm," in Proc. International Conference on Pattern Recognition (ICPR2006), Hong Kong, Aug 2006.
  10. Jen-Wei Kuo and Hsin-Min Wang, "Minimum Boundary Error Training for Automatic Phonetic Segmentation," in Proc. The Ninth International Conference on Spoken Language Processing (Interspeech 2006 - ICSLP) , Pittsburgh, Pennsylvania, USA, Sept 2006.
  11. Yi-Hsiang Chao, Wei-Ho Tsai, Hsin-Min Wang and Ruei-Chuan Chang, "Improving the Characterization of the Alternative Hypothesis via Kernel Discriminant Analysis for Likelihood Ratio-based Speaker Verification," in Proc. The Ninth International Conference on Spoken Language Processing (Interspeech 2006 - ICSLP) , Pittsburgh, Pennsylvania, USA, Sept 2006.
  12. Hung-Ming Yu, Wei-Ho Tsai, and Hsin-Min Wang, "A Music Retrieval System based on Query-by-singing for Karaoke Jukebox," in Proc. Asian Information Retrieval Symposium (AIRS2006), Singapore, Oct 2006.
    Lecture Notes in Computer Science, LNCS4182, Springer.
  13. Jen-Wei Kuo and Hsin-Min Wang, "A Minimum Boundary Error Framework for Automatic Phonetic Segmentation," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP2006), Singapore, Dec 2006.
    Lecture Notes in Artificial Intelligence, LNAI4274, Springer.
  14. Shih-Sian Cheng, Yeong-Yuh Xu, Hsin-Min Wang and Hsin-Chia Fu, "Automatic Construction of Regression Class Tree for MLLR via Model-based Hierarchical Clustering," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP2006), Singapore, Dec 2006.
    Lecture Notes in Artificial Intelligence, LNAI4274, Springer.
  15. Tzan-Hwei Chen, Berlin Chen and Hsin-Min Wang, "On Using Entropy Information to Improve Posterior Probability based Confidence Measures," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP2006), Singapore, Dec 2006.
    Lecture Notes in Artificial Intelligence, LNAI4274, Springer.
  16. Yi-Ting Chen, Suhan Yu, Hsin-Min Wang and Berlin Chen, "Extractive Chinese Spoken Document Summarization Using Probabilistic Ranking Models," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP2006), Singapore, Dec 2006.
    Lecture Notes in Artificial Intelligence, LNAI4274, Springer.
  17. Yi-Hsiang Chao, Hsin-Min Wang and Ruei-Chuan Chang, "A Novel Alternative Hypothesis Characterization Using Kernel Classifiers for LLR-based Speaker Verification," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP2006), Singapore, Dec 2006. (best student paper award)
    Lecture Notes in Artificial Intelligence, LNAI4274, Springer.

2005
  1. Hsin-Min Wang, Berlin Chen, Jen-Wei Kuo, and Shih-Sian Cheng, "MATBN: A Mandarin Chinese Broadcast News Corpus," International Journal of Computational Linguistics and Chinese Language Processing, 10(2), pp. 219-236, June 2005.
  2. Chiu-yu Tseng, Shao-huang Pin, Yehlin Lee, Hsin-Min Wang, Yong-cheng Chen, "Fluent speech prosody: framework and modeling," Speech Communication, 46(3-4), pp. 284-309, July 2005.
  3. Wei-Ho Tsai, Shih-Sian Cheng, Yi-Hsiang Chao, and Hsin-Min Wang, "Clustering speech utterances by speaker using eigenvoice-motivated vector space model," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal processing (ICASSP2005), Philadelphia, USA, March 2005.
  4. Yi-Hsiang Chao, Hsin-Min Wang, and Ruei-chuan Chang, "GMM-Based Bhattacharyya Kernel Fisher Discriminant Analysis For Speaker Recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal processing (ICASSP2005), Philadelphia, USA, March 2005.
  5. Wei-Ho Tsai and Hsin-Min Wang, "On the extraction of vocal-related information to facilitate the management of popular music collections," in Proc. IEEE/ACM Joint Conference on Digital Libraries (JCDL2005), Denver, USA, June 2005.
  6. Hsien-Ting Cheng, Yi-Hsiang Chao, Shih-Liang Yeh, Chu-Song Chen, Hsin-Min Wang, and Yi-Ping Hung, "An efficient approach to multi-modal person identity verification by fusing face and voice information," in Proc. IEEE Conference on Multimedia and Expo (ICME2005), Amsterdam, The Netherlands, July 2005.
  7. Wei-Ho Tsai and Hsin-Min Wang, "Speaker Clustering of Unknown Utterances Based on Maximum Purity Estimation," in Proc. European Conference on Speech Communication and Technology (Eurospeech2005), Lisbon, Portugal, Sept 2005.
  8. Wei-Ho Tsai, Hung-Ming Yu, and Hsin-Min Wang, "A Query-by-example Technique for Retrieving Cover Versions of Popular Songs with Similar Melodies," in Proc. The sixth International Conference on Music Information Retrieval (ISMIR2005), London, UK, Sept 2005.
  9. Hung-Ming Yu, Wei-Ho Tsai, and Hsin-Min Wang, "A Query-by-singing Technique for Retrieving Polyphonic Objects of Popular Music," in Proc. Asian Information Retrieval Symposium (AIRS2005), Jeju Island, Korea, Oct 2005. Lecture Notes in Computer Science, LNCS3689, Springer.

2004
  1. Shih-Sian Cheng, Hsin-Min Wang, and Hsin-Chia Fu, "A Model-selection-based Self-splitting Gaussian Mixture Learning with Application to Speaker Identification," EURASIP Journal on Applied Signal Processing, 2004(17), pp. 2626-2639, Dec 2004.
  2. Wei-Ho Tsai, Dwight Rodgers, and Hsin-Min Wang, "Blind Clustering of Popular Music Recordings Based on Singer Voice Characteristics," Computer Music Journal, 28(3), pp. 68-78, Fall 2004. The preliminary version also appeared in Proc. The fourth International Conference on Music Information Retrieval (ISMIR 2003), Baltimore, USA, Oct 2003. (PDF)
  3. Berlin Chen, Hsin-Min Wang, and Lin-shan Lee, "A Discriminative Hmm/N-Gram-Based Retrieval Approach for Mandarin Spoken Documents," ACM Trans. on Asian Language Information Processing, 3(2), pp. 128-145, June 2004.
  4. Helen Meng, Berlin Chen, Sanjeev Khudanpur, Gina-Anne Levow, Wai-kit Lo, Douglas Oard, Patrick Schone, Karen Tang, Hsin-Min Wang, and Jianqiang Wang, "Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval," Computer Speech and Language, 18(2), pp. 163-179, April 2004.
  5. Hsin-Min Wang, Shi-sian Cheng, and Yong-cheng Chen, "The SoVideo Mandarin Chinese broadcast news retrieval system," International Journal of Speech Technology, 7(2-3), pp. 189-202, April-July 2004.
  6. Wei-Ho Tsai and Hsin-Min Wang, "Automatic Detection and Tracking of Target Singer in Multi-Singer Music Recordings," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal processing (ICASSP2004), Montreal, Quebec, Canada, May 2004.
  7. Wei-Ho Tsai and Hsin-Min Wang, "A query-by-example framework to retrieve music documents by singer," in Proc. IEEE Conference on Multimedia and Expo (ICME2004), Taipei, Taiwan, June 2004.
  8. Yin-cheng Chen, Tan-Hsu Tan, Hsin-Min Wang, and Wei-Ho Tsai, "Performance Evaluation and Analysis of Mandarin Speech Recognition over Bluetooth Communication Environments," in Proc. ROCLING XVI: Conference on Computational Linguistics and Speech Processing, Taipei, Taiwan, Sept 2004. (in Chinese)
  9. Wei-Ho Tsai, Shih-Sian Cheng and Hsin-Min Wang, "Speaker Clustering of Speech Utterances Using A Voice Characteristic Reference Space," in Proc. International Conference on Spoken Language Processing (ICSLP2004), Jeju Island, Korea, Oct 2004.
  10. Shih-Sian Cheng and Hsin-Min Wang, "METRIC-SEQDAC: A Hybrid Approach for Audio Segmentation," in Proc. International Conference on Spoken Language Processing (ICSLP2004), Jeju Island, Korea, Oct 2004.
  11. Berlin Chen, Jen-Wei Kuo, Yao-Min Huang, and Hsin-Min Wang, "Statistical Chinese Spoken Document Retrieval Using Latent Topical Information," in Proc. International Conference on Spoken Language Processing (ICSLP2004), Jeju Island, Korea, Oct 2004.
  12. Wei-Ho Tsai and Hsin-Min Wang, "Towards automatic identification of singing language in popular music recordings," in Proc. International Conference on Music Information Retrieval (ISMIR2004), Barcelona, Spain, Oct 2004.
  13. ShaoHuang Pin, Yehlin Lee, Yong-cheng Chen, Hsin-Min Wang, and Chiu-yu Tseng, "A Mandarin TTS system with an integrated prosodic model," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP2004), Hong Kong, Dec 2004.
  14. Chuang-Hua Chueh, Jen-Tzung Chien, and Hsin-Min Wang, "A maximum entropy approach for integrating semantic information in statistical language models," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP2004), Hong Kong, Dec 2004. (best student paper award)
  15. Chih-Hsien Huang, Jen-Tzung Chien, and Hsin-Min Wang, "A new eigenvoice approach to speaker adaptation," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP2004), Hong Kong, Dec 2004.

2003
  1. Hsin-Min Wang, Shi-sian Cheng, and Yong-cheng Chen, "The SoVideo broadcast news retrieval system for Mandarin Chinese," in Proc. ISCA Workshop on Multilingual Spoken Document Retrieval (MSDR2003), Hong Kong, April 2003.
  2. Kuan-Ting Chen, Shui-Lung Chuang, Frank Seide,Hsin-Min Wang, Lee-Feng Chien, and Eric Chang, "New word learning for spoken document processing through discovery of comparable texts from external resources," in Proc. ISCA Workshop on Multilingual Spoken Document Retrieval (MSDR2003), Hong Kong, April 2003.
  3. Hsin-Min Wang, "MATBN 2002: A Mandarin Chinese broadcast news corpus," in Proc. ISCA & IEEE Workshop on Spontaneous Speech Processing and Recognition (SSPR2003), Tokyo, April 2003.
  4. Shi-sian Cheng and Hsin-Min Wang, "A Sequential Metric-based Audio Segmentation Method via The Bayesian Information Criterion," in Proc. European Conference on Speech Communication and Technology (Eurospeech2003), Geneva, Switzerland, Sept 2003.
  5. Wei-Ho Tsai, Hsin-Min Wang, and Dwight Rodgers, "Automatic Singer Identification of Popular Music Recordings via Estimation and Modeling of Solo Vocal Signal," in Proc. European Conference on Speech Communication and Technology (Eurospeech2003), Geneva, Switzerland, Sept 2003.
  6. Wai-kit Lo, Yuk-chi Li, Gina Levow, Hsin-Min Wang, and Helen Meng, "Multi-scale Document Expansion in English-Mandarin Cross-Language Spoken Document Retrieval," in Proc. European Conference on Speech Communication and Technology (Eurospeech2003), Geneva, Switzerland, Sept 2003.
  7. Wei-Ho Tsai, Hsin-Min Wang, Dwight Rodgers, Shi-sian Cheng, and Hung-Min Yu, "Blind Clustering of Popular Music Recordings Based on Singer Voice Characteristics," in Proc. The fourth International Conference on Music Information Retrieval (ISMIR 2003), Baltimore, USA, Oct 2003.

2002
  1. Berlin Chen, Hsin-Min Wang, and Lin-shan Lee, "Discriminating Capabilities of Syllable-based Features and Approaches of Utilizing Them for Voice Retrieval of Speech Information in Mandarin Chinese," IEEE Trans. on Speech and Audio Processing, vol. 10, no. 5, pp. 303-314, July 2002.
  2. Bor-shen Lin, Berlin Chen, Hsin-Min Wang, and Lin-shan Lee, "A Hierarchical Tag-Graph Search Scheme with Layered Grammar Rules for Spontaneous Speech Understanding," Pattern Recognition Letters, 23(7), pp. 819-831, May 2002. The preliminary version also appeared in Proc. International Conference on Spoken Language Processing (ICSLP98), Sydney, Australia, Dec. 1998. (PDF)
  3. Mei-fang Huang, Kuan-ting Chen and Hsin-Min Wang, "Towards Retrieval of Video Archives based on The Speech Content," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP2002), Taipei, Aug 2002.
  4. Hsin-Min Wang, "Speech Information Retrieval for Mandarin Chinese," in Proc. International Conference on Digital Archive Technologies (ICDAT2002), Taipei, Dec 2002.

2001
  1. Hsin-Min Wang and Berlin Chen, "Content-based Language Models for Spoken Document Retrieval," International Journal of Computer Processing of Oriental Languages, 14(2), pp. 193-209, June 2001. The preliminary version also appeared in Proc. International Workshop on Information Retrieval with Asian Languages (IRAL2000), Hong Kong, Sept. 2000, pp. 149-155. (PDF)
  2. Bor-shen Lin, Hsin-Min Wang, and Lin-shan Lee, "A Distributed Agent Architecture for Intelligent Multi-Domain Spoken Dialogue Systems," IEICE Trans. on Information and Systems, E84-D(9), pp. 1217-1230, Sept. 2001.
  3. Helen Meng, Berlin Chen,....., Hsin-Min Wang, and Jianqiang Wang, "Mandarin English Information (MEI): Investigating Translingual Speech Retrieval," in Proc. Human Language Technology Conference (HLT2001), San Diego, March 2001.
  4. Hsin-Min Wang, Helen Meng, Patrick Schone, Berlin Chen and Wai-kit Lo, "Multi-Scale Audio Indexing for Translingual Spoken Document Retrieval," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal processing (ICASSP2001), Salt Lake City, USA, May 2001.
  5. Kuan-ting Chen and Hsin-Min Wang, "Eigenspace-based Maximum A Posteriori Linear Regression for Rapid Speaker Adaptation," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal processing (ICASSP2001), Salt Lake City, USA, May 2001.
  6. Hsin-Min Wang, Berlin Chen, Liang-jui Shen, and Chao-chi Chang, "A Voice-Activated Web-based Mandarin Chinese Spoken Document Retrieval System," in Proc. The 19th International Conference on Computer Processing of Oriental Languages (ICCPOL2001), Seoul Korea, May 2001, pp. 403-408.
  7. Kuan-ting Chen and Hsin-Min Wang, "Eigenspace-based Linear Transformation Approach for Rapid Speaker Adaptation," in Proc. ISCA Workshop on Adaptation Methods for Speech Recognition, Sophia Antipolis France, Aug. 2001.
  8. Berlin Chen, Hsin-Min Wang, and Lin-shan Lee, "Improved Spoken Document Retrieval by Exploring Extra Acoustic and Linguistic Cues," in Proc. The 7th European Conference on Speech Communication and Technology (Eurospeech2001), Aalborg Demark, Sept. 2001.
  9. Berlin Chen, Hsin-Min Wang, and Lin-shan Lee, "An HMM/N-gram-based Linguistic Approach for Mandarin Spoken Document Retrieval," in Proc. The 7th European Conference on Speech Communication and Technology (Eurospeech2001), Aalborg Demark, Sept. 2001.
  10. Jeih-wei Hung, Hsin-Min Wang, and Lin-shan Lee, "Comparative Analysis for Data-Driven Temporal Filters Obtained via Principal Component Analysis," in proc. The 7th European Conference on Speech Communication and Technology (Eurospeech2001), Aalborg Demark, Sept. 2001.
  11. Hsin-Min Wang and Berlin Chen, "Comparison of Word and Subword Indexing Techniques for Mandarin Chinese Spoken Document Retrieval," in Proc. The 2nd IEEE Pacific-Rim Conference on Multimedia (PCM2001), Beijing, Oct 2001, pp. 606-613. Lecture Notes in Computer Science, LNCS2195, Springer.

2000
  1. Lee-feng Chien, Hsin-Min Wang, Bo-ren Bai and Sung-chien Lin, "A Spoken Access Approach for Chinese Text and Speech Information Retrieval," Journal of the American Society for Information Science, 51(4), pp. 313-323, 2000.
  2. Hsin-Min Wang, Yu-hsueh Chou, and Berlin Chen, "Browsing the Chinese Web Pages Using Mandarin Speech," International Journal of Computer Processing of Oriental Languages, 13(1), pp. 35-51, March 2000. The preliminary version also appeared in Proc. The 18th International Conference on Computer Processing of Oriental Languages (ICCPOL'99), Tokushima Japan, March 1999, pp. 503-508. (PS) (PDF)
  3. Hsin-Min Wang, "Mandarin Spoken Document Retrieval based on Syllable Lattice Matching," Pattern Recognition Letters, 21(6-7), pp. 615-624, June 2000. The preliminary version also appeared in Proc. Int. Workshop on Information Retrieval with Asian Languages (IRAL'99), Taipei, Nov. 1999, pp. 48-54. (PS) (PDF)
  4. Lee-feng Chien and Hsin-Min Wang, "Exploration of Robust Techniques for Mandarin Spoken Information Retrieval," Journal of the Chinese Institute of Electrical Engineering, 7(2), pp. 113-121, 2000.
  5. Bo-ren Bai, Berlin Chen, and Hsin-Min Wang, "Syllable-based Chinese Text/Spoken Document Retrieval Using Text/Speech Queries," International Journal of Pattern Recognition and Artificial Intelligence, 14(5), pp. 603-616, Aug. 2000. The preliminary version also appeared in Proc. The 2nd International Conference on Multimodal Interface (ICMI'99), Hong-kong, Jan. 1999, pp. II46-II51. (PS) (PDF)
  6. Hsin-Min Wang, "Experiments in Syllable-based Retrieval of Broadcast News Speech in Mandarin Chinese," Speech Communication, 32(1-2), pp. 49-60, Sept. 2000.
  7. Helen Meng, Sanjeev Khudanpur, Douglas W. Oard, and Hsin-Min Wang, "Mandarin English Information (MEI)," in Proc. Topic Detection and Tracking Workshop (TDT-3), Vienna, Virginia, USA, Feb. 2000. (PS) (PDF) (Slides)
  8. Helen Meng, Sanjeev Khudanpur, Gina Levow, Douglas W. Oard, and Hsin-Min Wang, "Mandarin English Information (MEI) - Investigating Translingual Speech Retrieval," in Proc. NAACL Workshop on Embedded Machine Translation, Seattle, Washington, USA, May 2000. (PS) (PDF)
  9. Berlin Chen, Hsin-Min Wang, and Lin-shan Lee, "Retrieval of Broadcast News Speech in Mandarin Chinese Collected in Taiwan Using Syllable-level Statistical Characteristics," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal processing (ICASSP2000), Istanbul, Turkey, June 2000. (PS) (PDF)
  10. Wen-ping Hsieh, Berlin Chen, Kuan-ting Chen, and Hsin-Min Wang, "Initial Experiments on Recognition of Internet-Accessible Compressed Mandarin Speech," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP2000), Beijing, Oct. 2000. (PS) (PDF)
  11. Jeih-weih Hung, Hsin-Min Wang, and Lin-shan Lee, "Automatic Metric-based Speech Segmentation for Broadcast News via Principle Component Analysis," in Proc. International Conference on Spoken Language Processing (ICSLP2000), Beijing, Oct. 2000. (PS) (PDF)
  12. Berlin Chen, Hsin-Min Wang, and Lin-shan Lee, "Retrieval of Mandarin Broadcast News using Spoken Queries," in Proc. International Conference on Spoken Language Processing (ICSLP2000), Beijing, Oct. 2000. (PS) (PDF)
  13. Kuan-ting Chen, Wen-wei Liau, Hsin-Min Wang, and Lin-shan Lee, "Fast Speaker Adaptation using Eigenspace-based Maximum Likelihood Linear Regression," in Proc. International Conference on Spoken Language Processing (ICSLP2000), Beijing, Oct. 2000. (PS) (PDF)

1999
  1. Jia-lin Shen, Hsin-Min Wang, Ren-yuan Lyu, and Lin-shan Lee, "Automatic Selection of Phonetically Distributed Sentence Sets for Speaker Adaptation With Application to Large Vocabualry Mandarin Speech Recognition," Computer Speech and Language, vol. 13, no. 1, pp. 79-97, Jan. 1999.
  2. Bor-shen Lin and Hsin-Min Wang, "Handling Multiple Topics in Spoken Dialogue Systems using Inference Trees," in Proc. The 18th International Conference on Computer Processing of Oriental Languages (ICCPOL'99), Tokushima Japan, March 1999, pp. 293-296. (PS) (PDF)
  3. Bor-shen Lin, Hsin-Min Wang, and Lin-shan Lee, "Consistent Dialogue across Concurrent Topics based on An Expert System Model," in Proc. European Conference on Speech Communication and Technology (EUROSPEECH'99), Budapest Hungary, Sept. 1999, pp. 1427-1430. (PS) (PDF)
  4. Lee-feng Chien and Hsin-Min Wang, "Exploration of Spoken Access for Chinese Text and Speech Information Retrieval," in Proc. Internationl Symposium on Signal Processing and Intelligent System (ISSPIS'99), Guangzhou, China, Nov. 1999, pp. 578-583.
  5. Bor-shen Lin, Hsin-Min Wang, and Lin-shan Lee, "A Distributed Architecture for Cooperative Spoken Dialogue Agents with Coherent Dialogue State and History," in Proc. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU'99), Keystone, Colorado, USA, Dec. 1999. (PS) (PDF)

1998
  1. Hsin-Min Wang, "Statistical Analysis of Mandarin Acoustic Units and Automatic Extraction of Phonetically Rich Sentences Based Upon A Very Large Chinese Text Corpus," International Journal of Computational Linguistics and Chinese Language Processing, vol. 3, no. 2, pp. 93-114, August 1998.
  2. Bor-shen Lin, Hsin-Min Wang, Bo-ren Bai, and Berlin Chen, "A prototype of Mandarin voice memo system," in Proc. IEEE International Conference on Consumer Electronics (ICCE98), Los Angeles, June 1998, pp. 88-89.
  3. Hsin-Min Wang, Yu-hsueh Chou, and Berlin Chen, "Surfing the Chinese Web pages by unconstrained Mandarin speech," in Proc. IEEE International Conference on Consumer Electronics (ICCE98), Los Angeles, June 1998, pp. 84-85.
  4. Berlin Chen and Hsin-Min Wang, "A vocabulary-flexible key-phrase spotting system for Mandarin Chinese," in Proc. The Fourth International Symposium on Real-time and Media Systems (RAMS98), Taipei, Sept. 1998, pp. 176-180.
  5. Hsin-Min Wang, Bor-shen Lin, Bo-ren Bai, and Berlin Chen, "Towards a Mandarin voice memo system," in Proc. International Conference on Spoken Language Processing (ICSLP98), Sydney, Australia, Dec. 1998. (PS) (PDF)
  6. Berlin Chen, Hsin-Min Wang, Lee-feng Chien, and Lin-shan Lee, "A*-admissible key-phrase spotting with sub-syllable level utterance verification," in Proc. International Conference on Spoken Language Processing (ICSLP98), Sydney, Australia, Dec. 1998. (PS) (PDF)
  7. Bo-ren Bai, Berlin Chen, Hsin-Min Wang, Lee-feng Chien, and Lin-shan Lee, "Large-Vocabulary Chinese Text/Speech Information Retrieval Using Mandarin Speech Queries," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP98), Singapore, Dec. 1998, pp. 284-289. (PS) (PDF)

1997
  1. Hsin-Min Wang, Tai-hsuan Ho, Rung-chiung Yang, Jia-lin Shen, Bo-ren Bai, Jenn-chau Hong, Wei-peng Chen, Tong-lo Yu, and Lin-shan Lee, "Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data," IEEE Trans. on Speech and Audio Processing, vol. 5, no. 2, pp. 195-200, March 1997.
  2. Bo-ren Bai, Hsin-Min Wang, and Lin-shan Lee, "Log-likelihood score normalization techniques for recognition of Chinese keywords with large vocabulary," in Proc. Int. Conf. Computer Processing of Oriental Languages, Hong-kong, April 1997, vol. 1, pp. 186-191.
  3. Lee-feng Chien, Sung-chien Lin, Jenn-chau Hong, Ming-chiuan Chen, Hsin-Min Wang, Jia-lin Shen, Keh-jiann Chen, and Lin-shan Lee, "Internet Chinese information retrieval using unconstrained Mandarin speech queries based on a client-server architecture and a PAT-tree-based language models," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, Munich, Germany, May 1997, vol. 2, pp. 1155-1158.
  4. Hsin-Min Wang, Bor-shen Lin and Bo-ren Bai, "Voice retrieval of Mandarin speech database," in Proc. Int. Workshop on Information Retrieval with Asian Languages, Tsukuba-City, Japan, 1997, pp. 185-190.
  5. Bo-ren Bai, Hsin-Min Wang, and Lin-shan Lee, "A word-length-dependent confidence measure for large vocabulary Chinese keyword spotting," in Proc. IEEE Region 10 Annual Conference (TENCON97), Brisbane, Australia, Dec. 1997, pp. 595-598.
  6. Bor-shen Lin, Hsin-Min Wang, and Lin-shan Lee, "A key-phrase understanding framework integrating real world knowledge with speech recognition with initial application in voice memo systems for Chinese language," in Proc. IEEE Region 10 Annual Conference (TENCON97), Brisbane, Australia, Dec. 1997, pp. 157-160.

1996
  1. Chih-Heng Lin, Chien-Hsing Wu, Pei-Yih Ting, and Hsin-Min Wang, "Frameworks for recognition of Mandarin syllables with tone using sub-syllabic units," Speech Communication, vol. 18, no. 2, pp. 175-190, 1996.

1995
  1. Hsin-Min Wang, Jia-lin Shen, Yen-ju Yang, Chiu-yu Tseng, and Lin-shan Lee, " Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but limited training data," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, Detroit, Michigan U.S.A, 1995, pp. 61-64.
  2. Hsin-Min Wang and Lin-shan Lee, "A fast approximate algorithm for parametric variable duration HMM's for speech recognition," in Proc. Int. Conf. Computer Processing of Oriental Languages, Hawaii, 1995, pp. 4-7.
  3. Tai-hsuan Ho, Hsin-Min Wang, Lee-feng Chien, and Lin-shan Lee, "Fast and accurate continuous speech recognition for Chinese language with very large vocabulary," in Proc. European Conf. on Speech Communication and Technology, Madrid, Spain, 1995, vol. I, pp. 211-214.

1994
  1. Hsin-Min Wang and Lin-shan Lee, "Tone recognition for continuous Mandarin speech with limited training data using selected context-dependent hidden Markov models," Journal of The Chinese Institute of Engineers, vol. 17, no. 6, pp. 775-784, 1994.
  2. Hsin-Min Wang, Renyuan Lyu, Jia-lin Shen, and Lin-shan Lee, "Mandarin syllable recognition in continuous speech under limited training data with sub-syllabi c acoustic modeling," International Journal of Computer Processing of Chinese and Oriental Languages, vol. 8, pp. 1-16, Dec. 1994.
  3. Hsin-Min Wang, Yuan-cheng Chang, and Lin-shan Lee, "An algorithm for automatically selecting phonetically balanced sentences from a large corpus for training and testing a speech recognition system," in Proc. Int. Conf. Computer Processing of Oriental Languages, Taejon, Korea, 1994, pp. 207-210.
  4. Renyuan Lyu, Hsin-Min Wang, Shiao-Hong Hwang, Chiu-yu Tseng, and Lin-shan Lee, " A comparison of different acoustic units applied to isolated/continuous large-vocabulary Mandarin speech recognition," in Proc. Int. Conf. Computer Processing of Oriental Languages, Taejon, Korea, 1994, pp. 211-214.
  5. Hsin-Min Wang, Renyuan Lyu, Jia-lin Shen, and Lin-shan Lee, "An initial study on large-vocabulary continuous Mandarin speech recognition with limited training data based on sub-syllabic models," in Proc. Int. Computer Symposium, Hsin-chu, R.O.C., 1994, pp. 1140-1145.
  6. Jia-lin Shen, Hsin-Min Wang, Bo-ren Bai, and Lin-shan Lee, "An initial study on a segmental probability model approach to large-vocabulary continuous Mandarin speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, Adelaide, South Australia, 1994, vol. II, pp. 133-136.
  7. Jia-lin Shen, Hsin-Min Wang, Renyuan Lyu, and Lin-shan Lee, "Incremental speaker adaptation using phonetically balanced sentences for Mandarin syllable recognition based on segmental probability models," in Proc. Int. Conf. Spoken Language Processing, Yokohama, Japan, 1994. pp. 443-446.