NUTN-KWS
Whisper Taiwanese Model V0.5
這個模型是由國立臺南大學執行國科會產學合作計畫,使用 openai/whisper-large-v3-turbo 微調的版本,並執行國科會TAIDE台英語家庭先導計畫,與真平出版社合作,使用中小學教材內容及學生學習資料進行模型微調,用於真平教材台語辨識。並與國研院國網中心合作,運用國網中心算力以及TAIDE模型,共同建構中小學台語AI學習模型。 📝 Model Details - Base Model: `openai/whisper-large-v3-turbo` - Fine-tuned for: 台灣閩南語語音辨識 (ASR) - Fine-tuning Framework: Hugging Face Transformers - Training Duration: 使用兩片 V100,大約 180 小時 - Dataset: 自訂資料集、教育部臺灣台語常用詞辭典,大約 90 小時的資料 - Input Format: 16kHz mono WAV - License: CC BY-NC 4.0 APA: - C. S. Lee, M. H. Wang, C. C. Yue, G. Y. Teseng, and Y. Nojima, "Fuzzy Estimation Agent with Knowledge Graph and Quantum Fuzzy Inference Engine for Taiwanese-English Co-Learning," 2025 IFSA World Congress and NAFIPS Annual Meeting (IFSA/NAFIPS 2025), Banff, Alberta, Canada, Aug. 16-19, 2025. - C. S. Lee, M. H. Wang, C. Y. Chen, S. C. Yang, M. Reformat, N. Kubota, and A. Pourabdollah, "Integrating quantum CI and generative AI for Taiwanese/English co-learning," Quantum Machine Intelligence, vol. 6, 64, pp. 1-19, 2024. - C. S. Lee, M. H. Wang, C. Y. Chen, S. C. Yang, M. Reformat, N. Kubota, and A. Pourabdollah, "Quantum fuzzy inference engine with generative AI and TAIDE KG for Taiwanese/English co-learning," 2025 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2025), Reims, France, Jul. 6-9, 2025.