花辨直播官方版_花辨直播平台官方app下载_花辨直播免费版app下载

Global EditionASIA 中文雙語Fran?ais
Culture
Home / Culture / Heritage

AI language model hopes to preserve metropolis' local dialect

By ZHENG ZHENG | CHINA DAILY | Updated: 2024-11-22 07:57
Share
Share - WeChat
The language model's development team discussing the project. [Photo by ZHENG ZHENG/CHINA DAILY]

A team of researchers at Shanghai University has unveiled an artificial intelligence language model designed to help preserve and promote the native Shanghai dialect. Named Xiao Hu, the model is capable of speech recognition, text-to-speech conversion, and dialogue between the Shanghai dialect and Mandarin Chinese.

"Our goal extends far beyond merely teaching AI to speak Shanghainese," says Zeng Jun, director of Shanghai University's publicity department. "Our true objective is to leverage this language model as a means to preserve and innovate the development of the Shanghai dialect, enabling more people, especially Shanghai, to become familiar with and use it."

The impetus behind the project stems from the university's deep-rooted excellence in Chinese language studies and its vision of empowering Chinese through AI technology. Renowned scholars, such as Professor Qian Nairong, a leading authority on the research of the Shanghai dialect, and Associate Professor Ding Dimeng, a Shanghai dialect expert, have laid the foundation for this project.

Ding underscored the cultural significance of the Shanghai dialect, describing it as the root of the city's identity, which contains ancient pronunciations and modern loanwords. "The Shanghai dialect is indispensable in this metropolis," Ding says. "I believe this language model can play an important role in language and dialect preservation efforts, accomplishing tasks that individuals cannot."

The urgency of the project cannot be overstated.

According to doctorate candidate Fan Tianyu, a key member of the development team, a recent statistic shows that only 22 percent of Shanghai residents aged 6 to 20 still use the dialect. The dialect's precarious state highlights the challenge of creating and purifying high-quality language data sets.

1 2 Next   >>|
Most Popular
Top
BACK TO THE TOP
English
Copyright 1995 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US