男女羞羞视频在线观看,国产精品黄色免费,麻豆91在线视频,美女被羞羞免费软件下载,国产的一级片,亚洲熟色妇,天天操夜夜摸,一区二区三区在线电影
Global EditionASIA 中文雙語Fran?ais
China
Home / China / Education

Guideline to develop AI-backed Chinese language database

Digitalization of ancient texts promotes cultural heritage, Mandarin learning

By Zhao Yimeng | China Daily | Updated: 2025-04-01 09:10
Share
Share - WeChat

China is accelerating the digitalization of ancient texts and boosting access to oracle bone script data, aiming to integrate cultural heritage with digital Chinese, officials said on Monday.

The Ministry of Education, the National Language Commission and the Cyberspace Administration of China issued a guideline to promote the digitalization of the Chinese language and characters. The focus is on developing national language resources and large-scale Chinese language models to support artificial intelligence.

The guideline aims to establish a national corpus and strategic language resources information database by 2027. By 2035, the country hopes it will have significantly expanded the presence of the Chinese language in global digital and generative AI scenarios.

Liu Peijun, head of the Department of Language Information Management at the Ministry of Education, said the guideline calls for the digitalization of linguistic and cultural heritage, while promoting the construction of a national digital language and script museum.

It emphasizes advancing key technologies for ancient text digitalization, enhancing the accessibility of oracle bone script data and launching a multilingual digital education program to facilitate Chinese language learning globally, Liu said at a news conference.

A key aspect of this initiative is the development of large-scale linguistic data resources. The guideline outlines a plan to build a national corpus with extensive Chinese language datasets to support AI applications.

Among the pilot projects, Beijing Normal University has launched a large-scale Classical Chinese language model, an AI-driven initiative that sets a new benchmark in the field, Liu said.

Kang Zhen, vice-president of BNU, said the university has developed a range of digital language databases, including a comprehensive holographic Chinese character database, a digital resource of the ancient Chinese dictionary Shuowen Jiezi, and repositories for ancient inscriptions and handwritten texts.

These resources have played a crucial role in linguistic research and cultural preservation, Kang added.

The university's AI Taiyan, a Classical Chinese large language model trained with 1.8 billion parameters, has been designed for high-accuracy interpretation of ancient texts, supporting tasks such as word and phrase explanations, as well as classical-to-modern Chinese translation.

China is also spearheading the construction of a new national corpus to strengthen linguistic infrastructure in the AI era, said Wang Hui, deputy head of the Ministry of Education's Department of Language Application and Administration.

"Currently, most linguistic datasets remain limited to single-text formats and specific academic domains, lacking the scale and diversity required for AI applications," Wang said.

The department has begun planning for the corpus this year, seeking to launch two flagship databases, the Chinese civilization corpus for AI-assisted teaching and research, and the Chinese grand reading system corpus, Wang said.

Top
BACK TO THE TOP
English
Copyright 1995 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US
 
主站蜘蛛池模板: 新龙县| 滦南县| 东明县| 夏津县| 兰西县| 镶黄旗| 巴彦淖尔市| 中山市| 石狮市| 娱乐| 房产| 大化| 固安县| 巴林左旗| 绍兴市| 平果县| 建始县| 天全县| 云龙县| 安福县| 平潭县| 泽普县| 恩平市| 明光市| 嘉峪关市| 新龙县| 突泉县| 同心县| 嘉荫县| 鲁山县| 汉阴县| 蛟河市| 化州市| 宽城| 通渭县| 甘孜| 银川市| 基隆市| 庆安县| 余姚市| 大渡口区| 尼勒克县| 昭觉县| 新乡县| 崇明县| 印江| 十堰市| 盐池县| 资溪县| 惠州市| 奉化市| 驻马店市| 石泉县| 绥芬河市| 兴宁市| 迁西县| 司法| 精河县| 黔南| 遵义市| 陕西省| 靖远县| 万荣县| 深水埗区| 洱源县| 始兴县| 罗山县| 砀山县| 邻水| 宁河县| 楚雄市| 射阳县| 泸水县| 女性| 报价| 江城| 泰和县| 大英县| 鹤庆县| 安西县| 屯昌县| 沁阳市|