男女羞羞视频在线观看,国产精品黄色免费,麻豆91在线视频,美女被羞羞免费软件下载,国产的一级片,亚洲熟色妇,天天操夜夜摸,一区二区三区在线电影
Global EditionASIA 中文雙語Fran?ais
China
Home / China / Education

Guideline to develop AI-backed Chinese language database

Digitalization of ancient texts promotes cultural heritage, Mandarin learning

By Zhao Yimeng | China Daily | Updated: 2025-04-01 09:10
Share
Share - WeChat

China is accelerating the digitalization of ancient texts and boosting access to oracle bone script data, aiming to integrate cultural heritage with digital Chinese, officials said on Monday.

The Ministry of Education, the National Language Commission and the Cyberspace Administration of China issued a guideline to promote the digitalization of the Chinese language and characters. The focus is on developing national language resources and large-scale Chinese language models to support artificial intelligence.

The guideline aims to establish a national corpus and strategic language resources information database by 2027. By 2035, the country hopes it will have significantly expanded the presence of the Chinese language in global digital and generative AI scenarios.

Liu Peijun, head of the Department of Language Information Management at the Ministry of Education, said the guideline calls for the digitalization of linguistic and cultural heritage, while promoting the construction of a national digital language and script museum.

It emphasizes advancing key technologies for ancient text digitalization, enhancing the accessibility of oracle bone script data and launching a multilingual digital education program to facilitate Chinese language learning globally, Liu said at a news conference.

A key aspect of this initiative is the development of large-scale linguistic data resources. The guideline outlines a plan to build a national corpus with extensive Chinese language datasets to support AI applications.

Among the pilot projects, Beijing Normal University has launched a large-scale Classical Chinese language model, an AI-driven initiative that sets a new benchmark in the field, Liu said.

Kang Zhen, vice-president of BNU, said the university has developed a range of digital language databases, including a comprehensive holographic Chinese character database, a digital resource of the ancient Chinese dictionary Shuowen Jiezi, and repositories for ancient inscriptions and handwritten texts.

These resources have played a crucial role in linguistic research and cultural preservation, Kang added.

The university's AI Taiyan, a Classical Chinese large language model trained with 1.8 billion parameters, has been designed for high-accuracy interpretation of ancient texts, supporting tasks such as word and phrase explanations, as well as classical-to-modern Chinese translation.

China is also spearheading the construction of a new national corpus to strengthen linguistic infrastructure in the AI era, said Wang Hui, deputy head of the Ministry of Education's Department of Language Application and Administration.

"Currently, most linguistic datasets remain limited to single-text formats and specific academic domains, lacking the scale and diversity required for AI applications," Wang said.

The department has begun planning for the corpus this year, seeking to launch two flagship databases, the Chinese civilization corpus for AI-assisted teaching and research, and the Chinese grand reading system corpus, Wang said.

Top
BACK TO THE TOP
English
Copyright 1995 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US
 
主站蜘蛛池模板: 乳源| 息烽县| 五河县| 甘南县| 金寨县| 临洮县| 西藏| 孙吴县| 乐清市| 来安县| 太白县| 芷江| 桂东县| 建阳市| 个旧市| 兴城市| 无棣县| 明星| 乌鲁木齐市| 日土县| 西吉县| 商城县| 吉安市| 镇江市| 台湾省| 玉林市| 咸宁市| 福泉市| 株洲县| 康平县| 双桥区| 定襄县| 凤凰县| 简阳市| 墨竹工卡县| 昂仁县| 枣阳市| 榆树市| 台中市| 澄迈县| 开封县| 揭阳市| 醴陵市| 江源县| 株洲市| 昌吉市| 长治县| 福鼎市| 会泽县| 攀枝花市| 合作市| 定西市| 扎赉特旗| 慈利县| 双辽市| 莆田市| 东海县| 都兰县| 大理市| 邵阳市| 舞阳县| 景泰县| 枣庄市| 宁强县| 晋中市| 龙州县| 米泉市| 乐清市| 安义县| 遂昌县| 禹州市| 义马市| 金华市| 宝山区| 瓮安县| 凌海市| 祥云县| 井冈山市| 吉隆县| 保康县| 云安县| 盈江县|