实验室主任史晓东教授率领十一名师生参加第十四届全国机器翻译研讨会(CWMT2018)

2018年10月25日,第十四届全国机器翻译研讨会(CWMT2018)在福建武夷山开幕。会议由中国中文信息学会主办,福建省人工智能学会承办,武夷学院协办,由金山、云译等企业赞助。来自国内外各高校、研究机构、相关企业等200多位嘉宾与会,围绕机器翻译展开为期两天的深入探讨。
cwmt2018
Continue reading “实验室主任史晓东教授率领十一名师生参加第十四届全国机器翻译研讨会(CWMT2018)”

ELRA Language Resources Catalogue – Update

We are happy to announce that 2 new Written Corpora and 4 new Speech resources are now available in our catalogue.

ELRA-W0126 Training and test data for Arabizi detection and transliteration
ISLRN: 986-364-744-303-9
The dataset is composed of : a collection of mixed English and Arabizi text intended to train and test a system for the automatic detection of code-switching in mixed English and Arabizi texts ; and a set of 3,452 Arabizi tokens manually transliterated into Arabic, intended to train and test a system that performs Arabizi to Arabic transliteration.
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-W0126/
Continue reading “ELRA Language Resources Catalogue – Update”