Publications
Publications that CrossLang has (co-)authored
Blogs

The Process of Building a Custom Machine Translation Engine – From A to Z
What are custom machine translation engines? How are they built? Learn all about it in this blog.

Automated Evaluation of Machine Translation – An Overview
Evaluating machine translation output is no easy feat. In order to quantify the quality of machine translation, the first thought that comes to mind is

Human Evaluation of Machine Translation Quality – A Quick Guide
A couple of years ago, neural machine translation stepped into the limelight. Using deep learning, the quality of machine translation has increased substantially and will

Anonymisation and Machine Translation
What happens when an anonymised text is processed automatically? Read our blog to learn more about how MT deals with anonymised content.

Anonymisation: an Introduction
What does anonymisation entail? And why is it so important? Read all about it here.

Machine Translation in Localization
How can Machine Translation be used in a commercial setting? Read all about it here.
All publications
On Curating HTR Training Datasets for Romanian Language with use of Transcribathon Tool (to appear in 2026)
Authors: S. Gordea, G.C. Cotea, F. Drauschke, J. Salesevic, M. Lamote, T. Vanallemeersch
Tailoring Machine Translation for Scientific Literature through Topic Filtering and Fuzzy Match Augmentation (2025)
Authors: T. Moerman, T. Vanallemeersch, S. Szoc, A. Tezcan
AI4Culture Platform: Upskilling Experts on Multilingual/-modal Tools (2024)
Authors: T. Vanallemeersch, S. Szoc, M. Lamote, F. Everaert, E. Kaldeli
AI4Culture: Towards Multilingual Access for Cultural Heritage Data (2024)
Authors: T. Vanallemeersch, S. Szoc, L. Meeus
Term Translation: Convert or Converse? (2023)
Authors: A. Kostikova, K. Migdisi, S. Szoc, T. Vanallemeersch
Translations and Open Science: Exploring how translation technologies can support multilingualism in scholary communication (2023)
Authors: S. Fiorini, A. Tezcan, T. Vanallemeersch, S. Szoc, K. Migdisi, L. Meeus, and L. Macken
ELRC Action: Covering Confidentiality, Correctness and Cross-linguality (2022)
Authors: Tom Vanallemeersch, Arne Defauw, Sara Szoc, Alina Kramchaninova, Joachim Van den Bogaert & Andrea Lösch
Synthetic Data Generation for Multilingual Domain-Adaptable Question Answering Systems (2022)
Authors: Alina Kramchaninova & Arne Defauw
Automatically Extracting the Semantic Network out of Public Services to Support Cities Becoming Smart Cities (2022)
Authors: Joachim Van den Bogaert, Laurens Meeus, Alina Kramchaninova, Arne Defauw, Sara Szoc, Frederic Everaert, Koen Van Winckel, Anna Bardadym & Tom Vanallemeersch
OCCAM: Cross-lingual Unlocking of Non-digital Texts (2021)
Authors: Laurens Meeus, Joachim Van den Bogaert, Arne Defauw, Oan Stultjens, Sara Szoc, Tom Vanallemeersch, Frederic Everaert & Koen Van Winckel
Validating Quality Estimation in a Computer-Aided Translation Workflow: Speed, Cost and Quality Trade-off (2021)
Authors: Fernando Alva-Manchego, Lucia Specia, Sara Szoc, Tom Vanallemeersch & Heidi Depraetere