DATAmundi

Belgian AI training-data specialist — multilingual speech, text, image, and video datasets. Now part of Summa Linguae Technologies.

AI Training AI Training Data / Multilingual Datasets
New (0 reviews)
Linter, Belgium HQ
35 employees
2007 founded

What is DATAmundi?

DATAmundi is a Belgian AI training-data company founded in 2007 by Gert Van Assche and his wife in Linter, Belgium. Originally a language-services boutique, the company moved into AI training data well before the LLM era and built a reputation for high-quality multilingual datasets across speech, text, image, and video. In December 2021 DATAmundi was acquired by Polish language-services group Summa Linguae Technologies, which subsequently consolidated its AI-data offering under the DATAmundi brand and relaunched as DATAmundi.ai — "data of the world". The company supplies ethically-sourced, bias-aware training data to AI labs and enterprises through a hybrid model: a ~35-person core team in Belgium directs project delivery against a broader contributor / language-expert network mobilised per engagement. Services include speech data collection and transcription, multilingual text annotation, image and video labelling, MT post-editing, search relevance evaluation, and human-in-the-loop QA. Specialism is European and rarer-language coverage where larger competitors struggle to staff quality contributors.

Mission & values

Supply ethically-sourced, bias-aware multilingual training data — built by language experts, audited by humans — to power AI systems that work as well in every language as they do in English.

Qualifications

DATAmundi hires across two tracks. Core team (Belgium / Summa Linguae offices in Poland and elsewhere) covers project management, linguistics, quality, data engineering, sales, and account management; these are local-employment roles typically based in Belgium or other EU Summa Linguae hubs. Contributor track covers freelance / per-project annotators, transcribers, voice talent, evaluators, and language experts — recruited per engagement against specific language pairs and domains (especially European and rare languages). Contributor work is fully remote with a country / native-language match per project. Applications go through the careers section of datamundi.ai and recruitment-page postings on the company's LinkedIn.

Leadership

V

Véronique Özkaya

Group Leadership (Global Expansion & Innovation)

25+ years in global content services, AI, and data solutions. Drives DATAmundi's expansion strategy across the combined Summa Linguae + DATAmundi entity.

E

Emanuele Di Rosa

Chief Technology Officer

~20 years in AI, software development, and language technologies. Leads DATAmundi's AI tooling, data pipelines, and annotation platforms.

A

Asier Pereda Jayo

Chief Revenue Officer

Leads global revenue strategy across DATAmundi's AI labs and enterprise customer base.

S

Sophie Murphy

Chief Operating Officer

Drives operational delivery across project management, quality, and contributor mobilisation.

G

Gert Van Assche

Founder (now exited)

Co-founded DATAmundi (BVBA) in 2007 in Linter, Belgium. Managing partner through the Summa Linguae acquisition in December 2021.

Hiring process

  1. 1

    Browse roles

    Visit datamundi.ai and review open contributor or core-team postings; LinkedIn often carries the most current listings.

  2. 2

    Apply

    Submit application via the listing — contributor track usually asks for language pairs, prior annotation/voice/transcription experience, and a CV.

  3. 3

    Screening

    Recruiter or project lead confirms language fluency, availability, and country / time-zone fit.

    About 3 days

  4. 4

    Skills test

    Language-specific assessment (transcription accuracy, annotation pilot, voice sample, MT post-editing trial).

    About 5 days

  5. 5

    Onboarding

    Sign engagement paperwork, complete project-specific guidelines, and begin paid work.

    About 3 days

Funding

StageAcquired
Investors
Summa Linguae Technologies (acquirer, December 2021)

Awards & recognition

  • Founded 2007 in Linter, Belgium · 2007

    DATAmundi

  • Acquired by Summa Linguae Technologies · 2021

    MultiLingual / Slator

  • Rebranded to DATAmundi.ai ("data of the world") · 2024

    DATAmundi

Company information

Frequently asked questions

Who owns DATAmundi?
DATAmundi was acquired by Polish language-services group Summa Linguae Technologies in December 2021. The DATAmundi.ai brand is Summa Linguae's consolidated AI training-data offering.
Who founded DATAmundi?
Gert Van Assche and his wife in Linter, Belgium, in 2007.
What kind of work does DATAmundi offer?
Speech data collection and transcription, multilingual text annotation, image and video labelling, MT post-editing, search relevance evaluation, and human-in-the-loop QA — with particular depth in European and rarer languages.
Can I sign up as a contributor?
Yes — contributor / language-expert applications go through the datamundi.ai careers section and LinkedIn postings. Engagements are project-based with language and country match.
Are roles remote?
Contributor work is fully remote (with country / native-language match per project). Core-team roles are typically based in Belgium or other Summa Linguae EU hubs.
What languages does DATAmundi cover?
A broad European book plus rarer languages where larger competitors struggle to staff quality contributors — staffed per project against client requirements.

Stay in the loop.

One email per week, 5 hand-picked roles.