Our mission is to increase data efficiency by the factor of 10^6.

Pleias is building the data infrastructure layer for enterprise agentic AI. Efficient, competitive and adapted to entreprise and institutions constraints.

Its dual-product stack - Stratum for AI-native enterprise data processing and Synth for synthetic data generation - enables organizations to train and implement language models that rival systems 100× larger.

What Makes Us Different

Frontier Efficiency
Frontier Efficiency

+200× more training-efficient. Our engineered data reaches state-of-the-art in ~100B tokens, where raw data needs trillions

Designed For Sensitive Use Cases
Designed For Sensitive Use Cases

Can run entirely on your own infrastructure - on-premise or on-device. Your data never leaves your walls and never touches an external API.

Fully Auditable and Compliant
Fully Auditable and Compliant

Every data point is rights-cleared and traceable to its source. Common Corpus - our 2-trillion-token open dataset - is EU AI Act-compliant by construction

Speed To Production
Speed To Production

Our data tooling does the slow, messy prep that stalls most projects - so internal AI use cases ship in weeks instead of months.

Founding Team

Pierre-Carl Langlais
Pierre-Carl Langlais
Credentials

PhD Information Science Sorbonne Center for AI & Sciences Po Médialab

Research Interest

Open data infrastructure, digital humanities, AI governance

Ivan Yamshchikov
Ivan Yamshchikov
Credentials

PhD Financial Mathematics Research Professor, CAIRO — THWS; ex-Yandex R&D Lead; ex-ABBYY AI Evangelist

Research Interest

Language models, generative AI, applied mathematics

Anastasia Stasenko
Anastasia Stasenko
Credentials

PhD Philosophy, ENS Ulm Associate Senior Lecturer, Sorbonne-Nouvelle; ex-Hachette Publishing

Research Interest

Philosophy of AI, language and cognition, publishing technology

Team

Pieter Delobelle
Pieter Delobelle
Lead AI Scientist

PhD, KU Leuven; ex-Aleph Alpha, ex-Apple

Hanna Shcharbakova
Hanna Shcharbakova
AI Engineer

M.Sc. University of Lorraine & Saarland University; B.A. Higher School of Economics

Yannick Detrois
Yannick Detrois
AI Scientist

M.Eng. EPFL

Anton Changalidi
Anton Changalidi
Lead AI Engineer

M.Sc. Maastricht University; B.Eng. ITMO

Carlos Rosas
Carlos Rosas
Lead Data Scientist

PhD candidate, ENS ULM; M.S. Sorbonne Université

Neil Si Smail
Neil Si Smail
AI Engineer

M.Eng. CentraleSupélec

Iaroslav Neverov
Iaroslav Neverov
Full-stack AI Engineer

M.Eng. École 42

Benjamin Burtin
Benjamin Burtin
AI engineer

M.Eng. CentraleSupélec

Pavel Chizhov
Pavel Chizhov
AI Scientist

PhD candidate, THWS; ex-Yandex

Mohamed Chenene
Mohamed Chenene
AI Engineer

M.Eng. CentraleSupélec; ex-Engie Research Lab

Enrico Milli
Enrico Milli
Full-stack AI Engineer

BSc, University of Greenwich

Vishnu Prasad
Vishnu Prasad
AI Engineer

M.Eng. THWS

Vaiva Kazanaviciute
Vaiva Kazanaviciute
Full-Stack AI engineer

M.Eng. Ecole 42

Pandora Langlais
Pandora Langlais
Data Analyst

M.A. École du Louvre

Our Partners & Ecosystem

Nvidia
Mozilla
AI Alliance
Wikimedia Foundation
thws
Scaleway
Nvidia
Mozilla
AI Alliance
Wikimedia Foundation
thws
Scaleway
Nvidia
Mozilla
AI Alliance
Wikimedia Foundation
thws
Scaleway
Nvidia
NvidiaOpen Source Collaboration (Nemotron-Personas)
Mozilla
MozillaLocal AI Builders 1st Cohort
AI Alliance
AI AllianceOpen Trusted Data Initiative Lead
Wikimedia Foundation
Wikimedia FoundationStrategic Partner
thws
thwsAcademic partnership
Scaleway
ScalewayInfrastructure partner