Our mission is to increase data efficiency by the factor of 10^6.

Pleias is building the data infrastructure layer for enterprise agentic AI. Efficient, competitive and adapted to entreprise and institutions constraints.

Its dual-product stack - Stratum for AI-native enterprise data processing and Synth for synthetic data generation - enables organizations to train and implement language models that rival systems 100× larger.

Our mission is to increase data efficiency by the factor of 10^6.

Pleias is building the data infrastructure layer for enterprise agentic AI. Efficient, competitive and adapted to entreprise and institutions constraints.

Its dual-product stack - Stratum for AI-native enterprise data processing and Synth for synthetic data generation - enables organizations to train and implement language models that rival systems 100× larger.

What Makes Us Different

Frontier Efficiency

Frontier Efficiency

+200× more training-efficient. Our engineered data reaches state-of-the-art in ~100B tokens, where raw data needs trillions

Designed For Sensitive Use Cases

Designed For Sensitive Use Cases

Can run entirely on your own infrastructure - on-premise or on-device. Your data never leaves your walls and never touches an external API.

Fully Auditable and Compliant

Fully Auditable and Compliant

Every data point is rights-cleared and traceable to its source. Common Corpus - our 2-trillion-token open dataset - is EU AI Act-compliant by construction

Speed To Production

Speed To Production

Our data tooling does the slow, messy prep that stalls most projects - so internal AI use cases ship in weeks instead of months.

Founding Team

Pierre-Carl Langlais

Pierre-Carl Langlais

Credentials

PhD Information Science Sorbonne Center for AI & Sciences Po Médialab

Research Interest

Data efficiency, digital humanities, AI governance

LinkedIn Twitter Email

Ivan Yamshchikov

Ivan Yamshchikov

Credentials

PhD Financial Mathematics Research Professor, CAIRO — THWS; ex-Yandex R&D Lead; ex-ABBYY AI Evangelist

Research Interest

Language models, AI and creativity, applied mathematics

Anastasia Stasenko

Anastasia Stasenko

Credentials

PhD Philosophy, ENS Ulm Associate Professor at Paris-Dauphine University; ex-Hachette Publishing

Research Interest

Philosophy of AI, language and cognition, ethics and safety

Team

Pieter Delobelle

Pieter Delobelle

Lead AI Scientist

PhD, KU Leuven; ex-Aleph Alpha, ex-Apple

Hanna Shcharbakova

Hanna Shcharbakova

AI Engineer

M.Sc. University of Lorraine & Saarland University; B.A. Higher School of Economics

Yannick Detrois

Yannick Detrois

AI Scientist

M.Eng. EPFL

Anton Changalidi

Anton Changalidi

Lead AI Engineer

M.Sc. Maastricht University; B.Eng. ITMO

Carlos Rosas

Carlos Rosas

Lead Data Scientist

PhD candidate, ENS ULM; M.S. Sorbonne Université

Iaroslav Neverov

Iaroslav Neverov

Full-stack AI Engineer

M.Eng. École 42

Enrico Milli

Enrico Milli

Full-stack AI Engineer

BSc, University of Greenwich

Benjamin Burtin

Benjamin Burtin

AI engineer

M.Eng. CentraleSupélec

Neil Si Smail

Neil Si Smail

AI Engineer

M.Eng. CentraleSupélec

Pavel Chizhov

Pavel Chizhov

AI Scientist

PhD candidate, THWS; ex-Yandex

Mohamed Chenene

Mohamed Chenene

AI Engineer

M.Eng. CentraleSupélec; ex-Engie Research Lab

Vishnu Prasad

Vishnu Prasad

AI Engineer

M.Eng. THWS

Vaiva Kazanaviciute

Vaiva Kazanaviciute

Full-Stack AI engineer

M.Eng. Ecole 42

Pandora Langlais

Pandora Langlais

Data Analyst

M.A. École du Louvre

Our Partners & Ecosystem

AI Alliance

Wikimedia Foundation

AI Alliance

Wikimedia Foundation

AI Alliance

Wikimedia Foundation

NvidiaOpen Source Collaboration (Nemotron-Personas)

MozillaLocal AI Builders 1st Cohort

AI Alliance

AI AllianceOpen Trusted Data Initiative Lead

Wikimedia Foundation

Wikimedia FoundationStrategic Partner

thwsAcademic partnership

ScalewayInfrastructure partner

products

Stratum Synth Common Corpus

research

HuggingFace Github Datasets Papers

company

Contact Partners Use Cases Blog / News Team About

legal

GDPR Notice Cookie Policy Terms of Use Legal Mentions

contact

Station F, 5 Parv. Alan Turing 75013, Paris contact@pleias.fr

©2026 Pleias. All rights reserved

LinkedIn GitHub HuggingFace