A Generative Foundation Model for Heterogeneous Tabular Data
Director of ML Research · Principal Scientist · SAP AI
Foundation models, agents, structured data, generative UI
Research leadership in foundation models, enterprise data, and applied AI systems.
My work connects machine learning research with deployed AI systems, from representation learning and foundation models for structured data to agentic workflows and AI-native interfaces.
I lead ML research at the SAP AI CTO Office, where my team works on foundation models for enterprise data. As a scientific advisor to startups, I also work on agentic AI systems and generative UI. My research spans the full stack: from representation learning and self-supervised pre-training to autonomous agent runtimes and AI-native interface generation.
I’m particularly interested in the coordination economics of agentic AI — how protocol-mediated agent systems change the cost structure of enterprise software, and what that implies for how we build and deploy AI at scale.
Previously: postdoc at MIT CSAIL and Harvard Medical School (representation learning, multimodal medical imaging). Ph.D. summa cum laude from TU Munich. Member of ELLIS.
| Area | Focus |
|---|---|
| Agentic AI | Autonomous agent runtimes, MCP-based tool calling, multi-agent orchestration, self-growing tool libraries |
| Foundation Models for Structured Data | Tabular pre-training, relational reasoning, enterprise-scale data representation |
| generative UI | AI-driven dynamic interface generation, adaptive UX, intent-to-UI synthesis |
| Knowledge-Augmented LLMs | RAG, privacy-preserving fine-tuning, domain adaptation |
| Representation Learning | Contrastive and self-supervised methods, multimodal alignment |
A Generative Foundation Model for Heterogeneous Tabular Data
TableFactory: Generating Semantically Linked Tabular Data via Multi-Agent Behavioral Simulation
Statistically Indistinguishable, Operationally Distinct: A Formal Barrier for Tabular Foundation Models
Tabular Foundation Model for Generative Modelling
The Headless Firm: How AI Reshapes Enterprise Boundaries
SALT-KG: A Benchmark for Semantics-Aware Learning on Enterprise Tables
Table Dissolution: Adding Salt To Your Data
SALT integrated into RelBench
Foundation Models for Tabular Data in Enterprises
Contrastive Perplexity for Controlled Generation: An Application in Detoxifying Large Language Model
SALT: Sales Autocompletion Linked Business Tables Dataset
PORTAL: Scalable Tabular Foundation Models via Content-Specific Tokenization
Low-shot contrastive learning of sentence representations
Self-supervised sentence representation learning
Contrastive Language Model Refinement for Commonsense Reasoning
Contrastive Self-Supervised Learning for Commonsense Reasoning
Co-organizer: Self-Supervised Learning for Reasoning and Perception
Self-supervised representation learning for medical imaging
Commonsense reasoning in AI
Contrastive self-supervised commonsense reasoning
Representation learning for medical imaging
Multi-Domain Learning
Commonsense reasoning
Deep generative models for visual learning
DeepNAT
Former interns who have gone on to research roles at top labs and faculty positions:
Enrico Fini — Member of Technical Staff at Microsoft AI (previously Research Scientist at Apple Machine Learning Research)
Stefan Lionar — Ph.D. Student in Computer Science at National University of Singapore (NUS) & Sea AI Lab
Jan Nikolas Morshuis — Doctoral Researcher at the University of Tübingen (Cluster of Excellence “Machine Learning” / Tübingen AI Center)
Aiham Taleb — Senior Applied Scientist – Generative AI at Amazon Web Services (AWS) / Generative AI Innovation Center
Artur Speiser — AI Research Scientist at Exscientia, formerly Ph.D. in Machine Learning in Science at University of Tübingen
Jannik Wolff — Machine Learning Research Scientist & Research Associate at BIFOLD, Ph.D. Candidate at TU Berlin (Machine Learning Group)
Daniel Dorda — Scientific Software Engineer at uniqFEED and researcher/teaching assistant at ETH Zürich (Computer Graphics / Computer Science)
Max Bain — Research Scientist at Google DeepMind, previously Member of Technical Staff at Reka and Ph.D. at VGG, University of Oxford
Mahdyar Ravanbakhsh — (Principal) Research Scientist at Zalando SE in Berlin
Oleksiy Ostapenko — Research Scientist at ServiceNow AI Research (Foundation Models Lab); Ph.D. from MILA / Université de Montréal
Colin Samplawski — Ph.D. Student at UMass Amherst (REML group) and Advanced Computer Scientist at SRI International (NuSCI Research Group)
Mihai M. Puscas — Computer Vision Researcher at SPORTTOTAL.TV (previously Huawei Research, Dublin)
Sandro Pezzelle — Assistant Professor in Responsible AI at ILLC, University of Amsterdam; Scientific Advisor at IVADO Labs
Frederik Pahde — Research Associate / Ph.D. Candidate at Fraunhofer HHI & TU Berlin (previously Applied ML Scientist at Amazon)
Shailza Jolly — Research Scientist at Amazon Alexa AI in Berlin (Ph.D. in Generative AI from TU Kaiserslautern / DFKI)
Denis Dushi — Machine Learning Engineer at Amazon
Vadim Tschernezki — Ph.D. Student in Computer Vision at the Department of Engineering Science, University of Oxford (VGG)
Robin C. Geyer — Ph.D. Student at the Institute for Machine Learning, ETH Zürich
Last updated — July 2026