Director of Machine Learning Research & Principal Scientist
đ SAP AI CTO Office | đĄ LLMs, Foundation Models, Structured Data AI
đ LinkedIn âą Google Scholar
Building intelligent systems that connect language, knowledge, and enterprise data.
I am Director of Machine Learning Research & Principal Scientist in the SAP AI CTO Office, where I lead research on large language models (LLMs), foundation models, and structured data AI.
My mission is to advance how enterprises leverage AI â by designing systems that can reason over complex structured data, adapt to limited supervision, and integrate domain-specific knowledge with the general capabilities of LLMs.
I oversee strategic research programs that bridge fundamental machine learning and enterprise-scale deployment, turning scientific innovation into robust, trustworthy AI platforms.
This includes developing foundation models for structured and relational data, knowledge-augmented LLMs, and agentic AI systems that coordinate decision-making and automation across business processes.
Over the past decade, Iâve built and led multidisciplinary research teams translating state-of-the-art methods into product-ready technologies â shaping how AI is applied in large-scale enterprise systems.
Before joining SAP, I conducted postdoctoral research at MIT CSAIL and Harvard Medical School, focusing on representation learning and multimodal modeling for medical imaging.
I hold a Ph.D. in Computer Science (summa cum laude) from the Technical University of Munich (TUM) and am a member of the European Laboratory for Learning and Intelligent Systems (ELLIS).
[2025.11] - A paper accepted (spotlight) at the EurIPSâ25 Workshop on AI for Tabular Data
[2025.07] - A paper accepted at the SIGMODâ25 Workshop on Data Management for End-to-End Machine Learning
[2025.07] - SALT officially integrated into RelBench.
[2025.05] - New pre-print available on foundation models for tabular data in enterprises
[2025.05] - Paper accepted at ACLÂ 2025Â â Contrastive Perplexity for Controlled Generation: An Application in Detoxifying Large Language Model
[2024.10] - Two papers accepted at the NeurIPSâ24 Table Representation Learning Workshop
[2023.05] - Paper accepted at ACL 2023 on low-shot contrastive learning of sentence representations.
[2022.02] Paper accepted at ACL 2022 on self-supervised sentence representation learning
[2021.08] Paper at EMNLP 2021 on Contrastive Language Model Refinement for Commonsense Reasoning
[2021.08] Paper at EMNLP 2021 on Contrastive Self-Supervised Learning for Commonsense Reasoning
[2021.04] Acceptance of co-organized at ICML 2021 workshop on Self-Supervised Learning for Reasoning and Perception
[2021.02] Paper accepted at IPMI 2021 on self-supervised representation learning for medical imaging (acceptance rate 30.0%)
[2020.09] Presentation on commonsense reasoning in AI
[2020.04] Paper accepted at ACL 2020 on contrastive self-supervised commonsense reasoning (acceptance rate of 17.6%)
[2020.02] Paper accepted at NeuroImage
[2019.10.20] Paper on Multi-Domain Learning accepted at ICCV 2019 (acceptance rate 25.0%)
[2019.05.14] Short-paper on commonsense reasoning accepted at ACL 2019 (acceptance rate 18.2%)
[2019.02.25] Paper accepted at CVPR 2019 (acceptance rate 25.2%)
[2017.02.01] Paper accepted at NeuroImage
I aim to advance AI systems that reason over structured data, adapt efficiently, and integrate domain knowledge to support enterprise decision-making.
At SAP, I drive the interface between fundamental AI research and strategic enterprise transformation, shaping how LLMs evolve for real-world impact.
Iâve had the privilege of mentoring exceptional research interns who have since moved on to leading academic labs and industrial AI teams:
Enrico Fini â Member of Technical Staff at Microsoft AI (previously Research Scientist at Apple Machine Learning Research)
Stefan Lionar â Ph.D. Student in Computer Science at National University of Singapore (NUS) & Sea AI Lab
Jan Nikolas Morshuis â Doctoral Researcher at the University of TĂŒbingen (Cluster of Excellence âMachine Learningâ / TĂŒbingen AI Center)
Aiham Taleb â Senior Applied Scientist â Generative AI at Amazon Web Services (AWS) / Generative AI Innovation Center
Artur Speiser â AI Research Scientist at Exscientia, formerly Ph.D. in Machine Learning in Science at University of TĂŒbingen
Jannik Wolff â Machine Learning Research Scientist & Research Associate at BIFOLD, Ph.D. Candidate at TU Berlin (Machine Learning Group)
Daniel Dorda â Scientific Software Engineer at uniqFEED and researcher/teaching assistant at ETH ZĂŒrich (Computer Graphics / Computer Science)
Max Bain â Research Scientist at Google DeepMind, previously Member of Technical Staff at Reka and Ph.D. at VGG, University of Oxford
Mahdyar Ravanbakhsh â (Principal) Research Scientist at Zalando SE in Berlin
Oleksiy Ostapenko â Research Scientist at ServiceNow AI Research (Foundation Models Lab); Ph.D. from MILA / UniversitĂ© de MontrĂ©al
Colin Samplawski â Ph.D. Student at UMass Amherst (REML group) and Advanced Computer Scientist at SRI International (NuSCI Research Group)
Mihai M. Puscas â Computer Vision Researcher at SPORTTOTAL.TV (previously Huawei Research, Dublin)
Sandro Pezzelle â Assistant Professor in Responsible AI at ILLC, University of Amsterdam; Scientific Advisor at IVADO Labs
Frederik Pahde â Research Associate / Ph.D. Candidate at Fraunhofer HHI & TU Berlin (previously Applied ML Scientist at Amazon)
Shailza Jolly â Research Scientist at Amazon Alexa AI in Berlin (Ph.D. in Generative AI from TU Kaiserslautern / DFKI)
Denis Dushi â Machine Learning Engineer at Amazon
Vadim Tschernezki â Ph.D. Student in Computer Vision at the Department of Engineering Science, University of Oxford (VGG)
Robin C. Geyer â Ph.D. Student at the Institute for Machine Learning, ETH ZĂŒrich
Status to the best of my knowledge as of November 2025.
Last updated â 27 November 2025