Prof. Dr. Sebastian Schelter
Research Group Lead
Research Group Lead | BIFOLD
Professor | Technische Universität Berlin
Sebastian Schelter is a Full Professor at the Berlin Institute for the Foundations of Learning and Data (BIFOLD) and Technische Universität Berlin. His research is focused on the intersection of data management and machine learning with the goal to foster the responsible management of data and to democratise data science technologies.
The research of his group is accompanied by efficient and scalable open source implementations, many of which are applied in real world use cases, for example in the Amazon Web Services cloud and in large European e-commerce platforms.
In the past, he has been an assistant professor at the University of Amsterdam, a faculty fellow at New York University, a senior applied scientist at Amazon Research and a research intern at Twitter and IBM Almaden in California. His research contributions have been recognized with an ACM SIGMOD Systems Award, an ACM SIGMOD Best Demo Runner Up Award, and a Best Paper Runner Up Award from the Table Representation Learning workshop at NeurIPS.
2018 | Moore-Sloan Data Science Fellowship |
2015 | Amazon Education Research Grant Award |
2012 | IBM Faculty Award (with Volker Markl) |
- Data Management for End-to-End Machine Learning
- Responsible Data Management
- Massively Parallel Data Processing
- Scalable Data Mining; Recommender Systems
- Computational Social Science
- Apache Software Foundation (emeritus)
- Association for Computing Machinery (ACM)
- Electronic Frontier Foundation
- Deutscher Hochschulverband
Sebastian Schelter, Shubha Guha, Stefan Grafberger
Automated Provenance-Based Screening of ML Data Preparation Pipelines
Sebastian Schelter, Stefan Grafberger
Messy Code Makes Managing ML Pipelines Difficult? Just Let LLMs Rewrite the Code!
Zeyu Zhang, Paul Groth, Iacer Calixto, Sebastian Schelter
AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language Model
Tim Januschowski, Jan Gasthaus, Yuyang (Bernie) Wang, Syama Rangapuram, Caner Turkmen, Jasper Zschiegner, Lorenzo Stella, Michael Bohlke-Schneider, Danielle Maddix Robinson, Konstantinos Benidis, Alexander Alexandrov, Christos Faloutsos, Sebastian Schelter
A Flexible Forecasting Stack
Stefan Grafberger, Paul Groth, Sebastian Schelter
Towards Interactively Improving ML Data Preparation Code via "Shadow Pipelines"
Publication Highlight - Snapcase
At the VLDB 2024 conference, the BIFOLD Research Group DEEM Lab introduced "Snapcase," a demo paper that addresses the concept of machine unlearning.
Reviewing VLDB 2024
Four BIFOLD research groups participated in the 50th International Conference on Very Large Databases in Guangzhou, China, taking place from August 26 to 30, 2024.
Newly appointed BIFOLD Professor: Sebastian Schelter
As of June 1st, 2024, Sebastian Schelter is a full Professor at the Berlin Institute for the Foundations of Learning and Data (BIFOLD) and Technische Universität Berlin. He chairs the Management of Data Science Processes group (DEEM Lab), whose research is focused on the intersection of data management and machine learning.
BIFOLD at the 2024 ACM SIGMOD/PODS Conference
BIFOLD researchers presented four research papers, two demos, one workshop paper and were of a panel at the 2024 ACM SIGMOD/ PODS Conference in Santiago, Chile.
8 researchers represented BIFOLD at SIGMOD 2023
Eight members of the BIFOLD team took the chance to showcase their recent work at SIGMOD 2023 in Seattle through a diverse array of presentations, including research papers, workshop papers, and a demo paper – all of them underscoring the institute's commitment to cutting-edge research in the field of data management.