Menu:

CIDR 2022 Program


Get the program as Calendar entries.

Talk videos will be posted after each conference day on YouTube.

Sunday

4:00pm

Registration

(Room: Seacliff Lounge and Terrace)

5:30pm

Dinner

(Room: Sunset Restaurant)

Monday

7:00am

Breakfast

(hotel guests only)

8:30am

Opening and Welcome

(Room: Santa Cruz) 8:30 - 08:45 Fatma Özcan (Google), Peter Boncz (CWI), Thomas Neumann (TUM)

8:45am

Keynote 1

(Room: Santa Cruz) 8:45 - 10:00 Chair: Fatma Özcan (Google) Lessons from Building Databricks Matei Zaharia (Stanford and Databricks)

10:00am

Break

10:30am

Session 1: Modern Data Storage

Chair: Natacha Crooks (Room: Santa Cruz) 10:30 - 10:50 Append is Near: Log-based Data Management on ZNS SSDs Devashish R Purandare (University of California, Santa Cruz); Peter Wilcox (UC Santa Cruz); Heiner Litz (UC Santa Cruz); Shel Finkelstein (University of California at Santa Cruz, USA) 10:50 - 11:10 SSDs Striking Back: The Storage Jungle and Its Implications to Persistent Indexes Kaisong Huang (Simon Fraser University); Darien Imai (Simon Fraser University); Tianzheng Wang (Simon Fraser University); Dong Xie (Penn State University) 11:10 - 11:30 Are You Sure You Want to Use MMAP in Your Database Management System? Andrew Crotty (Carnegie Mellon University)*; Viktor Leis (Friedrich-Alexander-Universität Erlangen-Nürnberg); Andrew Pavlo (Carnegie Mellon University) 11:30 - 11:50 D-RDMA: Bringing Zero-Copy RDMA to Database Systems André Ryser (University of Fribourg); Alberto Lerner (University of Friborug)*; Alex Forencich (University of California - San Diego); Philippe Cudre-Mauroux (Exascale Infolab, Fribourg University) 11:50 - 11:55 Micro-architectural Analysis of OLAP Systems on Persistent Memory Jie Liang Ang (National University of Singapore)*; Jefferson Chu (National University of Singapore); Hieu Le Trung (Bytedance); Jiajun Liu (NUS); Jiong He (Bytedance Ltd.); Qian Lin (ByteDance); Bingsheng He (National University of Singapore) 11:55 - 12:00 Runtime Encoding Execution in AnalyticDB: Efficient Query Executor for Cloud-native Database Qiaoyi Ding (Alibaba)

12:00pm

Lunch

(Room: Sunset Restaurant)

Women in DB: Discussion and Socialization (Open to All CIDR Attendees)

(Room: online & Sunset Patio) Chairs: Yuanyuan Tian & Alekh Jindal

01:00pm

Session 2: New Systems

Chair: Tilmann Rabl (Room: Santa Cruz) 1:00 - 1:20 A Progress Report on DBOS: A Database-oriented Operating System Qian Li (Stanford University)*; Peter Kraft (Stanford University); Kostis Kaffes (Stanford University); Athinagoras Skiadopoulos (Stanford University); Deeptaanshu Kumar (Carnegie Mellon University); Jason Li (MIT); Michael Cafarella (MIT); Goetz Graefe (Google); Jeremy Kepner (MIT Lincoln Laboratory); Christos Kozyrakis (Stanford University); Michael Stonebraker (MIT); Lalith Suresh (VMware Research); Matei Zaharia (Stanford and Databricks) 1:20 - 1:40 Mach: A Pluggable Metrics Storage Engine for the Age of Observability Franco Solleza (Brown University)*; Andrew Crotty (Carnegie Mellon University); Suman Karumuri (Slack); Nesime Tatbul (Intel Labs and MIT); Stan Zdonik (Brown University) 1:40 - 2:00 Darwin: Scale-In Stream Processing Lawrence Benson (Hasso Plattner Institute, University of Potsdam)*; Tilmann Rabl (HPI, University of Potsdam) 2:00 - 2:20 GRainDB: A Relational-core Graph-Relational DBMS Guodong Jin (Renmin University of China)*; Nafisa Anzum (University of Waterloo); Semih Salihoglu (University of Waterloo) 2:20 - 2:25 Amalur: Next-generation Data Integration in Data Lakes Rihan Hai (TU Delft)*; Christos Koutras (TU Delft); Andra Ionescu (TU Delft); Asterios Katsifodimos (TU Delft) 2:25 - 2:30 Photon: A High-Performance Query Engine for the Lakehouse Alexander Behm (Databricks); Shoumik Palkar (Databricks)*

2:30pm

Break

3:00pm

Session 3: Video Analytics

Chair: Jyoti Leeka (Room: Santa Cruz) 3:00 - 3:20 VOCAL: Video Organization and Interactive Compositional AnaLytics Maureen Daum (University of Washington)*; Enhao Zhang (University of Washington); Dong He (University of Washington); Magdalena Balazinska (UW); Brandon Haynes (Microsoft); Ranjay Krishna (Stanford University); Apryle Craig (University of Washington); Aaron Wirsing (University of Washington) 3:20 - 3:40 VIVA: An End-to-End System for Interactive Video Analytics Daniel Kang (Stanford University)*; Francisco Romero (Stanford University); Peter D Bailis (Stanford University); Christos Kozyrakis (Stanford University); Matei Zaharia (Stanford and Databricks) 03:40 - 03:50 Sponsor Talk by Google

5:30pm

Dinner

(Room: Sunset Restaurant)

7:30pm

Start-up Session

(Room: Santa Cruz) Chair: Chris Ré (Stanford) 7:30 - 8:30

Tuesday

7:00am

Breakfast

(Room: Sunset Restaurant)

8:45am

Keynote 2

(Room: Santa Cruz) Chair: Peter Boncz (CWI) 8:45 - 10:00 Title: The Diverse Challenges of Video Data Management Magdalena Balazinska (UW)

10:00am

Break

10:30am

Session 4: Cloud Computing

Chair: Florin Rusu (Room: Santa Cruz) 10:30 - 10:50 CompuCache: Remote Computable Caching using Spot VMs Qizhen Zhang (University of Pennsylvania)*; Philip A Bernstein (Microsoft Research); Daniel S Berger (Microsoft Research); Badrish Chandramouli (Microsoft Research); Vincent Liu (University of Pennsylvania); Boon Thau Loo (Univ. of Pennsylvania) 10:50 - 11:10 Self-Organizing Data Containers Samuel Madden (MIT)*; Jialin Ding (MIT); Tim Kraska (MIT); Sivaprasad Sudhir (MIT); David E Cohen (Intel); Timothy Mattson (Intel); Nesime Tatbul (Intel Labs and MIT) 11:10 - 11:30 Farview: Disaggregated Memory with Operator Off-loading for Database Engines Dario Korolija (ETH Zurich)*; Dimitrios Koutsoukos (ETHZ); Kimberly Keeton (Unaffiliated); Konstantin Taranov (ETH Zurich); Dejan Milojicic (Hewlett Packard Labs); Gustavo Alonso (ETHZ) 11:30 - 11:50 Decoupled Transactions: Low Tail Latency Online Transactions Atop Jittery Servers Pat Helland (Salesforce)* 11:50 - 11:55 A Network Use for Incomplete Knowledge Management Anduo Wang (Temple University)*; Fangping Lan (Temple University) 11:55 - 12:00 Correctness in Stream Processing: Challenges and Opportunities Caleb Stanford (University of Pennsylvania)*; Konstantinos Kallas (University of Pennsylvania); Rajeev Alur (University of Pennsylvania)

12:00pm

Lunch

(Room: Sunset Restaurant)

01:00pm

Session 5: Data Exploration

Chair: Yuanyuan Tian (Room: Santa Cruz) 1:00 - 1:20 Kyrix-J: Visual Discovery of Connected Datasets in a Data Lake Wenbo Tao (MIT)*; Adam Sah (Independent); Leilani Battle (University of Washington); Remco Chang (Tufts University); Michael Stonebraker (MIT) 1:20 - 1:40 Building a Shared Conceptual Model of Complex, Heterogeneous Data Systems: A Demonstration Michael R Anderson (University of Michigan); Yuze Lou (University of Michigan); Jiayun Zou (University of Michigan); Michael Cafarella (MIT)*; Sarah Chasins (University of California, Berkeley); Doug Downey (Allen Institute for Artificial Intelligence); Tian Gao (University of Michigan); Kexin Huang (University of Michigan); Dinghao Shen (University of Michigan); Jenny Vo-Phamhi (University of Michigan); Yitong Wang (University of Michigan); Yuning Wang (University of Michigan); Anna Zeng (MIT) 1:40 - 2:00 Knowledge Graph Exploration Systems: are we lost? Matteo Lissandrini (Aalborg University)*; Davide Mottin (Aarhus University); Katja Hose (Aalborg University); Torben Bach Pedersen (Aalborg University) 2:00 - 2:05 Data Management Opportunities for Foundation Models Laurel Orr (Stanford University)*; Karan Goel (Stanford); Christopher Re (Stanford University) 2:05 - 2:10 Towards NLP-Enhanced Data Profiling Tools Immanuel Trummer (Cornell)* 2:10 - 2:20 Sponsor Talk by Snowflake 2:20 - 2:30 Sponsor Talk by Oracle

2:30pm

Break

3:00pm

Session 6: Data Science

Chair: Carsten Binnig (Room: Santa Cruz) 3:00 - 3:20 DAPHNE: An Open and Extensible System Infrastructure for Integrated Data Analysis Pipelines Patrick Damme (Graz University of Technology & Know-Center GmbH)*; Marius Birkenbach (KAI); Constantinos Bitsakos (NTUA); Matthias Boehm (Graz University of Technology); Philippe Bonnet (IT Univ Copenhagen, Denmark); Florina M. Ciorba (Technical University of Dresden, Germany / University of Basel, Switzerland); Mark Dokter (Know-Center GmbH); Pawel Dowgiallo (Intel); Ahmed Eleliemy (University of Basel); Christian Faerber (Intel Corporation); Georgios Goumas (National Technical University of Athens); Dirk Habich (TU Dresden); Niclas Hedam (IT University of Copenhagen); Marlies Hofer (AVL List GmbH); Wenjun Huang (German Aerospace Center); Kevin Innerebner (Graz University of Technology); Vasileios Karakostas (National Technical University of Athens (NTUA)); Roman Kern (KNOW-CENTER GmbH); Tomaž Kosar (University of Maribor); Alexander Krause (TU Dresden); Daniel Krems (AVL List GmbH); Andreas Laber (Infineon); Wolfgang Lehner (TU Dresden); Eric Mier (TU Dresden); Marcus Paradies (German Aerospace Center); Bernhard Peischl (); Gabrielle Poerwawinata (University of Basel); Stratos Psomadakis (ICCS/NTUA); Tilmann Rabl (HPI, University of Potsdam); Piotr Ratuszniak (Intel Technology Poland); Pedro Silva (HPI, University of Potsdam); Nikolai Skuppin (German Aerospace Center (DLR)); Andreas Starzacher (Infineon); Benjamin Steinwender (KAI GmbH); Ilin Tolovski (Hasso Plattner Institute); Pinar Tozun (IT University of Copenhagen); Wojciech Ulatowski (Intel); Yuanyuan Wang (Technical University of Munich (TUM); German Aerospace Center (DLR)); Izajasz Wrosz (Intel); Aleš Zamuda (University of Maribor); Ce Zhang (ETH); Xiaoxiang Zhu (Technical University of Munich (TUM); German Aerospace Center (DLR) 3:20 - 3:40 Augmenting Decision Making via Interactive What-If Analysis Sneha Gathani (Sigma Computing)*; Madelon Hulsebos (Sigma Computing); James Gale (Sigma Computing); Peter Haas (University of Massachusetts Amherst); Cagatay Demiralp (Sigma Computing) 03:40 - 03:45 Towards Observability for Machine Learning Pipelines Shreya Shankar (University of California Berkeley)*; Aditya Parameswaran (University of California, Berkeley) 3:45 - 3:50 Screening Native Machine Learning Pipelines with ArgusEyes Sebastian Schelter (University of Amsterdam)*; Stefan Grafberger (University of Amsterdam); Shubha Guha (University of Amsterdam); Olivier Sprangers (University of Amsterdam); Bojan Karlas (ETH Zurich); Ce Zhang (ETH) 03:50 - 03:55 Making Table Understanding Work in Practice Madelon Hulsebos (Sigma Computing)*; Sneha Gathani (Sigma Computing); James Gale (Sigma Computing); Isil Dillig (UT Austin); Paul Groth (University of Amsterdam); Cagatay Demiralp (Sigma Computing) 3:55 - 4:00 Examples are All You Need: Iterative Data Discovery by Example in Data Lakes El Kindi Rezig (MIT)*; Anshul Bhandari (National Institute of Technology Hamirpur); Anna Fariha (Microsoft); Benjamin Price (MIT Lincoln Laboratory); Allan Vanterpool (United States Air Force); Andrew Bowne (United States Air Force); Lindsey McEvoy (U.S. Air Force AI Accelerator (SAF/AQ)); Vijay Gadepally (MIT Lincoln Laboratory)

5:30pm

Dinner

(Room: Sunset Restaurant)

7:30pm

Gong Show

(Room: Santa Cruz) Chair: Andy Pavlo (CMU) 7:30 - 8:30

Wednesday

7:00am

Breakfast

(Room: Sunset Restaurant)

08:30am

Session 7: Query Processing

Chair: Matthias Böhm (Room: Santa Cruz) 08:30 - 08:50 The 3D Hash Join: Building On Non-Unique Join Attributes Daniel Flachs (University of Mannheim)*; Magnus Mueller (University of Mannheim); Guido Moerkotte (University of Mannheim) 08:50 - 09:10 Memory Efficient Scheduling of Query Pipeline Execution Lukas Landgraf (TU Dresden)*; Wolfgang Lehner (TU Dresden); Florian Wolf (SAP SE); Alexander Boehm (SAP SE) 09:10 - 09:30 Introducing a Query Acceleration Path for Analytics in SQLite3 Martin Prammer (University of Wisconsin - Madison)*; Suryadev Sahadevan Rajesh (University of Wisconsin - Madison); Junda Chen (University of Wisconsin-Madison); Jignesh Patel (UW - Madison) 09:30 - 09:50 Accelerating Python UDFs in Vectorized Query Execution Steffen Kläbe (Actian Corp.)*; Robert DeSantis (Actian Corp); Stefan Hagedorn (TU Ilmenau); Kai-Uwe Sattler (TU Ilmenau) 09:50 - 10:10 Boosting Efficiency of External Pipelines by Blurring Application Boundaries Anna Herlihy (EPFL)*; Periklis Chrysogelos (EPFL); Anastasia Ailamaki (EPFL) 10:10 - 10:20 Sponsor Talk by SAP 10:20 - 10:30 Sponsor Talk by Microsoft

10:30am

Break

(Room: Sunset Restaurant)

11:00am

Session 8: ML and Query Optimization

Chair: Fatma Özcan (Google) (Room: Santa Cruz) 11:00 - 11:20 Workload-driven, Lazy Discovery of Data Dependencies for Query Optimization Jan Kossmann (Hasso Plattner Institute)*; Felix Naumann (Hasso Plattner Institute); Daniel Lindner (Hasso Plattner Institute); Thorsten Papenbrock (Philipps University of Marburg) 11:20 - 11:40 A Unified Transferable Model for ML-Enhanced DBMS Ziniu Wu (Massachusetts Institute of Technology)*; Pei Yu (); Peilun Yang (University of Technology Sydney); Rong Zhu (Alibaba Group); Yuxing Han (ByteDance); Yaliang Li (Alibaba Group); Defu Lian (University of Science and Technology of China); Kai Zeng (Alibaba Group); Jingren Zhou (Alibaba Group) 11:40 - 12:00 One Model to Rule them All: Towards Zero-Shot Learning for Databases Benjamin Hilprecht (TU Darmstadt)*; Carsten Binnig (TU Darmstadt) 12:00 - 12:20 Machine Learning, Linear Algebra, and More: Is SQL All You Need? Mark Blacher (Friedrich Schiller University Jena)*; Joachim Giesen (Friedrich Schiller University Jena); Sören Laue (Friedrich Schiller University Jena / Data Assessment Solutions GmbH Hannover); Julien Klaus (Friedrich Schiller University Jena); Viktor Leis (Friedrich-Alexander-Universität Erlangen-Nürnberg) 12:20 - 12:25 DataFarm: Farm Your ML-based Query Optimizer’s Food! – Human-Guided Training Data Generation – Robin P. van de Water (TU Berlin)*; Francesco Ventura (TU Berlin); Zoi Kaoudi (TU Berlin); Jorge Arnulfo Quiane Ruiz (TU Berlin); Volker Markl (Technische Universität Berlin) 12:25 - 12:30 Can Transfer Learning be used to build a Query Optimizer? Yunjia Zhang (University of Wisconsin-Madison)*; Yannis Chronis (University of Wisconsin Madison); Jignesh Patel (UW - Madison); Theodoros Rekatsinas (University of Wisconsin-Madison)

12:30pm

Lunch

(Room: Sunset Restaurant)