SUNDAY JANUARY 4, 2015

4PM REGISTRATION OPENS (ASILOMAR CHECK-IN)

6PM DINNER (SEASCAPE RESTAURANT)

7PM INFORMAL EVENING GET-TOGETHER  (CHAPEL)

MONDAY JANUARY 5 , 2015

8:30AM-10:15AM  SESSION 1: REAL-TIME DATA (CHAIR: R. JOHNSON)

JUST-IN-TIME DATA VIRTUALIZATION: LIGHTWEIGHT DATA MANAGEMENT

WITH VIDA (PDF) (SLIDES)

Manos Karpathiotakis (EPFL);  Ioannis Alagiannis (EPFL);  Thomas Heinis (EPFL & Imperial College); Miguel Branco (EPFL);  Anastasia Ailamaki (EPFL) 9_CIDR15_Slides_Paper9.pdf

JUST-IN-TIME DATA STRUCTURES (PDF) (SLIDES)

Oliver Kennedy (University at Buffalo); Lukasz Ziarek (University at Buffalo)

 

DATA STREAM WAREHOUSING IN TIDALRACE (PDF) 

Theodore Johnson (AT&T Labs – Research);  Vladislav Shkapenyuk (AT&T Labs – Research) 

 

LIQUID: UNIFYING NEARLINE AND OFFLINE BIG DATA INTEGRATION (PDF) 

Raul Castro Fernandez (Imperial College); Peter Pietzuch (Imperial College); Jay Kreps (Confluent Inc.); Neha Narkhede (Confluent Inc.);

Jun Rao (Confluent Inc.); Joel Koshy (LinkedIn Inc.);  Dong Lin (LinkedIn Inc.); Chris Riccomini (LinkedIn Inc.); Guozhang Wang (LinkedIn Inc.)

 

10:15AM-10:45AM: BREAK

10:45AM-12:00PM Session 2: COLUMN STORES (CHAIR: I. PANDIS)  (SLIDES)

IMPALA: A MODERN, OPEN-SOURCE SQL ENGINE FOR HADOOP (PDF) (SLIDES)

Marcel Kornacker (Cloudera); Alexander Behm (Cloudera);  Victor Bittorf (Cloudera);  Taras Bobrovytsky (Cloudera); Casey Ching (Cloudera); Alan Choi (Cloudera); Justin Erickson (Cloudera);  Martin Grund (Cloudera); Daniel Hecht (Cloudera); Matthew Jacobs (Cloudera);  Ishaan Joshi (Cloudera);  Lenni Kuff (Cloudera); Dileep Kumar (Cloudera); Alex Leblang (Cloudera); Nong Li (Cloudera); Ippokratis Pandis (Cloudera); Henry Robinson (Cloudera); David Rorke (Cloudera); Silvius Rus (Cloudera); John Russell (Cloudera); Dimitris Tsirogiannis (Cloudera); Skye Wanderman-Milne (Cloudera); Michael Yoder (Cloudera)

 

SHILPA LAWANDE et al.: VERTICA (SLIDES)

 

12:00PM-1:15PM LUNCH

1:15PM-2:30PM Session 3: DATA-DRIVEN SCIENCE (CHAIR: H. KUNO)

HEPHAESTUS: DATA REUSE FOR ACCELERATING SCIENTIFIC DISCOVERY (PDF)

Jennie Duggan (Northwestern EECS); Michael L. Brodie (MIT CSAIL)

 

DATAHUB: COLLABORATIVE DATA SCIENCE & DATASET VERSION MANAGEMENT

AT SCALE (PDF) (SLIDES)

Anant Bhardwaj (MIT);  Souvik Bhattacherjee (U. Maryland); Amit Chavan (U. Maryland); Amol Deshpande(U. Maryland); Aaron J. Elmore (MIT & U. Chicago); Samuel Madden (MIT); Aditya Parameswaran (MIT & U. Illinois)

 

BUILDING HIGHLY-OPTIMIZED, LOW-LATENCY PIPELINES FOR GENOMIC DATA

ANALYSIS  (PDF)

Yanlei Diao (University of Massachusetts Amherst); Abhishek Roy (University of Massachusetts Amherst); Toby Bloom (New York Genome Center)

 

2:30PM-2:45PM BREAK

2:45PM-4:00PM  Session 4: DATA INTEGRATION (CHAIR: M. STONEBRAKER)  (SLIDES)

APPLYING WEBTABLES IN PRACTICE (PDF)

Sreeram Balakrishnan (Google Research); Alon Halevy (Google Research);  Boulos Harb (Google Research); Hongrae Lee (Google Research); Jayant Madhavan (Google Research); Afshin Rostamizadeh (Google Research); Warren Shen (Google Research); Kenneth Wilder (Google Research); Fei Wu (Google Research); Cong Yu (Google Research)

 

FINDING QUALITY IN QUANTITY: THE CHALLENGE OF DISCOVERING VALUABLE SOURCES FOR INTEGRATION  (PDF) (SLIDES)

Theodoros Rekatsinas (University of Maryland);  Xin Luna Dong (Google Inc.); Lise Getoor (UC Santa Cruz); Divesh Srivastava (AT&T Labs-Research)

LOOKING AT EVERYTHING IN CONTEXT (PDF) (SLIDES) 

Zachary G. Ives (University of Pennsylvania); Zhepeng Yan(University of Pennsylvania);  Nan Zheng(University of Pennsylvania);  Brian Litt (University of Pennsylvania); Joost B. Wagenaar (University of Pennsylvania)

 

4:00PM-4:15PM: BREAK

4:15PM-5:30PM Session 5: STORAGE SYSTEMS (CHAIR: A. AILAMAKI)

INSTANT RECOVERY FOR MAIN MEMORY DATABASES  (PDF) (SLIDES)

Ismail Oukid (Technische Universität Dresden & SAP SE); Wolfgang Lehner (Technische Universität Dresden); Thomas Kissinger (Technische Universität Dresden); Thomas Willhalm (Intel GmbH); Peter Bumbulis (SAP SE)

INVISIBLE GLUE: SCALABLE SELF-TUNNING MULTI-STORES (PDF)  (SLIDES) 

Francesca Bugiotti (INRIA & U. Paris-Sud); Damian Bursztyn (INRIA & U. Paris-Sud); Alin Deutsch (UC San Diego); Ioana Ileana (Telecom ParisTech); Ioana Manolescu (INRIA & U. Paris-Sud)

 

IMMUTABILITY CHANGES EVERYTHING (PDF)

Pat Helland (Salesforce.com)

 

5:30PM-7:15PM BREAK AND DINNER

7:15PM  GONG SHOW (CHAIR: Y. DIAO)

Tracking Personal Data Use: Provenance and Trust

Lucja Kot (Cornell University) (PDF)

 

XCloud: Extensible Performance Management for  Cloud Data Services

Olga Papaemmanouil (Brandeis University) (PDF)

 

The Case for Small Data Management (PDF)

Jens Dittrich (Saarland University)

 

Modelling the Evolving World (PDF)

Anja Gruenheid (ETH)

 

The Big Data - Same Humans Problem  (PDF)

Alexandros Labrinidis (University of Pittsburgh)

 

Smoothing non-uniform communication latencies for OLTP (PDF)

Danica Porobic (EPFL)

 

Synthesizing Data Programs

Mike Cafarella (University of Michigan) (PDF)

 

Mastering Situation Awareness: The Next Frontier? (PDF)

Dieter Gawlick (Oracle)

 

Breathing Life into Database Textbooks (PDF)

Arnab Nandi (The Ohio State  University)

 

Big Data Space Fungus (PDF)

Martin Kersten (CWI)

 

Heisenberg Was on the Write Track (PDF) (SLIDES)

Patrick Helland (Salesforce.com)

 

A Multicore Database is not a Distributed System (PDF)

Neha Narula (MIT CSAIL)

 

Towards Generating Application-Specific Data Management Systems (PDF)

Alvin Cheung (University of Washington)

 

Robust Data Transformations (PDF)

Alekh Jindal (MIT CSAIL)

 

Data Visualization Management Systems (PDF)

Eugene Wu (MIT CSAIL)

 

Data, Technology and the Challenges of Abundance

Joseph Hellerstein (UC Berkeley) (PDF)

 

Towards Systematic Data Center Design (PDF)

Avrilia Floratou (IBM Almaden)

 

Desiderata for a Big Data Language (PDF)

David Maier (Portland State University)

 

Big Data Analytics with Access Control

Johannes Gehrke (Microsoft)

 

The Case for Invariant-Based Concurrency Control  (PDF)

Peter Bailis (UC Berkeley)

 

Big Data Science Needs Big Data Middleware (PDF)

Bill Howe  (University of Washington)

 

Verdict: A System for Stochastic Query Planning (PDF)

Barzan Mozafari (University of Michigan)

 

 

TUESDAY, JANUARY 6, 2015

 

8:30AM-9:30AM: KEY NOTE TALK BY RAVI RAJVAR (INTEL): SPECIALIZED EVOLUTION OF THE GENERAL PURPOSE CPU  (PDF)

9:30AM-10:00AM BREAK      

10:00AM-11:45PM  Session 6 QUERY PROCESSING (UNDER NEW ASSUMPTIONS) (CHAIR: S. FINKELSTEIN)  (SLIDES)

THE CASE AGAINST SPECIALIZED GRAPH ANALYTICS ENGINES (PDF) (SLIDES)

Jing Fan (University of Wisconsin); Adalbert Gerald Soosai Raj (University of Wisconsin); Jignesh M. Patel (University of Wisconsin)

FPGA-BASED MULTITHREADING FOR IN-MEMORY HASH JOINS  (PDF) (SLIDES)

Robert J. Halstead (University of California);  Ildar Absalyamov (University of California);  Walid A. Najjar (University of California); Vassilis J. Tsotras (University of California)

 

RAISING AUTHORIZATION AWARENESS IN A DBMS (PDF)

Abhijeet Mohapatra (Stanford University); Ravi Ramamurthy (Microsoft Research);

Raghav Kaushik (Microsoft Research)

 

TUPLEWARE: ''BIG'' DATA, BIG ANALYTICS, SMALL CLUSTERS (PDF)

Andrew Crotty (Brown University);  Alex Galakatos (Brown University);

Kayhan Dursun (Brown University); Tim Kraska (Brown University);

Ugur Cetintemel (Brown  University); Stan Zdonik  (Brown University)

 

 

11:45AM-1:15PM LUNCH

1:15PM-3:00PM  Session 7: CLOUD (CHAIR: A. FEKETE)   (SLIDES)

CHANGING THE FACE OF DATABASE CLOUD SERVICES WITH PERSONALIZED

SERVICE LEVEL AGREEMENTS (PDF) (SLIDES)

Jennifer Ortiz (University of Washington);  Victor Teixeira de Almeida (University of Washington & PETROBRAS S.A.); Magdalena Balazinska (University of Washington)

 

DATABASE OPTIMIZATION IN THE CLOUD: WHERE COSTS, PARTIAL RESULTS,

AND CONSUMER CHOICE MEET (PDF)

Willis Lang (Microsoft Gray Systems Lab); Rimma V. Nehme (Microsoft Gray Systems Lab);

Ian Rae (Microsoft Gray Systems Lab)

 

WANALYTICS: ANALYTICS FOR A GEO-DISTRIBUTED DATA-INTENSIVE WORLD (PDF) (SLIDES)

Ashish Vulimiri (UIUC); Carlo Curino (Microsoft); Brighten Godfrey (UIUC); Konstantinos Karanasos (Microsoft); George Varghese (Microsoft)

 

HIGH PERFORMANCE TRANSACTIONS IN DEUTERONOMY  (PDF) (SLIDES)

Justin Levandoski (Microsoft Research); David Lomet (Microsoft Research); Sudipta Sengupta (Microsoft Research);  Ryan Stutsman (Microsoft Research); Rui Wang (Microsoft Research)

 

3:00PM-6:00PM: FREE TIME

6:00PM-7:15PM DINNER

7:15PM-8:30PM  Session 8:  DATA INTEGRATION II (CHAIR: T. KRASKA)

AN INFORMATION PROVIDER’S WISH LIST FOR A NEXT GENERATION

BIG DATA END-TO-END INFORMATION SYSTEM  (PDF)

Mona M. Vernon (Thomson Reuters); Brian Ulicny (Thomson Reuters); Dan Bennett (Thomson Reuters)

 

DATAXFORMER: LEVERAGING THE WEB FOR SEMANTIC TRANSFORMATIONS

(PDF) (SLIDES)

Ziawasch Abedjan (CSAIL MIT); John Morcos (University of Waterloo); Michael Gubanov (CSAIL MIT); Ihab F. Ilyas (University of Waterloo); Michael Stonebraker(CSAIL MIT);  Paolo Papotti (Qatar Computing Research Institute);

Mourad Ouzzani (Qatar Computing Research Institute)

 

PREDICTIVE INTERACTION FOR DATA TRANSFORMATION  (PDF)

Jeffrey Heer (U. Washington & Trifacta Inc.); Joseph M. Hellerstein (UC Berkeley & Trifacta Inc.);  Sean Kandel (Trifacta Inc.)

8:30pm-8:45pm GONG SHOW AWARDS CEREMONY (CHAIR: YANLEI DIAO)

 

WEDNESDAY, JANUARY 7, 2015

8:30AM-10:15AM Session 9: MODEL MANAGEMENT (CHAIR: C. RE)

MANAGEMENT OF FLEXIBLE SCHEMA DATA IN RDBMSs

OPPORTUNITIES UND LIMITATIONS FOR NOSQL – (PDF) (SLIDES)  

Zhen Hua Liu (Oracle Corporation); Dieter Gawlick (Oracle Corporation)

 

CAPTURING THE LAWS OF (DATA) NATURE (PDF) (SLIDES)

Hannes Mühleisen (CWI, Amsterdam); Martin Kersten (CWI, Amsterdam); Stefan Manegold (CWI, Amsterdam)

 

COMBINING DATABASES AND SIGNAL PROCESSING IN PLATO (PDF)

Yannis Katsis (UC San Diego); Yoav Freund (UC San Diego);  Yannis Papakonstantinou (UC San Diego)

 

THE MISSING PIECE IN COMPLEX ANALYTICS: LOW LATENCY, SCALABLE MODEL MANAGEMENT AND SERVING WITH VELOX  (PDF)

Daniel Crankshaw (UC Berkeley AMPLab); Peter Bailis (UC Berkeley AMPLab);  Joseph E. Gonzalez(UC Berkeley AMPLab);  Haoyuan Li (UC Berkeley AMPLab);  Zhao Zhang (UC Berkeley AMPLab);  Michael J. Franklin (UC Berkeley AMPLab);  Ali Ghodsi (UC Berkeley AMPLab);  Michael I. Jordan (UC Berkeley AMPLab)

 

10:15AM-10:45AM BREAK

10:45AM-12:00PM Session 10:  KNOWLEDGE MANAGEMENT (CHAIR: M. CAFARELLA)

YAGO3: A KNOWLEDGE BASE FROM MULTILINGUAL WIKIPEDIAS (PDF) (SLIDES)

Farzaneh Mahdisoltani (Max Planck Institute); Joanna Biega (Max Planck Institute); Fabian M. Suchanek (Télécom ParisTech)

 

MANAGING GENERAL AND INDIVIDUAL KNOWLEDGE IN CROWD MINING APPLICATIONS (PDF) (SLIDES)  

Yael Amsterdamer (Tel Aviv University);  Susan B. Davidson (University of Pennsylvania); Anna Kukliansky (Tel Aviv University);  Tova Milo (Tel Aviv University);  Slava Novgorodov (Tel Aviv University);  Amit Somech (Tel Aviv University)

 

DATA WRANGLING: THE CHALLENGING YOURNEY FROM THE WILD TO THE LAKE (PDF) (SLIDES)

Ignacio Terrizzano (IBM Research);  Peter Schwarz (IBM Research);  Mary Roth (IBM Research);  John E. Colino IBM Research)