Skip to main content

Array Databases: Big Data Cubes in SQL

Prof. Dr. Peter Baumann, Jacobs University, Bremen, Germany

Date: 09-Jan-2017 , 4pm

Abstract

Array Databases close a gap in the database ecosystem by adding modeling, storage, and processing support on multi-dimensional arrays. Declarative array query languages provide on-demand analytics on massive arrays while allowing internal optimization, parallelization, and data distribution.

Such "datacubes" appear as spatio-temporal sensor, image, simulation, and statistics data in all science and engineering domains, and beyond. For example, 2-D satellite imagery, 2-D x/y/t image timeseries and x/y/z geophysical voxel data, and 4-D x/y/z/t climate data contribute to today's data deluge in the Earth sciences. Virtual observatories in the Space sciences routinely generate Petabytes of such data. Life sciences deal with microarray data, confocal microscopy, human brain data. In business, financial risk analysis is an array-intensive application. Some researchers even consider matrices a suitable paradigm for the processing of large graphs.

We present and discuss concepts and implementation of the pioneer Array DBMS, rasdaman, which is in operational use on 100+ TB databases. Further, we present applications and datacube standards rasdaman has shaped, such as the ISO SQL/MDA (Multi-Dimensional Arrays) candidate standard.

About the Presenter

Dr. Peter Baumann is Professor of Computer Science at Jacobs University, Bremen, Germany where he researches on scalable multi-dimensional array databases and their application in science and engineering. With his work on algebra, query languages, and efficient architectures culminating in the rasdaman array DBMS he has coined the research field of array databases. He has published 100+ book chapters and journal and conference articles, holds international patents on array database technology, and has received numerous international innovation awards for his work. The rasdaman technology is in operational use on 250+ TB array databases. In 2014, rasdaman has been ranked sole winner of the Big Data Challenge posed by T-Systems as part of the Copernicus Master competition.

Peter Baumann is active, often leading contributor to standardization in the Open Geospatial Consortium (OGC) and ISO bodies, being editor of a series of adopted international standards. In OGC he is chairing the "Big Earth Data" working groups and is editor of a dozen adopted standards, among them the WCS suite which recently has been adopted by the European legal framework for a common Spatial Data nfrastructure, INSPIRE. In ISO, he has initiated and is leading development of the forthcoming SQL / MDA standard which extends SQL with n-D arrays, and on modernizing the ISO Big Geo Data standards. In the Research Data Alliance (RDA), he co-chairs the Big Data and Geospatial Interest Groups as well as the Array Database Assessment Working Group. In 2014, OGC has honored his contribution to Big Data standardization with its prestigious Kenneth Gardels Award.

See www.peter-baumann.org for more information.

 

Location: TU Berlin, E-N, 7th floor (DIMA), Room: E-N 719