Please use this identifier to cite or link to this item: https://hdl.handle.net/10316/100598
Title: 3iCubing: An Interval Inverted Index Approach to Data Cubes
Authors: Domingues, Marco
Silva, Rodrigo Rocha 
Bernardino, Jorge 
Keywords: Big data; data cube; inverted index; OLAP
Issue Date: 2022
Serial title, monograph or event: IEEE Access
Volume: 10
Abstract: The increase in the amounts of information used to analyze data is problematic since the memory necessary to store and process it is getting quite big. The interval inverted index representation was developed to reduce the required memory to store data, and Frag-Cubing is one of the most popular algorithms. In this paper, we propose two new data cubing algorithms: 3iCubing and M3iCubing. 3iCubing is a Frag-Cubing-based algorithm that uses the interval inverted index representation, while M3iCubing uses both a normal and interval inverted index data representation. The algorithms were compared using synthetic and real data sets in indexation and querying operations, both runtime and memory-wise. The experimental evaluation shows that 3iCubing can considerably reduce the memory needed to index a data set, reducing around 25% of the memory used by Frag-Cubing. Moreover, the results show that the interval inverted index representation is dependent on the data skewness to reduce the memory consumption, having positive results with highly skewed and real-world data sets.
URI: https://hdl.handle.net/10316/100598
ISSN: 2169-3536
DOI: 10.1109/ACCESS.2022.3142449
Rights: openAccess
Appears in Collections:I&D CISUC - Artigos em Revistas Internacionais

Files in This Item:
Show full item record

SCOPUSTM   
Citations

1
checked on Apr 15, 2024

WEB OF SCIENCETM
Citations

1
checked on Apr 2, 2024

Page view(s)

71
checked on Apr 16, 2024

Download(s)

28
checked on Apr 16, 2024

Google ScholarTM

Check

Altmetric

Altmetric


This item is licensed under a Creative Commons License Creative Commons