Please use this identifier to cite or link to this item: https://hdl.handle.net/10316/113871
Title: On the accuracy of code complexity metrics: A neuroscience-based guideline for improvement
Authors: Hao, Gao
Hijazi, Haytham 
Durães, João
Medeiros, Julio 
Couceiro, Ricardo
Lam, Chan Tong
Teixeira, César A. 
Castelhano, João 
Castelo-Branco, Miguel 
Carvalho, Paulo de 
Madeira, Henrique 
Keywords: code complexity metrics; code comprehension; EEG; cognitive load; mental effort; code refactoring; code constructs
Issue Date: 2022
Publisher: Frontiers Media S.A.
Project: This work was funded in part by the BASE (Biofeedback Augmented Software Engineering) project under Grant POCI- 01-0145-FEDER-031581, by the Centro de Informática e Sistemas da Universidade de Coimbra (CISUC), and in part by Coimbra Institute for Biomedical Imaging and Translational Research (CIBIT), Institute of Nuclear Sciences Applied to Health (ICNAS), and the University of Coimbra under Grant PTDC/PSI-GER/30852/2017 | CONNECT-BCI. 
Serial title, monograph or event: Frontiers in Neuroscience
Volume: 16
Abstract: Complexity is the key element of software quality. This article investigates the problem of measuring code complexity and discusses the results of a controlled experiment to compare different views and methods to measure code complexity. Participants (27 programmers) were asked to read and (try to) understand a set of programs, while the complexity of such programs is assessed through different methods and perspectives: (a) classic code complexity metrics such as McCabe and Halstead metrics, (b) cognitive complexity metrics based on scored code constructs, (c) cognitive complexity metrics from state-of-the-art tools such as SonarQube, (d) human-centered metrics relying on the direct assessment of programmers' behavioral features (e.g., reading time, and revisits) using eye tracking, and (e) cognitive load/mental effort assessed using electroencephalography (EEG). The human-centered perspective was complemented by the subjective evaluation of participants on the mental effort required to understand the programs using the NASA Task Load Index (TLX). Additionally, the evaluation of the code complexity is measured at both the program level and, whenever possible, at the very low level of code constructs/code regions, to identify the actual code elements and the code context that may trigger a complexity surge in the programmers' perception of code comprehension difficulty. The programmers' cognitive load measured using EEG was used as a reference to evaluate how the different metrics can express the (human) difficulty in comprehending the code. Extensive experimental results show that popular metrics such as V(g) and the complexity metric from SonarSource tools deviate considerably from the programmers' perception of code complexity and often do not show the expected monotonic behavior. The article summarizes the findings in a set of guidelines to improve existing code complexity metrics, particularly state-of-the-art metrics such as cognitive complexity from SonarSource tools.
URI: https://hdl.handle.net/10316/113871
ISSN: 1662-4548
DOI: 10.3389/fnins.2022.1065366
Rights: openAccess
Appears in Collections:I&D CIBIT - Artigos em Revistas Internacionais
I&D ICNAS - Artigos em Revistas Internacionais
I&D CISUC - Artigos em Revistas Internacionais

Show full item record

Page view(s)

19
checked on Apr 24, 2024

Download(s)

5
checked on Apr 24, 2024

Google ScholarTM

Check

Altmetric

Altmetric


This item is licensed under a Creative Commons License Creative Commons