Bernabé-Díaz, J.A., Franco, M., Vivo, J.-M., Quesada-Martínez, M., Fernández-Breis, J.T. (2022) “An automated process for supporting decisions in clustering-based data analysis”, Computer Methods and Programs in Biomedicine, 219:106765

José Antonio Bernabé-Díaz, Jesualdo T. Fernández-Breis (Dept. Informática y Sistemas, Universidad de Murcia, IMIB-Arrixaca), Manuel Franco, Juana-María Vivo (Dept. Statistics and Operations Research, University of Murcia, IMIB-Arrixaca) and Manuel Quesada (Operations Research Center, University Miguel Hernández of Elche)

Abstract: Background and objective: Metrics are commonly used by biomedical researchers and practitioners to measure and evaluate properties of individuals, instruments, models, methods, or datasets. Due to the lack of a standardized validation procedure for a metric, it is assumed that if a metric is appropriate for analyzing a dataset in a certain domain, then it will be appropriate for other datasets in the same domain. However, such generalizability cannot be taken for granted, since the behavior of a metric can vary in different scenarios. The study of such behavior of a metric is the objective of this paper, since it would allow for assessing its reliability before drawing any conclusion about biomedical datasets. Methods: We present a method to support in evaluating the behavior of quantitative metrics on datasets. Our approach assesses a metric by using clustering-based data analysis, and enhancing the decision-making process in the optimal classification. Our method assesses the metrics by applying two important criteria of the unsupervised classification validation that are calculated on the clusterings generated by the metric, namely stability and goodness of the clusters. The application of our method is facilitated to biomedical researchers by our evaluomeR tool. Results: The analytical power of our methods is shown in the results of the application of our method to analyze (1) the behavior of the impact factor metric for a series of journal categories; (2) which structural metrics provide a better partitioning of the content of a repository of biomedical ontologies, and (3) the heterogeneity sources in effect size metrics of biomedical primary studies. Conclusions: The use of statistical properties such as stability and goodness of classifications allows for a useful analysis of the behavior of quantitative metrics, which can be used for supporting decisions about which metrics to apply on a certain dataset.

N. Allouch, Luis A. Guardiola, & A. Meca (2024). Measuring productivity in networks: A game-theoretic approach. Socio-Economic Planning Sciences, 91, 101783.

N. Allouch (University of Kent – School of Economics), Luis A. Guardiola (Departamento de Métodos Cuantitativos para la Economía y Empresa, Universidad de Murcia), Ana Meca (I.U. Centro de Investigación Operativa, Universidad Miguel Hernández de Elche) Abstract: Measuring individual productivity (or equivalently distributing the overall productivity) in a network structure of workers displaying peer effects has been [...]

Artículos Científicos

Lola Fernández-Gómez, José A. Sánchez-Zapata, José A. Donázar, Xavier Barber, & Jomar M. Barbosa (2024). Ecosystem productivity drives the breeding success of an endangered top avian scavenger in a changing grazing pressure context. Science of The Total Environment, 910, 168553.

Lola Fernández-Gómez (Department of Applied Biology, Centro de Investigación e Innovación Alimentaria, Universidad Miguel Hernández de Elche), José A. Sánchez-Zapata (Department of Applied Biology, Centro de Investigación e Innovación Alimentaria, Universidad Miguel Hernández de Elche), José A. Donázar (Department of Conservation Biology, Estación Biológica de Doñana, CSIC), Xavier Barber (Operations Research Center, University Miguel Hernández [...]

Artículos Científicos

Juan F. Monge, & José L. Ruiz (2023). Setting closer targets based on non-dominated convex combinations of Pareto-efficient units: A bi-level linear programming approach in Data Envelopment Analysis. European Journal of Operational Research, 311, 1084-1096.

Juan F. Monge (Operations Research Center, University Miguel Hernández of Elche) and José L. Ruiz (Operations Research Center, University Miguel Hernández of Elche) Abstract: Data Envelopment Analysis (DEA) very often sets unrealistic targets, which require from the decision-making units (DMUs) a huge amount effort, perhaps non-assumable, for their achievement. For the identification of best practices in [...]

Bernabé-Díaz, J.A., Franco, M., Vivo, J.-M., Quesada-Martínez, M., Fernández-Breis, J.T. (2022) “An automated process for supporting decisions in clustering-based data analysis”, Computer Methods and Programs in Biomedicine, 219:106765

Published by salonso on 26 abril, 2022

Related Posts

Artículos Científicos

N. Allouch, Luis A. Guardiola, & A. Meca (2024). Measuring productivity in networks: A game-theoretic approach. Socio-Economic Planning Sciences, 91, 101783.

Artículos Científicos

Lola Fernández-Gómez, José A. Sánchez-Zapata, José A. Donázar, Xavier Barber, & Jomar M. Barbosa (2024). Ecosystem productivity drives the breeding success of an endangered top avian scavenger in a changing grazing pressure context. Science of The Total Environment, 910, 168553.

Artículos Científicos

Juan F. Monge, & José L. Ruiz (2023). Setting closer targets based on non-dominated convex combinations of Pareto-efficient units: A bi-level linear programming approach in Data Envelopment Analysis. European Journal of Operational Research, 311, 1084-1096.