Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Report
Data sources: ZENODO
addClaim

Adaptation of Tabular Evaluation Metrics for Benchmarking Multimodal Generative Models and Cross-Domain Generalization

Authors: Assignee Research;

Adaptation of Tabular Evaluation Metrics for Benchmarking Multimodal Generative Models and Cross-Domain Generalization

Abstract

Generative models have revolutionized multiple domains, yet their application to tabular data remains underexplored. Evaluating generative models for tabular data presents unique challenges due to structural complexity, large-scale variability, and mixed data types, making it difficult to intuitively capture intricate patterns. Existing evaluation metrics offer only partial insights, lacking a comprehensive measure of generative performance. To address this limitation, we propose three novel evaluation metrics: FAED, FPCAD, and RFIS. Our extensive experimental analysis, conducted on three stanResearch goal: Can the proposed tabular data evaluation metrics be adapted for benchmarking multimodal generative models (e.g., combining tabular and text data), and how does their performance compare to domain-specific metrics in terms of robustness and cross-domain generalization?Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 8.0/10.

Powered by OpenAIRE graph
Found an issue? Give us feedback