Estimating the uncertainty of average F1 scores

Part of book or chapter of book English OPEN
Zhang, Dell; Wang, J.; Zhao, X.;

In multi-class text classification, the performance (effectiveness) of a classifier is usually measured by micro-averaged and macro-averaged F1 scores. However, the scores themselves do not tell us how reliable they are in terms of forecasting the classifier's future pe... View more
Share - Bookmark