Unlike T271799 having individual regression for each component. In this task, we’ll use all 15 components (title, caption, etc.) as input, fit a logistic regression with these scores as features and ratings as target variable.
It seems there are a lot sparse features, so we’ll check how to deal the logistic regression with sparse features.
Output will be information about the accuracy for the model with different metrics (f1 scores, balanced accuracy, etc) and the coefficients for the individual component.