hinge loss vs cross entropy

hinge loss vs logistic loss advantages and disadvantages/limitations

stats.stackexchange.com › questions › 146277 › hinge-loss-vs-logistic-loss-advantages-and-disadvantages-limitations

Logarithmic loss minimization leads to well-behaved probabilistic outputs.

Hinge loss leads to some (not guaranteed) sparsity on the dual, but it doesn't help at probability estimation. Instead, it punishes misclassifications (that's why it's so useful to determine margins): diminishing hinge-loss comes with diminishing across margin misclassifications.

So, summarizing:

Logarithmic loss ideally leads to better probability estimation at the cost of not actually optimizing for accuracy
Hinge loss ideally leads to better accuracy and some sparsity at the cost of not actually estimating probabilities

In ideal scenarios, each respective method would excel in their domain (accuracy vs probability estimation). However, due to the No-Free-Lunch Theorem, it is not possible to know, a priori, if the model choice is optimal.

Answer from Firebug on Stack Exchange

Quora

quora.com › What-is-the-advantage-disadvantage-of-Hinge-loss-compared-to-cross-entropy

What is the advantage/disadvantage of Hinge-loss compared to cross-entropy? - Quora

Answer (1 of 2): Cross Entropy (or Log Loss), Hing Loss (SVM Loss), Squared Loss etc. are different forms of Loss functions. Log Loss in the classification context gives Logistic Regression, while the Hinge Loss is Support Vector Machines. Logistic ...

Medium

medium.com › @hatodi0945 › a-comparison-between-mse-cross-entropy-and-hinge-loss-4d4fe63cca12

A comparison between MSE, Cross Entropy, and Hinge Loss | by abc xyz | Medium

October 2, 2023 - The main difference between the hinge loss and the cross entropy loss is that the former arises from trying to maximize the margin between our decision boundary and data points — thus attempting to ensure that each point is correctly and ...

Discussions

machine learning - hinge loss vs logistic loss advantages and disadvantages/limitations - Cross Validated

12 Is there a Good Illustrative Example where the Hinge Loss (SVM) Gives a Higher Accuracy than the Logistic Loss · 3 Disadvantages of cross entropy loss comparing to SVM loss More on stats.stackexchange.com

stats.stackexchange.com

April 14, 2015

machine learning - softmax+cross entropy compared with square regularized hinge loss for CNNs - Cross Validated

SVM is actually a single layer ... with gradients. In addition, squared regularized hinge loss can be transformed into dual form to induce kernel and find the support vector. Compared with softmax+cross entropy, squared regularized hinge loss has better convergence and ... More on stats.stackexchange.com

stats.stackexchange.com

October 15, 2021

DailyML 28: MSE, Hinge Loss, and cross-entropy are all types of ___________.

To add a little more fun and learning, I think I'm going to try out giving awards to helpful/engaging comments on the DailyML a few times a week. Do we like the idea? More on reddit.com

r/learnmachinelearning

April 21, 2022

Why do we use log-loss in logistic regression instead of just taking the absolute difference between expected probability and actual value for each instance?

You can try it and see if it works🤷‍♂️ Absolute is usually avoided because makes a "V" shaped gradient. Sharp corners are bad in general for gradient based optimization. Same reason we use MSE or RMSE instead of absolute error for regression tasks. More on reddit.com

r/learnmachinelearning

April 26, 2023

Videos