hinge loss vs logistic loss in machine learning

hinge loss vs logistic loss advantages and disadvantages/limitations

stats.stackexchange.com › questions › 146277 › hinge-loss-vs-logistic-loss-advantages-and-disadvantages-limitations

Logarithmic loss minimization leads to well-behaved probabilistic outputs.

Hinge loss leads to some (not guaranteed) sparsity on the dual, but it doesn't help at probability estimation. Instead, it punishes misclassifications (that's why it's so useful to determine margins): diminishing hinge-loss comes with diminishing across margin misclassifications.

So, summarizing:

Logarithmic loss ideally leads to better probability estimation at the cost of not actually optimizing for accuracy
Hinge loss ideally leads to better accuracy and some sparsity at the cost of not actually estimating probabilities

In ideal scenarios, each respective method would excel in their domain (accuracy vs probability estimation). However, due to the No-Free-Lunch Theorem, it is not possible to know, a priori, if the model choice is optimal.

Answer from Firebug on Stack Exchange

Baeldung

baeldung.com › home › artificial intelligence › machine learning › differences between hinge loss and logistic loss

Differences Between Hinge Loss and Logistic Loss | Baeldung on Computer Science

February 28, 2025 - Secondly, while hinge loss produces a hyperplane separating the classes, it doesn’t give us any information on how certain it is about the membership of the samples. Logistic loss, on the other hand, models the conditional probability and thus, the probability estimates are its byproducts.

Stack Exchange

stats.stackexchange.com › questions › 146277 › hinge-loss-vs-logistic-loss-advantages-and-disadvantages-limitations

machine learning - hinge loss vs logistic loss advantages and disadvantages/limitations - Cross Validated

Videos

05:30

YouTube

What is the Hinge Loss in SVM in Machine Learning | Data Science ...

Loss Functions : Data Science Basics - YouTube

Loss Functions and its types |Log Loss |Cross Entropy Loss |Hinge ...

April 2, 2020

22:50

YouTube

Hinge Loss, SVMs, and the Loss of Users - YouTube

Hinge Loss for Binary Classifiers - YouTube

March 14, 2020

View all

Wikipedia

en.wikipedia.org › wiki › Loss_functions_for_classification

Loss functions for classification - Wikipedia

January 12, 2026 - The square loss function is both convex and smooth. However, the square loss function tends to penalize outliers excessively, leading to slower convergence rates (with regards to sample complexity) than for the logistic loss or hinge loss functions.

Bayes consistency Proper loss functions, loss margin and regularization Square loss Logistic loss Exponential loss Savage loss Tangent loss Hinge loss Generalized smooth hinge loss

Techkluster

techkluster.com › technology › hinge-loss-vs-logistic-loss

Differences Between Hinge Loss and Logistic Loss – TechKluster

Information Gain: Logistic loss can be interpreted as measuring the information gain when the predicted probability aligns with the true label. Support Vector Machines (SVMs): Hinge loss is a natural fit for SVMs, where maximizing the margin between classes is a primary objective.

Aiml

aiml.com › home › posts › machine learning interview questions › supervised learning › classification › support vector machine › how does hinge loss differ from logistic loss?

How does hinge loss differ from logistic loss? - AIML.com

March 26, 2023 - As can be seen in the graphs above, hinge loss is non-differentiable, which means that the optimization problem is no longer convex. Logistic, or cross-entropy loss, does not suffer from such a problem and also allows for the computation of predicted probabilities rather than just class labels, ...

Yuan Du

yuan-du.com › post › 2020-12-13-loss-functions › decision-theory

Loss Functions in Machine Learning and LTR | Yuan Du

August 10, 2022 - The binomial log-likelihood loss ... ... Hinge loss, Exponential loss, Logistic loss have very similar tails, giving zero penalty to points well inside their margin and linear or exponential penalty to points on the wrong side adn far away...

Find elsewhere

Google Bing Mojeek

Stanford University

cs229.stanford.edu › extra-notes › loss-functions.pdf pdf

CS229 Supplemental Lecture notes John Duchi 1 Binary classiﬁcation

machine learning procedures; in particular, the logistic loss ϕlogistic is logistic · regression, the hinge loss ϕhinge gives rise to so-called support vector machines, and the exponential loss gives rise to the classical version of boosting, both · of which we will explore in more depth ...

Medium

medium.com › analytics-vidhya › understanding-loss-functions-hinge-loss-a0ff112b40a1

Understanding loss functions : Hinge loss | by Kunal Chowdhury | Analytics Vidhya | Medium

January 18, 2024 - Almost, all classification models are based on some kind of models. E.g. Logistic regression has logistic loss (Fig 4: exponential), SVM has hinge loss (Fig 4: Support Vector), etc.

NISER

niser.ac.in › ~smishra › teach › cs460 › 23cs460 › lectures › lec11.pdf pdf

HINGE LOSS IN SUPPORT VECTOR MACHINES Chandan Kumar Sahu and Maitrey Sharma

February 7, 2023 - Figure. The support vector loss function (hinge loss), compared to the negative log-likelihood loss (binomial · deviance) for logistic regression, squared-error loss, and a “Huberized” version of the squared hinge loss.

Quora

quora.com › What-are-the-advantages-of-hinge-loss-over-log-loss

What are the advantages of hinge loss over log loss? - Quora

Answer: Hinge loss is easier to compute than log loss. Ditto for its derivative or subgradient. Hinge loss also induces sparsity in the solution, if the ML weights are a linear combination of the training observations.

Quora

quora.com › What-is-the-advantage-disadvantage-of-Hinge-loss-compared-to-cross-entropy

What is the advantage/disadvantage of Hinge-loss compared to cross-entropy? - Quora

Answer (1 of 2): Cross Entropy (or Log Loss), Hing Loss (SVM Loss), Squared Loss etc. are different forms of Loss functions. Log Loss in the classification context gives Logistic Regression, while the Hinge Loss is Support Vector Machines. Logistic ...

Stack Exchange

datascience.stackexchange.com › questions › 9420 › whats-the-relationship-between-an-svm-and-hinge-loss

logistic regression - What's the relationship between an SVM and hinge loss? - Data Science Stack Exchange

Top answer

1 of 1

They are both discriminative models, yes. The logistic regression loss function is conceptually a function of all points. Correctly classified points add very little to the loss function, adding more if they are close to the boundary. The points near the boundary are therefore more important to the loss and therefore deciding how good the boundary is.

SVM uses a hinge loss, which conceptually puts the emphasis on the boundary points. Anything farther than the closest points contributes nothing to the loss because of the "hinge" (the max) in the function. Those closest points are the support vectors, simply. Therefore it actually reduces to picking a boundary that creates the largest margin -- distance to closest point. The theory is that the boundary case is all that really matters to generalization.

The downside is that hinge loss is not differentiable, but that just means it takes more math to discover how to optimize it via Lagrange multipliers. It doesn't really handle the case where data isn't linearly separable. Slack variables are a trick that lets this possibility be incorporated cleanly into the optimization problem.

You can use hinge loss with "deep learning", e.g. http://arxiv.org/pdf/1306.0239.pdf

DataCamp

datacamp.com › tutorial › loss-function-in-machine-learning

Loss Functions in Machine Learning Explained | DataCamp

December 4, 2024 - When BCE is utilized as a component ... training. Hinge Loss is a loss function utilized within machine learning to train classifiers that optimize to increase the margin between data points and the decision boundary....

Analytics Vidhya

analyticsvidhya.com › home › what is hinge loss in machine learning?

What is Hinge loss in Machine Learning?

December 23, 2024 - Hinge loss in machine learning, a key loss function in SVMs, enhances model robustness by penalizing incorrect or marginal predictions.

Kaggle

kaggle.com › code › viveknimsarkar › hands-on-guide-to-loss-functions

Hands-On Guide To Loss Functions

Checking your browser before accessing www.kaggle.com · Click here if you are not automatically redirected after 5 seconds

Number Analytics

numberanalytics.com › blog › hinge-loss-ultimate-guide-for-ml-practitioners

Hinge Loss: The Ultimate Guide for ML Practitioners

June 11, 2025 - Hinge loss is particularly useful when the goal is to maximize the margin between classes, whereas logistic loss is more suitable for problems that require a probabilistic interpretation. Hinge loss is widely used in binary and multi-class classification problems.

Texas A&M University

people.tamu.edu › ~sji › classes › loss-slides.pdf pdf

A Uniﬁed View of Loss Functions in Supervised Learning Shuiwang Ji

2 The loss function used in logistic ... si) = log(1 + e−yisi). (7) 6 / 12 · Hinge loss (support vector machines) 1 The support vector machines employ hinge loss to obtain a classiﬁer ·...

Rohanvarma

rohanvarma.me › Loss-Functions

Picking Loss Functions - A comparison between MSE, Cross Entropy, and Hinge Loss

The MSE loss is therefore better suited to regression problems, and the cross-entropy loss provides us with faster learning when our predictions differ significantly from our labels, as is generally the case during the first several iterations of model training. We’ve also compared and contrasted the cross-entropy loss and hinge loss, and discussed how using one over the other leads to our models learning in different ways.

Wikipedia

en.wikipedia.org › wiki › Hinge_loss

Hinge loss - Wikipedia

January 26, 2026 - The hinge loss is used for "maximum-margin" classification, most notably for support vector machines (SVMs). For an intended output t = ±1 and a classifier score y, the hinge loss of the prediction y is defined as ...

Extensions Optimization

Stack Exchange

stats.stackexchange.com › questions › 222585 › what-are-the-impacts-of-choosing-different-loss-functions-in-classification-to-a

machine learning - What are the impacts of choosing different loss functions in classification to approximate 0-1 loss - Cross Validated