Get the latest tech news

Adventures in Imbalanced Learning and Class Weight


Finally turning that stone

I set up a rudimentary imbalanced classification pipeline with scikit-learn ’s make_classification and DecisionTreeClassifier, and created an empirical version of the above plot, using class_sep as a proxy for the tradeoff curve. As for why the plot looks different for bigger values of \(\alpha\), my hunch is that the tradeoff curve isn’t symmetric, allowing the classifier to get a decent recall without sacrificing precision entirely. After publishing the post it’s been pointed out to me that there are tutorials that specifically demonstrate how inverse proportion weighting (or stratified under- / oversampling, which is pretty equivalent) improves imbalanced classification performance.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of class

class

Photo of Adventures

Adventures

Photo of weight

weight

Related news:

News photo

Numerical Linear Algebra Class in Julia TUM

News photo

It's School time: Adventures in hacking an old Kindle

News photo

TSMC mulls massive 1000W-class multi-chiplet processors with 40X the performance of standard models | A 9.5x reticle size SiP on a massive substrate.