SHAP Library in PythonEvery profession has their unique toolbox, full of items that are essential to their work. Painters have their brushes and canvas…Oct 31, 20201Oct 31, 20201
K-Prototypes clustering — for when you’re clustering continuous and categorical dataClustering is one of the most popular types of unsupervised machine learning. Clustering techniques allow us to group data objects into…Oct 23, 20202Oct 23, 20202
Data exploration made easy — subplots in MatplotlibDoing more work with less effort is the name of the game in coding. For data scientists, this is a huge advantage to using tools like…Oct 17, 2020Oct 17, 2020
Bayesian vs Frequentist approach to finding probabilityThe comparison between the way things are and the way things ought to be is one that is made frequently. Good ice cream should be…Oct 9, 2020Oct 9, 2020
Data transformation in ML — Standardization vs NormalizationMany things in life come in a variety of shapes, sizes, flavors, etc. It is this variety that is said to be “the spice of life”…Oct 1, 2020Oct 1, 2020
Connecting to SQL Server in Python“SQL is an important tool for any data scientist” — is my entry for the understatement of the year.Sep 26, 2020Sep 26, 2020
Chi-square Test for Feature Reduction in Python“There are two kinds of people in this world…”…Sep 16, 20201Sep 16, 20201
Parametric vs non-parametric statistical tests in PythonOnce one has a good understanding of the data they have to work with, they next need to decide what they aim to answer with this…Sep 6, 2020Sep 6, 2020
Levels of Data MeasurementIt’s 10pm — do you know what Level of Measurement your Data is?Aug 30, 2020Aug 30, 2020
Hypertune your process before you hypertune your parameters:Why it is important to know what your end goal is…Jun 5, 2020Jun 5, 2020