Zach ZazuetaSHAP Library in PythonEvery profession has their unique toolbox, full of items that are essential to their work. Painters have their brushes and canvas…4 min read·Oct 31, 2020--1--1
Zach ZazuetaK-Prototypes clustering — for when you’re clustering continuous and categorical dataClustering is one of the most popular types of unsupervised machine learning. Clustering techniques allow us to group data objects into…7 min read·Oct 23, 2020--2--2
Zach ZazuetaData exploration made easy — subplots in MatplotlibDoing more work with less effort is the name of the game in coding. For data scientists, this is a huge advantage to using tools like…4 min read·Oct 17, 2020----
Zach ZazuetaBayesian vs Frequentist approach to finding probabilityThe comparison between the way things are and the way things ought to be is one that is made frequently. Good ice cream should be…3 min read·Oct 9, 2020----
Zach ZazuetaData transformation in ML — Standardization vs NormalizationMany things in life come in a variety of shapes, sizes, flavors, etc. It is this variety that is said to be “the spice of life”…4 min read·Oct 1, 2020----
Zach ZazuetaConnecting to SQL Server in Python“SQL is an important tool for any data scientist” — is my entry for the understatement of the year.4 min read·Sep 26, 2020----
Zach ZazuetaChi-square Test for Feature Reduction in Python“There are two kinds of people in this world…”…7 min read·Sep 16, 2020--1--1
Zach ZazuetaParametric vs non-parametric statistical tests in PythonOnce one has a good understanding of the data they have to work with, they next need to decide what they aim to answer with this…5 min read·Sep 6, 2020----
Zach ZazuetaLevels of Data MeasurementIt’s 10pm — do you know what Level of Measurement your Data is?5 min read·Aug 30, 2020----
Zach ZazuetaHypertune your process before you hypertune your parameters:Why it is important to know what your end goal is…5 min read·Jun 5, 2020----