diff --git a/docs/notes/predictive-modeling/classification/index.qmd b/docs/notes/predictive-modeling/classification/index.qmd index d1ed891..7d19f27 100644 --- a/docs/notes/predictive-modeling/classification/index.qmd +++ b/docs/notes/predictive-modeling/classification/index.qmd @@ -3,6 +3,9 @@ ## Classification Objectives +A classification problem is a supervised learning task, where we have the labels. + +In **classification**, the variable we are trying to predict is categorical or discrete (e.g. whether or not an applicant will default on a loan). ## Classification Models @@ -15,7 +18,6 @@ Classification Models: ## Classification Metrics - Classification Metrics: + Accuracy diff --git a/docs/notes/predictive-modeling/ml-foundations/index.qmd b/docs/notes/predictive-modeling/ml-foundations/index.qmd index 431f033..ae656ec 100644 --- a/docs/notes/predictive-modeling/ml-foundations/index.qmd +++ b/docs/notes/predictive-modeling/ml-foundations/index.qmd @@ -47,6 +47,7 @@ Example supervised learning tasks include: + **Classification**, where the variable we are trying to predict is categorical or discrete (e.g. whether or not an applicant will default on a loan). + ### Unsupervised Learning In contrast, **unsupervised learning** deals with data that lacks labeled outcomes. The model is tasked with finding patterns or groupings in the data without any explicit guidance. While supervised learning focuses on predicting specific outcomes, unsupervised learning seeks to uncover hidden structures or relationships within the data. @@ -76,6 +77,8 @@ Machine learning problem formulation refers to the process of clearly defining t + **Evaluation Metrics**: Establishing how the model's success will be measured. This could involve metrics like accuracy, precision, recall for classification problems, or r-squared or mean squared error for regression problems. +![Illustration of Mean Squared Error (MSE), a regression metric.](../../../images/mse-eq.png) + Proper problem formulation ensures that the right machine learning approach is chosen and that the model development process is aligned with the business or research objectives. diff --git a/docs/notes/predictive-modeling/regression/index.qmd b/docs/notes/predictive-modeling/regression/index.qmd index 40fea45..1e51dc2 100644 --- a/docs/notes/predictive-modeling/regression/index.qmd +++ b/docs/notes/predictive-modeling/regression/index.qmd @@ -2,7 +2,9 @@ ## Regression Objectives +A regression problem is a supervised learning task, where we have the labels. +In a **regression**, the variable we are trying to predict is continuous (e.g. housing prices). ## Regression Models @@ -13,10 +15,8 @@ Regression Models: + Lasso Regression + etc. - ## Regression Metrics - Regression Metrics: + R^2 Score @@ -24,3 +24,5 @@ Regression Metrics: + Mean Absolute Error + Root Mean Square Error + etc. + +![Illustration of Mean Squared Error (MSE).](../../../images/mse-eq.png)