The following notebook presents a thought process of predicting a continuous variable through Machine Learning methods. More specifically, we want to predict house prices based on multiple features using regression analysis.
As an example, we will use a dataset of house sales in King County, where Seattle is located.
In the first part of the analysis, we set up the context using map visualization, and highlighted the association between the variables in our dataset.
This is, for example, a map of King County showing the average house price per zipcode. We can see the disparities between the different zipcodes. The location of the houses should play an important role in our regression model.