There is a clear linear association between the variables (r = 0.7), indicating a strong positive relationship. sqft_living should be a good predicator of house price. (note: sqft_living distribution is also skewed to the right)
Let's do the same with the 7 remaining continuous variables:
sqft_lot
sqft_above (i.e., sqft_above = sqft_living - sqft_basement)
sqft_basement
sqft_living15, the average house square footage of the 15 closest neighbours
sqft_lot15, the average lot square footage of the 15 closest neighbours
yr_built
yr_renovated
lat
long