Use Excel to analyse the Boston House Price dataset

Crystal X
4 min readNov 4, 2024

The Boston House Price dataset was collected in the early 1970’s and published in 1978 by Harrison and Rubinfield in their paper entitled, “Hedonic Prices and the Demand for Clean Air”. This dataset has 506 entries that represent aggregated data about 14 features for homes in the various suburbs in Boston, Massachusetts.

I first became aware of this dataset when I was initially studying machine learning and using Python’s machine learning library, SciKit Learn. This dataset is no longer in the SciKit Learn website, and I believe that it is because it made reference to a certain demographic of the population.

I have been able to obtain a copy of the Boston House Prices dataset in Excel and feel that it would be a great idea to answer a few statistical questions concerning that datase.

Below is the Excel spreadsheet that I downloaded.

  1. It does not have any suburb names so I had to include an index for each suburb, which I placed in column N of the worksheet.
  2. On column P of the worksheet, I ranked the median values because that information will be crucial in the analysis.

--

--

Crystal X
Crystal X

Written by Crystal X

I have over five decades experience in the world of work, being in fast food, the military, business, non-profits, and the healthcare sector.