Let’s take a look of Seattle AirBnB Open Data using Python

Which features are most related to homestay cost? How do seasonal costs change? What is different between superhost and regular host? Let’s get the answers by analyzing the data in python

Photo by MILKOVÍ on Unsplash


Seattle AirBnB Open Data describes the listing activity of AirBnB homestays in Seattle, WA until 2016. It is consists of 3 files

How do seasonal costs change?

Below graphs show the mean, quantile prices by months.

  1. Average prices are highest in summer and increase slightly in December. Perhaps summer holidays, year-end events, and Christmas were influential.
  2. Prices fluctuate mainly at high price points, and cheap rooms do not change even when the season changes.

Which features are most related to homestay cost?

By using `pandas.DataFrame.corr` can easily do correlation analysis. Let’s show the heatmap of correlation matrix.

Correlations related to reviews

Correlations related to bathrooms, accommodates, bedrooms, beds

What is different between superhost and regular host?

  1. Superhosts almost slightly outperformed all score of reviews. However, there are far more reviews written. It could be because more customers are staying at accommodation on the superhost, or it could be because the superhost encourages customers to actively write reviews.
  2. Another difference is that on average the superhost has fewer rooms than the rest of the hosts.
  3. Even though the average number of beds, toilets, and number of people in a room operated by superhosts are slightly smaller There is a shorter minimum/maximum period for customers to stay, and more guests can be included. But the difference is not big.

Assuming that the higher the number of reviews, the more customers it attracts, the more likely a superhost will have other factors that can attract more customers. For example, they may have been running a homestay for a long time, or their location may be better.

Date the host started hosting

AirBnB superhost do not requires long hosting period. Host can become a superhost even if hosting period is less than 12 months. Hosting for a long time doesn’t mean host will become a superhost.

Location differences between superhosts and regular hosts

It was not possible to identify nearby facilities with the given data. So I marked the host’s location on the map. Red is superhost, blue is regular host.

It can be seen that there is not a big difference in location.

(FIXED — My analysis on this point was performed incorrectly, So I wrote there is a lot of differences so modified the article..… really sorry… 😢 🙏🙏🙏🙏🙏🙏)


  1. Accommodation prices increase in summer and December and are lowest at the beginning of the year. However, the lower the price, the less the volatility.
  2. The size of the accommodation has a big impact on the price. Price and number of reviews have a weak negative correlation.
  3. The biggest distinction between the superhost and the rest is the number of reviews. The rating is slightly higher than the rest, but it is about the same level.

2020.12.8 ~ 2022.6.9 군복무중 Serving in the South Korean Military Service

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store