1 Data Source: Amsterdam availability data scraped from AirBnB on December 24th. Question: What are the popular neighborhoods in Amsterdam?

The data is not showing the most popular nighborhoods:

It is biased as it only shows information from AirBnB It is affected by an exceptional circumstance: the 24th of December

Solution:

I would either scrap AirBnB web for more data (across the year) and/or include other websites or, I would reframe the question to:

what are the most popular AirBnB nighborhoods in Christmas?

2 Data Source: Mental health services use on September 12, 2001 in San Francisco, CA and New York City, NY. Question: How do patterns of mental health service use vary between cities?

The data can not answer the question as it would be heavily biased due to 9/11 attacks in NY.

Solution:

I would reframe the quetion to: what was the impact of the 9/11 attacks on the mental health of NY´s population compared to other cities in the US?

3 Data Source: Armenian Pub Survey. Question: What are the most common reasons Armenians visit local pubs?

The survey only contains data about Armenian pubs. It does not suit the question we are trying to answer

Solution:

Reframe the question to: What are the most common reasons why people visit Armenian pubs.

(If we have information about the origin of the visitors in the survey)

What are the most common reasons why people (X nationality) visit Armeninan pubs.


In [ ]: