Python 3.6 Jupyter Notebook

Social analytics blueprint

This notebook contains advanced exercises that are only applicable to students who wish to deepen their understanding and qualify for bonus marks on this course. You will be able to achieve 100% for this notebook by completing Exercise 1. An optional, additional exercise (Exercise 2) can be completed to qualify for bonus marks.

Your completion of the notebook exercises will be graded based on your ability to do the following:

Understand: Do your comments show evidence that you recall and understand technical concepts?

Notebook objectives:

By the end of this notebook, you will be expected to understand how to create a blueprint for social physics experiments.

List of exercises:

  • Exercise 1: Create a blueprint for a big data and social analytics project.
  • Exercise 2 [Advanced]: Provide an overview of typical resource requirements, and Phase 0 activities for your selected project.

Notebook introduction

This notebook is intended to serve as a blueprint that you can use in your future endeavors. You are expected to provide a high-level blueprint of a potential study, because an in-depth blueprint is beyond the scope of this course. Exercise 1 will not be graded in detail, therefore one or two sentences for each of the numbered sections is sufficient.

Note: Providing more information regarding your interests, experience, and background will allow for better context during the assessment. However, this is optional and does not need to be provided.

The outline below is provided in an attempt to structure your thoughts. Keep in mind that typical studies, whether in academic or commercial applications, typically do not follow a straight path. As a result of this, the output is usually produced after many iterations, deviations, and potential changes in course. Living labs are longitudinal and adaptive by nature. In science, researchers uncover insights that may be different from what they had originally anticipated, and, in business, individuals may uncover new and unexpected business and improvement opportunities. You need to ensure that you spend enough time in the exploration phase of real-life projects, while ensuring that you deliver actionable results in the required timeframes. A good approach is to start with a minimum viable product, and add features to this product as your project matures.

Exercise 1 Start.

Instructions

Based on the content provided in Module 6's Notebook 1 (an overview of big data projects) and Module 7's Notebook 1 (an overview of the “Friends and Family” study), create a social analytics blueprint for your own topic of interest. Provide a short introductory paragraph stating your choice of industry and topic, and a description of your plan or expectations for each of the items listed below (in one or two sentences).

Note: Refer to the image of the scientific method, in Module 7's Notebook 1, as a refresher of the typical workflow observed when creating a social analytics blueprint and carrying out a big data project.

0. Introductory paragraph

Your introductory paragraph in the cell below, stating your choice of industry and topic.

Your markdown answer here.


In [ ]:

1. Make observations

Typically, this follows the research and context phase of the project plan. List the observations that you would like to study in your project.

Your markdown answer here.


In [ ]:

2. Think of interesting questions

List a few of the potential questions that you would like to answer in your project.

Your markdown answer here.


In [ ]:

3. Formulate a hypothesis

State one or two hypotheses that relate to the questions listed in the previous answer.

Your markdown answer here.


In [ ]:

4. Develop testable predictions

Briefly describe how you would develop testable predictions.

Your markdown answer here.


In [ ]:

5. Gather data to test predictions, and execute the analysis phase

You do not need to complete the execution aspect of this phase. Simply provide one or two observations on where you expect to source the required data, and execute the planned analysis.

Your markdown answer here.


In [ ]:

Once this phase is complete, you will likely have to:

6. Refine, alter, expand, or reject your hypothesis

Can you think of additional uses for (or other parties interested in) the data in your hypothetical use case? List them below.

Your markdown answer here.


In [ ]:

After you have completed this phase, you will need to repeat phases four and five of the scientific method. Alternatively, if you do not find a need to refine, alter, expand, or reject your hypothesis, you can proceed to the final phase, which is to:

7. Develop general theories

Assuming that your analysis was successful, and that you were able to develop a general theory to better understand a specific topic, what changes would you propose in order to benefit from the insights gained?

Your markdown answer here.


In [ ]:


Exercise 1 End.

Exercise complete:

This is a good time to "Save and Checkpoint".

Exercise 2 [Advanced] Start.

Instructions

Many of the concepts that you are required to perform in order to execute a project (from beginning to end) do not form part of the core focus of this course.

Based on your experience, and with reference to the project you outlined in Exercise 1, complete the following:

  1. Provide a brief overview of the people, processes, and technological aspects that you will need to address in order to exploit the value contained in the data set you identified. You may also consult Sivarajah et al. (2017) as well as Bernard Marr blog post "Where Big Data Projects Fail" for further information.
  2. Refer to the project checklist in M6_NB1, which lists the typical activities that make up the phases in a project plan. Phase 1 is covered to a large extent by Exercise 1. Provide a short description of the Phase 0 activities that you would have to address to motivate for, and initiate, the project in your environment (academic or commercial). Include one or two sentences on how you would plan for the project to be terminated.

Your markdown answer here.


In [ ]:


In [ ]:


Exercise 2 [Advanced] End.

Exercise complete:

This is a good time to "Save and Checkpoint".

Submit your notebook

Please make sure that you:

  • Perform a final "Save and Checkpoint";
  • Download a copy of the notebook in ".ipynb" format to your local machine using "File", "Download as", and "IPython Notebook (.ipynb)"; and
  • Submit a copy of this file to the Online Campus.

Note:

A model answer will not be provided for this notebook.

References

  1. Uthayasankar Sivarajah, Muhammad Mustafa Kamal, Zahir Irani, Vishanth Weerakkody. 2017. "Critical analysis of Big Data challenges and analytical methods." Journal of Business Research: 70:263-286. https://www.researchgate.net/publication/306051907_Critical_analysis_of_Big_Data_challenges_and_analytical_methods. (Open Access)
  2. Marr, Bernard. "Where big data projects fail." http://www.datasciencecentral.com/profiles/blogs/where-big-data-projects-fail. Last accessed: 25 January 2017.

In [ ]: