Answer: Using Planet Express customer data from January 3001-3005, determine how likely previous customers are to request a repeat delivery using demographic information (profession, company size, location) and previous delivery data (days since last delivery, number of total deliveries).
Answer: There were 525 deliveries in our dataset. There were 25 observations with missing data that were dropped from this analysis. The final analytic sample was 500. This data was collected from January 3001-3005.
There were three common professions- Account Manager, Warehouse Manger, and Alien Intake. All others were combined into a fourth category: Other.
Similarily there were 4 locations in this data set which had 20 or more deliveries; they were included in this analysis while all others were grouped into the "Other" category. "Days since last delivery" and "number of deliveries" are continuous variables ranging from 0-360 days and 1-100 deliveries, respectively.
| Variable | Description | Type of Variable |
|---|---|---|
| Profession | Title of the account owner | categorical |
| Company Size | 1- small, 2- medium, 3- large | categorical |
| Location | planet of the company | categorical |
| Days Since Last Delivery | integer | continuous |
| Number of Deliveries | integer | continuous |
Mean (STD) or counts for 2 of the 4 variables
| Variable | Mean (STD) or Frequency (%) |
|---|---|
| Number of Deliveries | 50.0 (10) |
| Earth | 50 (10%) |
| Amphibios 9 | 100 (20%) |
| Bogad | 100 (20%) |
| Colgate 8 | 100 (20%) |
| Other | 150 (30%) |
Answer: We completed a logistic regression using Statsmodels v. XX. We calculated the probability of a customer placing another order with Planet Express.
Customers from large companies had 2.0 (CI 1.9, 2.1) the odds of of placing another order with Planet Express compared to customers from small companies.
Our findings indicate that customers have a higher probability of returning if they are from a large company. Next steps could include exploring the difference by statisfaction levels, as measured by a survey.