In [1]:
# Configure Jupyter so figures appear in the notebook
%matplotlib inline
# Configure Jupyter to display the assigned value after an assignment
%config InteractiveShell.ast_node_interactivity='last_expr_or_assign'
# import functions from the modsim.py module
from modsim import *
# set the random number generator
np.random.seed(7)
This notebook presents a solution to an exercise from Modeling and Simulation in Python. It uses features from the first four chapters to answer a question related to queueing theory, which is the study of systems that involve waiting in lines, also known as "queues".
Suppose you are designing the checkout area for a new store. There is room for two checkout counters and a waiting area for customers. You can make two lines, one for each counter, or one line that serves both counters.
In theory, you might expect a single line to be better, but it has some practical drawbacks: in order to maintain a single line, you would have to install rope barriers, and customers might be put off by what seems to be a longer line, even if it moves faster.
So you'd like to check whether the single line is really better and by how much. Simulation can help answer this question.
As we did in the bikeshare model, we'll assume that a customer is equally likely to arrive during any timestep. I'll denote this probability using the Greek letter lambda, $\lambda$, or the variable name lam
. The value of $\lambda$ probably varies from day to day, so we'll have to consider a range of possibilities.
Based on data from other stores, you know that it takes 5 minutes for a customer to check out, on average. But checkout times are highly variable: most customers take less than 5 minutes, but some take substantially more. A simple way to model this variability is to assume that when a customer is checking out, they have the same probability of finishing up during each time step. I'll denote this probability using the Greek letter mu, $\mu$, or the variable name mu
.
If we choose $\mu=1/5$, the average number of time steps for each checkout will be 5 minutes, which is consistent with the data.
In [2]:
# Solution goes here
Test this function by creating a System
object with lam=1/8
and mu=1/5
.
In [3]:
# Solution goes here
Write an update function that takes as parameters x
, which is the total number of customer in the store, including the one checking out; t
, which is the number of minutes that have elapsed in the simulation, and system
, which is a System
object.
If there's a customer checking out, it should use flip
to decide whether they are done. And it should use flip
to decide if a new customer has arrived.
It should return the total number of customers at the end of the time step.
In [4]:
# Solution goes here
Test your function by calling it with x=1
, t=0
, and the System
object you created. If you run it a few times, you should see different results.
In [5]:
# Solution goes here
Now we can run the simulation. Here's a version of run_simulation
that creates a TimeSeries
with the total number of customers in the store, including the one checking out.
In [6]:
def run_simulation(system, update_func):
"""Simulate a queueing system.
system: System object
update_func: function object
"""
x = 0
results = TimeSeries()
results[0] = x
for t in linrange(0, system.duration):
x = update_func(x, t, system)
results[t+1] = x
return results
Call run_simulation
with your update function and plot the results.
In [7]:
# Solution goes here
After the simulation, we can compute L
, which is the average number of customers in the system, and W
, which is the average time customers spend in the store. L
and W
are related by Little's Law:
$L = \lambda W$
Where $\lambda$ is the arrival rate. Here's a function that computes them.
In [8]:
def compute_metrics(results, system):
"""Compute average number of customers and wait time.
results: TimeSeries of queue lengths
system: System object
returns: L, W
"""
L = results.mean()
W = L / system.lam
return L, W
Call compute_metrics
with the results from your simulation.
In [9]:
# Solution goes here
Since we don't know the actual value of $\lambda$, we can sweep through a range of possibilities, from 10% to 80% of the completion rate, $\mu$. (If customers arrive faster than the completion rate, the queue grows without bound. In that case the metrics L
and W
just depend on how long the store is open.)
Create an array of values for lam
.
In [10]:
# Solution goes here
Write a function that takes an array of values for lam
, a single value for mu
, and an update function.
For each value of lam
, it should run a simulation, compute L
and W
, and store the value of W
in a SweepSeries
.
It should return the SweepSeries
.
In [11]:
# Solution goes here
Call your function to generate a SweepSeries
, and plot it.
In [12]:
# Solution goes here
In [13]:
# Solution goes here
If we imagine that this range of values represents arrival rates on different days, we can use the average value of W
, for a range of values of lam
, to compare different queueing strategies.
In [14]:
# W_avg = sweep.mean()
The model I chose for this system is a common model in queueing theory, in part because many of its properties can be derived analytically.
In particular, we can derive the average time in the store as a function of $\mu$ and $\lambda$:
$W = 1 / (\mu - \lambda)$
The following function plots the theoretical value of $W$ as a function of $\lambda$.
In [15]:
def plot_W(lam_array, mu):
"""Plot the theoretical mean wait time.
lam_array: array of values for `lam`
mu: probability of finishing a checkout
"""
W = 1 / (mu - lam_array)
plot(lam_array, W, 'g-')
Use this function to plot the theoretical results, then plot your simulation results again on the same graph. How do they compare?
In [16]:
# Solution goes here
In [17]:
# Solution goes here
Use this update function to simulate the system, plot the results, and print the metrics.
In [18]:
# Solution goes here
Since we have two checkout counters now, we can consider values for $\lambda$ that exceed $\mu$.
Create a new array of values for lam
from 10% to 160% of mu
.
In [19]:
# Solution goes here
Use your sweep function to simulate the two server, one queue scenario with a range of values for lam
.
Plot the results and print the average value of W
across all values of lam
.
In [20]:
# Solution goes here
In [21]:
# Solution goes here
To simulate the scenario with two separate queues, we need two state variables to keep track of customers in each queue.
Write an update function that takes x1
, x2
, t
, and system
as parameters and returns x1
and x2
as return values. f you are not sure how to return more than one return value, see compute_metrics
.
When a customer arrives, which queue do they join?
In [22]:
# Solution goes here
Write a version of run_simulation
that works with this update function.
In [23]:
# Solution goes here
Test your functions by running a simulation with a single value of lam
.
In [24]:
# Solution goes here
Sweep a range of values for lam
, plot the results, and print the average wait time across all values of lam
.
How do the results compare to the scenario with two servers and one queue.
In [25]:
# Solution goes here
In [26]:
# Solution goes here
In [27]:
# Solution goes here
In [ ]: