In this lecture, we'll go over how to make "decisions" over the course of your code depending on the values certain variables take. We'll also introduce exceptions and how to handle them gracefully. By the end of the lecture, you should be able to
In fact, let's go ahead and look at the problem of finding the maximum value in a list.
In [1]:
x = [51, 65, 56, 19, 11, 49, 81, 59, 45, 73]
If we want to figure out the maximum value, we'll obviously need a loop to check each element of the list (which we know how to do), and a variable to store the maximum.
In [2]:
max_val = 0
for element in x:
# ... now what?
pass
We also know we can check relative values, like max_val < element
. If this evaluates to True
, we know we've found a number in the list that's bigger than our current candidate for maximum value. But how do we execute code until this condition, and this condition alone?
Enter if / elif / else
statements! (otherwise known as "conditionals")
We can use the keyword if
, followed by a statement that evaluates to either True
or False
, to determine whether or not to execute the code. For a straightforward example:
In [3]:
x = 5
if x < 5:
print("How did this happen?!") # Spoiler alert: this won't happen.
if x == 5:
print("Working as intended.")
In conjunction with if
, we also have an else
clause that we can use to execute whenever the if
statement doesn't:
In [4]:
x = 5
if x < 5:
print("How did this happen?!") # Spoiler alert: this won't happen.
else:
print("Correct.")
This is great! We can finally finish computing the maximum element of a list!
In [5]:
x = [51, 65, 56, 19, 11, 49, 81, 59, 45, 73]
max_val = 0
for element in x:
if max_val < element:
max_val = element
print("The maximum element is: {}".format(max_val))
We can test conditions! But what if we have multifaceted decisions to make? Let's look at a classic example: assigning letter grades from numerical grades.
In [6]:
student_grades = {
'Jen': 82,
'Shannon': 75,
'Natasha': 94,
'Benjamin': 48,
}
We know the 90-100 range is an "A", 80-89 is a "B", and so on. We can't do just a standard "if / else", since we have more than two possibilities here.
The third and final component of conditionals is the elif
statement (short for "else if").
elif
allows us to evaluate as many options as we'd like, all within the same conditional context (this is important). So for our grading example, it might look like this:
In [7]:
for student, grade in student_grades.items():
letter = ''
if grade >= 90:
letter = "A"
elif grade >= 80:
letter = "B"
elif grade >= 70:
letter = "C"
elif grade >= 60:
letter = "D"
else:
letter = "F"
print("{}'s letter grade: {}".format(student, letter))
Ok, that's neat. But there's still one more edge case: what happens if we want to enforce multiple conditions simultaneously?
To illustrate, let's go back to our example of finding the maximum value in a list, and this time, let's try to find the second-largest value in the list. For simplicity, let's say we've already found the largest value.
In [8]:
x = [51, 65, 56, 19, 11, 49, 81, 59, 45, 73]
max_val = 81 # We've already found it!
second_largest = 0
Here's the rub: we now have two constraints to enforce--the second largest value needs to be larger than pretty much everything in the list, but also needs to be smaller than the maximum value. Not something we can encode using if
/ elif
/ else
.
Instead, we'll use two more keywords integral to conditionals: and
and or
.
In [9]:
for element in x:
if second_largest < element and element < max_val:
second_largest = element
print("The second-largest element is: {}".format(second_largest))
second_largest < element
, is the same as before: if our current estimate of the second largest element is smaller than the latest element we're looking at, it's definitely a candidate for second-largest.element < max_val
, is what ensures we don't just pick the largest value again. This enforces the constraint that the current element we're looking at is also less than our known maximum value.and
keyword glues these two conditions together: it requires that they BOTH be True before the code inside the statement is allowed to execute.It would be easy to replicate this with "nested" conditionals:
In [10]:
second_largest = 0
for element in x:
if second_largest < element:
if element < max_val:
second_largest = element
print("The second-largest element is: {}".format(second_largest))
...but your code starts getting a little unwieldy with so many indentations.
You can glue as many comparisons as you want together with and
; the whole statement will only be True
if every single condition evaluates to True. This is what and
means: everything must be True.
The other side of this coin is or
. Like and
, you can use it to glue together multiple constraints. Unlike and
, the whole statement will evaluate to True as long as at least ONE condition is True. This is far less stringent than and
, where ALL conditions had to be True.
In [11]:
numbers = [1, 2, 5, 6, 7, 9, 10]
for num in numbers:
if num == 2 or num == 4 or num == 6 or num == 8 or num == 10:
print("{} is an even number.".format(num))
In this contrived example, I've glued together a bunch of constraints. Obviously, these constraints are mutually exclusive; a number can't be equal to both 2 and 4 at the same time, so num == 2 and num == 4
would never evaluate to True. However, using or
, only one of them needs to be True for the statement underneath to execute.
There's a little bit of intuition to it.
One other important tidbit, concerning not only conditionals, but also lists and booleans: the not
keyword.
An often-important task in data science, when you have a list of things, is querying whether or not some new piece of information you just received is already in your list. You could certainly loop through the list, asking "is my new_item == list[item i]?". But, thankfully, there's a better way:
In [12]:
import random
list_of_numbers = [random.randint(1, 100) for i in range(10)] # Generaets 10 random numbers, between 1 and 100.
if 13 not in list_of_numbers:
print("Aw man, my lucky number isn't here!")
Notice a couple things here--
if
statement asks if the number 13 is NOT found in list_of_numbers
True
--meaning the number is NOT found--it prints the message.If you omit the not
keyword, then the question becomes: "is this number in the list?"
In [13]:
import random
list_of_numbers = [random.randint(1, 2) for i in range(10)] # Generaets 10 random numbers, between 1 and 2. Yep.
if 1 in list_of_numbers:
print("Sweet, found a 1!")
This works for strings as well: 'some_string' in some_list
will return True
if that string is indeed found in the list.
Be careful with this. Typing issues can hit you full force here: if you ask if 0 in some_list
, and it's a list of floats, then this operation will always evaluate to False
.
Similarly, if you ask if "shannon" in name_list
, it will look for the precise string "shannon"
and return False
even if the string "Shannon"
is in the list. With great power, etc etc.
By now, I suspect you've likely seen your fair share of Python crashes.
NotImplementedError
from the homework assignmentsTypeError
from trying to multiply an integer by a stringKeyError
from attempting to access a dictionary key that didn't existIndexError
from referencing a list beyond its actual lengthor any number of other error messages. These are the standard way in which Python (and most other programming languages) handles error messages.
The error is known as an Exception. Some other terminology here includes:
raise NotImplementedError
in your homeworks. In other languages such as Java, an exception is "thrown" instead of "raised", but the meanings are equivalent.Here's a fairly classic example: divide by zero!
Let's say we're designing a simple calculator application that divides two numbers. We'll ask the user for two numbers, divide them, and return the quotient. Seems simple enough, right?
In [14]:
def divide(x, y):
return x / y
In [22]:
divide(11, 0)
D'oh! The user fed us a 0 for the denominator and broke our calculator. Meanie-face.
So we know there's a possibility of the user entering a 0. This could be malicious or simply by accident. Since it's only one value that could crash our app, we could in principle have an if
statement that checks if the denominator is 0. That would be simple and perfectly valid.
But for the sake of this lecture, let's assume we want to try and catch the ZeroDivisionError
ourselves and handle it gracefully.
To do this, we use something called a try / except
block, which is very similar in its structure to if / elif / else
blocks.
First, put the code that could potentially crash your program inside a try
statement. Under that, have a except
statement that defines
In [16]:
def divide_safe(x, y):
quotient = 0
try:
quotient = x / y
except ZeroDivisionError:
print("You tried to divide by zero. Why would you do that?!")
return quotient
Now if our user tries to be snarky again--
In [17]:
divide_safe(11, 0)
Out[17]:
No error, no crash! Just a "helpful" error message.
Like conditionals, you can also create multiple except
statements to handle multiple different possible exceptions:
In [18]:
import random # For generating random exceptions.
num = random.randint(0, 1)
try:
if num == 1:
raise NameError("This happens when you use a variable you haven't defined")
else:
raise ValueError("This happens when you try to multiply a string")
except NameError:
print("Caught a NameError!")
except ValueError:
print("Nope, it was actually a ValueError.")
If you download this notebook or run it with mybinder and re-run the above cell, the exception should flip randomly between the two.
Also like conditionals, you can handle multiple errors simultaneously. If, like in the previous example, your code can raise multiple exceptions, but you want to handle them all the same way, you can stack them all in one except
statement:
In [19]:
import random # For generating random exceptions.
num = random.randint(0, 1)
try:
if num == 1:
raise NameError("This happens when you use a variable you haven't defined")
else:
raise ValueError("This happens when you try to multiply a string")
except (NameError, ValueError): # MUST have the parentheses!
print("Caught...well, some kinda error, not sure which.")
If you're like me, and you're writing code that you know could raise one of several errors, but are too lazy to look up specifically what errors are possible, you can create a "catch-all" by just not specifying anything:
In [20]:
import random # For generating random exceptions.
num = random.randint(0, 1)
try:
if num == 1:
raise NameError("This happens when you use a variable you haven't defined")
else:
raise ValueError("This happens when you try to multiply a string")
except:
print("I caught something!")
Finally--and this is really getting into what's known as control flow (quite literally: "controlling the flow" of your program)--you can tack an else
statement onto the very end of your exception-handling block to add some final code to the handler.
Why? This is code that is only executed if NO exception occurs. Let's go back to our random number example: instead of raising one of two possible exceptions, we'll raise an exception only if we flip a 1.
In [21]:
import random # For generating random exceptions.
num = random.randint(0, 1)
try:
if num == 1:
raise NameError("This happens when you use a variable you haven't defined")
except:
print("I caught something!")
else:
print("HOORAY! Lucky coin flip!")
Again, if you run this notebook yourself and execute the above cell multiple times, you should see it flip between "I caught something!" and "HOORAY!", signifying that the else
clause only executes if no exceptions are raised.
Some questions to discuss and consider:
1: Go back to the if
/ elif
/ else
example about student grades. Let's assume, instead of elif
for the different conditions, you used a bunch of if
statements, e.g. if grade >= 90
, if grade >= 80
, if grade >= 70
, and so on; effectively, you didn't use elif
at all, but just used if
. What would the final output be in this case?
2: We saw that you can add an else
statement to the end of an exception handling block, which will run code in the event that no exception is raised. Why is this useful? Why not add the code you want to run in the try
block itself?
3: With respect to error handling, we discussed try
, except
, and else
statements. There is actually one more: finally
, which executes no matter what, regardless of whether an exception occurs or not. Why would this be useful?