The writer of this code wants to count the mean and median article length for recent articles on gay marriage in the New York Times. This code has several issues, including errors. When they checked their custom functions against the numpy functions, they noticed some discrepancies. Fix the code so it executes properly, retrieves the articles, and outputs the correct result from the custom functions, compared to the numpy functions.



In [1]:

    
import requests # a better package than urllib2



In [2]:

    
def my_mean(input_list):
    list_sum = 0
    list_count = 0
    for el in input_list:
        list_sum += el
        list_count += 1
    return list_sum / list_count



In [81]:

    
def my_median(input_list):
    input_list.sort()
    list_length = len(input_list)
    if list_length % 2 == 1:
        index = int((list_length - 1) / 2)
        return input_list[index]
    else:
        index1 = int(list_length/2)
        index2 = int(index1 - 1)
        return (input_list[index1] + input_list[index2]) / 2



In [4]:

    
api_key = "ffaf60d7d82258e112dd4fb2b5e4e2d6:3:72421680"



In [6]:

    
url = "http://api.nytimes.com/svc/search/v2/articlesearch.json?q=gay+marriage&api-key=%s" % api_key



In [7]:

    
r = requests.get(url)



In [22]:

    
# I'm not counting the things without word counts, because I don't think they should count as articles
wc_list = []
for article in r.json()['response']['docs']:
    if article['word_count']:
        wc_list.append(int(article['word_count']))



In [82]:

    
my_mean(wc_list)









    Out[82]:





720.7777777777778



In [83]:

    
import numpy as np



In [84]:

    
np.mean(wc_list)









    Out[84]:





720.77777777777783



In [85]:

    
my_median(wc_list)









    Out[85]:





684



In [86]:

    
np.median(wc_list)









    Out[86]:





684.0