Analyzing Growth in Diversity of Actors in the Film Industry

By: Eunice Famodimu

Description

This project aims to explore the topic of underrepresentation of black actors in the film industry, and in particular explore the growth of the presence of lead and supporting actors of African descent over the last few decades. The data used in this project is a combination of data obtained through web scraping from various Wikipedia webpages, and data extracted from excel and csv files from the Internet.

I will be comparing underrepresentation of black actors in Academy award nominations, and will examine whether we see a growth of black actors in successful films in general. Moreover, we will finally compare growth in the presence of black actors in high grossing films, for female versus male black actors.

Data Sources

Data

I. Web Scraping

The data in this section was scraped from numerous Wikipedia pages that contained lists of actors from the 1800's to the current era, who have African ancestry.


In [884]:
import sys                             # system module 
import pandas as pd # Import pandas
import matplotlib.pyplot as plt # Import matplotlib for graphics
import datetime as dt #Import date time for current data time information
import numpy as np # Import numpy 
import csv # Import csv
import requests as rq # Import requests
from bs4 import BeautifulSoup as bs # Import Beautiful Soup
import seaborn as sns
import scipy.stats.mstats as mstats

%matplotlib inline

In [706]:
# check versions (overkill, but why not?)
print('Python version:', sys.version)
print('Pandas version: ', pd.__version__)
print('Numpy version: ', np.__version__)
print('Today: ', dt.date.today())


Python version: 3.6.0 |Anaconda 4.3.0 (x86_64)| (default, Dec 23 2016, 13:19:00) 
[GCC 4.2.1 Compatible Apple LLVM 6.0 (clang-600.0.57)]
Pandas version:  0.19.2
Numpy version:  1.11.3
Today:  2017-05-11

Web pages


In [241]:
# African American Males Actors

url_AA_Male1_1 = 'https://en.wikipedia.org/w/index.php?title=Category:African-American_male_actors'
url_AA_Male1_2 = '&pageuntil=Cox%2C+Tony%0ATony+Cox+%28actor%29#mw-pages'
url_AA_Male2_1 = 'https://en.wikipedia.org/w/index.php?title=Category:African-American' 
url_AA_Male2_2 = '_male_actors&pagefrom=Cox%2C+Tony%0ATony+Cox+%28actor%29#mw-pages'
url_AA_Male3_1 = 'https://en.wikipedia.org/w/index.php?title=Category:African-American_male_actors'
url_AA_Male3_2 = '&pagefrom=Henry%2C+Joshua%0AJoshua+Henry#mw-pages'
url_AA_Male4_1 = 'https://en.wikipedia.org/w/index.php?title=Category:African-American_male_actors'
url_AA_Male4_2 = '&pagefrom=Luke%2C+Derek%0ADerek+Luke+%28actor%29#mw-pages'
url_AA_Male5_1 = 'https://en.wikipedia.org/w/index.php?title=Category:African-American_male_actors'
url_AA_Male5_2 = '&pagefrom=Richard%2C+Robert%0ARobert+Ri%27chard#mw-pages'
url_AA_Male6_1 = 'https://en.wikipedia.org/w/index.php?title=Category:African-American_male_actors'
url_AA_Male6_2 = '&pagefrom=Ward%2C+Richard%0ARichard+Ward+%28actor%29#mw-pages'

url_AA_Male1 = url_AA_Male1_1 + url_AA_Male1_2
url_AA_Male2 = url_AA_Male2_1 + url_AA_Male2_2
url_AA_Male3 = url_AA_Male3_1 + url_AA_Male3_2
url_AA_Male4 = url_AA_Male4_1 + url_AA_Male4_2
url_AA_Male5 = url_AA_Male5_1 + url_AA_Male5_2
url_AA_Male6 = url_AA_Male6_1 + url_AA_Male6_2

In [242]:
# African American Female Actors

url_AA_Female1 = 'https://en.wikipedia.org/wiki/Category:African-American_actresses'
url_AA_Female2_1 = 'https://en.wikipedia.org/w/index.php?title=Category:African-American_actresses' 
url_AA_Female2_2 = '&pagefrom=Fields%2C+Alexis%0AAlexis+Fields#mw-pages'
url_AA_Female3_1 = 'https://en.wikipedia.org/w/index.php?title=Category:African-American_actresses'
url_AA_Female3_2 = '&pagefrom=Knowles%2C+Solange%0ASolange+Knowles#mw-pages'
url_AA_Female4_1 = 'https://en.wikipedia.org/w/index.php?title=Category:African-American_actresses'
url_AA_Female4_2 = '&pagefrom=Ray%2C+Tanika%0ATanika+Ray#mw-pages'
url_AA_Female5_1 = 'https://en.wikipedia.org/w/index.php?title=Category:African-American_actresses'
url_AA_Female5_2 = '&pagefrom=Williamson%2C+Kenya%0AKenya+D.+Williamson#mw-pages'

url_AA_Female2 = url_AA_Female2_1 + url_AA_Female2_2
url_AA_Female3 = url_AA_Female3_1 + url_AA_Female3_2
url_AA_Female4 = url_AA_Female4_1 + url_AA_Female4_2
url_AA_Female5 = url_AA_Female5_1 + url_AA_Female5_2
African American Actors code

In [243]:
# Get html from browser
AA_Male1 = rq.get(url_AA_Male1)
AA_Male2 = rq.get(url_AA_Male2)
AA_Male3 = rq.get(url_AA_Male3)
AA_Male4 = rq.get(url_AA_Male4)
AA_Male5 = rq.get(url_AA_Male5)
AA_Male6 = rq.get(url_AA_Male6)

In [885]:
# Download successful? - Yes
[
(AA_Male1.status_code == 200),
(AA_Male2.status_code == 200),
(AA_Male3.status_code == 200),
(AA_Male4.status_code == 200),
(AA_Male5.status_code == 200),
(AA_Male6.status_code == 200),
]


Out[885]:
[True, True, True, True, True, True]

In [245]:
# get data as Soup
AA_Male1_soup= bs(AA_Male1.content, 'html.parser')

# get data as Soup
AA_Male2_soup= bs(AA_Male2.content, 'html.parser')

# get data as Soup
AA_Male3_soup= bs(AA_Male3.content, 'html.parser')

# get data as Soup
AA_Male4_soup= bs(AA_Male4.content, 'html.parser')

# get data as Soup
AA_Male5_soup= bs(AA_Male5.content, 'html.parser')

# get data as Soup
AA_Male_6soup= bs(AA_Male6.content, 'html.parser')

In [886]:
# Males Group 1 

li_all_M1 = AA_Male1_soup.find_all('li')
namesM1 = []
for li in li_all_M1:
    try:
        text = li.find('a').get_text()
        namesM1.append(text)
    except AttributeError:
        pass

namesAA_male1= namesM1[2:202]
namesAA_male1


Out[886]:
['50 Cent',
 'Quinton Aaron',
 'Barkhad Abdi',
 'Kareem Abdul-Jabbar',
 'Brandon Adams (actor)',
 'Granville Adams',
 'Tunde Adebimpe',
 'Ilunga Adell',
 'Jovan Adepo',
 'Faysal Ahmed',
 'Franklyn Ajaye',
 'Gbenga Akinnagbe',
 'Ira Aldridge',
 'Flex Alexander',
 'Terry Alexander (actor)',
 'Mahershala Ali',
 'Raymond Allen (television actor)',
 'John Amos',
 'Anthony Anderson',
 'Carl Anderson (singer)',
 'Eddie "Rochester" Anderson',
 'Haskell V. Anderson III',
 'Shedrack Anderson III',
 'André 3000',
 'Ray Aranha',
 'Louis Armstrong',
 'John A. Arneaux',
 'Jacob Artist',
 'Sharif Atkins',
 'Jensen Atwood',
 'Lloyd Avery II',
 'James Avery (actor)',
 'B.G. (rapper)',
 'Obba Babatundé',
 'Ross Bagley',
 'Philip Bailey',
 'Henry Judd Baker',
 'John Bailey (American actor)',
 'Leslie David Baker',
 'Shaun Baker (actor)',
 'Tab Baker',
 'Alimi Ballard',
 'Reginald Ballard',
 'Isaach de Bankolé',
 'Ogie Banks',
 'David Banner',
 'Tsalta Baptiste',
 'Charlie Barnett (comedian)',
 'Ty Barnett',
 'Malcolm Barrett (actor)',
 'Thom Barry',
 'Count Basie',
 'James Baskett',
 'Hinton Battle',
 'Texas Battle',
 'Adrien Beard',
 'Matthew Beard (American actor)',
 'Powhatan Beaty',
 'Beetlejuice (entertainer)',
 'Coby Bell',
 'Darryl M. Bell',
 'Spencer Bell (actor)',
 'Tone Bell',
 'Bill Bellamy',
 'Lou Bellamy',
 'Ben & Eddie',
 'Paul Benjamin',
 'Lamont Bentley',
 'Jason Bernard',
 'Anthony Berry (actor)',
 'Fred Berry',
 'Kendré Berry',
 'Ahmed Best',
 'Willie Best',
 'Big Daddy Kane',
 'Richard Biggs',
 'Lexie Bigham',
 'Beau Billingslea',
 'Black Herman',
 'Black Thought',
 'James R. Black',
 'Taurean Blacque',
 'Sean Blakemore',
 'Billy Blanks',
 'Corbin Bleu',
 'Kwesi Boakye',
 'Michael Boatman',
 "De'Aundre Bonds",
 'Chadwick Boseman',
 'Cameron Boyce',
 'Booker Bradshaw',
 'Wayne Brady',
 'Andre Braugher',
 'Daniel Breaker',
 'Jimmy Bridges (actor)',
 'Spencir Bridges',
 'Todd Bridges',
 'Shannon Briggs',
 'Steelo Brim',
 'Avery Brooks',
 'Mehcad Brooks',
 'Randy Brooks (actor)',
 'Richard Brooks (actor)',
 'Brother Blue',
 'Charles Brown (actor)',
 'Chris Brown',
 'Drew Bundini Brown',
 'Graham Brown (actor)',
 'Jim Brown',
 'Johnny Brown (actor)',
 'Neil Brown Jr.',
 'Orlando Brown (actor)',
 'Rob Brown (actor)',
 'Roger Aaron Brown',
 'Shaun Brown (actor)',
 'Sterling K. Brown',
 'Vis Brown',
 'Wren T. Brown',
 'Roscoe Lee Browne',
 'Bruce Bruce',
 'Hannibal Buress',
 'Gregg Burge',
 'Tituss Burgess',
 'Arthur Burghardt',
 'Eric Burroughs',
 'Sid Burston',
 'LeVar Burton',
 'Tony Burton',
 'Grand L. Bush',
 'Busta Rhymes',
 'Trai Byers',
 'Adolph Caesar',
 'A. J. Calloway',
 'Northern Calloway',
 'Edmund Cambridge',
 'Godfrey Cambridge',
 'Dick Campbell (producer)',
 'Vernon Campbell',
 "Cam'ron",
 'Ron Canada',
 'Nick Cannon',
 'Jerrod Carmichael',
 'Rocky Carroll',
 'Thomas Carroll (martial artist)',
 'Julius Carry',
 'Terrence C. Carson',
 'Jack Carter (actor)',
 'Ralph Carter',
 'T. K. Carter',
 'Terry Carter',
 'Thomas Carter (director)',
 'Dante Carver',
 'Bernie Casey',
 'Reg E. Cathey',
 'Matt Cedeño',
 'Cedric the Entertainer',
 'Al Chalk',
 'Chris Chalk',
 'Rudy Challenger',
 'Michael Chambers',
 'Chamillionaire',
 'Grizz Chapman',
 'Gaius Charles',
 'Vernon Chatman',
 'Don Cheadle',
 'Alan Cherry',
 'Rodney Chester',
 'Morris Chestnut',
 'Chikezie',
 'Alvin Childress',
 'Chingy',
 'Chosen Effect',
 'Rodd Christensen',
 'Julian Christopher',
 'Eugene Clark (actor)',
 'Mystro Clark',
 'Brodus Clay',
 'Rudolph M. Clay',
 'Bill Cobbs',
 'Nat King Cole',
 'Chad Coleman',
 'Gary Coleman',
 'Jim Coleman (actor)',
 'King Coleman',
 'Ryan Coleman',
 'Don Pedro Colley',
 "Rickey D'Shon Collins",
 'Mike Colter',
 'Michael Colyar',
 'Sean Combs',
 'Common (rapper)',
 'Onest Conley',
 'Conphidance',
 'Lawrence Cook (actor)',
 'Coolio',
 'Chuck Cooper (actor)',
 'Jeff Coopwood',
 'Brian Copeland',
 'Bill Cosby',
 'DeForest Covan']

In [887]:
# Males2

li_all_M1_2 = AA_Male2_soup.find_all('li')
namesM1_2 = []
for li in li_all_M1_2:
    try:
        text = li.find('a').get_text()
        namesM1_2.append(text)
    except AttributeError:
        pass
    
namesAA_male2= namesM1_2[2:202]
namesAA_male2


Out[887]:
['Tony Cox (actor)',
 'Adam Jamal Craig',
 'Jermaine Crawford',
 'Lavell Crawford',
 'Terry Crews',
 'Rupert Crosse',
 'Iman Crosson',
 'Scatman Crothers',
 'Ice Cube',
 'Ji-Tu Cumbuka',
 'Rusty Cundieff',
 'Don Curry',
 'Mark Curry (actor)',
 'Vondie Curtis-Hall',
 'RJ Cyler',
 'D.C. Scorpio',
 'Percy Daggs III',
 'Draft:Dalpre Grayer',
 'Billy Daniels',
 'Dee Jay Daniels',
 'Dexter Darden',
 'Keith David',
 'Tommy Davidson',
 'Charles Michael Davis',
 'Clifton Davis',
 "D'Mitch Davis",
 'DeRay Davis',
 'Duane Davis',
 'LaVan Davis',
 'Nore Davis',
 'Ossie Davis',
 'Sammy Davis Jr.',
 'Sammy Davis Sr.',
 'EJay Day',
 'Deezer D',
 'Oscar DeGruy',
 'Bob Delegall',
 'Erik Dellums',
 'Clinton Derricks-Carroll',
 'Cleavant Derricks (actor)',
 'Brandon DeShazer',
 'Hal DeWindt',
 'Keith Diamond (actor)',
 'Dudley Dickerson',
 'Daveed Diggs',
 'Taye Diggs',
 'Thomas Dilward',
 'Brandon J. Dirden',
 'Ivan Dixon',
 'Badja Djola',
 'DMX (rapper)',
 'Colman Domingo',
 'Robert DoQui',
 'Michael Dorn',
 'Doug E. Doug',
 'Gary Dourdan',
 'David Downing (actor)',
 'Doctor Dré',
 'Dr. Dre',
 'Dres (rapper)',
 'Bill Duke',
 'Rockmond Dunbar',
 'Christopher B. Duncan',
 'Michael Clarke Duncan',
 'Charles S. Dutton',
 'Lil Duval',
 'E-40',
 'Greg Eagles',
 'Michael Ealy',
 'David Early',
 'Earthquake (comedian)',
 'Ronnie Eckstine',
 'Dean Edwards',
 'James Edwards (actor)',
 'Lonne Elder III',
 'William Elliott (African American actor)',
 'Nelsan Ellis',
 'Kiko Ellsworth',
 'Mike Epps',
 'Omar Epps',
 'Giancarlo Esposito',
 'Art Evans',
 'Damon Evans (actor)',
 'Mike Evans (actor)',
 'Dwight Ewell',
 'Donald Faison',
 'Frankie Faison',
 'Olamide Faison',
 'Fatso-Fasano',
 'Kent Faulcon',
 'Roshon Fegan',
 'Dave Fennoy',
 'Jessie Lawrence Ferguson',
 'Kareem Ferguson',
 'Jason Finn (actor)',
 'Laurence Fishburne',
 'Ray Fisher (actor)',
 'Brandon Fobbs',
 'Shawn Fonteno',
 'Thomas Mikal Ford',
 'Ken Foree',
 'Stan Foster',
 'Jamie Foxx',
 'Redd Foxx',
 'Carl Franklin',
 'Don Franklin',
 'Marcus Carl Franklin',
 'Al Freeman Jr.',
 'K. Todd Freeman',
 'Morgan Freeman',
 'Arthur French (actor)',
 'Leon Frierson',
 'Donald Fullilove',
 'Ron Funches',
 'James Gaines',
 'Wildman Steve',
 'The Game (rapper)',
 'Richard Gant',
 'George Garner',
 'James Gaylyn',
 'Gee Money',
 'Michael Genet',
 'Jason Winston George',
 'Nathan George',
 'Trestin George',
 'Tyrese Gibson',
 'Lawrence Gilliard Jr.',
 'Charles Sidney Gilpin',
 'Clarence Gilyard',
 'Ron Glass',
 'Roy Glenn',
 'Anwan Glover',
 'Corey Glover',
 'Danny Glover',
 'Donald Glover',
 'Savion Glover',
 'Dedrick D. Gobert',
 'Jesse D. Goins',
 'Cuba Gooding Sr.',
 'Cuba Gooding Jr.',
 'Omar Gooding',
 'Carl Gordon (actor)',
 'King Gordy',
 'George O. Gore II',
 'Louis Gossett Jr.',
 'Noah Gray-Cabey',
 'Bryshere Y. Gray',
 'CeeLo Green',
 'Reuben Greene',
 'Dorian Gregory',
 'Kevin Grevioux',
 'David Alan Grier',
 'Rosey Grier',
 'Eddie Griffin',
 'Khamani Griffin',
 'Lance Gross',
 'Robert Guillaume',
 'Bill Gunn (writer)',
 'Moses Gunn',
 'Richard Parnell Habersham',
 'Jester Hairston',
 'Albert Hall (actor)',
 'Arsenio Hall',
 'J. D. Hall',
 'Kevin Peter Hall',
 'Pooch Hall',
 'Sydney Hall (actor)',
 'Bernie Hamilton',
 'Brandon Hammond',
 'John Hancock (actor)',
 'Kadeem Hardison',
 'Cory Hardrict',
 'Dorian Harewood',
 'Ben Harney (actor)',
 'Hill Harper',
 'Maestro Harrell',
 'David Harris (American actor)',
 'Julius Harris',
 'Ralph Harris (comedian)',
 'Ricky Harris',
 'Robin Harris',
 'Steve Harris (actor)',
 'Wendell B. Harris Jr.',
 'Wood Harris',
 'Kelvin Harrison Jr.',
 'Shawn Harrison (actor)',
 'Kevin Hart',
 'Steve Harvey',
 'Corey Hawkins',
 'Isaac Hayes',
 'Reggie Hayes',
 'Lloyd Haynes',
 'Tiger Haynes',
 'Dennis Haysbert',
 'Bryan Hearne',
 'Gordon Heath',
 'Heavy D',
 'Sherman Hemsley',
 'Stephen Henderson (actor)',
 'Barry Shabaka Henley']

In [888]:
# Males3

li_all_M1_3 = AA_Male3_soup.find_all('li')
namesM1_3 = []
for li in li_all_M1_3:
    try:
        text = li.find('a').get_text()
        namesM1_3.append(text)
    except AttributeError:
        pass
    
namesAA_male3= namesM1_3[2:202]
namesAA_male3


Out[888]:
['Joshua Henry',
 'Darrin Henson',
 'Hilly Hicks',
 'Rodney Hicks',
 'Dulé Hill',
 'Gil Hill',
 'Jon Michael Hill',
 'Roger Hill (actor)',
 'Lawrence Hilton-Jacobs',
 'Gregory Hines',
 'Maurice Hines',
 'Aldis Hodge',
 'Edwin Hodge',
 'Ty Hodges',
 'Dominic Hoffman',
 'Gus Hoffman',
 'Corey Holcomb',
 'André Holland',
 'Christopher Michael Holley',
 'Kene Holliday',
 'Greg Hollimon',
 'Tommy Hollis',
 'Brian Hooks',
 'Kevin Hooks',
 'Robert Hooks',
 'Jermaine Hopkins',
 'Bobby Hosea',
 'Allen Hoskins',
 'Gavin Houston',
 'Sterling Houston',
 'Terrence Howard',
 'Ernie Hudson',
 'Ernie Hudson Jr.',
 'D. L. Hughley',
 'Harold Hunter',
 'Reginald D. Hunter',
 'Philip Hurlic',
 'Rif Hutton',
 'Earle Hyman',
 'Ice-T',
 'James Monroe Iglehart',
 'Eme Ikwuakor',
 'Rex Ingram (actor)',
 'Michael Irvin',
 'Ja Rule',
 'Michael Jace',
 "Alphonso A'Qen-Aten Jackson",
 'Brandon T. Jackson',
 'Leonard Jackson (actor)',
 'Mel Jackson',
 'Merrell Jackson',
 "O'Shea Jackson Jr.",
 'Samuel L. Jackson',
 'Stoney Jackson',
 'Trevor Jackson (performer)',
 'Jeffrey Jacquet',
 'Lord Jamar',
 'Jamel Simmons',
 'James Hewlett',
 'Brad James',
 'Bryton James',
 'Hawthorne James',
 'Paul James (actor)',
 'Peter Francis James',
 'Steve James (actor)',
 'Jay Pharoah',
 'DJ Jazzy Jeff',
 'Kyle Jean-Baptiste',
 'Phillip Jeanmarie',
 'Def Jef',
 'Marc John Jefferies',
 'Brenden Jefferson',
 'Herbert Jefferson Jr.',
 'Adam Jeffries',
 'Herb Jeffries',
 'Larry "Flash" Jenkins',
 'Mykel Shannon Jenkins',
 'Brent Jennings',
 'DeWayne Jessie',
 'Anthony Johnson (actor)',
 'Arnold Johnson (actor)',
 'Clark Johnson',
 'Dots Johnson',
 'Dwayne Johnson',
 'Hassan Johnson',
 'Hisonni Johnson',
 'Keith Johnson (actor)',
 'Kyle Johnson (actor)',
 'Marques Johnson',
 'Noble Johnson',
 'William L. Johnson',
 'Wesley Jonathan',
 'Ian Jones-Quartey',
 'Brian Jones (activist)',
 'Buster Jones',
 'Duane Jones',
 'Ike Jones',
 'James Earl Jones',
 'Jim Jones (rapper)',
 'John Marshall Jones',
 'Mike Jones (rapper)',
 'Orlando Jones',
 'Quincy Jones',
 'Richard T. Jones',
 'Robbie Jones (actor)',
 'Robert Earl Jones',
 'Sam Jones III',
 'Walter Emanuel Jones',
 'Jo D. Jonz',
 'Michael B. Jordan',
 'Amin Joseph',
 'David Joyner (actor)',
 'Christopher Judge',
 'OJ da Juiceman',
 'Max Julien',
 'Justin Hires',
 'Juvenile (rapper)',
 'Khalil Kain',
 'Melvin Ray Kearney II',
 'NaShawn Kearse',
 'Elijah Kelley',
 'Malcolm David Kelley',
 'Jim Kelly (martial artist)',
 'R. Kelly',
 'Keegan-Michael Key',
 'Christian Keyes',
 'Hakeem Khaaliq',
 'Wiz Khalifa',
 'Lincoln Kilpatrick',
 'Arif S. Kinchen',
 'Erik King',
 'Andre Kinney',
 'George Kirby',
 'Craig Kirkwood',
 'Tory Kittles',
 'John Kitzmiller',
 'Chris Knowings',
 'Yaphet Kotto',
 'Lenny Kravitz',
 'KRS-One',
 'Jean-Claude La Marre',
 'Eriq La Salle',
 'Phil LaMarr',
 'Lamon Archey',
 'Charles Lampkin',
 'Charles Lane (filmmaker)',
 'Eric Lane (actor)',
 'James T. Lane',
 'Eric Laneuville',
 'Ted Lange',
 'Bobby Lashley',
 'Jacob Latimore',
 'Nick LaTour',
 'Mark Christopher Lawrence',
 'Martin Lawrence',
 'Scott Lawrence',
 'Richard Lawson (actor)',
 'Clayton LeBouef',
 'Anthony Lee (actor)',
 'Canada Lee',
 'Carl Lee (actor)',
 'Cinqué Lee',
 'Daniel Curtis Lee',
 'Eugene Lee (actor)',
 'Irving Allen Lee',
 'Johnny Lee (actor)',
 'RonReaco Lee',
 'Spike Lee',
 'John Legend',
 'Raw Leiba',
 'Harry Lennix',
 'Leon Isaac Kennedy',
 'Sugar Ray Leonard',
 'Ali LeRoi',
 'James Lesure',
 'Calvin Levels',
 'Emmanuel Lewis',
 'Norm Lewis',
 'Phill Lewis',
 "Lil' JJ",
 'Lil Jon',
 'Lil Wayne',
 'Tom Lister Jr.',
 'Cleavon Little',
 'LL Cool J',
 'Goldie Loc',
 'Spider Loc',
 'Doug Locke',
 'Cirroc Lofton',
 'Avon Long',
 'Jackie Long',
 'Brian Love',
 'Darris Love',
 'Faizon Love',
 'Victor Love',
 'Herbie Lovelle',
 'Ed Lover',
 'Arvie Lowe Jr.',
 'Sam Lucas',
 'Ludacris']

In [889]:
# Males4

li_all_M1_4 = AA_Male4_soup.find_all('li')
namesM1_4 = []
for li in li_all_M1_4:
    try:
        text = li.find('a').get_text()
        namesM1_4.append(text)
    except AttributeError:
        pass
    
namesAA_male4= namesM1_4[2:202]
namesAA_male4


Out[889]:
['Derek Luke (actor)',
 'Bernie Mac',
 'CJ Mac',
 'Mack 10',
 'Anthony Mackie',
 'Noel MacNeal',
 'Peter Macon',
 'Blu Mankuma',
 'Mario (American singer)',
 'Pigmeat Markham',
 'Jaron Marquis',
 'John Marriott (actor)',
 'Don Marshall (actor)',
 'Larry Marshall (actor)',
 'Tim Marshall (radio host)',
 'William Marshall (actor)',
 'Christopher Martin (entertainer)',
 'Dan Martin (actor)',
 'Duane Martin',
 "D'Urville Martin",
 'Jesse L. Martin',
 'Mase',
 'Christopher Massey',
 'Kyle Massey',
 'Master Juba',
 'Master P',
 'James C. Mathis III',
 'Al Matthews',
 'Onzy Matthews',
 'Whitman Mayo',
 'MC Ren',
 'Chi McBride',
 'Billy McClain',
 'Darius McCrary',
 'Suli McCullough',
 'James McDaniel',
 'Jonathan McDaniel',
 'Sam McDaniel',
 'James McEachin',
 'Andrew McFarlane (American actor)',
 'William McGhee',
 'Charles McGregor',
 'Caleb McLaughlin',
 'Frank McRae',
 'Mo McRae',
 'Tim Meadows',
 'Aaron Meeks',
 "E'Shun Melvin",
 'Method Man',
 'Windell Middlebrooks',
 'Qaasim Middleton',
 'Hal Miller (actor)',
 'Omar Benson Miller',
 'Romeo Miller',
 'Derek Minor',
 'Jerry Minor',
 'Robert Lee Minor',
 'Brian Stokes Mitchell',
 'Daryl Mitchell (actor)',
 'Finesse Mitchell',
 'Jason Mitchell (actor)',
 'Kel Mitchell',
 'Scoey Mitchell',
 'Jerod Mixon',
 'Anthony Montgomery',
 'Jim Moody (actor)',
 'Paul Mooney (comedian)',
 'Phil Moore (actor)',
 'Rudy Ray Moore',
 'Shemar Moore',
 'Tim Moore (comedian)',
 'Vic Moore',
 'Mantan Moreland',
 'Cesar A. Moreno',
 'S. Robert Morgan',
 'Tracy Morgan',
 'Garrett Morris',
 'Greg Morris',
 'Lamorne Morris',
 'Phil Morris (actor)',
 'Ernie Morrison',
 "M'fundo Morrison",
 'Joe Morton',
 'Mos Def',
 'Roger E. Mosley',
 'Walter Mosley',
 'Khalid Moultrie',
 'Tahj Mowry',
 'Charlie Murphy',
 'Eddie Murphy',
 'Clarence Muse',
 'Isaiah Mustafa',
 'Ntare Mwine',
 'Lou Myers (actor)',
 'Rashaan Nall',
 'Lane Napper',
 'Nas',
 'Nelly',
 'Nephew Tommy',
 'Alex Newell',
 'Ngo Okafor',
 'Fayard Nicholas',
 'Lance E. Nichols',
 'DeVaughn Nixon',
 'Bill Nunn',
 'Nyambi Nyambi',
 'George T. Odom',
 'Leslie Odom Jr.',
 'Omarion',
 'Afemo Omilami',
 "Patrice O'Neal",
 "Ron O'Neal",
 "Shaquille O'Neal",
 'Roscoe Orman',
 'Adesola Osakalumi',
 'Owen H.M. Smith',
 'Geoffrey Owens',
 'Nathan Owens',
 'Brandon P. Bell',
 'Michael J. Pagan',
 'Harrison Page',
 'Ken Page',
 'Dempsey Pappion',
 'Anthony Ray Parker',
 'Nate Parker',
 'Deantoni Parks',
 'Gordon Parks',
 'Charles Parnell (actor)',
 'Peter Parros',
 'Rahsaan Patterson',
 'Marcus T. Paulk',
 'Allen Payne',
 'Carl Anthony Payne II',
 'Christian Payton',
 'Khary Payton',
 'Jordan Peele',
 'Nathan Pelle',
 'Collins Pennie',
 'Kelly Perine',
 'Harold Perrineau',
 'Felton Perry',
 'Rod Perry (actor)',
 'Rodney Perry',
 'Tyler Perry',
 'Wolfe Perry',
 'Brock Peters',
 'Clarke Peters',
 'Mekhi Phifer',
 'Joseph C. Phillips',
 'James Pickens Jr.',
 'Stack Pierce',
 'Wendell Pierce',
 'Ron Pinkard',
 'Glenn Plummer',
 'Jeris Lee Poindexter',
 'Sidney Poitier',
 'Oscar Polk',
 'Patrik-Ian Polk',
 'DJ Pooh',
 'Albert Popwell',
 'Billy Porter (entertainer)',
 'Clifton Powell',
 'Keith Powell',
 'Will Power (performer)',
 'Ben Powers',
 'Keith T. Powers',
 'Billy Preston',
 'J. A. Preston',
 'Gilbert Price',
 'Prince (musician)',
 'Richard Pryor',
 'Willard E. Pugh',
 'Nathan Purdee',
 'Quincy (actor)',
 'Luther Rackley',
 'Damaine Radcliff',
 'Michael Ralph',
 'Ahmad Rashād',
 'Thalmus Rasulala',
 'Mishon Ratliff',
 'Donnell Rawlings',
 'Lou Rawls',
 'Ray J',
 'Gene Anthony Ray',
 'Raz-B',
 'Chris Redd',
 'Lance Reddick',
 'Redman (rapper)',
 'Markus Redmond',
 'Albert Reed Jr.',
 'George H. Reed',
 'Joshua Elijah Reese',
 'Omar Regan',
 'Tim Reid',
 'Reynaldo Rey',
 'Corey Reynolds',
 'James Reynolds (actor)',
 'Khylin Rhambo',
 'Ving Rhames',
 'Hari Rhodes']

In [890]:
# Males5

li_all_M1_5 = AA_Male5_soup.find_all('li')
namesM1_5 = []
for li in li_all_M1_5:
    try:
        text = li.find('a').get_text()
        namesM1_5.append(text)
    except AttributeError:
        pass
    
namesAA_male5= namesM1_5[2:202]
namesAA_male5


Out[890]:
["Robert Ri'chard",
 'J. August Richards',
 'Kevin Michael Richardson',
 'Marque Richardson',
 'Ron Richardson',
 'Sam Richardson (actor)',
 'Sy Richardson',
 'Deon Richmond',
 'Tequan Richmond',
 'Stephen Rider',
 'Larry Riley (actor)',
 'Robert Christopher Riley',
 'Sean Ringgold',
 'Rodney Allen Rippy',
 'Robert Christian',
 'Davis Roberts',
 'Leonard Roberts',
 'Michael D. Roberts',
 'Tony T. Roberts',
 'Lance Robertson',
 'Paul Robeson',
 'Bill Robinson',
 'Bumper Robinson',
 'Charlie Robinson (actor)',
 'Craig Robinson (actor)',
 'Jimmy Robinson (actor)',
 'Keith Robinson (actor)',
 'Leon Robinson',
 'Matt Robinson (actor)',
 'Roger Robinson (actor)',
 'Jeff Roches',
 'Chris Rock',
 'Tony Rock',
 'Rocko (rapper)',
 'Delvon Roe',
 'Ivan Rogers (actor)',
 'Timmie Rogers',
 'Al Roker',
 'Howard Rollins',
 'Clinton Rosemond',
 'Evan Ross',
 'Ricco Ross',
 'Shavar Ross',
 'Ted Ross',
 'Rotimi (actor)',
 'Richard Roundtree',
 'Andre Royo',
 'Lamman Rucker',
 'RuPaul',
 'Tim Russ',
 'Montae Russell',
 'Nipsey Russell',
 'Ken Sagoes',
 'Raymond St. Jacques',
 'John Salley',
 'Kendrick Sampson',
 'Jeffrey D. Sams',
 'Samwell (entertainer)',
 'Alvin Sanders',
 'Ashton Sanders',
 'C. J. Sanders',
 'Henry G. Sanders',
 'Juelz Santana',
 'Ruben Santiago-Hudson',
 'Bob Sapp',
 'Parker Sawyers',
 'MuMs da Schemer',
 'Harold Scott (director)',
 'Larry B. Scott',
 'Franklyn Seales',
 'Joe Seneca',
 'Sayeed Shahidi',
 'Tupac Shakur',
 'Vicellous Reon Shannon',
 'Monti Sharp',
 'Doc Shaw',
 'Stan Shaw',
 'Al Shearer',
 'Ray Shell',
 'Chaz Lamar Shepherd',
 'Rondell Sheridan',
 'Demetrius Shipp Jr.',
 'Bobby Short',
 'Columbus Short',
 'P. Jay Sidney',
 'Keith Silverstein',
 'Henry Simmons',
 'O. J. Simpson',
 'Sinbad (comedian)',
 'IronE Singleton',
 'Isaac C. Singleton Jr.',
 'Kofi Siriboe',
 'Sisqó',
 'Darryl Sivad',
 'Cle Shaheed Sloan',
 'Tucker Smallwood',
 'Rickey Smiley',
 'Arjay Smith',
 'Brandon Mychal Smith',
 'Bubba Smith',
 'Chris Smith (composer)',
 'Jaden Smith',
 'Jamil Walker Smith',
 'Jason Matthew Smith',
 'Justice Smith',
 'Keith Randolph Smith',
 'Will Smith',
 'Jake Smollett',
 'Jussie Smollett',
 'J. B. Smoove',
 'David Smyrl',
 'Wesley Snipes',
 'Snoop Dogg',
 'Trey Songz',
 'Soulja Boy',
 'Aaron D. Spears',
 'Aries Spears',
 'Chris Spencer (actor)',
 'Christopher St. John',
 'Kristoff St. John',
 'Mathew St. Patrick',
 'Nathaniel Stampley',
 'Keith Stanfield',
 'Fredro Starr',
 'Eddie Steeples',
 'Darryl Stephens',
 'Mel Stewart',
 'Nick Stewart',
 'Timothy Stickney',
 'Sticky Fingaz',
 'Austin Stoker',
 'Michael Strahan',
 'Stretch (rapper)',
 'Astro (rapper)',
 'Woody Strode',
 'Dana Stubblefield',
 'Levi Stubbs',
 'Gary Anthony Sturgis',
 'Jeremy Suarez',
 'Daniel Sunjata',
 'Harold Sylvester',
 'Mr. T',
 'Taimak',
 'Michael Taliferro',
 'Antwon Tanner',
 'Le Tari',
 'Rockne Tarkington',
 'Larenz Tate',
 'Henry E. Taylor III',
 'Lawrence Taylor',
 'Meshach Taylor',
 'Nathaniel Taylor (actor)',
 'Ron Taylor (actor)',
 'Wally Taylor (actor)',
 'Devon Terrell',
 'Marcello Thedford',
 'Billie Thomas',
 'Ernest Lee Thomas',
 'Khleo Thomas',
 'Leon Thomas III',
 'Sean Patrick Thomas',
 'Warren Thomas',
 'William Thomas Jr.',
 "Bobb'e J. Thompson",
 'Kenan Thompson',
 'T.I.',
 'Tony Todd',
 'Tone Lōc',
 'Fred Toones',
 'Guy Torry',
 'Joe Torry',
 'Robert Townsend (actor)',
 'Craig Lamar Traylor',
 'Treach',
 'Trick Daddy',
 'Xavier Lamar Truesdell',
 'Chris Tucker',
 'Lorenzo Tucker',
 'Dan Tullis Jr.',
 'Glynn Turman',
 'Tyrin Turner',
 'Tyler, The Creator',
 'Willie Tyler',
 'Blair Underwood',
 'Usher (singer)',
 'Jessie Usher',
 'Ron van Clief',
 'Mario Van Peebles',
 'Melvin Van Peebles',
 'Courtney B. Vance',
 'Victor Varnado',
 'Reginald VelJohnson',
 'Ben Vereen',
 'Adam Wade (singer)',
 'Shangela Laquifa Wadley',
 'Jimmie Walker',
 'William "Bill" Walker',
 'James Wall (actor)',
 'Basil Wallace',
 'George Wallace (comedian)']

In [891]:
# Males6

li_all_M1_6 = AA_Male_6soup.find_all('li')
namesM1_6 = []
for li in li_all_M1_6:
    try:
        text = li.find('a').get_text()
        namesM1_6.append(text)
    except AttributeError:
        pass
    
namesAA_male6= namesM1_6[2:122]
namesAA_male6


Out[891]:
['Richard Ward (actor)',
 'Vincent M. Ward',
 'Malcolm-Jamal Warner',
 'Michael Warren (actor)',
 'Blue Washington',
 'Denzel Washington',
 'Isaiah Washington',
 'Jascha Washington',
 'Kenneth Washington',
 'Vernon Washington',
 'Damien Dante Wayans',
 'Damon Wayans',
 'Damon Wayans Jr.',
 'Dwayne Wayans',
 'Keenen Ivory Wayans',
 'Marlon Wayans',
 'Shawn Wayans',
 'WC (rapper)',
 'Carl Weathers',
 'Sean Weathers',
 'Jason Weaver',
 'Lee Weaver',
 'Derek Webster (actor)',
 'John Wesley (actor)',
 'Ernie Wheelwright (running back)',
 'Leigh Whipper',
 'Denzel Whitaker',
 'Forest Whitaker',
 'Kenn Whitaker',
 'Al White',
 'Brian J. White',
 "De'voreaux White",
 'Jaleel White',
 'Michael Jai White',
 'Slappy White',
 'Steve White (actor)',
 'Tye White',
 'Charles Malik Whitfield',
 'Dondre Whitfield',
 'Vantile Whitfield',
 'Napoleon Whiting',
 'Isiah Whitlock Jr.',
 'Ernest Whitman',
 'Jack Wiggins',
 'Ralph Wilcox (actor)',
 'Wayne Wilderson',
 'Tristan Wilds',
 'Billy Dee Williams',
 "Chino 'Fats' Williams",
 'Chris Williams (actor)',
 'Christopher Williams (singer)',
 'Clarence Williams (musician)',
 'Clarence Williams III',
 'Cress Williams',
 'Curtis Williams',
 'Darnell Williams',
 'Dick Anthony Williams',
 'Doug Williams (comedian)',
 'Gary Anthony Williams',
 'Gregory Alan Williams',
 'Hal Williams',
 'J. D. Williams',
 'Jesse Williams (actor)',
 'Katt Williams',
 'Melvin Williams (actor)',
 'Micah Stephen Williams',
 'Mike Williams (boxer)',
 'Michael K. Williams',
 'Redaric Williams',
 'Saul Williams',
 'Spencer Williams (actor)',
 'Stephen Tyrone Williams',
 'Steven Williams',
 'Todd Williams (actor)',
 'Tyler James Williams',
 'Tyrel Jackson Williams',
 'Victor Williams',
 'Fred Williamson',
 'Mykelti Williamson',
 "Dap 'Sugar' Willie",
 'Victor Willis',
 'Larry Wilmore',
 'Marc Wilmore',
 'Bryce Wilson',
 "De'Angelo Wilson",
 'Demond Wilson',
 'Dooley Wilson',
 'Dorien Wilson',
 'Flip Wilson',
 'Frank H. Wilson',
 'Lisle Wilson',
 'Reno Wilson',
 'Theodore Wilson',
 'Troy Winbush',
 'Paul Winfield',
 'Michael Winslow',
 'John Witherspoon (actor)',
 'Jeffery Wood',
 'Bokeem Woodbine',
 'Shaheed Woods',
 'D. B. Woodside',
 'Wayne Woodson',
 'Michael-Leon Wooley',
 'Carl Wright (actor)',
 'Dorsey Wright',
 'Jeffrey Wright',
 'Michael Wright (actor)',
 'Samuel E. Wright',
 'Tom Wright (actor)',
 'Xzibit',
 'Yahya Abdul-Mateen II',
 'Cedric Yarbrough',
 'Malik Yoba',
 'Jade Yorker',
 'Bruce A. Young',
 'Chazz Young',
 'Lee Thompson Young',
 'Marlon Young',
 'Otis Young',
 'William Allen Young']

In [892]:
# Noteable male actors of African descent, not included in Wikipedia page

notable1 = [
'Djimon Hounsou', 'Chiwetel Ejiofor', 'David Oyelowo', 'Adewale Akinnuoye-Agbaje', 'Idris Elba', 'Omar Sy, John Boyega', 'Nonso Anozie', 'Isaach De Bankole', 'Gbenga Akinnagbe', 
'Peter Mensah', 'Richard Ayoade', 'Tongayi Chirisa', 'Dayo Okeniyi', 'Yaphet Kotto', 'Hugh Quarshie', 'Ntare Guma Mbaho Mwine', 'John Kani', 'Edi Gathegi', 'Sahr Ngaujah', 'Presley Chweneyagae', 
'Eriq Ebouaney', 'Aml Ameen', 'N!xau, Hakeem Kae-Kazim', 'Ato Essandoh', 'Fana Mokoena', 'Daniel Kaluuya']

In [893]:
# Add List
namesAA_male= namesAA_male1 + namesAA_male2 + namesAA_male3 + namesAA_male4 + namesAA_male5 + namesAA_male6 + notable1
namesAA_male


Out[893]:
['50 Cent',
 'Quinton Aaron',
 'Barkhad Abdi',
 'Kareem Abdul-Jabbar',
 'Brandon Adams (actor)',
 'Granville Adams',
 'Tunde Adebimpe',
 'Ilunga Adell',
 'Jovan Adepo',
 'Faysal Ahmed',
 'Franklyn Ajaye',
 'Gbenga Akinnagbe',
 'Ira Aldridge',
 'Flex Alexander',
 'Terry Alexander (actor)',
 'Mahershala Ali',
 'Raymond Allen (television actor)',
 'John Amos',
 'Anthony Anderson',
 'Carl Anderson (singer)',
 'Eddie "Rochester" Anderson',
 'Haskell V. Anderson III',
 'Shedrack Anderson III',
 'André 3000',
 'Ray Aranha',
 'Louis Armstrong',
 'John A. Arneaux',
 'Jacob Artist',
 'Sharif Atkins',
 'Jensen Atwood',
 'Lloyd Avery II',
 'James Avery (actor)',
 'B.G. (rapper)',
 'Obba Babatundé',
 'Ross Bagley',
 'Philip Bailey',
 'Henry Judd Baker',
 'John Bailey (American actor)',
 'Leslie David Baker',
 'Shaun Baker (actor)',
 'Tab Baker',
 'Alimi Ballard',
 'Reginald Ballard',
 'Isaach de Bankolé',
 'Ogie Banks',
 'David Banner',
 'Tsalta Baptiste',
 'Charlie Barnett (comedian)',
 'Ty Barnett',
 'Malcolm Barrett (actor)',
 'Thom Barry',
 'Count Basie',
 'James Baskett',
 'Hinton Battle',
 'Texas Battle',
 'Adrien Beard',
 'Matthew Beard (American actor)',
 'Powhatan Beaty',
 'Beetlejuice (entertainer)',
 'Coby Bell',
 'Darryl M. Bell',
 'Spencer Bell (actor)',
 'Tone Bell',
 'Bill Bellamy',
 'Lou Bellamy',
 'Ben & Eddie',
 'Paul Benjamin',
 'Lamont Bentley',
 'Jason Bernard',
 'Anthony Berry (actor)',
 'Fred Berry',
 'Kendré Berry',
 'Ahmed Best',
 'Willie Best',
 'Big Daddy Kane',
 'Richard Biggs',
 'Lexie Bigham',
 'Beau Billingslea',
 'Black Herman',
 'Black Thought',
 'James R. Black',
 'Taurean Blacque',
 'Sean Blakemore',
 'Billy Blanks',
 'Corbin Bleu',
 'Kwesi Boakye',
 'Michael Boatman',
 "De'Aundre Bonds",
 'Chadwick Boseman',
 'Cameron Boyce',
 'Booker Bradshaw',
 'Wayne Brady',
 'Andre Braugher',
 'Daniel Breaker',
 'Jimmy Bridges (actor)',
 'Spencir Bridges',
 'Todd Bridges',
 'Shannon Briggs',
 'Steelo Brim',
 'Avery Brooks',
 'Mehcad Brooks',
 'Randy Brooks (actor)',
 'Richard Brooks (actor)',
 'Brother Blue',
 'Charles Brown (actor)',
 'Chris Brown',
 'Drew Bundini Brown',
 'Graham Brown (actor)',
 'Jim Brown',
 'Johnny Brown (actor)',
 'Neil Brown Jr.',
 'Orlando Brown (actor)',
 'Rob Brown (actor)',
 'Roger Aaron Brown',
 'Shaun Brown (actor)',
 'Sterling K. Brown',
 'Vis Brown',
 'Wren T. Brown',
 'Roscoe Lee Browne',
 'Bruce Bruce',
 'Hannibal Buress',
 'Gregg Burge',
 'Tituss Burgess',
 'Arthur Burghardt',
 'Eric Burroughs',
 'Sid Burston',
 'LeVar Burton',
 'Tony Burton',
 'Grand L. Bush',
 'Busta Rhymes',
 'Trai Byers',
 'Adolph Caesar',
 'A. J. Calloway',
 'Northern Calloway',
 'Edmund Cambridge',
 'Godfrey Cambridge',
 'Dick Campbell (producer)',
 'Vernon Campbell',
 "Cam'ron",
 'Ron Canada',
 'Nick Cannon',
 'Jerrod Carmichael',
 'Rocky Carroll',
 'Thomas Carroll (martial artist)',
 'Julius Carry',
 'Terrence C. Carson',
 'Jack Carter (actor)',
 'Ralph Carter',
 'T. K. Carter',
 'Terry Carter',
 'Thomas Carter (director)',
 'Dante Carver',
 'Bernie Casey',
 'Reg E. Cathey',
 'Matt Cedeño',
 'Cedric the Entertainer',
 'Al Chalk',
 'Chris Chalk',
 'Rudy Challenger',
 'Michael Chambers',
 'Chamillionaire',
 'Grizz Chapman',
 'Gaius Charles',
 'Vernon Chatman',
 'Don Cheadle',
 'Alan Cherry',
 'Rodney Chester',
 'Morris Chestnut',
 'Chikezie',
 'Alvin Childress',
 'Chingy',
 'Chosen Effect',
 'Rodd Christensen',
 'Julian Christopher',
 'Eugene Clark (actor)',
 'Mystro Clark',
 'Brodus Clay',
 'Rudolph M. Clay',
 'Bill Cobbs',
 'Nat King Cole',
 'Chad Coleman',
 'Gary Coleman',
 'Jim Coleman (actor)',
 'King Coleman',
 'Ryan Coleman',
 'Don Pedro Colley',
 "Rickey D'Shon Collins",
 'Mike Colter',
 'Michael Colyar',
 'Sean Combs',
 'Common (rapper)',
 'Onest Conley',
 'Conphidance',
 'Lawrence Cook (actor)',
 'Coolio',
 'Chuck Cooper (actor)',
 'Jeff Coopwood',
 'Brian Copeland',
 'Bill Cosby',
 'DeForest Covan',
 'Tony Cox (actor)',
 'Adam Jamal Craig',
 'Jermaine Crawford',
 'Lavell Crawford',
 'Terry Crews',
 'Rupert Crosse',
 'Iman Crosson',
 'Scatman Crothers',
 'Ice Cube',
 'Ji-Tu Cumbuka',
 'Rusty Cundieff',
 'Don Curry',
 'Mark Curry (actor)',
 'Vondie Curtis-Hall',
 'RJ Cyler',
 'D.C. Scorpio',
 'Percy Daggs III',
 'Draft:Dalpre Grayer',
 'Billy Daniels',
 'Dee Jay Daniels',
 'Dexter Darden',
 'Keith David',
 'Tommy Davidson',
 'Charles Michael Davis',
 'Clifton Davis',
 "D'Mitch Davis",
 'DeRay Davis',
 'Duane Davis',
 'LaVan Davis',
 'Nore Davis',
 'Ossie Davis',
 'Sammy Davis Jr.',
 'Sammy Davis Sr.',
 'EJay Day',
 'Deezer D',
 'Oscar DeGruy',
 'Bob Delegall',
 'Erik Dellums',
 'Clinton Derricks-Carroll',
 'Cleavant Derricks (actor)',
 'Brandon DeShazer',
 'Hal DeWindt',
 'Keith Diamond (actor)',
 'Dudley Dickerson',
 'Daveed Diggs',
 'Taye Diggs',
 'Thomas Dilward',
 'Brandon J. Dirden',
 'Ivan Dixon',
 'Badja Djola',
 'DMX (rapper)',
 'Colman Domingo',
 'Robert DoQui',
 'Michael Dorn',
 'Doug E. Doug',
 'Gary Dourdan',
 'David Downing (actor)',
 'Doctor Dré',
 'Dr. Dre',
 'Dres (rapper)',
 'Bill Duke',
 'Rockmond Dunbar',
 'Christopher B. Duncan',
 'Michael Clarke Duncan',
 'Charles S. Dutton',
 'Lil Duval',
 'E-40',
 'Greg Eagles',
 'Michael Ealy',
 'David Early',
 'Earthquake (comedian)',
 'Ronnie Eckstine',
 'Dean Edwards',
 'James Edwards (actor)',
 'Lonne Elder III',
 'William Elliott (African American actor)',
 'Nelsan Ellis',
 'Kiko Ellsworth',
 'Mike Epps',
 'Omar Epps',
 'Giancarlo Esposito',
 'Art Evans',
 'Damon Evans (actor)',
 'Mike Evans (actor)',
 'Dwight Ewell',
 'Donald Faison',
 'Frankie Faison',
 'Olamide Faison',
 'Fatso-Fasano',
 'Kent Faulcon',
 'Roshon Fegan',
 'Dave Fennoy',
 'Jessie Lawrence Ferguson',
 'Kareem Ferguson',
 'Jason Finn (actor)',
 'Laurence Fishburne',
 'Ray Fisher (actor)',
 'Brandon Fobbs',
 'Shawn Fonteno',
 'Thomas Mikal Ford',
 'Ken Foree',
 'Stan Foster',
 'Jamie Foxx',
 'Redd Foxx',
 'Carl Franklin',
 'Don Franklin',
 'Marcus Carl Franklin',
 'Al Freeman Jr.',
 'K. Todd Freeman',
 'Morgan Freeman',
 'Arthur French (actor)',
 'Leon Frierson',
 'Donald Fullilove',
 'Ron Funches',
 'James Gaines',
 'Wildman Steve',
 'The Game (rapper)',
 'Richard Gant',
 'George Garner',
 'James Gaylyn',
 'Gee Money',
 'Michael Genet',
 'Jason Winston George',
 'Nathan George',
 'Trestin George',
 'Tyrese Gibson',
 'Lawrence Gilliard Jr.',
 'Charles Sidney Gilpin',
 'Clarence Gilyard',
 'Ron Glass',
 'Roy Glenn',
 'Anwan Glover',
 'Corey Glover',
 'Danny Glover',
 'Donald Glover',
 'Savion Glover',
 'Dedrick D. Gobert',
 'Jesse D. Goins',
 'Cuba Gooding Sr.',
 'Cuba Gooding Jr.',
 'Omar Gooding',
 'Carl Gordon (actor)',
 'King Gordy',
 'George O. Gore II',
 'Louis Gossett Jr.',
 'Noah Gray-Cabey',
 'Bryshere Y. Gray',
 'CeeLo Green',
 'Reuben Greene',
 'Dorian Gregory',
 'Kevin Grevioux',
 'David Alan Grier',
 'Rosey Grier',
 'Eddie Griffin',
 'Khamani Griffin',
 'Lance Gross',
 'Robert Guillaume',
 'Bill Gunn (writer)',
 'Moses Gunn',
 'Richard Parnell Habersham',
 'Jester Hairston',
 'Albert Hall (actor)',
 'Arsenio Hall',
 'J. D. Hall',
 'Kevin Peter Hall',
 'Pooch Hall',
 'Sydney Hall (actor)',
 'Bernie Hamilton',
 'Brandon Hammond',
 'John Hancock (actor)',
 'Kadeem Hardison',
 'Cory Hardrict',
 'Dorian Harewood',
 'Ben Harney (actor)',
 'Hill Harper',
 'Maestro Harrell',
 'David Harris (American actor)',
 'Julius Harris',
 'Ralph Harris (comedian)',
 'Ricky Harris',
 'Robin Harris',
 'Steve Harris (actor)',
 'Wendell B. Harris Jr.',
 'Wood Harris',
 'Kelvin Harrison Jr.',
 'Shawn Harrison (actor)',
 'Kevin Hart',
 'Steve Harvey',
 'Corey Hawkins',
 'Isaac Hayes',
 'Reggie Hayes',
 'Lloyd Haynes',
 'Tiger Haynes',
 'Dennis Haysbert',
 'Bryan Hearne',
 'Gordon Heath',
 'Heavy D',
 'Sherman Hemsley',
 'Stephen Henderson (actor)',
 'Barry Shabaka Henley',
 'Joshua Henry',
 'Darrin Henson',
 'Hilly Hicks',
 'Rodney Hicks',
 'Dulé Hill',
 'Gil Hill',
 'Jon Michael Hill',
 'Roger Hill (actor)',
 'Lawrence Hilton-Jacobs',
 'Gregory Hines',
 'Maurice Hines',
 'Aldis Hodge',
 'Edwin Hodge',
 'Ty Hodges',
 'Dominic Hoffman',
 'Gus Hoffman',
 'Corey Holcomb',
 'André Holland',
 'Christopher Michael Holley',
 'Kene Holliday',
 'Greg Hollimon',
 'Tommy Hollis',
 'Brian Hooks',
 'Kevin Hooks',
 'Robert Hooks',
 'Jermaine Hopkins',
 'Bobby Hosea',
 'Allen Hoskins',
 'Gavin Houston',
 'Sterling Houston',
 'Terrence Howard',
 'Ernie Hudson',
 'Ernie Hudson Jr.',
 'D. L. Hughley',
 'Harold Hunter',
 'Reginald D. Hunter',
 'Philip Hurlic',
 'Rif Hutton',
 'Earle Hyman',
 'Ice-T',
 'James Monroe Iglehart',
 'Eme Ikwuakor',
 'Rex Ingram (actor)',
 'Michael Irvin',
 'Ja Rule',
 'Michael Jace',
 "Alphonso A'Qen-Aten Jackson",
 'Brandon T. Jackson',
 'Leonard Jackson (actor)',
 'Mel Jackson',
 'Merrell Jackson',
 "O'Shea Jackson Jr.",
 'Samuel L. Jackson',
 'Stoney Jackson',
 'Trevor Jackson (performer)',
 'Jeffrey Jacquet',
 'Lord Jamar',
 'Jamel Simmons',
 'James Hewlett',
 'Brad James',
 'Bryton James',
 'Hawthorne James',
 'Paul James (actor)',
 'Peter Francis James',
 'Steve James (actor)',
 'Jay Pharoah',
 'DJ Jazzy Jeff',
 'Kyle Jean-Baptiste',
 'Phillip Jeanmarie',
 'Def Jef',
 'Marc John Jefferies',
 'Brenden Jefferson',
 'Herbert Jefferson Jr.',
 'Adam Jeffries',
 'Herb Jeffries',
 'Larry "Flash" Jenkins',
 'Mykel Shannon Jenkins',
 'Brent Jennings',
 'DeWayne Jessie',
 'Anthony Johnson (actor)',
 'Arnold Johnson (actor)',
 'Clark Johnson',
 'Dots Johnson',
 'Dwayne Johnson',
 'Hassan Johnson',
 'Hisonni Johnson',
 'Keith Johnson (actor)',
 'Kyle Johnson (actor)',
 'Marques Johnson',
 'Noble Johnson',
 'William L. Johnson',
 'Wesley Jonathan',
 'Ian Jones-Quartey',
 'Brian Jones (activist)',
 'Buster Jones',
 'Duane Jones',
 'Ike Jones',
 'James Earl Jones',
 'Jim Jones (rapper)',
 'John Marshall Jones',
 'Mike Jones (rapper)',
 'Orlando Jones',
 'Quincy Jones',
 'Richard T. Jones',
 'Robbie Jones (actor)',
 'Robert Earl Jones',
 'Sam Jones III',
 'Walter Emanuel Jones',
 'Jo D. Jonz',
 'Michael B. Jordan',
 'Amin Joseph',
 'David Joyner (actor)',
 'Christopher Judge',
 'OJ da Juiceman',
 'Max Julien',
 'Justin Hires',
 'Juvenile (rapper)',
 'Khalil Kain',
 'Melvin Ray Kearney II',
 'NaShawn Kearse',
 'Elijah Kelley',
 'Malcolm David Kelley',
 'Jim Kelly (martial artist)',
 'R. Kelly',
 'Keegan-Michael Key',
 'Christian Keyes',
 'Hakeem Khaaliq',
 'Wiz Khalifa',
 'Lincoln Kilpatrick',
 'Arif S. Kinchen',
 'Erik King',
 'Andre Kinney',
 'George Kirby',
 'Craig Kirkwood',
 'Tory Kittles',
 'John Kitzmiller',
 'Chris Knowings',
 'Yaphet Kotto',
 'Lenny Kravitz',
 'KRS-One',
 'Jean-Claude La Marre',
 'Eriq La Salle',
 'Phil LaMarr',
 'Lamon Archey',
 'Charles Lampkin',
 'Charles Lane (filmmaker)',
 'Eric Lane (actor)',
 'James T. Lane',
 'Eric Laneuville',
 'Ted Lange',
 'Bobby Lashley',
 'Jacob Latimore',
 'Nick LaTour',
 'Mark Christopher Lawrence',
 'Martin Lawrence',
 'Scott Lawrence',
 'Richard Lawson (actor)',
 'Clayton LeBouef',
 'Anthony Lee (actor)',
 'Canada Lee',
 'Carl Lee (actor)',
 'Cinqué Lee',
 'Daniel Curtis Lee',
 'Eugene Lee (actor)',
 'Irving Allen Lee',
 'Johnny Lee (actor)',
 'RonReaco Lee',
 'Spike Lee',
 'John Legend',
 'Raw Leiba',
 'Harry Lennix',
 'Leon Isaac Kennedy',
 'Sugar Ray Leonard',
 'Ali LeRoi',
 'James Lesure',
 'Calvin Levels',
 'Emmanuel Lewis',
 'Norm Lewis',
 'Phill Lewis',
 "Lil' JJ",
 'Lil Jon',
 'Lil Wayne',
 'Tom Lister Jr.',
 'Cleavon Little',
 'LL Cool J',
 'Goldie Loc',
 'Spider Loc',
 'Doug Locke',
 'Cirroc Lofton',
 'Avon Long',
 'Jackie Long',
 'Brian Love',
 'Darris Love',
 'Faizon Love',
 'Victor Love',
 'Herbie Lovelle',
 'Ed Lover',
 'Arvie Lowe Jr.',
 'Sam Lucas',
 'Ludacris',
 'Derek Luke (actor)',
 'Bernie Mac',
 'CJ Mac',
 'Mack 10',
 'Anthony Mackie',
 'Noel MacNeal',
 'Peter Macon',
 'Blu Mankuma',
 'Mario (American singer)',
 'Pigmeat Markham',
 'Jaron Marquis',
 'John Marriott (actor)',
 'Don Marshall (actor)',
 'Larry Marshall (actor)',
 'Tim Marshall (radio host)',
 'William Marshall (actor)',
 'Christopher Martin (entertainer)',
 'Dan Martin (actor)',
 'Duane Martin',
 "D'Urville Martin",
 'Jesse L. Martin',
 'Mase',
 'Christopher Massey',
 'Kyle Massey',
 'Master Juba',
 'Master P',
 'James C. Mathis III',
 'Al Matthews',
 'Onzy Matthews',
 'Whitman Mayo',
 'MC Ren',
 'Chi McBride',
 'Billy McClain',
 'Darius McCrary',
 'Suli McCullough',
 'James McDaniel',
 'Jonathan McDaniel',
 'Sam McDaniel',
 'James McEachin',
 'Andrew McFarlane (American actor)',
 'William McGhee',
 'Charles McGregor',
 'Caleb McLaughlin',
 'Frank McRae',
 'Mo McRae',
 'Tim Meadows',
 'Aaron Meeks',
 "E'Shun Melvin",
 'Method Man',
 'Windell Middlebrooks',
 'Qaasim Middleton',
 'Hal Miller (actor)',
 'Omar Benson Miller',
 'Romeo Miller',
 'Derek Minor',
 'Jerry Minor',
 'Robert Lee Minor',
 'Brian Stokes Mitchell',
 'Daryl Mitchell (actor)',
 'Finesse Mitchell',
 'Jason Mitchell (actor)',
 'Kel Mitchell',
 'Scoey Mitchell',
 'Jerod Mixon',
 'Anthony Montgomery',
 'Jim Moody (actor)',
 'Paul Mooney (comedian)',
 'Phil Moore (actor)',
 'Rudy Ray Moore',
 'Shemar Moore',
 'Tim Moore (comedian)',
 'Vic Moore',
 'Mantan Moreland',
 'Cesar A. Moreno',
 'S. Robert Morgan',
 'Tracy Morgan',
 'Garrett Morris',
 'Greg Morris',
 'Lamorne Morris',
 'Phil Morris (actor)',
 'Ernie Morrison',
 "M'fundo Morrison",
 'Joe Morton',
 'Mos Def',
 'Roger E. Mosley',
 'Walter Mosley',
 'Khalid Moultrie',
 'Tahj Mowry',
 'Charlie Murphy',
 'Eddie Murphy',
 'Clarence Muse',
 'Isaiah Mustafa',
 'Ntare Mwine',
 'Lou Myers (actor)',
 'Rashaan Nall',
 'Lane Napper',
 'Nas',
 'Nelly',
 'Nephew Tommy',
 'Alex Newell',
 'Ngo Okafor',
 'Fayard Nicholas',
 'Lance E. Nichols',
 'DeVaughn Nixon',
 'Bill Nunn',
 'Nyambi Nyambi',
 'George T. Odom',
 'Leslie Odom Jr.',
 'Omarion',
 'Afemo Omilami',
 "Patrice O'Neal",
 "Ron O'Neal",
 "Shaquille O'Neal",
 'Roscoe Orman',
 'Adesola Osakalumi',
 'Owen H.M. Smith',
 'Geoffrey Owens',
 'Nathan Owens',
 'Brandon P. Bell',
 'Michael J. Pagan',
 'Harrison Page',
 'Ken Page',
 'Dempsey Pappion',
 'Anthony Ray Parker',
 'Nate Parker',
 'Deantoni Parks',
 'Gordon Parks',
 'Charles Parnell (actor)',
 'Peter Parros',
 'Rahsaan Patterson',
 'Marcus T. Paulk',
 'Allen Payne',
 'Carl Anthony Payne II',
 'Christian Payton',
 'Khary Payton',
 'Jordan Peele',
 'Nathan Pelle',
 'Collins Pennie',
 'Kelly Perine',
 'Harold Perrineau',
 'Felton Perry',
 'Rod Perry (actor)',
 'Rodney Perry',
 'Tyler Perry',
 'Wolfe Perry',
 'Brock Peters',
 'Clarke Peters',
 'Mekhi Phifer',
 'Joseph C. Phillips',
 'James Pickens Jr.',
 'Stack Pierce',
 'Wendell Pierce',
 'Ron Pinkard',
 'Glenn Plummer',
 'Jeris Lee Poindexter',
 'Sidney Poitier',
 'Oscar Polk',
 'Patrik-Ian Polk',
 'DJ Pooh',
 'Albert Popwell',
 'Billy Porter (entertainer)',
 'Clifton Powell',
 'Keith Powell',
 'Will Power (performer)',
 'Ben Powers',
 'Keith T. Powers',
 'Billy Preston',
 'J. A. Preston',
 'Gilbert Price',
 'Prince (musician)',
 'Richard Pryor',
 'Willard E. Pugh',
 'Nathan Purdee',
 'Quincy (actor)',
 'Luther Rackley',
 'Damaine Radcliff',
 'Michael Ralph',
 'Ahmad Rashād',
 'Thalmus Rasulala',
 'Mishon Ratliff',
 'Donnell Rawlings',
 'Lou Rawls',
 'Ray J',
 'Gene Anthony Ray',
 'Raz-B',
 'Chris Redd',
 'Lance Reddick',
 'Redman (rapper)',
 'Markus Redmond',
 'Albert Reed Jr.',
 'George H. Reed',
 'Joshua Elijah Reese',
 'Omar Regan',
 'Tim Reid',
 'Reynaldo Rey',
 'Corey Reynolds',
 'James Reynolds (actor)',
 'Khylin Rhambo',
 'Ving Rhames',
 'Hari Rhodes',
 "Robert Ri'chard",
 'J. August Richards',
 'Kevin Michael Richardson',
 'Marque Richardson',
 'Ron Richardson',
 'Sam Richardson (actor)',
 'Sy Richardson',
 'Deon Richmond',
 'Tequan Richmond',
 'Stephen Rider',
 'Larry Riley (actor)',
 'Robert Christopher Riley',
 'Sean Ringgold',
 'Rodney Allen Rippy',
 'Robert Christian',
 'Davis Roberts',
 'Leonard Roberts',
 'Michael D. Roberts',
 'Tony T. Roberts',
 'Lance Robertson',
 'Paul Robeson',
 'Bill Robinson',
 'Bumper Robinson',
 'Charlie Robinson (actor)',
 'Craig Robinson (actor)',
 'Jimmy Robinson (actor)',
 'Keith Robinson (actor)',
 'Leon Robinson',
 'Matt Robinson (actor)',
 'Roger Robinson (actor)',
 'Jeff Roches',
 'Chris Rock',
 'Tony Rock',
 'Rocko (rapper)',
 'Delvon Roe',
 'Ivan Rogers (actor)',
 'Timmie Rogers',
 'Al Roker',
 'Howard Rollins',
 'Clinton Rosemond',
 'Evan Ross',
 'Ricco Ross',
 'Shavar Ross',
 'Ted Ross',
 'Rotimi (actor)',
 'Richard Roundtree',
 'Andre Royo',
 'Lamman Rucker',
 'RuPaul',
 'Tim Russ',
 'Montae Russell',
 'Nipsey Russell',
 'Ken Sagoes',
 'Raymond St. Jacques',
 'John Salley',
 'Kendrick Sampson',
 'Jeffrey D. Sams',
 'Samwell (entertainer)',
 'Alvin Sanders',
 'Ashton Sanders',
 'C. J. Sanders',
 'Henry G. Sanders',
 'Juelz Santana',
 'Ruben Santiago-Hudson',
 'Bob Sapp',
 'Parker Sawyers',
 'MuMs da Schemer',
 'Harold Scott (director)',
 'Larry B. Scott',
 'Franklyn Seales',
 'Joe Seneca',
 'Sayeed Shahidi',
 'Tupac Shakur',
 'Vicellous Reon Shannon',
 'Monti Sharp',
 'Doc Shaw',
 'Stan Shaw',
 'Al Shearer',
 'Ray Shell',
 'Chaz Lamar Shepherd',
 'Rondell Sheridan',
 'Demetrius Shipp Jr.',
 'Bobby Short',
 'Columbus Short',
 'P. Jay Sidney',
 'Keith Silverstein',
 'Henry Simmons',
 'O. J. Simpson',
 'Sinbad (comedian)',
 'IronE Singleton',
 'Isaac C. Singleton Jr.',
 'Kofi Siriboe',
 'Sisqó',
 'Darryl Sivad',
 'Cle Shaheed Sloan',
 'Tucker Smallwood',
 'Rickey Smiley',
 'Arjay Smith',
 'Brandon Mychal Smith',
 'Bubba Smith',
 'Chris Smith (composer)',
 'Jaden Smith',
 'Jamil Walker Smith',
 'Jason Matthew Smith',
 'Justice Smith',
 'Keith Randolph Smith',
 'Will Smith',
 'Jake Smollett',
 'Jussie Smollett',
 'J. B. Smoove',
 'David Smyrl',
 'Wesley Snipes',
 'Snoop Dogg',
 'Trey Songz',
 'Soulja Boy',
 'Aaron D. Spears',
 'Aries Spears',
 'Chris Spencer (actor)',
 'Christopher St. John',
 'Kristoff St. John',
 'Mathew St. Patrick',
 'Nathaniel Stampley',
 'Keith Stanfield',
 'Fredro Starr',
 'Eddie Steeples',
 'Darryl Stephens',
 'Mel Stewart',
 'Nick Stewart',
 'Timothy Stickney',
 'Sticky Fingaz',
 'Austin Stoker',
 'Michael Strahan',
 'Stretch (rapper)',
 'Astro (rapper)',
 'Woody Strode',
 'Dana Stubblefield',
 'Levi Stubbs',
 'Gary Anthony Sturgis',
 'Jeremy Suarez',
 'Daniel Sunjata',
 'Harold Sylvester',
 'Mr. T',
 'Taimak',
 'Michael Taliferro',
 'Antwon Tanner',
 'Le Tari',
 'Rockne Tarkington',
 'Larenz Tate',
 'Henry E. Taylor III',
 'Lawrence Taylor',
 'Meshach Taylor',
 'Nathaniel Taylor (actor)',
 'Ron Taylor (actor)',
 'Wally Taylor (actor)',
 'Devon Terrell',
 'Marcello Thedford',
 'Billie Thomas',
 'Ernest Lee Thomas',
 'Khleo Thomas',
 'Leon Thomas III',
 'Sean Patrick Thomas',
 'Warren Thomas',
 'William Thomas Jr.',
 "Bobb'e J. Thompson",
 'Kenan Thompson',
 'T.I.',
 'Tony Todd',
 'Tone Lōc',
 'Fred Toones',
 'Guy Torry',
 'Joe Torry',
 'Robert Townsend (actor)',
 'Craig Lamar Traylor',
 'Treach',
 'Trick Daddy',
 'Xavier Lamar Truesdell',
 'Chris Tucker',
 'Lorenzo Tucker',
 'Dan Tullis Jr.',
 'Glynn Turman',
 'Tyrin Turner',
 'Tyler, The Creator',
 'Willie Tyler',
 'Blair Underwood',
 'Usher (singer)',
 'Jessie Usher',
 'Ron van Clief',
 'Mario Van Peebles',
 'Melvin Van Peebles',
 'Courtney B. Vance',
 'Victor Varnado',
 'Reginald VelJohnson',
 'Ben Vereen',
 'Adam Wade (singer)',
 'Shangela Laquifa Wadley',
 'Jimmie Walker',
 'William "Bill" Walker',
 'James Wall (actor)',
 'Basil Wallace',
 'George Wallace (comedian)',
 ...]

In [254]:
# Final Data Set
d= "Male"
M_AA= [d for i in namesAA_male]

# Race
e= "African Descent"
r_AA= [e for i in namesAA_male]

# Combine
AA_males = pd.DataFrame({'names':namesAA_male, 'sex':M_AA, 'race': r_AA })
AA_males


Out[254]:
names race sex
0 50 Cent African Descent Male
1 Quinton Aaron African Descent Male
2 Barkhad Abdi African Descent Male
3 Kareem Abdul-Jabbar African Descent Male
4 Brandon Adams (actor) African Descent Male
5 Granville Adams African Descent Male
6 Tunde Adebimpe African Descent Male
7 Ilunga Adell African Descent Male
8 Jovan Adepo African Descent Male
9 Faysal Ahmed African Descent Male
10 Franklyn Ajaye African Descent Male
11 Gbenga Akinnagbe African Descent Male
12 Ira Aldridge African Descent Male
13 Flex Alexander African Descent Male
14 Terry Alexander (actor) African Descent Male
15 Mahershala Ali African Descent Male
16 Raymond Allen (television actor) African Descent Male
17 John Amos African Descent Male
18 Anthony Anderson African Descent Male
19 Carl Anderson (singer) African Descent Male
20 Eddie "Rochester" Anderson African Descent Male
21 Haskell V. Anderson III African Descent Male
22 Shedrack Anderson III African Descent Male
23 André 3000 African Descent Male
24 Ray Aranha African Descent Male
25 Louis Armstrong African Descent Male
26 John A. Arneaux African Descent Male
27 Jacob Artist African Descent Male
28 Sharif Atkins African Descent Male
29 Jensen Atwood African Descent Male
... ... ... ...
1116 Lee Thompson Young African Descent Male
1117 Marlon Young African Descent Male
1118 Otis Young African Descent Male
1119 William Allen Young African Descent Male
1120 Djimon Hounsou African Descent Male
1121 Chiwetel Ejiofor African Descent Male
1122 David Oyelowo African Descent Male
1123 Adewale Akinnuoye-Agbaje African Descent Male
1124 Idris Elba African Descent Male
1125 Omar Sy, John Boyega African Descent Male
1126 Nonso Anozie African Descent Male
1127 Isaach De Bankole African Descent Male
1128 Gbenga Akinnagbe African Descent Male
1129 Peter Mensah African Descent Male
1130 Richard Ayoade African Descent Male
1131 Tongayi Chirisa African Descent Male
1132 Dayo Okeniyi African Descent Male
1133 Yaphet Kotto African Descent Male
1134 Hugh Quarshie African Descent Male
1135 Ntare Guma Mbaho Mwine African Descent Male
1136 John Kani African Descent Male
1137 Edi Gathegi African Descent Male
1138 Sahr Ngaujah African Descent Male
1139 Presley Chweneyagae African Descent Male
1140 Eriq Ebouaney African Descent Male
1141 Aml Ameen African Descent Male
1142 N!xau, Hakeem Kae-Kazim African Descent Male
1143 Ato Essandoh African Descent Male
1144 Fana Mokoena African Descent Male
1145 Daniel Kaluuya African Descent Male

1146 rows × 3 columns

African American Actresses


In [894]:
# Get html from browser

AA_Female1 = rq.get(url_AA_Female1)
AA_Female2 = rq.get(url_AA_Female2)
AA_Female3 = rq.get(url_AA_Female3)
AA_Female4 = rq.get(url_AA_Female4)
AA_Female5 = rq.get(url_AA_Female5)

In [895]:
# Download successful? - Yes

[AA_Female1.status_code,
 AA_Female2.status_code,
 AA_Female3.status_code,
 AA_Female4.status_code,
 AA_Female5.status_code
]


Out[895]:
[200, 200, 200, 200, 200]

In [257]:
# get data as Soup
AA_Female1_soup= bs(AA_Female1.content, 'html.parser')

# get data as Soup
AA_Female2_soup= bs(AA_Female2.content, 'html.parser')

# get data as Soup
AA_Female3_soup= bs(AA_Female3.content, 'html.parser')

# get data as Soup
AA_Female4_soup= bs(AA_Female4.content, 'html.parser')

# get data as Soup
AA_Female5_soup= bs(AA_Female5.content, 'html.parser')

In [896]:
# Females Group 1

li_all_F1 = AA_Female1_soup.find_all('li')
namesF1 = []
for li in li_all_F1:
    try:
        text = li.find('a').get_text()
        namesF1.append(text)
    except AttributeError:
        pass
    
namesAA_Female1= namesF1[1:201]
namesAA_Female1


Out[896]:
['Mariann Aalda',
 'Aaliyah',
 'Diahnne Abbott',
 'Loretta Abbott',
 'Donzaleigh Abernathy',
 'Abiola Abrams',
 'Acquanetta',
 'Osceola Macarthy Adams',
 'Yolanda Adams',
 'Uzo Aduba',
 'Erika Alexander',
 'Tatyana Ali',
 'Mary Alice',
 'Lexi Allen',
 'Billie Allen',
 'Debbie Allen',
 'Jonelle Allen',
 'Sasha Allen',
 'Amerie',
 'Maya Angelou',
 'Tichina Arnold',
 'Ashanti (singer)',
 'Karan Ashley',
 'Essence Atkins',
 'Beverly Hope Atkinson',
 'Ruth Attaway',
 'K. D. Aubert',
 'Ashley August',
 'Margaret Avery',
 'Shondrella Avery',
 'Ethel Ayler',
 'Rochelle Aytes',
 "De'Adre Aziza",
 'Vanessa Baden',
 'Cynthia Bailey',
 'Pearl Bailey',
 'Tyra Banks',
 'Tanya Barfield',
 'Etta Moten Barnett',
 'Fantasia Barrino',
 'Angela Bassett',
 'Ryan Michelle Bathe',
 'Simone Battle',
 'Ruth Virginia Bayton',
 'Jennifer Beals',
 'Carlena Beard',
 'Louise Beavers',
 'Zazie Beetz',
 'Nicole Beharie',
 'Patricia Belcher',
 'Felecia M. Bell',
 'Jean Bell',
 'Troian Bellisario',
 'Fran Bennett',
 'Rhona Bennett',
 'Denée Benton',
 'Halle Berry',
 'Troy Beyer',
 'Margot Bingham',
 'Traci Bingham',
 'Nzinga Blake',
 'Tempestt Bledsoe',
 'Mary J. Blige',
 'Tanya Blount',
 'Daphne Blunt',
 'Gail Boggs',
 'Lisa Bonet',
 'Vivian Bonnell',
 'Denise Boutte',
 'Laura Bowman',
 'Kathleen Bradley',
 'Sufe Bradshaw',
 'Da Brat',
 'Tamar Braxton',
 'Toni Braxton',
 'Angelle Brooks',
 'Danielle Brooks',
 'Kimberly Brooks',
 'Candy Brown',
 'Chelsea Brown',
 'Cocoa Brown',
 'Krystal Joy Brown',
 'Olivia Brown',
 'Ruth Brown',
 'Yvette Nicole Brown',
 'Logan Browning',
 'Darmirra Brunson',
 'Hazel Joan Bryant',
 'Joy Bryant',
 'Joyce Bryant',
 'Jasmine Burke',
 'Nakia Burrise',
 'Vinie Burrows',
 'Kandi Burruss',
 'Denise Burse',
 'Anita Bush',
 'L. Scott Caldwell',
 'Monica Calhoun',
 'Frances Callier',
 'Vanessa Bell Calloway',
 'Tisha Campbell-Martin',
 'Maia Campbell',
 'Marietta Canty',
 'Virginia Capers',
 'Mariah Carey',
 'Gabrielle Carmouche',
 'Diahann Carroll',
 'Vinnette Justine Carroll',
 'Buffie Carruth',
 'Lisa Nicole Carson',
 'Nell Carter',
 'Rosalind Cash',
 'Yvette Cason',
 'Lee Chamberlin',
 'Faune A. Chambers',
 'Carol Channing',
 'Annazette Chase',
 'Vanessa Lee Chester',
 'Alice Childress',
 'Ciara',
 'Hope Clarke',
 'Ellen Cleghorne',
 'Kiersey Clemons',
 'Inez Clough',
 'Keyshia Cole',
 'Natalie Cole',
 'Olivia Cole',
 'Monique Coleman',
 'Porscha Coleman',
 'Tyler Collins (singer)',
 'Angell Conwell',
 'Emayatzy Corinealdi',
 'Shamika Cotton',
 'Laverne Cox',
 'Rachel Crow',
 'Zara Cully',
 'Ayesha Curry',
 'Tawny Cypress',
 "Pamella D'Pella",
 'Sheila Dabney',
 'Yaya DaCosta',
 'Clamma Dale',
 'Susan Dalian',
 'Dorothy Dandridge',
 'Ruby Dandridge',
 'Leila Danette',
 'Stacey Dash',
 'Kiami Davael',
 'Shanésia Davis-Williams',
 'Altovise Davis',
 'Ariel Alexandria Davis',
 'Cassi Davis',
 'Dana Davis',
 'Dee Dee Davis',
 'Henrietta Vinton Davis',
 'Lynn Davis (singer)',
 'Mia Amber Davis',
 'Viola Davis',
 'Marie-Alise Recasner',
 'Melissa De Sousa',
 'Danielle Deadwyler',
 'Damita Jo DeBlanc',
 'Ruby Dee',
 'Sylvia del Villard',
 'Venus DeMilo',
 'Gabrielle Dennis',
 'Natalie Desselle-Reid',
 'Loretta Devine',
 'Rah Digga',
 'Victoria Dillard',
 'Djakarta (artist)',
 'Tamara Dobson',
 'Cathy Jenéen Doe',
 'Suzzanne Douglas',
 'Helen Dowdy',
 "Ja'net Dubois",
 'Merrin Dungey',
 'Teala Dunn',
 'Cheryl Dunye',
 'Monique Dupree',
 'Karen Dyer',
 'Dina Eastwood',
 'Megalyn Echikunwoke',
 'Sonya Eddy',
 'Judyann Elder',
 'Kimberly Elise',
 'Missy Elliott',
 'Aunjanue Ellis',
 'Evelyn Ellis',
 'Femi Emiola',
 'Ellia English',
 'Shareeka Epps',
 'Faith Evans',
 'Tiffany Evans',
 'Eve (rapper)',
 'Francine Everett',
 'Lola Falana',
 'Fatima Faloye',
 'Porsha Ferguson',
 'Tyra Ferrell']

In [901]:
# Females Group 2

li_all_F2 = AA_Female2_soup.find_all('li')
namesF2 = []
for li in li_all_F2:
    try:
        text = li.find('a').get_text()
        namesF2.append(text)
    except AttributeError:
        pass
    
namesAA_Female2= namesF2[1:201]
namesAA_Female2


Out[901]:
['Alexis Fields',
 'Chip Fields',
 'Kim Fields',
 'Edwina Findley',
 'Gail Fisher',
 'Patrice Fisher',
 'Gloria Foster',
 'Crystal R. Fox',
 'Vivica A. Fox',
 'Jaimee Foxworth',
 'Minnie Foxx',
 'Karen Fraction',
 'Stacy Francis',
 'Farrah Franklin',
 'Marina Franklin',
 'Tomiko Fraser',
 'Sheila Frazier',
 'Jennia Fredrique',
 'Carlotta Freeman',
 'Jennifer Freeman',
 'Yvette Freeman',
 'Daniele Gaither',
 'Lita Gaithers',
 'Maura Gale',
 'Susie Garrett',
 'Nona Gaye',
 'Gloria Gaynor',
 'Minnie Gentry',
 'Geretta Geretta',
 'Marla Gibbs',
 'Mercedes Gilbert',
 'Nancy Giles',
 'Erica Gimpel',
 'Adele Givens',
 'Robin Givens',
 'Candice Glover',
 'Cat Glover',
 'Montego Glover',
 'Whoopi Goldberg',
 'Renée Elise Goldsberry',
 'Reagan Gomez-Preston',
 "La'Myia Good",
 'Meagan Good',
 'Elena Goode',
 'Deidre Goodwin',
 'Kia Goodwin',
 'Raven Goodwin',
 'Denise Gordy',
 'Galyn Görg',
 'Deneen Graham',
 'Kat Graham',
 'Crystal Celeste Grant',
 'Teresa Graves',
 'Macy Gray',
 'Alexandra Grey',
 'Odessa Warren Grey',
 'Pam Grier',
 'Alvaleta Guess',
 'Jasmine Guy',
 'Sheila Guyse',
 'Tiffany Haddish',
 'Montrose Hagins',
 'Alaina Reed Hall',
 'Daheli Hall',
 'Delores Hall',
 'Irma P. Hall',
 'Juanita Hall',
 'Regina Hall',
 'Kim Hamilton',
 'Lisa Gay Hamilton',
 'Lynn Hamilton (actress)',
 'Shanola Hampton',
 'Juliana Harkavy',
 'Ethel Ernestine Harper',
 'Tanisha Harper',
 'Edna Mae Harris',
 'Theresa Harris',
 'Zelda Harris',
 'Jackée Harry',
 'Georgette Harvey',
 'Kali Hawk',
 'Kim Hawthorne',
 'Lillian Hayman',
 'Kiki Haynes',
 'Shari Headley',
 'Heather Hemmens',
 'Myra Hemmings',
 'Shirley Hemphill',
 'Estelle Hemsley',
 'Gloria Hendry',
 'Gloria Lynne Henry',
 'Taraji P. Henson',
 'Cindy Herron',
 'Jillian Hervey',
 'Susan Heyward',
 'Taral Hicks',
 'Grace Hightower',
 'Lauryn Hill',
 'Keri Hilson',
 'Aisha Hinds',
 'Tiffany Hines',
 'Jerrika Hinton',
 'Hannah Hodson',
 'Lovely Hoffman',
 'Marjean Holden',
 'Jennifer Holliday',
 'Patrice Holloway',
 'Ellen Holly',
 'Telma Hopkins',
 'Lena Horne',
 'Thelma Houston',
 'Whitney Houston',
 'Adina Howard',
 'Gertrude Howard',
 'Melissa Howard',
 'Tanedra Howard',
 'Maria Howell',
 'Erica Hubbard',
 'Janet Hubert',
 'Jennifer Hudson',
 'Yvonne Hudson',
 'Yolanda Hughes-Heying',
 'Amy Hunter',
 'Heather Hunter',
 'Michelle Hurd',
 'Paige Hurd',
 'Michelle Hurst',
 'Hyers Sisters',
 'Idil Ibrahim',
 'Patricia Idlette',
 'Osas Ighodaro',
 'Annie Ilonzeh',
 'Indigo (actress)',
 'Desreta Jackson',
 'Ernestine Jackson',
 'Janet Jackson',
 'Leeah D. Jackson',
 'Rose Jackson (actress)',
 'Shar Jackson',
 'Nastashia Fuller',
 'Skai Jackson',
 'Zaidee Jackson',
 'Jadagrace',
 'Jaz Sinclair',
 'Jazzmun',
 'Juliette Jeffers',
 'Capathia Jenkins',
 'Ella Jenkins',
 'Juanita Jennings',
 'Penny Johnson Jerald',
 'Tracy Camilla Johns',
 'Adrienne-Joi Johnson',
 'Anne-Marie Johnson',
 'Ariyan A. Johnson',
 'Beverly Johnson',
 'Cherie Johnson',
 'Johari Johnson',
 'Kimmarie Johnson',
 'Nicole Randall Johnson',
 'Seba Johnson',
 'Stacii Jae Johnson',
 'Syleena Johnson',
 'Marilyn Joi',
 'Coco Jones',
 'Jill Marie Jones',
 'Jowharah Jones',
 'Julia Jones',
 'Kidada Jones',
 'Leslie Jones (comedian)',
 'Rashida Jones',
 'Rebecca Naomi Jones',
 'Renée Jones',
 'Sarah Jones (stage actress)',
 'Tamala Jones',
 "Ta'Rhonda Jones",
 'Toccara Jones',
 'Claudia Jordan',
 'Rhoda Jordan',
 'Ella Joyce',
 'Shannon Kane',
 'Rosanne Katon',
 'Paula Kelly (actress)',
 "Rae'Ven Larrymore Kelly",
 'Jayne Kennedy',
 'JoNell Kennedy',
 'Germany Kent',
 'Brook Kerr',
 "T'Keyah Crystal Keymáh",
 'Alicia Keys',
 'Simbi Khali',
 'Christel Khalil',
 'Aja Naomi King',
 'Cleo King',
 'Kent Masters King',
 'Mabel King',
 'Regina King',
 'Yolanda King',
 'Keshia Knight Pulliam',
 'Gladys Knight',
 'Christy Knowings']

In [902]:
# Females Group 3

li_all_F3 = AA_Female3_soup.find_all('li')
namesF3 = []
for li in li_all_F3:
    try:
        text = li.find('a').get_text()
        namesF3.append(text)
    except AttributeError:
        pass
    
namesAA_Female3= namesF3[1:201]
namesAA_Female3


Out[902]:
['Solange Knowles',
 'Zoë Kravitz',
 'Patti LaBelle',
 'Deborah Lacey',
 'LaChanze',
 'Lady Cam',
 'The Lady of Rage',
 'Sasha Lane',
 'Sanaa Lathan',
 'Syr Law',
 'Bianca Lawson',
 'Chene Lawson',
 'Denyce Lawton',
 'NeNe Leakes',
 'Sharon Leal',
 'Sabrina Le Beauf',
 'Ledisi',
 'Charmin Lee',
 'Joie Lee',
 'Raquel Lee',
 'Jeni Le Gon',
 'Lillian Lehman',
 'Kasi Lemmons',
 'Rosetta LeNoire',
 'Adriane Lenox',
 'Urylee Leonardos',
 'Ketty Lester',
 'Dawnn Lewis',
 'Jazsmin Lewis',
 'Jenifer Lewis',
 'Toni Lewis',
 'Tina Lifford',
 'Destiny Lightsy',
 'AzMarie Livingston',
 'Tembi Locke',
 'Bellina Logan',
 'LaToya London',
 'Lauren London',
 'Loretta Long',
 'Nia Long',
 'Mammy Lou',
 'Loni Love',
 'Mother Love',
 'Marcella Lowery',
 'Shanti Lowry',
 'LeToya Luckett',
 'Luenell',
 'Donyale Luna',
 'Tanisha Lynn',
 'Moms Mabley',
 'Janet MacLachlan',
 'Tamela Mann',
 'Teal Marchande',
 'Eva Marcille',
 'Meghan Markle',
 'Sonequa Martin-Green',
 'Alex Martin',
 'Helen Martin',
 'Tanjareen Martin',
 'Brandi Chavonne Massey',
 'Michelle Matlock',
 'Chenoa Maxwell',
 'Tristin Mays',
 'Durga McBroom',
 'China Anne McClain',
 'Rose McClendon',
 'LisaRaye McCoy',
 'Etta McDaniel',
 'Hattie McDaniel',
 'Audra McDonald',
 'Vonetta McGee',
 'Stephanie McKay',
 'Demetria McKinney',
 'Nina Mae McKinney',
 'Barbara McNair',
 'Claudia McNeil',
 'Serayah (entertainer)',
 'Armelia McQueen',
 'Butterfly McQueen',
 'Carmen McRae',
 'Cynthia McWilliams',
 'Barbara Meek',
 'S. Epatha Merkerson',
 'Theresa Merritt',
 'Nicki Micheaux',
 'Michael Michele',
 'Janee Michelle',
 'Liz Mikel',
 'Rosalind Miles (actress)',
 'The Millen Sisters',
 'Amanda C. Miller',
 'Cymphonique Miller',
 'Patina Miller',
 'Tangi Miller',
 'Valarie Rae Miller',
 'Kelly Jo Minter',
 'Ella Mitchell',
 'Michelle Mitchenor',
 'Mittie Lawrence',
 'Mona Lisa (singer)',
 'Janelle Monáe',
 "Mo'Nique",
 'Barbara Montgomery',
 'Lynne Moody',
 'Adrienne C. Moore',
 'Jacqueline Moore',
 'Juanita Moore',
 'Kenya Moore',
 'Melba Moore',
 'Debbi Morgan',
 'Diana Morgan (actress)',
 'Michele Morgan (American actress)',
 'Zaena Morisho',
 'Zaena Now',
 'Iona Morris',
 'Dorothy Morrison (actress)',
 'Mari Morrow',
 'Karla Cheatham Mosley',
 'Tamera Mowry',
 'Tia Mowry',
 'Malina Moye',
 'Kiara Muhammad',
 'Mýa',
 'Kortney Nash',
 'Niecy Nash',
 'Naturi Naughton',
 'Elise Neal',
 'Novella Nelson',
 'Toy Newkirk',
 'Paula Newsome',
 'Denise Nicholas',
 'Nichelle Nichols',
 'Jasika Nicole',
 'Danielle Nicolet',
 'Hayley Marie Norman',
 'Maidie Norman',
 'Brandy Norwood',
 'Emma Nyra',
 'Adepero Oduye',
 'Olivia (singer)',
 'Elisabeth Omilami',
 'Yvonne Orji',
 'Ion Overman',
 'Judy Pace',
 'LaWanda Page',
 'Taylour Paige',
 'Keke Palmer',
 'Devika Parikh',
 'Sydney Park (actress)',
 'Nicole Ari Parker',
 'Paula Jai Parker',
 'Taylor Parks',
 'Trina Parks',
 'Teyonah Parris',
 'Karyn Parsons',
 'Tonye Patano',
 'Candice Patton',
 'Paula Patton',
 'Freda Payne',
 'Jo Marie Payton',
 'Felicia Pearson',
 'Beverly Peele',
 'Holly Robinson Peete',
 'CeCe Peniston',
 'Peppermint (drag queen)',
 'Brianna Perry',
 'Netfa Perry',
 'Shauneille Perry',
 'Madison Pettis',
 'Peggy Pettitt',
 'Jada Pinkett Smith',
 'Tonya Pinkins',
 'Karen Pittman',
 'Sydney Tamiia Poitier',
 'Tiffany Pollard',
 'Adina Porter',
 'Pam Potillo',
 'Deborah Pratt',
 'Kyla Pratt',
 'Evelyn Preer',
 'Jeryl Prescott',
 'Joan Pringle',
 'Rain Pryor',
 'Saundra Quarterman',
 'Queen Latifah',
 'Issa Rae',
 'Muriel Rahn',
 'Sheryl Lee Ralph',
 'Franchesca Ramsey',
 'Marion Ramsey',
 'Theresa Randle',
 'Amanda Randolph',
 "Da'Vine Joy Randolph",
 'Lillian Randolph',
 'Condola Rashād',
 'Phylicia Rashad',
 'Raven-Symoné',
 'Gina Ravera',
 'Ola Ray',
 'Tanika Ray']

In [903]:
# Females Group 4

li_all_F4 = AA_Female4_soup.find_all('li')
namesF4 = []
for li in li_all_F4:
    try:
        text = li.find('a').get_text()
        namesF4.append(text)
    except AttributeError:
        pass
    
namesAA_Female4= namesF4[1:201]
namesAA_Female4


Out[903]:
['Tanika Ray',
 'Jazz Raycole',
 'Rebecca Rice (actress)',
 'Veronica Redd',
 'Tracy Reed (American actress)',
 'Vivian Reed (musical theatre actress)',
 'Della Reese',
 'Daphne Maxwell Reid',
 'Retta',
 'Anita Thompson Dickinson Reynolds',
 'Vickilyn Reynolds',
 'Beah Richards',
 'LaTanya Richardson',
 'Salli Richardson',
 'Tijuana Ricks',
 'Amber Riley',
 'Naya Rivera',
 'Eslanda Goode Robeson',
 'Angela Robinson (actress)',
 'Shaun Robinson',
 'Wendy Raquel Robinson',
 'Kitana Kiki Rodriguez',
 'Roxie Roker',
 'Esther Rolle',
 'Rose Rollins',
 'Xosha Roquemore',
 'Anika Noni Rose',
 'Rhonda Ross Kendrick',
 'Angelica Ross',
 'Diana Ross',
 'Tracee Ellis Ross',
 'Tracey Ross',
 'Victoria Rowell',
 'Kelly Rowland',
 'Alice B. Russell',
 'Kimberly Russell',
 'Roz Ryan',
 'Rye Rye',
 'Zoe Saldana',
 'Giovonnie Samuels',
 'Ajai Sanders',
 'Diana Sands',
 'Isabel Sanford',
 'Saniyya Sidney',
 'India Scandrick',
 'Mercedes Scelba-Shorte',
 'Esther Scott',
 'Jill Scott',
 'Judith Scott (American actress)',
 'Kimberly Scott',
 'Amanda Seales',
 'Toni Seawright',
 'Saycon Sengbloh',
 'Attallah Shabazz',
 'Yara Shahidi',
 'Shanice',
 'Keesha Sharp',
 'Telisha Shaw',
 'Angela V. Shelton',
 'Kiki Shepard',
 'Gwen Shepherd',
 'Neferteri Shepherd',
 'Sherri Shepherd',
 'Davetta Sherwood',
 'Alexandra Shipp',
 'Priscilla Shirer',
 'Sicily (actress)',
 'Gabourey Sidibe',
 'Drew Sidora',
 'Leslie Silva',
 'Kimora Lee Simmons',
 'Hilda Simms',
 'Ada "Bricktop" Smith',
 'Anna Deavere Smith',
 'Antonique Smith',
 'Candace Smith',
 'Chelsi Smith',
 'Dwan Smith',
 'Ebonie Smith',
 'Tasha Smith',
 'Toukie Smith',
 'Willow Smith',
 'Jurnee Smollett-Bell',
 'Sonja Sohn',
 'Diane Sommerfield',
 'Sommore',
 'Zoë Soul',
 'Jordin Sparks',
 'Carol Speed',
 'Danielle Spencer (American actress)',
 'Octavia Spencer',
 'Bern Nadette Stanis',
 'Karrine Steffans',
 'Amandla Stenberg',
 'Kellee Stewart',
 'Angie Stone',
 'Madame Sul-Te-Wan',
 'Jazmine Sullivan',
 'Donna Summer',
 'Tika Sumpter',
 'Brooklyn Sudano',
 'Brenda Sykes',
 'Wanda Sykes',
 'Jessica Szohr',
 'Meagan Tandy',
 'Ta-Tanisha',
 'Serria Tawan',
 'Clarice Taylor',
 'Eva Taylor',
 'Mya Taylor',
 'Regina Taylor',
 'Rosslynn Taylor',
 'Teyana Taylor',
 'Barbara Ann Teer',
 'Renee Tenison',
 'Rosie Tenison',
 'Robin Thede',
 'Lynne Thigpen',
 'Michelle Thomas',
 'Rozonda Thomas',
 'Tessa Thompson',
 'Tracie Thoms',
 'Tinashe',
 'Beverly Todd',
 'Berlinda Tolbert',
 'LeShay Tomlinson',
 'LaKendra Tookes',
 'Tammy Townsend',
 'Kali Troy',
 'Toni Trucks',
 'Rachel True',
 'Lisa Tucker (singer)',
 'Tamara Tunie',
 'Tina Turner',
 'Aisha Tyler',
 'Cicely Tyson',
 'Leslie Uggams',
 'Stephanie Umoh',
 'Kianna Underwood',
 'Sheryl Underwood',
 'Gabrielle Union',
 'Brooke Valentine',
 'Danitra Vance',
 'Countess Vaughn',
 'Terri J. Vaughn',
 'Danielle Vega',
 'Cassie Ventura',
 'Thea Vidale',
 'Lark Voorhies',
 'Ernestine Wade',
 'Aida Overton Walker',
 'Arnetia Walker',
 'Nayo Wallace',
 'Quvenzhané Wallis',
 'Caryn Ward',
 'Marlene Warfield',
 'Marsha Warfield',
 'Sharon Warren',
 'Dionne Warwick',
 'Fredi Washington',
 'Kerry Washington',
 'Shirley Washington',
 'Ethel Waters',
 'Myrtle Watkins',
 'Vernee Watson-Johnson',
 'Danièle Watts',
 'Rolonda Watts',
 'Chaunté Wayans',
 'Kim Wayans',
 'Bresha Webb',
 'Veronica Webb',
 'Valerie Wellington',
 'Rutina Wesley',
 'Diamond White (singer)',
 'Gillian Iliana Waters',
 'Jane White',
 'Jessica White',
 'Karen Malina White',
 'Lillias White',
 'Terri White',
 'Lynn Whitfield',
 'Kym Whitley',
 'Whitman Sisters',
 'Samira Wiley',
 'Sharon Wilkins',
 'Cynda Williams',
 'Davida Williams',
 'Jessica Williams (actress)',
 'Kellie Shanygne Williams',
 'Kiely Williams',
 'Kimberly Kevon Williams',
 'Michelle Williams (singer)',
 'Natashia Williams',
 'Shanice Williams (actress)',
 'Vanessa Williams',
 'Vanessa A. Williams',
 'Vesta Williams',
 'Wendy Williams',
 'Afton Williamson',
 'Kenya D. Williamson']

In [904]:
# Females Group 5

li_all_F5 = AA_Female5_soup.find_all('li')
namesF5 = []
for li in li_all_F5:
    try:
        text = li.find('a').get_text()
        namesF5.append(text)
    except AttributeError:
        pass
    
namesAA_Female5= namesF5[1:29]
namesAA_Female5


Out[904]:
['Kenya D. Williamson',
 'Sheila Wills',
 'Ajita Wilson',
 'Chandra Wilson',
 'Debra Wilson',
 'Nancy Wilson (jazz singer)',
 'Yvette Wilson',
 'Deborah Joy Winans',
 'Camille Winbush',
 'Saaphyri Windsor',
 'Oprah Winfrey',
 'Hattie Winston',
 'Alfre Woodard',
 'Charlayne Woodard',
 'Anita Woodley',
 'Carol Woods',
 'Renn Woods',
 'Nicole Wray',
 'Aloma Wright',
 'C. Kelly Wright',
 "N'Bushe Wright",
 'Tanya Wright',
 'Lisa Wu',
 'Mia X',
 'Camille Yarbrough',
 'Yo-Yo (rapper)',
 'Sasheer Zamata',
 'Zendaya']

In [905]:
# Noteworthy female actors of African descent, not included in Wikipedia data

noteable2= ['Thandie Newton', 'Sophie Okonedo', 'Liya Kebede', 'Danai Gurira', 'Rachel Mwanza', 'Enuka Okuma', 
            'Yaya DaCosta', 'Saycon Sengbloh', 'Gugu Mbatha-Raw', 'Adina Porter', 'Ashley Madekwe', 'Annie Ilonzeh',
            "Lupita Nyong'o", 'Akosua Busia', 'Genevieve Nnaji', 'Adepero Oduye']

In [906]:
# Add List

namesAA_female = namesAA_Female1 + namesAA_Female2 + namesAA_Female3 + namesAA_Female4 + namesAA_Female5 + noteable2

In [907]:
# Final List of Black Actresses

f= "Female"
F_AA= [f for i in namesAA_female]

# Race
g= "African Descent"
r_fAA= [g for i in namesAA_female]

# Combine
AA_females = pd.DataFrame({'names':namesAA_female, 'sex':F_AA, 'race': r_fAA })
AA_females


Out[907]:
names race sex
0 Mariann Aalda African Descent Female
1 Aaliyah African Descent Female
2 Diahnne Abbott African Descent Female
3 Loretta Abbott African Descent Female
4 Donzaleigh Abernathy African Descent Female
5 Abiola Abrams African Descent Female
6 Acquanetta African Descent Female
7 Osceola Macarthy Adams African Descent Female
8 Yolanda Adams African Descent Female
9 Uzo Aduba African Descent Female
10 Erika Alexander African Descent Female
11 Tatyana Ali African Descent Female
12 Mary Alice African Descent Female
13 Lexi Allen African Descent Female
14 Billie Allen African Descent Female
15 Debbie Allen African Descent Female
16 Jonelle Allen African Descent Female
17 Sasha Allen African Descent Female
18 Amerie African Descent Female
19 Maya Angelou African Descent Female
20 Tichina Arnold African Descent Female
21 Ashanti (singer) African Descent Female
22 Karan Ashley African Descent Female
23 Essence Atkins African Descent Female
24 Beverly Hope Atkinson African Descent Female
25 Ruth Attaway African Descent Female
26 K. D. Aubert African Descent Female
27 Ashley August African Descent Female
28 Margaret Avery African Descent Female
29 Shondrella Avery African Descent Female
... ... ... ...
814 Anita Woodley African Descent Female
815 Carol Woods African Descent Female
816 Renn Woods African Descent Female
817 Nicole Wray African Descent Female
818 Aloma Wright African Descent Female
819 C. Kelly Wright African Descent Female
820 N'Bushe Wright African Descent Female
821 Tanya Wright African Descent Female
822 Lisa Wu African Descent Female
823 Mia X African Descent Female
824 Camille Yarbrough African Descent Female
825 Yo-Yo (rapper) African Descent Female
826 Sasheer Zamata African Descent Female
827 Zendaya African Descent Female
828 Thandie Newton African Descent Female
829 Sophie Okonedo African Descent Female
830 Liya Kebede African Descent Female
831 Danai Gurira African Descent Female
832 Rachel Mwanza African Descent Female
833 Enuka Okuma African Descent Female
834 Yaya DaCosta African Descent Female
835 Saycon Sengbloh African Descent Female
836 Gugu Mbatha-Raw African Descent Female
837 Adina Porter African Descent Female
838 Ashley Madekwe African Descent Female
839 Annie Ilonzeh African Descent Female
840 Lupita Nyong'o African Descent Female
841 Akosua Busia African Descent Female
842 Genevieve Nnaji African Descent Female
843 Adepero Oduye African Descent Female

844 rows × 3 columns

Combine All


In [913]:
# Complete List of Actors (Male & Female) of African Descent

df_rG = AA_males.append(AA_females)
df_rG


Out[913]:
names race sex
0 50 Cent African Descent Male
1 Quinton Aaron African Descent Male
2 Barkhad Abdi African Descent Male
3 Kareem Abdul-Jabbar African Descent Male
4 Brandon Adams (actor) African Descent Male
5 Granville Adams African Descent Male
6 Tunde Adebimpe African Descent Male
7 Ilunga Adell African Descent Male
8 Jovan Adepo African Descent Male
9 Faysal Ahmed African Descent Male
10 Franklyn Ajaye African Descent Male
11 Gbenga Akinnagbe African Descent Male
12 Ira Aldridge African Descent Male
13 Flex Alexander African Descent Male
14 Terry Alexander (actor) African Descent Male
15 Mahershala Ali African Descent Male
16 Raymond Allen (television actor) African Descent Male
17 John Amos African Descent Male
18 Anthony Anderson African Descent Male
19 Carl Anderson (singer) African Descent Male
20 Eddie "Rochester" Anderson African Descent Male
21 Haskell V. Anderson III African Descent Male
22 Shedrack Anderson III African Descent Male
23 André 3000 African Descent Male
24 Ray Aranha African Descent Male
25 Louis Armstrong African Descent Male
26 John A. Arneaux African Descent Male
27 Jacob Artist African Descent Male
28 Sharif Atkins African Descent Male
29 Jensen Atwood African Descent Male
... ... ... ...
814 Anita Woodley African Descent Female
815 Carol Woods African Descent Female
816 Renn Woods African Descent Female
817 Nicole Wray African Descent Female
818 Aloma Wright African Descent Female
819 C. Kelly Wright African Descent Female
820 N'Bushe Wright African Descent Female
821 Tanya Wright African Descent Female
822 Lisa Wu African Descent Female
823 Mia X African Descent Female
824 Camille Yarbrough African Descent Female
825 Yo-Yo (rapper) African Descent Female
826 Sasheer Zamata African Descent Female
827 Zendaya African Descent Female
828 Thandie Newton African Descent Female
829 Sophie Okonedo African Descent Female
830 Liya Kebede African Descent Female
831 Danai Gurira African Descent Female
832 Rachel Mwanza African Descent Female
833 Enuka Okuma African Descent Female
834 Yaya DaCosta African Descent Female
835 Saycon Sengbloh African Descent Female
836 Gugu Mbatha-Raw African Descent Female
837 Adina Porter African Descent Female
838 Ashley Madekwe African Descent Female
839 Annie Ilonzeh African Descent Female
840 Lupita Nyong'o African Descent Female
841 Akosua Busia African Descent Female
842 Genevieve Nnaji African Descent Female
843 Adepero Oduye African Descent Female

1990 rows × 3 columns


In [914]:
# Formatting - remove all () and everything within
df_rG['names']= df_rG['names'].str.replace(r"\(.*\)","")
# Rename column
df_rG= df_rG.rename(columns={'names': 'actor_name'})

df_rG


Out[914]:
actor_name race sex
0 50 Cent African Descent Male
1 Quinton Aaron African Descent Male
2 Barkhad Abdi African Descent Male
3 Kareem Abdul-Jabbar African Descent Male
4 Brandon Adams African Descent Male
5 Granville Adams African Descent Male
6 Tunde Adebimpe African Descent Male
7 Ilunga Adell African Descent Male
8 Jovan Adepo African Descent Male
9 Faysal Ahmed African Descent Male
10 Franklyn Ajaye African Descent Male
11 Gbenga Akinnagbe African Descent Male
12 Ira Aldridge African Descent Male
13 Flex Alexander African Descent Male
14 Terry Alexander African Descent Male
15 Mahershala Ali African Descent Male
16 Raymond Allen African Descent Male
17 John Amos African Descent Male
18 Anthony Anderson African Descent Male
19 Carl Anderson African Descent Male
20 Eddie "Rochester" Anderson African Descent Male
21 Haskell V. Anderson III African Descent Male
22 Shedrack Anderson III African Descent Male
23 André 3000 African Descent Male
24 Ray Aranha African Descent Male
25 Louis Armstrong African Descent Male
26 John A. Arneaux African Descent Male
27 Jacob Artist African Descent Male
28 Sharif Atkins African Descent Male
29 Jensen Atwood African Descent Male
... ... ... ...
814 Anita Woodley African Descent Female
815 Carol Woods African Descent Female
816 Renn Woods African Descent Female
817 Nicole Wray African Descent Female
818 Aloma Wright African Descent Female
819 C. Kelly Wright African Descent Female
820 N'Bushe Wright African Descent Female
821 Tanya Wright African Descent Female
822 Lisa Wu African Descent Female
823 Mia X African Descent Female
824 Camille Yarbrough African Descent Female
825 Yo-Yo African Descent Female
826 Sasheer Zamata African Descent Female
827 Zendaya African Descent Female
828 Thandie Newton African Descent Female
829 Sophie Okonedo African Descent Female
830 Liya Kebede African Descent Female
831 Danai Gurira African Descent Female
832 Rachel Mwanza African Descent Female
833 Enuka Okuma African Descent Female
834 Yaya DaCosta African Descent Female
835 Saycon Sengbloh African Descent Female
836 Gugu Mbatha-Raw African Descent Female
837 Adina Porter African Descent Female
838 Ashley Madekwe African Descent Female
839 Annie Ilonzeh African Descent Female
840 Lupita Nyong'o African Descent Female
841 Akosua Busia African Descent Female
842 Genevieve Nnaji African Descent Female
843 Adepero Oduye African Descent Female

1990 rows × 3 columns


In [915]:
# Remove Duplicates
df_rG= df_rG.drop_duplicates()

II. Imdb Data

Imdb Data from excel spreadsheet


In [943]:
# Read excel sheet
df= pd.read_excel("movie_metadata.xlsx")
df


Out[943]:
color director_name num_critic_for_reviews duration director_facebook_likes actor_3_facebook_likes actor_2_name actor_1_facebook_likes gross genres ... budget title_year actor_2_facebook_likes imdb_score aspect_ratio movie_facebook_likes Unnamed: 28 Unnamed: 29 Unnamed: 30 28
0 Color James Cameron 723.0 178.0 0.0 855.0 Joel David Moore 1000.0 760505847.0 Action|Adventure|Fantasy|Sci-Fi ... 237000000.0 2009.0 936.0 7.9 1.78 33000 NaN NaN NaN NaN
1 Color Gore Verbinski 302.0 169.0 563.0 1000.0 Orlando Bloom 40000.0 309404152.0 Action|Adventure|Fantasy ... 300000000.0 2007.0 5000.0 7.1 2.35 0 NaN NaN NaN NaN
2 Color Sam Mendes 602.0 148.0 0.0 161.0 Rory Kinnear 11000.0 200074175.0 Action|Adventure|Thriller ... 245000000.0 2015.0 393.0 6.8 2.35 85000 NaN NaN NaN NaN
3 Color Christopher Nolan 813.0 164.0 22000.0 23000.0 Christian Bale 27000.0 448130642.0 Action|Thriller ... 250000000.0 2012.0 23000.0 8.5 2.35 164000 NaN NaN NaN NaN
4 NaN Doug Walker NaN NaN 131.0 NaN Rob Walker 131.0 NaN Documentary ... NaN NaN 12.0 7.1 NaN 0 NaN NaN NaN NaN
5 Color Andrew Stanton 462.0 132.0 475.0 530.0 Samantha Morton 640.0 73058679.0 Action|Adventure|Sci-Fi ... 263700000.0 2012.0 632.0 6.6 2.35 24000 NaN NaN NaN NaN
6 Color Sam Raimi 392.0 156.0 0.0 4000.0 James Franco 24000.0 336530303.0 Action|Adventure|Romance ... 258000000.0 2007.0 11000.0 6.2 2.35 0 NaN NaN NaN NaN
7 Color Nathan Greno 324.0 100.0 15.0 284.0 Donna Murphy 799.0 200807262.0 Adventure|Animation|Comedy|Family|Fantasy|Musi... ... 260000000.0 2010.0 553.0 7.8 1.85 29000 NaN NaN NaN NaN
8 Color Joss Whedon 635.0 141.0 0.0 19000.0 Robert Downey Jr. 26000.0 458991599.0 Action|Adventure|Sci-Fi ... 250000000.0 2015.0 21000.0 7.5 2.35 118000 NaN NaN NaN NaN
9 Color David Yates 375.0 153.0 282.0 10000.0 Daniel Radcliffe 25000.0 301956980.0 Adventure|Family|Fantasy|Mystery ... 250000000.0 2009.0 11000.0 7.5 2.35 10000 NaN NaN NaN NaN
10 Color Zack Snyder 673.0 183.0 0.0 2000.0 Lauren Cohan 15000.0 330249062.0 Action|Adventure|Sci-Fi ... 250000000.0 2016.0 4000.0 6.9 2.35 197000 NaN NaN NaN NaN
11 Color Bryan Singer 434.0 169.0 0.0 903.0 Marlon Brando 18000.0 200069408.0 Action|Adventure|Sci-Fi ... 209000000.0 2006.0 10000.0 6.1 2.35 0 NaN NaN NaN NaN
12 Color Marc Forster 403.0 106.0 395.0 393.0 Mathieu Amalric 451.0 168368427.0 Action|Adventure ... 200000000.0 2008.0 412.0 6.7 2.35 0 NaN NaN NaN NaN
13 Color Gore Verbinski 313.0 151.0 563.0 1000.0 Orlando Bloom 40000.0 423032628.0 Action|Adventure|Fantasy ... 225000000.0 2006.0 5000.0 7.3 2.35 5000 NaN NaN NaN NaN
14 Color Gore Verbinski 450.0 150.0 563.0 1000.0 Ruth Wilson 40000.0 89289910.0 Action|Adventure|Western ... 215000000.0 2013.0 2000.0 6.5 2.35 48000 NaN NaN NaN NaN
15 Color Zack Snyder 733.0 143.0 0.0 748.0 Christopher Meloni 15000.0 291021565.0 Action|Adventure|Fantasy|Sci-Fi ... 225000000.0 2013.0 3000.0 7.2 2.35 118000 NaN NaN NaN NaN
16 Color Andrew Adamson 258.0 150.0 80.0 201.0 Pierfrancesco Favino 22000.0 141614023.0 Action|Adventure|Family|Fantasy ... 225000000.0 2008.0 216.0 6.6 2.35 0 NaN NaN NaN NaN
17 Color Joss Whedon 703.0 173.0 0.0 19000.0 Robert Downey Jr. 26000.0 623279547.0 Action|Adventure|Sci-Fi ... 220000000.0 2012.0 21000.0 8.1 1.85 123000 NaN NaN NaN NaN
18 Color Rob Marshall 448.0 136.0 252.0 1000.0 Sam Claflin 40000.0 241063875.0 Action|Adventure|Fantasy ... 250000000.0 2011.0 11000.0 6.7 2.35 58000 NaN NaN NaN NaN
19 Color Barry Sonnenfeld 451.0 106.0 188.0 718.0 Michael Stuhlbarg 10000.0 179020854.0 Action|Adventure|Comedy|Family|Fantasy|Sci-Fi ... 225000000.0 2012.0 816.0 6.8 1.85 40000 NaN NaN NaN NaN
20 Color Peter Jackson 422.0 164.0 0.0 773.0 Adam Brown 5000.0 255108370.0 Adventure|Fantasy ... 250000000.0 2014.0 972.0 7.5 2.35 65000 NaN NaN NaN NaN
21 Color Marc Webb 599.0 153.0 464.0 963.0 Andrew Garfield 15000.0 262030663.0 Action|Adventure|Fantasy ... 230000000.0 2012.0 10000.0 7.0 2.35 56000 NaN NaN NaN NaN
22 Color Ridley Scott 343.0 156.0 0.0 738.0 William Hurt 891.0 105219735.0 Action|Adventure|Drama|History ... 200000000.0 2010.0 882.0 6.7 2.35 17000 NaN NaN NaN NaN
23 Color Peter Jackson 509.0 186.0 0.0 773.0 Adam Brown 5000.0 258355354.0 Adventure|Fantasy ... 225000000.0 2013.0 972.0 7.9 2.35 83000 NaN NaN NaN NaN
24 Color Chris Weitz 251.0 113.0 129.0 1000.0 Eva Green 16000.0 70083519.0 Adventure|Family|Fantasy ... 180000000.0 2007.0 6000.0 6.1 2.35 0 NaN NaN NaN NaN
25 Color Peter Jackson 446.0 201.0 0.0 84.0 Thomas Kretschmann 6000.0 218051260.0 Action|Adventure|Drama|Romance ... 207000000.0 2005.0 919.0 7.2 2.35 0 NaN NaN NaN NaN
26 Color James Cameron 315.0 194.0 0.0 794.0 Kate Winslet 29000.0 658672302.0 Drama|Romance ... 200000000.0 1997.0 14000.0 7.7 2.35 26000 NaN NaN NaN NaN
27 Color Anthony Russo 516.0 147.0 94.0 11000.0 Scarlett Johansson 21000.0 407197282.0 Action|Adventure|Sci-Fi ... 250000000.0 2016.0 19000.0 8.2 2.35 72000 NaN NaN NaN NaN
28 Color Peter Berg 377.0 131.0 532.0 627.0 Alexander Skarsg’Çrd 14000.0 65173160.0 Action|Adventure|Sci-Fi|Thriller ... 209000000.0 2012.0 10000.0 5.9 2.35 44000 NaN NaN NaN NaN
29 Color Colin Trevorrow 644.0 124.0 365.0 1000.0 Judy Greer 3000.0 652177271.0 Action|Adventure|Sci-Fi|Thriller ... 150000000.0 2015.0 2000.0 7.0 2.00 150000 NaN NaN NaN NaN
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
5013 Color Eric Eason 28.0 79.0 3.0 42.0 Panchito G’_mez 93.0 NaN Drama|Family ... 24000.0 2002.0 46.0 7.0 1.78 61 NaN NaN NaN NaN
5014 Color Uwe Boll 58.0 80.0 892.0 492.0 Katharine Isabelle 986.0 NaN Action|Crime|Thriller ... NaN 2009.0 918.0 6.3 2.35 0 NaN NaN NaN NaN
5015 Black and White Richard Linklater 61.0 100.0 0.0 0.0 Richard Linklater 5.0 1227508.0 Comedy|Drama ... 23000.0 1991.0 0.0 7.1 1.37 2000 NaN NaN NaN NaN
5016 Color Joseph Mazzella NaN 90.0 0.0 9.0 Mikaal Bates 313.0 NaN Crime|Drama|Thriller ... 25000.0 2015.0 25.0 4.8 NaN 33 NaN NaN NaN NaN
5017 Color Travis Legge 1.0 90.0 138.0 138.0 Suzi Lorraine 370.0 NaN Comedy|Romance ... 22000.0 2013.0 184.0 3.3 1.78 200 NaN NaN NaN NaN
5018 Color Alex Kendrick 5.0 120.0 589.0 4.0 Lisa Arnold 51.0 NaN Drama ... 20000.0 2003.0 49.0 6.9 1.85 725 NaN NaN NaN NaN
5019 Color Marcus Nispel 43.0 91.0 158.0 265.0 Brittany Curran 630.0 NaN Horror|Mystery|Thriller ... NaN 2015.0 512.0 4.6 1.85 0 NaN NaN NaN NaN
5020 NaN Brandon Landers NaN 143.0 8.0 8.0 Alana Kaniewski 720.0 NaN Drama|Horror|Thriller ... 17350.0 2011.0 19.0 3.0 NaN 33 NaN NaN NaN NaN
5021 Color Jay Duplass 51.0 85.0 157.0 10.0 Katie Aselton 830.0 192467.0 Comedy|Drama|Romance ... 15000.0 2005.0 224.0 6.6 NaN 297 NaN NaN NaN NaN
5022 Black and White Jim Chuchu 6.0 60.0 0.0 4.0 Olwenya Maina 147.0 NaN Drama ... 15000.0 2014.0 19.0 7.4 NaN 45 NaN NaN NaN NaN
5023 Color Daryl Wein 22.0 88.0 38.0 211.0 Heather Burns 331.0 76382.0 Romance ... 15000.0 2009.0 212.0 6.2 2.35 324 NaN NaN NaN NaN
5024 Color Jason Trost 42.0 78.0 91.0 86.0 Jason Trost 407.0 NaN Sci-Fi|Thriller ... 20000.0 2011.0 91.0 4.0 2.35 835 NaN NaN NaN NaN
5025 Color John Waters 73.0 108.0 0.0 105.0 Mink Stole 462.0 180483.0 Comedy|Crime|Horror ... 10000.0 1972.0 143.0 6.1 1.37 0 NaN NaN NaN NaN
5026 Color Olivier Assayas 81.0 110.0 107.0 45.0 B’atrice Dalle 576.0 136007.0 Drama|Music|Romance ... 4500.0 2004.0 133.0 6.9 2.35 171 NaN NaN NaN NaN
5027 Color Jafar Panahi 64.0 90.0 397.0 0.0 Nargess Mamizadeh 5.0 673780.0 Drama ... 10000.0 2000.0 0.0 7.5 1.85 697 NaN NaN NaN NaN
5028 Black and White Ivan Kavanagh 12.0 83.0 18.0 0.0 Michael Parle 10.0 NaN Horror ... 10000.0 2007.0 5.0 6.7 1.33 105 NaN NaN NaN NaN
5029 Color Kiyoshi Kurosawa 78.0 111.0 62.0 6.0 Anna Nakagawa 89.0 94596.0 Crime|Horror|Mystery|Thriller ... 1000000.0 1997.0 13.0 7.4 1.85 817 NaN NaN NaN NaN
5030 Color Tadeo Garcia NaN 84.0 5.0 12.0 Michael Cortez 21.0 NaN Drama ... NaN 2004.0 20.0 6.1 NaN 22 NaN NaN NaN NaN
5031 Color Thomas L. Phillips 13.0 82.0 120.0 84.0 Joe Coffey 785.0 NaN Comedy|Horror|Thriller ... 200000.0 2012.0 98.0 5.4 16.00 424 NaN NaN NaN NaN
5032 Color Ash Baron-Cohen 10.0 98.0 3.0 152.0 Stanley B. Herman 789.0 NaN Crime|Drama ... NaN 1995.0 194.0 6.4 NaN 20 NaN NaN NaN NaN
5033 Color Shane Carruth 143.0 77.0 291.0 8.0 David Sullivan 291.0 424760.0 Drama|Sci-Fi|Thriller ... 7000.0 2004.0 45.0 7.0 1.85 19000 NaN NaN NaN NaN
5034 Color Neill Dela Llana 35.0 80.0 0.0 0.0 Edgar Tancangco 0.0 70071.0 Thriller ... 7000.0 2005.0 0.0 6.3 NaN 74 NaN NaN NaN NaN
5035 Color Robert Rodriguez 56.0 81.0 0.0 6.0 Peter Marquardt 121.0 2040920.0 Action|Crime|Drama|Romance|Thriller ... 7000.0 1992.0 20.0 6.9 1.37 0 NaN NaN NaN NaN
5036 Color Anthony Vallone NaN 84.0 2.0 2.0 John Considine 45.0 NaN Crime|Drama ... 3250.0 2005.0 44.0 7.8 NaN 4 NaN NaN NaN NaN
5037 Color Edward Burns 14.0 95.0 0.0 133.0 Caitlin FitzGerald 296.0 4584.0 Comedy|Drama ... 9000.0 2011.0 205.0 6.4 NaN 413 NaN NaN NaN NaN
5038 Color Scott Smith 1.0 87.0 2.0 318.0 Daphne Zuniga 637.0 NaN Comedy|Drama ... NaN 2013.0 470.0 7.7 NaN 84 NaN NaN NaN NaN
5039 Color NaN 43.0 43.0 NaN 319.0 Valorie Curry 841.0 NaN Crime|Drama|Mystery|Thriller ... NaN NaN 593.0 7.5 16.00 32000 NaN NaN NaN NaN
5040 Color Benjamin Roberds 13.0 76.0 0.0 0.0 Maxwell Moody 0.0 NaN Drama|Horror|Thriller ... 1400.0 2013.0 0.0 6.3 NaN 16 NaN NaN NaN NaN
5041 Color Daniel Hsia 14.0 100.0 0.0 489.0 Daniel Henney 946.0 10443.0 Comedy|Drama|Romance ... NaN 2012.0 719.0 6.3 2.35 660 NaN NaN NaN NaN
5042 Color Jon Gunn 43.0 90.0 16.0 16.0 Brian Herzlinger 86.0 85222.0 Documentary ... 1100.0 2004.0 23.0 6.6 1.85 456 NaN NaN NaN NaN

5043 rows × 32 columns

Explore Data


In [944]:
# Data Type per field
print('Variable dtypes:\n', df.dtypes, sep='')

# Dimensions
print('Dimensions:', df.shape)

# First few rows
df.head()


Variable dtypes:
color                         object
director_name                 object
num_critic_for_reviews       float64
duration                     float64
director_facebook_likes      float64
actor_3_facebook_likes       float64
actor_2_name                  object
actor_1_facebook_likes       float64
gross                        float64
genres                        object
actor_1_name                  object
movie_title                   object
num_voted_users                int64
cast_total_facebook_likes      int64
actor_3_name                  object
facenumber_in_poster         float64
plot_keywords                 object
movie_imdb_link               object
num_user_for_reviews         float64
language                      object
country                       object
content_rating                object
budget                       float64
title_year                   float64
actor_2_facebook_likes       float64
imdb_score                   float64
aspect_ratio                 float64
movie_facebook_likes           int64
Unnamed: 28                  float64
Unnamed: 29                  float64
Unnamed: 30                  float64
28                           float64
dtype: object
Dimensions: (5043, 32)
Out[944]:
color director_name num_critic_for_reviews duration director_facebook_likes actor_3_facebook_likes actor_2_name actor_1_facebook_likes gross genres ... budget title_year actor_2_facebook_likes imdb_score aspect_ratio movie_facebook_likes Unnamed: 28 Unnamed: 29 Unnamed: 30 28
0 Color James Cameron 723.0 178.0 0.0 855.0 Joel David Moore 1000.0 760505847.0 Action|Adventure|Fantasy|Sci-Fi ... 237000000.0 2009.0 936.0 7.9 1.78 33000 NaN NaN NaN NaN
1 Color Gore Verbinski 302.0 169.0 563.0 1000.0 Orlando Bloom 40000.0 309404152.0 Action|Adventure|Fantasy ... 300000000.0 2007.0 5000.0 7.1 2.35 0 NaN NaN NaN NaN
2 Color Sam Mendes 602.0 148.0 0.0 161.0 Rory Kinnear 11000.0 200074175.0 Action|Adventure|Thriller ... 245000000.0 2015.0 393.0 6.8 2.35 85000 NaN NaN NaN NaN
3 Color Christopher Nolan 813.0 164.0 22000.0 23000.0 Christian Bale 27000.0 448130642.0 Action|Thriller ... 250000000.0 2012.0 23000.0 8.5 2.35 164000 NaN NaN NaN NaN
4 NaN Doug Walker NaN NaN 131.0 NaN Rob Walker 131.0 NaN Documentary ... NaN NaN 12.0 7.1 NaN 0 NaN NaN NaN NaN

5 rows × 32 columns

Clean Data


In [945]:
# Marking missing values

df= df.replace(to_replace=[' '], value=[None])

In [946]:
# Remove all movies that do not have release date years
df1= df[df['title_year'].notnull()]

In [947]:
df1


Out[947]:
color director_name num_critic_for_reviews duration director_facebook_likes actor_3_facebook_likes actor_2_name actor_1_facebook_likes gross genres ... budget title_year actor_2_facebook_likes imdb_score aspect_ratio movie_facebook_likes Unnamed: 28 Unnamed: 29 Unnamed: 30 28
0 Color James Cameron 723.0 178.0 0.0 855.0 Joel David Moore 1000.0 760505847.0 Action|Adventure|Fantasy|Sci-Fi ... 237000000.0 2009.0 936.0 7.9 1.78 33000 NaN NaN NaN NaN
1 Color Gore Verbinski 302.0 169.0 563.0 1000.0 Orlando Bloom 40000.0 309404152.0 Action|Adventure|Fantasy ... 300000000.0 2007.0 5000.0 7.1 2.35 0 NaN NaN NaN NaN
2 Color Sam Mendes 602.0 148.0 0.0 161.0 Rory Kinnear 11000.0 200074175.0 Action|Adventure|Thriller ... 245000000.0 2015.0 393.0 6.8 2.35 85000 NaN NaN NaN NaN
3 Color Christopher Nolan 813.0 164.0 22000.0 23000.0 Christian Bale 27000.0 448130642.0 Action|Thriller ... 250000000.0 2012.0 23000.0 8.5 2.35 164000 NaN NaN NaN NaN
5 Color Andrew Stanton 462.0 132.0 475.0 530.0 Samantha Morton 640.0 73058679.0 Action|Adventure|Sci-Fi ... 263700000.0 2012.0 632.0 6.6 2.35 24000 NaN NaN NaN NaN
6 Color Sam Raimi 392.0 156.0 0.0 4000.0 James Franco 24000.0 336530303.0 Action|Adventure|Romance ... 258000000.0 2007.0 11000.0 6.2 2.35 0 NaN NaN NaN NaN
7 Color Nathan Greno 324.0 100.0 15.0 284.0 Donna Murphy 799.0 200807262.0 Adventure|Animation|Comedy|Family|Fantasy|Musi... ... 260000000.0 2010.0 553.0 7.8 1.85 29000 NaN NaN NaN NaN
8 Color Joss Whedon 635.0 141.0 0.0 19000.0 Robert Downey Jr. 26000.0 458991599.0 Action|Adventure|Sci-Fi ... 250000000.0 2015.0 21000.0 7.5 2.35 118000 NaN NaN NaN NaN
9 Color David Yates 375.0 153.0 282.0 10000.0 Daniel Radcliffe 25000.0 301956980.0 Adventure|Family|Fantasy|Mystery ... 250000000.0 2009.0 11000.0 7.5 2.35 10000 NaN NaN NaN NaN
10 Color Zack Snyder 673.0 183.0 0.0 2000.0 Lauren Cohan 15000.0 330249062.0 Action|Adventure|Sci-Fi ... 250000000.0 2016.0 4000.0 6.9 2.35 197000 NaN NaN NaN NaN
11 Color Bryan Singer 434.0 169.0 0.0 903.0 Marlon Brando 18000.0 200069408.0 Action|Adventure|Sci-Fi ... 209000000.0 2006.0 10000.0 6.1 2.35 0 NaN NaN NaN NaN
12 Color Marc Forster 403.0 106.0 395.0 393.0 Mathieu Amalric 451.0 168368427.0 Action|Adventure ... 200000000.0 2008.0 412.0 6.7 2.35 0 NaN NaN NaN NaN
13 Color Gore Verbinski 313.0 151.0 563.0 1000.0 Orlando Bloom 40000.0 423032628.0 Action|Adventure|Fantasy ... 225000000.0 2006.0 5000.0 7.3 2.35 5000 NaN NaN NaN NaN
14 Color Gore Verbinski 450.0 150.0 563.0 1000.0 Ruth Wilson 40000.0 89289910.0 Action|Adventure|Western ... 215000000.0 2013.0 2000.0 6.5 2.35 48000 NaN NaN NaN NaN
15 Color Zack Snyder 733.0 143.0 0.0 748.0 Christopher Meloni 15000.0 291021565.0 Action|Adventure|Fantasy|Sci-Fi ... 225000000.0 2013.0 3000.0 7.2 2.35 118000 NaN NaN NaN NaN
16 Color Andrew Adamson 258.0 150.0 80.0 201.0 Pierfrancesco Favino 22000.0 141614023.0 Action|Adventure|Family|Fantasy ... 225000000.0 2008.0 216.0 6.6 2.35 0 NaN NaN NaN NaN
17 Color Joss Whedon 703.0 173.0 0.0 19000.0 Robert Downey Jr. 26000.0 623279547.0 Action|Adventure|Sci-Fi ... 220000000.0 2012.0 21000.0 8.1 1.85 123000 NaN NaN NaN NaN
18 Color Rob Marshall 448.0 136.0 252.0 1000.0 Sam Claflin 40000.0 241063875.0 Action|Adventure|Fantasy ... 250000000.0 2011.0 11000.0 6.7 2.35 58000 NaN NaN NaN NaN
19 Color Barry Sonnenfeld 451.0 106.0 188.0 718.0 Michael Stuhlbarg 10000.0 179020854.0 Action|Adventure|Comedy|Family|Fantasy|Sci-Fi ... 225000000.0 2012.0 816.0 6.8 1.85 40000 NaN NaN NaN NaN
20 Color Peter Jackson 422.0 164.0 0.0 773.0 Adam Brown 5000.0 255108370.0 Adventure|Fantasy ... 250000000.0 2014.0 972.0 7.5 2.35 65000 NaN NaN NaN NaN
21 Color Marc Webb 599.0 153.0 464.0 963.0 Andrew Garfield 15000.0 262030663.0 Action|Adventure|Fantasy ... 230000000.0 2012.0 10000.0 7.0 2.35 56000 NaN NaN NaN NaN
22 Color Ridley Scott 343.0 156.0 0.0 738.0 William Hurt 891.0 105219735.0 Action|Adventure|Drama|History ... 200000000.0 2010.0 882.0 6.7 2.35 17000 NaN NaN NaN NaN
23 Color Peter Jackson 509.0 186.0 0.0 773.0 Adam Brown 5000.0 258355354.0 Adventure|Fantasy ... 225000000.0 2013.0 972.0 7.9 2.35 83000 NaN NaN NaN NaN
24 Color Chris Weitz 251.0 113.0 129.0 1000.0 Eva Green 16000.0 70083519.0 Adventure|Family|Fantasy ... 180000000.0 2007.0 6000.0 6.1 2.35 0 NaN NaN NaN NaN
25 Color Peter Jackson 446.0 201.0 0.0 84.0 Thomas Kretschmann 6000.0 218051260.0 Action|Adventure|Drama|Romance ... 207000000.0 2005.0 919.0 7.2 2.35 0 NaN NaN NaN NaN
26 Color James Cameron 315.0 194.0 0.0 794.0 Kate Winslet 29000.0 658672302.0 Drama|Romance ... 200000000.0 1997.0 14000.0 7.7 2.35 26000 NaN NaN NaN NaN
27 Color Anthony Russo 516.0 147.0 94.0 11000.0 Scarlett Johansson 21000.0 407197282.0 Action|Adventure|Sci-Fi ... 250000000.0 2016.0 19000.0 8.2 2.35 72000 NaN NaN NaN NaN
28 Color Peter Berg 377.0 131.0 532.0 627.0 Alexander Skarsg’Çrd 14000.0 65173160.0 Action|Adventure|Sci-Fi|Thriller ... 209000000.0 2012.0 10000.0 5.9 2.35 44000 NaN NaN NaN NaN
29 Color Colin Trevorrow 644.0 124.0 365.0 1000.0 Judy Greer 3000.0 652177271.0 Action|Adventure|Sci-Fi|Thriller ... 150000000.0 2015.0 2000.0 7.0 2.00 150000 NaN NaN NaN NaN
30 Color Sam Mendes 750.0 143.0 0.0 393.0 Helen McCrory 883.0 304360277.0 Action|Adventure|Thriller ... 200000000.0 2012.0 563.0 7.8 2.35 80000 NaN NaN NaN NaN
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
5012 Color David Ayer 233.0 109.0 453.0 120.0 Martin Donovan 1000.0 10499968.0 Action|Crime|Drama|Thriller ... 35000000.0 2014.0 206.0 5.7 1.85 10000 NaN NaN NaN NaN
5013 Color Eric Eason 28.0 79.0 3.0 42.0 Panchito G’_mez 93.0 NaN Drama|Family ... 24000.0 2002.0 46.0 7.0 1.78 61 NaN NaN NaN NaN
5014 Color Uwe Boll 58.0 80.0 892.0 492.0 Katharine Isabelle 986.0 NaN Action|Crime|Thriller ... NaN 2009.0 918.0 6.3 2.35 0 NaN NaN NaN NaN
5015 Black and White Richard Linklater 61.0 100.0 0.0 0.0 Richard Linklater 5.0 1227508.0 Comedy|Drama ... 23000.0 1991.0 0.0 7.1 1.37 2000 NaN NaN NaN NaN
5016 Color Joseph Mazzella NaN 90.0 0.0 9.0 Mikaal Bates 313.0 NaN Crime|Drama|Thriller ... 25000.0 2015.0 25.0 4.8 NaN 33 NaN NaN NaN NaN
5017 Color Travis Legge 1.0 90.0 138.0 138.0 Suzi Lorraine 370.0 NaN Comedy|Romance ... 22000.0 2013.0 184.0 3.3 1.78 200 NaN NaN NaN NaN
5018 Color Alex Kendrick 5.0 120.0 589.0 4.0 Lisa Arnold 51.0 NaN Drama ... 20000.0 2003.0 49.0 6.9 1.85 725 NaN NaN NaN NaN
5019 Color Marcus Nispel 43.0 91.0 158.0 265.0 Brittany Curran 630.0 NaN Horror|Mystery|Thriller ... NaN 2015.0 512.0 4.6 1.85 0 NaN NaN NaN NaN
5020 NaN Brandon Landers NaN 143.0 8.0 8.0 Alana Kaniewski 720.0 NaN Drama|Horror|Thriller ... 17350.0 2011.0 19.0 3.0 NaN 33 NaN NaN NaN NaN
5021 Color Jay Duplass 51.0 85.0 157.0 10.0 Katie Aselton 830.0 192467.0 Comedy|Drama|Romance ... 15000.0 2005.0 224.0 6.6 NaN 297 NaN NaN NaN NaN
5022 Black and White Jim Chuchu 6.0 60.0 0.0 4.0 Olwenya Maina 147.0 NaN Drama ... 15000.0 2014.0 19.0 7.4 NaN 45 NaN NaN NaN NaN
5023 Color Daryl Wein 22.0 88.0 38.0 211.0 Heather Burns 331.0 76382.0 Romance ... 15000.0 2009.0 212.0 6.2 2.35 324 NaN NaN NaN NaN
5024 Color Jason Trost 42.0 78.0 91.0 86.0 Jason Trost 407.0 NaN Sci-Fi|Thriller ... 20000.0 2011.0 91.0 4.0 2.35 835 NaN NaN NaN NaN
5025 Color John Waters 73.0 108.0 0.0 105.0 Mink Stole 462.0 180483.0 Comedy|Crime|Horror ... 10000.0 1972.0 143.0 6.1 1.37 0 NaN NaN NaN NaN
5026 Color Olivier Assayas 81.0 110.0 107.0 45.0 B’atrice Dalle 576.0 136007.0 Drama|Music|Romance ... 4500.0 2004.0 133.0 6.9 2.35 171 NaN NaN NaN NaN
5027 Color Jafar Panahi 64.0 90.0 397.0 0.0 Nargess Mamizadeh 5.0 673780.0 Drama ... 10000.0 2000.0 0.0 7.5 1.85 697 NaN NaN NaN NaN
5028 Black and White Ivan Kavanagh 12.0 83.0 18.0 0.0 Michael Parle 10.0 NaN Horror ... 10000.0 2007.0 5.0 6.7 1.33 105 NaN NaN NaN NaN
5029 Color Kiyoshi Kurosawa 78.0 111.0 62.0 6.0 Anna Nakagawa 89.0 94596.0 Crime|Horror|Mystery|Thriller ... 1000000.0 1997.0 13.0 7.4 1.85 817 NaN NaN NaN NaN
5030 Color Tadeo Garcia NaN 84.0 5.0 12.0 Michael Cortez 21.0 NaN Drama ... NaN 2004.0 20.0 6.1 NaN 22 NaN NaN NaN NaN
5031 Color Thomas L. Phillips 13.0 82.0 120.0 84.0 Joe Coffey 785.0 NaN Comedy|Horror|Thriller ... 200000.0 2012.0 98.0 5.4 16.00 424 NaN NaN NaN NaN
5032 Color Ash Baron-Cohen 10.0 98.0 3.0 152.0 Stanley B. Herman 789.0 NaN Crime|Drama ... NaN 1995.0 194.0 6.4 NaN 20 NaN NaN NaN NaN
5033 Color Shane Carruth 143.0 77.0 291.0 8.0 David Sullivan 291.0 424760.0 Drama|Sci-Fi|Thriller ... 7000.0 2004.0 45.0 7.0 1.85 19000 NaN NaN NaN NaN
5034 Color Neill Dela Llana 35.0 80.0 0.0 0.0 Edgar Tancangco 0.0 70071.0 Thriller ... 7000.0 2005.0 0.0 6.3 NaN 74 NaN NaN NaN NaN
5035 Color Robert Rodriguez 56.0 81.0 0.0 6.0 Peter Marquardt 121.0 2040920.0 Action|Crime|Drama|Romance|Thriller ... 7000.0 1992.0 20.0 6.9 1.37 0 NaN NaN NaN NaN
5036 Color Anthony Vallone NaN 84.0 2.0 2.0 John Considine 45.0 NaN Crime|Drama ... 3250.0 2005.0 44.0 7.8 NaN 4 NaN NaN NaN NaN
5037 Color Edward Burns 14.0 95.0 0.0 133.0 Caitlin FitzGerald 296.0 4584.0 Comedy|Drama ... 9000.0 2011.0 205.0 6.4 NaN 413 NaN NaN NaN NaN
5038 Color Scott Smith 1.0 87.0 2.0 318.0 Daphne Zuniga 637.0 NaN Comedy|Drama ... NaN 2013.0 470.0 7.7 NaN 84 NaN NaN NaN NaN
5040 Color Benjamin Roberds 13.0 76.0 0.0 0.0 Maxwell Moody 0.0 NaN Drama|Horror|Thriller ... 1400.0 2013.0 0.0 6.3 NaN 16 NaN NaN NaN NaN
5041 Color Daniel Hsia 14.0 100.0 0.0 489.0 Daniel Henney 946.0 10443.0 Comedy|Drama|Romance ... NaN 2012.0 719.0 6.3 2.35 660 NaN NaN NaN NaN
5042 Color Jon Gunn 43.0 90.0 16.0 16.0 Brian Herzlinger 86.0 85222.0 Documentary ... 1100.0 2004.0 23.0 6.6 1.85 456 NaN NaN NaN NaN

4935 rows × 32 columns


In [948]:
## Change year to integer

df1['title_year']= df1['title_year'].astype(int)

# Sort by Title Year

df1= df1.sort_values('title_year')


/Users/eunicefamodimu/anaconda/lib/python3.6/site-packages/ipykernel/__main__.py:3: SettingWithCopyWarning: 
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/indexing.html#indexing-view-versus-copy
  app.launch_new_instance()

In [949]:
# Extract Variables of Interest

df1= df1[['movie_title', 'title_year', 'country' ,'genres', 'actor_1_name', 'actor_2_name', 'actor_3_name','gross', 'budget', 'imdb_score' , 'num_user_for_reviews', 'language', 'content_rating']]
df1.shape


Out[949]:
(4935, 13)

In [950]:
# Break Up Into 3 Data Frames

df_m1 = df1[['movie_title', 'title_year', 'country', 'genres', 'actor_1_name','gross', 'budget', 'imdb_score' , 'num_user_for_reviews', 'language', 'content_rating']]
df_m2 = df1[['movie_title', 'title_year', 'country', 'genres', 'actor_2_name','gross', 'budget', 'imdb_score' , 'num_user_for_reviews', 'language', 'content_rating']]
df_m3 = df1[['movie_title', 'title_year', 'country', 'genres', 'actor_3_name','gross', 'budget', 'imdb_score' , 'num_user_for_reviews', 'language', 'content_rating']]

In [951]:
# Rename actor columns column
df_m1= df_m1.rename(columns={'actor_1_name': 'actor_name', 'movie_title': 'Film', 'title_year': 'Year'})
df_m2= df_m2.rename(columns={'actor_2_name': 'actor_name', 'movie_title': 'Film', 'title_year': 'Year'})
df_m3= df_m3.rename(columns={'actor_3_name': 'actor_name', 'movie_title': 'Film', 'title_year': 'Year'})

Academy Awards Data

Academy Awards data from excel spreadsheet


In [929]:
award_dat= pd.read_excel("awards.xlsx")

In [930]:
award_dat


Out[930]:
Film Year Awards Nominations
0 Moonlight 2016 3 8
1 La La Land 2016 6 14
2 Hacksaw Ridge 2016 2 6
3 Manchester by the Sea 2016 2 6
4 Arrival 2016 1 8
5 Fences 2016 1 4
6 Fantastic Beasts and Where to Find Them 2016 1 2
7 The Jungle Book 2016 1 1
8 O.J.: Made in America 2016 1 1
9 Piper 2016 1 1
10 The Salesman 2016 1 1
11 Sing 2016 1 1
12 Suicide Squad 2016 1 1
13 The White Helmets 2016 1 1
14 Zootopia 2016 1 1
15 Spotlight 2015 2 6
16 Mad Max: Fury Road 2015 6 10
17 The Revenant 2015 3 12
18 Bridge of Spies 2015 1 6
19 The Big Short 2015 1 5
20 The Danish Girl 2015 1 4
21 Room 2015 1 4
22 The Hateful Eight 2015 1 3
23 Ex Machina 2015 1 2
24 Inside Out 2015 1 2
25 Amy 2015 1 1
26 Bear Story 2015 1 1
27 A Girl in the River: The Price of Forgiveness 2015 1 1
28 Son of Saul 2015 1 1
29 Spectre 2015 1 1
... ... ... ... ...
1241 White Shadows in the South Seas 1928 1 1
1242 White Wilderness 1958 1 2
1243 Who's Afraid of Virginia Woolf? 1966 5 13
1244 Who's Who in Animal Land 1944 1 1
1245 Why Korea? 1950 1 1
1246 Why Man Creates 1968 1 1
1247 Wild Wings 1966 1 1
1248 Wilson 1944 5 10
1249 Wings 1927 2 2
1250 Wings Over Everest 1935 1 1
1251 Winnie the Pooh and the Blustery Day 1968 1 1
1252 With a Song in My Heart 1952 1 5
1253 With Byrd at the South Pole 1930 1 1
1254 With the Marines at Tarawa 1944 1 1
1255 The Wizard of Oz 1939 2 6
1256 Woman of the Year 1942 1 2
1257 Wonder Man 1945 1 4
1258 The Wonderful World of the Brothers Grimm 1962 1 4
1259 World of Kids 1951 1 1
1260 World Without Sun (Le Monde sans soleil) 1964 1 1
1261 Wrestling Swordfish 1931 1 1
1262 Written on the Wind 1956 1 3
1263 Wuthering Heights 1939 1 8
1264 Yankee Doodle Dandy 1942 3 8
1265 The Yankee Doodle Mouse 1943 1 1
1266 A Year Toward Tomorrow 1966 1 1
1267 The Yearling 1946 2 7
1268 Yesterday, Today and Tomorrow (Ieri, oggi, dom... 1964 1 1
1269 You Can't Take It with You 1938 2 7
1270 Zorba the Greek (Alexis Zorbas) 1964 3 7

1271 rows × 4 columns


In [952]:
award_dat.dtypes


Out[952]:
Film           object
Year            int64
Awards          int64
Nominations     int64
dtype: object

Combine Data

Combine Race data, IMDB data, and Academy Awards data

Group 1 - Lead Actors

  • Add race and gender of main actors to movie data

In [932]:
# Group 1
combo1 = pd.merge(df_m1, df_rG,   # left and right df's
                 how='left',        # add to left 
                 on='actor_name'       # link with this variable/column 
                ) 

combo1


Out[932]:
Film Year country genres actor_name gross budget imdb_score num_user_for_reviews language content_rating race sex
0 Intolerance: Love's Struggle Throughout the Ages 1916 USA Drama|History|War Lillian Gish NaN 385907.0 8.0 88.0 NaN Not Rated NaN NaN
1 Over the Hill to the Poorhouse 1920 USA Crime|Drama Stephen Carr 3000000.0 100000.0 4.8 1.0 NaN NaN NaN NaN
2 The Big Parade 1925 USA Drama|Romance|War John Gilbert NaN 245000.0 8.3 45.0 NaN Not Rated NaN NaN
3 Metropolis 1927 Germany Drama|Sci-Fi Brigitte Helm 26435.0 6000000.0 8.3 413.0 German Not Rated NaN NaN
4 The Broadway Melody 1929 USA Musical|Romance Anita Page 2808000.0 379000.0 6.3 71.0 English Passed NaN NaN
5 Pandora's Box 1929 Germany Crime|Drama|Romance Louise Brooks 9950.0 NaN 8.0 84.0 German Not Rated NaN NaN
6 Hell's Angels 1930 USA Drama|War Jean Harlow NaN 3950000.0 7.8 53.0 English Passed NaN NaN
7 A Farewell to Arms 1932 USA Drama|Romance|War Gary Cooper NaN 800000.0 6.6 46.0 English Unrated NaN NaN
8 42nd Street 1933 USA Comedy|Musical|Romance Ginger Rogers 2300000.0 439000.0 7.7 97.0 English Unrated NaN NaN
9 She Done Him Wrong 1933 USA Comedy|Drama|History|Musical|Romance Mae West NaN 200000.0 6.5 59.0 English Approved NaN NaN
10 It Happened One Night 1934 USA Comedy|Romance Claudette Colbert NaN 325000.0 8.2 235.0 English Unrated NaN NaN
11 Top Hat 1935 USA Comedy|Musical|Romance Ginger Rogers 3000000.0 609000.0 7.8 98.0 English Approved NaN NaN
12 Modern Times 1936 USA Comedy|Drama|Family Paulette Goddard 163245.0 1500000.0 8.6 211.0 English G NaN NaN
13 The Charge of the Light Brigade 1936 USA Action|Adventure|Romance|War Errol Flynn NaN 1200000.0 7.1 52.0 English Approved NaN NaN
14 Snow White and the Seven Dwarfs 1937 USA Animation|Family|Fantasy|Musical Adriana Caselotti 184925485.0 2000000.0 7.7 204.0 English Approved NaN NaN
15 The Prisoner of Zenda 1937 USA Adventure|Drama|Romance David Niven NaN NaN 7.8 44.0 English Approved NaN NaN
16 Alexander's Ragtime Band 1938 USA Drama|Musical|Romance Tyrone Power NaN 2000000.0 7.0 29.0 English Approved NaN NaN
17 You Can't Take It with You 1938 USA Comedy|Drama|Romance Jean Arthur NaN 1644736.0 8.0 133.0 English Approved NaN NaN
18 Mr. Smith Goes to Washington 1939 USA Comedy|Drama Claude Rains NaN 1500000.0 8.2 245.0 English Not Rated NaN NaN
19 Gone with the Wind 1939 USA Drama|History|Romance|War Hattie McDaniel 198655278.0 3977000.0 8.2 706.0 English G African Descent Female
20 The Wizard of Oz 1939 USA Adventure|Family|Fantasy|Musical Margaret Hamilton 22202612.0 2800000.0 8.1 533.0 English Passed NaN NaN
21 The Blue Bird 1940 USA Drama|Family|Fantasy Spring Byington NaN NaN 6.5 25.0 English Approved NaN NaN
22 Boom Town 1940 USA Adventure|Drama|Romance|Western Hedy Lamarr NaN 1614000.0 7.1 34.0 English Passed NaN NaN
23 Rebecca 1940 USA Drama|Film-Noir|Mystery|Thriller Laurence Olivier NaN 1288000.0 8.2 276.0 English Not Rated NaN NaN
24 Pinocchio 1940 USA Animation|Family|Fantasy|Musical Mel Blanc 84300000.0 2600000.0 7.5 147.0 English Approved NaN NaN
25 Fantasia 1940 USA Animation|Family|Fantasy|Music Leopold Stokowski 76400000.0 2280000.0 7.8 230.0 English G NaN NaN
26 How Green Was My Valley 1941 USA Drama|Family Roddy McDowall NaN 1250000.0 7.8 124.0 English Approved NaN NaN
27 Casablanca 1942 USA Drama|Romance|War Humphrey Bogart NaN 950000.0 8.6 1123.0 English PG NaN NaN
28 Bambi 1942 USA Animation|Drama|Family Sam Edwards 102797150.0 NaN 7.4 136.0 English Approved NaN NaN
29 A Guy Named Joe 1943 USA Drama|Fantasy|Romance|War Spencer Tracy NaN 2627000.0 7.0 27.0 English Passed NaN NaN
... ... ... ... ... ... ... ... ... ... ... ... ... ...
4905 Hail, Caesar! 2016 UK Comedy|Mystery Scarlett Johansson 29997095.0 22000000.0 6.4 302.0 English PG-13 NaN NaN
4906 The Infiltrator 2016 UK Biography|Crime|Drama|Thriller Joseph Gilgun 14946229.0 25000000.0 7.3 29.0 English R NaN NaN
4907 Ben-Hur 2016 USA Adventure|Drama|History Morgan Freeman NaN 100000000.0 6.0 1.0 English PG-13 African Descent Male
4908 The Veil 2016 USA Horror Lily Rabe NaN 4000000.0 4.7 29.0 English R NaN NaN
4909 Eddie the Eagle 2016 UK Biography|Comedy|Drama|Sport Hugh Jackman 15785632.0 23000000.0 7.5 119.0 English PG-13 NaN NaN
4910 The Conjuring 2 2016 USA Horror|Mystery|Thriller Javier Botet 102310175.0 40000000.0 7.8 279.0 English R NaN NaN
4911 The Birth of a Nation 2016 USA Biography|Drama Jason Stuart NaN 10000000.0 5.4 8.0 English R NaN NaN
4912 Airlift 2016 India Action|Drama|History|Thriller|War Nimrat Kaur NaN 4400000.0 8.5 178.0 Hindi NaN NaN NaN
4913 Our Kind of Traitor 2016 UK Thriller Radivoje Bukvic 3108216.0 NaN 6.4 21.0 English R NaN NaN
4914 Jane Got a Gun 2016 USA Action|Drama|Western Natalie Portman 1512815.0 25000000.0 5.8 56.0 English R NaN NaN
4915 The Purge: Election Year 2016 France Action|Horror|Sci-Fi|Thriller Frank Grillo 78845130.0 10000000.0 6.1 94.0 English R NaN NaN
4916 Ben-Hur 2016 USA Adventure|Drama|History Morgan Freeman NaN 100000000.0 6.1 1.0 English PG-13 African Descent Male
4917 The Finest Hours 2016 USA Action|Drama|History|Thriller Michael Raymond-James 27550735.0 NaN 6.8 113.0 English PG-13 NaN NaN
4918 Triple 9 2016 USA Action|Crime|Drama|Thriller Kate Winslet 12626905.0 20000000.0 6.3 106.0 English R NaN NaN
4919 Antibirth 2016 USA Horror Natasha Lyonne NaN 3500000.0 6.3 2.0 English NaN NaN NaN
4920 The Jungle Book 2016 UK Adventure|Drama|Family|Fantasy Scarlett Johansson 362645141.0 175000000.0 7.8 398.0 English PG NaN NaN
4921 Dirty Grandpa 2016 USA Comedy Robert De Niro 35537564.0 11500000.0 6.0 166.0 English R NaN NaN
4922 London Has Fallen 2016 UK Action|Crime|Drama|Thriller Gerard Butler 62401264.0 60000000.0 5.9 323.0 English R NaN NaN
4923 Nerve 2016 USA Adventure|Crime|Mystery|Sci-Fi|Thriller Samira Wiley 28876924.0 20000000.0 7.1 35.0 English PG-13 African Descent Female
4924 Me Before You 2016 UK Drama|Romance Sam Claflin 56154094.0 20000000.0 7.6 130.0 English PG-13 NaN NaN
4925 Suicide Squad 2016 USA Action|Adventure|Comedy|Sci-Fi Will Smith 161087183.0 175000000.0 6.9 971.0 English PG-13 African Descent Male
4926 The BFG 2016 UK Adventure|Family|Fantasy Mark Rylance 52792307.0 140000000.0 6.8 106.0 English PG NaN NaN
4927 Deadpool 2016 USA Action|Adventure|Comedy|Romance|Sci-Fi Ryan Reynolds 363024263.0 58000000.0 8.1 1058.0 English R NaN NaN
4928 X-Men: Apocalypse 2016 USA Action|Adventure|Sci-Fi Jennifer Lawrence 154985087.0 178000000.0 7.3 622.0 English PG-13 NaN NaN
4929 The Legend of Tarzan 2016 USA Action|Adventure|Drama|Romance Christoph Waltz 124051759.0 180000000.0 6.6 239.0 English PG-13 NaN NaN
4930 Compadres 2016 Mexico Action|Comedy Kevin Pollak 3105269.0 3000000.0 5.0 8.0 English R NaN NaN
4931 Bad Moms 2016 USA Comedy Mila Kunis 55461307.0 20000000.0 6.7 46.0 English R NaN NaN
4932 Misconduct 2016 USA Drama|Thriller Al Pacino NaN 11000000.0 5.3 50.0 English R NaN NaN
4933 Risen 2016 USA Action|Adventure|Drama|Mystery Peter Firth 36874745.0 20000000.0 6.3 117.0 English PG-13 NaN NaN
4934 Keanu 2016 USA Action|Comedy Nia Long 20566327.0 15000000.0 6.4 84.0 English R African Descent Female

4935 rows × 13 columns


In [953]:
# Dimensions
combo1.shape


Out[953]:
(4935, 13)
  • Add number of awards per movie to combo data

In [954]:
# Group 1
combo1_1 = pd.merge(combo1, award_dat,   # left and right df's
                 how='left',        # add to left 
                 on=['Film','Year']      # link with this variable/column 
                ) 

combo1_1


Out[954]:
Film Year country genres actor_name gross budget imdb_score num_user_for_reviews language content_rating race sex Awards Nominations
0 Intolerance: Love's Struggle Throughout the Ages 1916 USA Drama|History|War Lillian Gish NaN 385907.0 8.0 88.0 NaN Not Rated NaN NaN NaN NaN
1 Over the Hill to the Poorhouse 1920 USA Crime|Drama Stephen Carr 3000000.0 100000.0 4.8 1.0 NaN NaN NaN NaN NaN NaN
2 The Big Parade 1925 USA Drama|Romance|War John Gilbert NaN 245000.0 8.3 45.0 NaN Not Rated NaN NaN NaN NaN
3 Metropolis 1927 Germany Drama|Sci-Fi Brigitte Helm 26435.0 6000000.0 8.3 413.0 German Not Rated NaN NaN NaN NaN
4 The Broadway Melody 1929 USA Musical|Romance Anita Page 2808000.0 379000.0 6.3 71.0 English Passed NaN NaN 1.0 3.0
5 Pandora's Box 1929 Germany Crime|Drama|Romance Louise Brooks 9950.0 NaN 8.0 84.0 German Not Rated NaN NaN NaN NaN
6 Hell's Angels 1930 USA Drama|War Jean Harlow NaN 3950000.0 7.8 53.0 English Passed NaN NaN NaN NaN
7 A Farewell to Arms 1932 USA Drama|Romance|War Gary Cooper NaN 800000.0 6.6 46.0 English Unrated NaN NaN 2.0 4.0
8 42nd Street 1933 USA Comedy|Musical|Romance Ginger Rogers 2300000.0 439000.0 7.7 97.0 English Unrated NaN NaN NaN NaN
9 She Done Him Wrong 1933 USA Comedy|Drama|History|Musical|Romance Mae West NaN 200000.0 6.5 59.0 English Approved NaN NaN NaN NaN
10 It Happened One Night 1934 USA Comedy|Romance Claudette Colbert NaN 325000.0 8.2 235.0 English Unrated NaN NaN 5.0 5.0
11 Top Hat 1935 USA Comedy|Musical|Romance Ginger Rogers 3000000.0 609000.0 7.8 98.0 English Approved NaN NaN NaN NaN
12 Modern Times 1936 USA Comedy|Drama|Family Paulette Goddard 163245.0 1500000.0 8.6 211.0 English G NaN NaN NaN NaN
13 The Charge of the Light Brigade 1936 USA Action|Adventure|Romance|War Errol Flynn NaN 1200000.0 7.1 52.0 English Approved NaN NaN 1.0 3.0
14 Snow White and the Seven Dwarfs 1937 USA Animation|Family|Fantasy|Musical Adriana Caselotti 184925485.0 2000000.0 7.7 204.0 English Approved NaN NaN 0.0 1.0
15 The Prisoner of Zenda 1937 USA Adventure|Drama|Romance David Niven NaN NaN 7.8 44.0 English Approved NaN NaN NaN NaN
16 Alexander's Ragtime Band 1938 USA Drama|Musical|Romance Tyrone Power NaN 2000000.0 7.0 29.0 English Approved NaN NaN 1.0 6.0
17 You Can't Take It with You 1938 USA Comedy|Drama|Romance Jean Arthur NaN 1644736.0 8.0 133.0 English Approved NaN NaN 2.0 7.0
18 Mr. Smith Goes to Washington 1939 USA Comedy|Drama Claude Rains NaN 1500000.0 8.2 245.0 English Not Rated NaN NaN 1.0 11.0
19 Gone with the Wind 1939 USA Drama|History|Romance|War Hattie McDaniel 198655278.0 3977000.0 8.2 706.0 English G African Descent Female 8.0 13.0
20 The Wizard of Oz 1939 USA Adventure|Family|Fantasy|Musical Margaret Hamilton 22202612.0 2800000.0 8.1 533.0 English Passed NaN NaN 2.0 6.0
21 The Blue Bird 1940 USA Drama|Family|Fantasy Spring Byington NaN NaN 6.5 25.0 English Approved NaN NaN NaN NaN
22 Boom Town 1940 USA Adventure|Drama|Romance|Western Hedy Lamarr NaN 1614000.0 7.1 34.0 English Passed NaN NaN NaN NaN
23 Rebecca 1940 USA Drama|Film-Noir|Mystery|Thriller Laurence Olivier NaN 1288000.0 8.2 276.0 English Not Rated NaN NaN 2.0 11.0
24 Pinocchio 1940 USA Animation|Family|Fantasy|Musical Mel Blanc 84300000.0 2600000.0 7.5 147.0 English Approved NaN NaN 2.0 2.0
25 Fantasia 1940 USA Animation|Family|Fantasy|Music Leopold Stokowski 76400000.0 2280000.0 7.8 230.0 English G NaN NaN 0.0 0.0
26 How Green Was My Valley 1941 USA Drama|Family Roddy McDowall NaN 1250000.0 7.8 124.0 English Approved NaN NaN 5.0 10.0
27 Casablanca 1942 USA Drama|Romance|War Humphrey Bogart NaN 950000.0 8.6 1123.0 English PG NaN NaN NaN NaN
28 Bambi 1942 USA Animation|Drama|Family Sam Edwards 102797150.0 NaN 7.4 136.0 English Approved NaN NaN NaN NaN
29 A Guy Named Joe 1943 USA Drama|Fantasy|Romance|War Spencer Tracy NaN 2627000.0 7.0 27.0 English Passed NaN NaN NaN NaN
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
4905 Hail, Caesar! 2016 UK Comedy|Mystery Scarlett Johansson 29997095.0 22000000.0 6.4 302.0 English PG-13 NaN NaN NaN NaN
4906 The Infiltrator 2016 UK Biography|Crime|Drama|Thriller Joseph Gilgun 14946229.0 25000000.0 7.3 29.0 English R NaN NaN NaN NaN
4907 Ben-Hur 2016 USA Adventure|Drama|History Morgan Freeman NaN 100000000.0 6.0 1.0 English PG-13 African Descent Male NaN NaN
4908 The Veil 2016 USA Horror Lily Rabe NaN 4000000.0 4.7 29.0 English R NaN NaN NaN NaN
4909 Eddie the Eagle 2016 UK Biography|Comedy|Drama|Sport Hugh Jackman 15785632.0 23000000.0 7.5 119.0 English PG-13 NaN NaN NaN NaN
4910 The Conjuring 2 2016 USA Horror|Mystery|Thriller Javier Botet 102310175.0 40000000.0 7.8 279.0 English R NaN NaN NaN NaN
4911 The Birth of a Nation 2016 USA Biography|Drama Jason Stuart NaN 10000000.0 5.4 8.0 English R NaN NaN NaN NaN
4912 Airlift 2016 India Action|Drama|History|Thriller|War Nimrat Kaur NaN 4400000.0 8.5 178.0 Hindi NaN NaN NaN NaN NaN
4913 Our Kind of Traitor 2016 UK Thriller Radivoje Bukvic 3108216.0 NaN 6.4 21.0 English R NaN NaN NaN NaN
4914 Jane Got a Gun 2016 USA Action|Drama|Western Natalie Portman 1512815.0 25000000.0 5.8 56.0 English R NaN NaN NaN NaN
4915 The Purge: Election Year 2016 France Action|Horror|Sci-Fi|Thriller Frank Grillo 78845130.0 10000000.0 6.1 94.0 English R NaN NaN NaN NaN
4916 Ben-Hur 2016 USA Adventure|Drama|History Morgan Freeman NaN 100000000.0 6.1 1.0 English PG-13 African Descent Male NaN NaN
4917 The Finest Hours 2016 USA Action|Drama|History|Thriller Michael Raymond-James 27550735.0 NaN 6.8 113.0 English PG-13 NaN NaN NaN NaN
4918 Triple 9 2016 USA Action|Crime|Drama|Thriller Kate Winslet 12626905.0 20000000.0 6.3 106.0 English R NaN NaN NaN NaN
4919 Antibirth 2016 USA Horror Natasha Lyonne NaN 3500000.0 6.3 2.0 English NaN NaN NaN NaN NaN
4920 The Jungle Book 2016 UK Adventure|Drama|Family|Fantasy Scarlett Johansson 362645141.0 175000000.0 7.8 398.0 English PG NaN NaN 1.0 1.0
4921 Dirty Grandpa 2016 USA Comedy Robert De Niro 35537564.0 11500000.0 6.0 166.0 English R NaN NaN NaN NaN
4922 London Has Fallen 2016 UK Action|Crime|Drama|Thriller Gerard Butler 62401264.0 60000000.0 5.9 323.0 English R NaN NaN NaN NaN
4923 Nerve 2016 USA Adventure|Crime|Mystery|Sci-Fi|Thriller Samira Wiley 28876924.0 20000000.0 7.1 35.0 English PG-13 African Descent Female NaN NaN
4924 Me Before You 2016 UK Drama|Romance Sam Claflin 56154094.0 20000000.0 7.6 130.0 English PG-13 NaN NaN NaN NaN
4925 Suicide Squad 2016 USA Action|Adventure|Comedy|Sci-Fi Will Smith 161087183.0 175000000.0 6.9 971.0 English PG-13 African Descent Male 1.0 1.0
4926 The BFG 2016 UK Adventure|Family|Fantasy Mark Rylance 52792307.0 140000000.0 6.8 106.0 English PG NaN NaN NaN NaN
4927 Deadpool 2016 USA Action|Adventure|Comedy|Romance|Sci-Fi Ryan Reynolds 363024263.0 58000000.0 8.1 1058.0 English R NaN NaN NaN NaN
4928 X-Men: Apocalypse 2016 USA Action|Adventure|Sci-Fi Jennifer Lawrence 154985087.0 178000000.0 7.3 622.0 English PG-13 NaN NaN NaN NaN
4929 The Legend of Tarzan 2016 USA Action|Adventure|Drama|Romance Christoph Waltz 124051759.0 180000000.0 6.6 239.0 English PG-13 NaN NaN NaN NaN
4930 Compadres 2016 Mexico Action|Comedy Kevin Pollak 3105269.0 3000000.0 5.0 8.0 English R NaN NaN NaN NaN
4931 Bad Moms 2016 USA Comedy Mila Kunis 55461307.0 20000000.0 6.7 46.0 English R NaN NaN NaN NaN
4932 Misconduct 2016 USA Drama|Thriller Al Pacino NaN 11000000.0 5.3 50.0 English R NaN NaN NaN NaN
4933 Risen 2016 USA Action|Adventure|Drama|Mystery Peter Firth 36874745.0 20000000.0 6.3 117.0 English PG-13 NaN NaN NaN NaN
4934 Keanu 2016 USA Action|Comedy Nia Long 20566327.0 15000000.0 6.4 84.0 English R African Descent Female NaN NaN

4935 rows × 15 columns


In [955]:
# Dimensions
combo1_1.shape


Out[955]:
(4935, 15)

In [956]:
# Rename Actor Name to Actor_1
combo1_1= combo1_1.rename(columns={'race': 'race_1', 'actor_name': 'actor_1_name', 'sex': 'sex_1'})
# Set Index
combo1_1= combo1_1.set_index("Year", "Film")

In [957]:
# Replace missing
combo1_1['Awards']= (combo1_1['Awards'].replace(to_replace=[None], value=0))
combo1_1['race_1']= (combo1_1['race_1'].replace(to_replace=[None], value="Other"))
combo1_1['sex_1']= (combo1_1['sex_1'].replace(to_replace=[None], value="Other"))

In [958]:
# See
combo1_1


Out[958]:
Film country genres actor_1_name gross budget imdb_score num_user_for_reviews language content_rating race_1 sex_1 Awards Nominations
Year
1916 Intolerance: Love's Struggle Throughout the Ages USA Drama|History|War Lillian Gish NaN 385907.0 8.0 88.0 NaN Not Rated Other Other 0.0 NaN
1920 Over the Hill to the Poorhouse USA Crime|Drama Stephen Carr 3000000.0 100000.0 4.8 1.0 NaN NaN Other Other 0.0 NaN
1925 The Big Parade USA Drama|Romance|War John Gilbert NaN 245000.0 8.3 45.0 NaN Not Rated Other Other 0.0 NaN
1927 Metropolis Germany Drama|Sci-Fi Brigitte Helm 26435.0 6000000.0 8.3 413.0 German Not Rated Other Other 0.0 NaN
1929 The Broadway Melody USA Musical|Romance Anita Page 2808000.0 379000.0 6.3 71.0 English Passed Other Other 1.0 3.0
1929 Pandora's Box Germany Crime|Drama|Romance Louise Brooks 9950.0 NaN 8.0 84.0 German Not Rated Other Other 0.0 NaN
1930 Hell's Angels USA Drama|War Jean Harlow NaN 3950000.0 7.8 53.0 English Passed Other Other 0.0 NaN
1932 A Farewell to Arms USA Drama|Romance|War Gary Cooper NaN 800000.0 6.6 46.0 English Unrated Other Other 2.0 4.0
1933 42nd Street USA Comedy|Musical|Romance Ginger Rogers 2300000.0 439000.0 7.7 97.0 English Unrated Other Other 0.0 NaN
1933 She Done Him Wrong USA Comedy|Drama|History|Musical|Romance Mae West NaN 200000.0 6.5 59.0 English Approved Other Other 0.0 NaN
1934 It Happened One Night USA Comedy|Romance Claudette Colbert NaN 325000.0 8.2 235.0 English Unrated Other Other 5.0 5.0
1935 Top Hat USA Comedy|Musical|Romance Ginger Rogers 3000000.0 609000.0 7.8 98.0 English Approved Other Other 0.0 NaN
1936 Modern Times USA Comedy|Drama|Family Paulette Goddard 163245.0 1500000.0 8.6 211.0 English G Other Other 0.0 NaN
1936 The Charge of the Light Brigade USA Action|Adventure|Romance|War Errol Flynn NaN 1200000.0 7.1 52.0 English Approved Other Other 1.0 3.0
1937 Snow White and the Seven Dwarfs USA Animation|Family|Fantasy|Musical Adriana Caselotti 184925485.0 2000000.0 7.7 204.0 English Approved Other Other 0.0 1.0
1937 The Prisoner of Zenda USA Adventure|Drama|Romance David Niven NaN NaN 7.8 44.0 English Approved Other Other 0.0 NaN
1938 Alexander's Ragtime Band USA Drama|Musical|Romance Tyrone Power NaN 2000000.0 7.0 29.0 English Approved Other Other 1.0 6.0
1938 You Can't Take It with You USA Comedy|Drama|Romance Jean Arthur NaN 1644736.0 8.0 133.0 English Approved Other Other 2.0 7.0
1939 Mr. Smith Goes to Washington USA Comedy|Drama Claude Rains NaN 1500000.0 8.2 245.0 English Not Rated Other Other 1.0 11.0
1939 Gone with the Wind USA Drama|History|Romance|War Hattie McDaniel 198655278.0 3977000.0 8.2 706.0 English G African Descent Female 8.0 13.0
1939 The Wizard of Oz USA Adventure|Family|Fantasy|Musical Margaret Hamilton 22202612.0 2800000.0 8.1 533.0 English Passed Other Other 2.0 6.0
1940 The Blue Bird USA Drama|Family|Fantasy Spring Byington NaN NaN 6.5 25.0 English Approved Other Other 0.0 NaN
1940 Boom Town USA Adventure|Drama|Romance|Western Hedy Lamarr NaN 1614000.0 7.1 34.0 English Passed Other Other 0.0 NaN
1940 Rebecca USA Drama|Film-Noir|Mystery|Thriller Laurence Olivier NaN 1288000.0 8.2 276.0 English Not Rated Other Other 2.0 11.0
1940 Pinocchio USA Animation|Family|Fantasy|Musical Mel Blanc 84300000.0 2600000.0 7.5 147.0 English Approved Other Other 2.0 2.0
1940 Fantasia USA Animation|Family|Fantasy|Music Leopold Stokowski 76400000.0 2280000.0 7.8 230.0 English G Other Other 0.0 0.0
1941 How Green Was My Valley USA Drama|Family Roddy McDowall NaN 1250000.0 7.8 124.0 English Approved Other Other 5.0 10.0
1942 Casablanca USA Drama|Romance|War Humphrey Bogart NaN 950000.0 8.6 1123.0 English PG Other Other 0.0 NaN
1942 Bambi USA Animation|Drama|Family Sam Edwards 102797150.0 NaN 7.4 136.0 English Approved Other Other 0.0 NaN
1943 A Guy Named Joe USA Drama|Fantasy|Romance|War Spencer Tracy NaN 2627000.0 7.0 27.0 English Passed Other Other 0.0 NaN
... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
2016 Hail, Caesar! UK Comedy|Mystery Scarlett Johansson 29997095.0 22000000.0 6.4 302.0 English PG-13 Other Other 0.0 NaN
2016 The Infiltrator UK Biography|Crime|Drama|Thriller Joseph Gilgun 14946229.0 25000000.0 7.3 29.0 English R Other Other 0.0 NaN
2016 Ben-Hur USA Adventure|Drama|History Morgan Freeman NaN 100000000.0 6.0 1.0 English PG-13 African Descent Male 0.0 NaN
2016 The Veil USA Horror Lily Rabe NaN 4000000.0 4.7 29.0 English R Other Other 0.0 NaN
2016 Eddie the Eagle UK Biography|Comedy|Drama|Sport Hugh Jackman 15785632.0 23000000.0 7.5 119.0 English PG-13 Other Other 0.0 NaN
2016 The Conjuring 2 USA Horror|Mystery|Thriller Javier Botet 102310175.0 40000000.0 7.8 279.0 English R Other Other 0.0 NaN
2016 The Birth of a Nation USA Biography|Drama Jason Stuart NaN 10000000.0 5.4 8.0 English R Other Other 0.0 NaN
2016 Airlift India Action|Drama|History|Thriller|War Nimrat Kaur NaN 4400000.0 8.5 178.0 Hindi NaN Other Other 0.0 NaN
2016 Our Kind of Traitor UK Thriller Radivoje Bukvic 3108216.0 NaN 6.4 21.0 English R Other Other 0.0 NaN
2016 Jane Got a Gun USA Action|Drama|Western Natalie Portman 1512815.0 25000000.0 5.8 56.0 English R Other Other 0.0 NaN
2016 The Purge: Election Year France Action|Horror|Sci-Fi|Thriller Frank Grillo 78845130.0 10000000.0 6.1 94.0 English R Other Other 0.0 NaN
2016 Ben-Hur USA Adventure|Drama|History Morgan Freeman NaN 100000000.0 6.1 1.0 English PG-13 African Descent Male 0.0 NaN
2016 The Finest Hours USA Action|Drama|History|Thriller Michael Raymond-James 27550735.0 NaN 6.8 113.0 English PG-13 Other Other 0.0 NaN
2016 Triple 9 USA Action|Crime|Drama|Thriller Kate Winslet 12626905.0 20000000.0 6.3 106.0 English R Other Other 0.0 NaN
2016 Antibirth USA Horror Natasha Lyonne NaN 3500000.0 6.3 2.0 English NaN Other Other 0.0 NaN
2016 The Jungle Book UK Adventure|Drama|Family|Fantasy Scarlett Johansson 362645141.0 175000000.0 7.8 398.0 English PG Other Other 1.0 1.0
2016 Dirty Grandpa USA Comedy Robert De Niro 35537564.0 11500000.0 6.0 166.0 English R Other Other 0.0 NaN
2016 London Has Fallen UK Action|Crime|Drama|Thriller Gerard Butler 62401264.0 60000000.0 5.9 323.0 English R Other Other 0.0 NaN
2016 Nerve USA Adventure|Crime|Mystery|Sci-Fi|Thriller Samira Wiley 28876924.0 20000000.0 7.1 35.0 English PG-13 African Descent Female 0.0 NaN
2016 Me Before You UK Drama|Romance Sam Claflin 56154094.0 20000000.0 7.6 130.0 English PG-13 Other Other 0.0 NaN
2016 Suicide Squad USA Action|Adventure|Comedy|Sci-Fi Will Smith 161087183.0 175000000.0 6.9 971.0 English PG-13 African Descent Male 1.0 1.0
2016 The BFG UK Adventure|Family|Fantasy Mark Rylance 52792307.0 140000000.0 6.8 106.0 English PG Other Other 0.0 NaN
2016 Deadpool USA Action|Adventure|Comedy|Romance|Sci-Fi Ryan Reynolds 363024263.0 58000000.0 8.1 1058.0 English R Other Other 0.0 NaN
2016 X-Men: Apocalypse USA Action|Adventure|Sci-Fi Jennifer Lawrence 154985087.0 178000000.0 7.3 622.0 English PG-13 Other Other 0.0 NaN
2016 The Legend of Tarzan USA Action|Adventure|Drama|Romance Christoph Waltz 124051759.0 180000000.0 6.6 239.0 English PG-13 Other Other 0.0 NaN
2016 Compadres Mexico Action|Comedy Kevin Pollak 3105269.0 3000000.0 5.0 8.0 English R Other Other 0.0 NaN
2016 Bad Moms USA Comedy Mila Kunis 55461307.0 20000000.0 6.7 46.0 English R Other Other 0.0 NaN
2016 Misconduct USA Drama|Thriller Al Pacino NaN 11000000.0 5.3 50.0 English R Other Other 0.0 NaN
2016 Risen USA Action|Adventure|Drama|Mystery Peter Firth 36874745.0 20000000.0 6.3 117.0 English PG-13 Other Other 0.0 NaN
2016 Keanu USA Action|Comedy Nia Long 20566327.0 15000000.0 6.4 84.0 English R African Descent Female 0.0 NaN

4935 rows × 14 columns

Group 2 - Supporting Actors 1

  • Add race and gender of main actors to movie data

In [959]:
# Group 2
combo2 = pd.merge(df_m2, df_rG,   # left and right df's
                 how='left',        # add to left 
                 on='actor_name'       # link with this variable/column 
                ) 

combo2


Out[959]:
Film Year country genres actor_name gross budget imdb_score num_user_for_reviews language content_rating race sex
0 Intolerance: Love's Struggle Throughout the Ages 1916 USA Drama|History|War Mae Marsh NaN 385907.0 8.0 88.0 NaN Not Rated NaN NaN
1 Over the Hill to the Poorhouse 1920 USA Crime|Drama Johnnie Walker 3000000.0 100000.0 4.8 1.0 NaN NaN NaN NaN
2 The Big Parade 1925 USA Drama|Romance|War Ren’e Ador’e NaN 245000.0 8.3 45.0 NaN Not Rated NaN NaN
3 Metropolis 1927 Germany Drama|Sci-Fi Gustav Fr’_hlich 26435.0 6000000.0 8.3 413.0 German Not Rated NaN NaN
4 The Broadway Melody 1929 USA Musical|Romance Bessie Love 2808000.0 379000.0 6.3 71.0 English Passed NaN NaN
5 Pandora's Box 1929 Germany Crime|Drama|Romance Francis Lederer 9950.0 NaN 8.0 84.0 German Not Rated NaN NaN
6 Hell's Angels 1930 USA Drama|War Marian Marsh NaN 3950000.0 7.8 53.0 English Passed NaN NaN
7 A Farewell to Arms 1932 USA Drama|Romance|War Helen Hayes NaN 800000.0 6.6 46.0 English Unrated NaN NaN
8 42nd Street 1933 USA Comedy|Musical|Romance Dick Powell 2300000.0 439000.0 7.7 97.0 English Unrated NaN NaN
9 She Done Him Wrong 1933 USA Comedy|Drama|History|Musical|Romance Gilbert Roland NaN 200000.0 6.5 59.0 English Approved NaN NaN
10 It Happened One Night 1934 USA Comedy|Romance Alan Hale NaN 325000.0 8.2 235.0 English Unrated NaN NaN
11 Top Hat 1935 USA Comedy|Musical|Romance Edward Everett Horton 3000000.0 609000.0 7.8 98.0 English Approved NaN NaN
12 Modern Times 1936 USA Comedy|Drama|Family Stanley Blystone 163245.0 1500000.0 8.6 211.0 English G NaN NaN
13 The Charge of the Light Brigade 1936 USA Action|Adventure|Romance|War David Niven NaN 1200000.0 7.1 52.0 English Approved NaN NaN
14 Snow White and the Seven Dwarfs 1937 USA Animation|Family|Fantasy|Musical Billy Gilbert 184925485.0 2000000.0 7.7 204.0 English Approved NaN NaN
15 The Prisoner of Zenda 1937 USA Adventure|Drama|Romance Mary Astor NaN NaN 7.8 44.0 English Approved NaN NaN
16 Alexander's Ragtime Band 1938 USA Drama|Musical|Romance Don Ameche NaN 2000000.0 7.0 29.0 English Approved NaN NaN
17 You Can't Take It with You 1938 USA Comedy|Drama|Romance Ann Miller NaN 1644736.0 8.0 133.0 English Approved NaN NaN
18 Mr. Smith Goes to Washington 1939 USA Comedy|Drama Jean Arthur NaN 1500000.0 8.2 245.0 English Not Rated NaN NaN
19 Gone with the Wind 1939 USA Drama|History|Romance|War George Reeves 198655278.0 3977000.0 8.2 706.0 English G NaN NaN
20 The Wizard of Oz 1939 USA Adventure|Family|Fantasy|Musical Terry 22202612.0 2800000.0 8.1 533.0 English Passed NaN NaN
21 The Blue Bird 1940 USA Drama|Family|Fantasy Nigel Bruce NaN NaN 6.5 25.0 English Approved NaN NaN
22 Boom Town 1940 USA Adventure|Drama|Romance|Western Spencer Tracy NaN 1614000.0 7.1 34.0 English Passed NaN NaN
23 Rebecca 1940 USA Drama|Film-Noir|Mystery|Thriller Joan Fontaine NaN 1288000.0 8.2 276.0 English Not Rated NaN NaN
24 Pinocchio 1940 USA Animation|Family|Fantasy|Musical Dickie Jones 84300000.0 2600000.0 7.5 147.0 English Approved NaN NaN
25 Fantasia 1940 USA Animation|Family|Fantasy|Music Deems Taylor 76400000.0 2280000.0 7.8 230.0 English G NaN NaN
26 How Green Was My Valley 1941 USA Drama|Family Walter Pidgeon NaN 1250000.0 7.8 124.0 English Approved NaN NaN
27 Casablanca 1942 USA Drama|Romance|War Claude Rains NaN 950000.0 8.6 1123.0 English PG NaN NaN
28 Bambi 1942 USA Animation|Drama|Family Donnie Dunagan 102797150.0 NaN 7.4 136.0 English Approved NaN NaN
29 A Guy Named Joe 1943 USA Drama|Fantasy|Romance|War Esther Williams NaN 2627000.0 7.0 27.0 English Passed NaN NaN
... ... ... ... ... ... ... ... ... ... ... ... ... ...
4905 Hail, Caesar! 2016 UK Comedy|Mystery Channing Tatum 29997095.0 22000000.0 6.4 302.0 English PG-13 NaN NaN
4906 The Infiltrator 2016 UK Biography|Crime|Drama|Thriller Amy Ryan 14946229.0 25000000.0 7.3 29.0 English R NaN NaN
4907 Ben-Hur 2016 USA Adventure|Drama|History Ayelet Zurer NaN 100000000.0 6.0 1.0 English PG-13 NaN NaN
4908 The Veil 2016 USA Horror Shannon Woodward NaN 4000000.0 4.7 29.0 English R NaN NaN
4909 Eddie the Eagle 2016 UK Biography|Comedy|Drama|Sport Taron Egerton 15785632.0 23000000.0 7.5 119.0 English PG-13 NaN NaN
4910 The Conjuring 2 2016 USA Horror|Mystery|Thriller Frances O'Connor 102310175.0 40000000.0 7.8 279.0 English R NaN NaN
4911 The Birth of a Nation 2016 USA Biography|Drama Nate Parker NaN 10000000.0 5.4 8.0 English R African Descent Male
4912 Airlift 2016 India Action|Drama|History|Thriller|War Sameer Ali Khan NaN 4400000.0 8.5 178.0 Hindi NaN NaN NaN
4913 Our Kind of Traitor 2016 UK Thriller Pawel Szajda 3108216.0 NaN 6.4 21.0 English R NaN NaN
4914 Jane Got a Gun 2016 USA Action|Drama|Western Noah Emmerich 1512815.0 25000000.0 5.8 56.0 English R NaN NaN
4915 The Purge: Election Year 2016 France Action|Horror|Sci-Fi|Thriller Joseph Julian Soria 78845130.0 10000000.0 6.1 94.0 English R NaN NaN
4916 Ben-Hur 2016 USA Adventure|Drama|History Ayelet Zurer NaN 100000000.0 6.1 1.0 English PG-13 NaN NaN
4917 The Finest Hours 2016 USA Action|Drama|History|Thriller Abraham Benrubi 27550735.0 NaN 6.8 113.0 English PG-13 NaN NaN
4918 Triple 9 2016 USA Action|Crime|Drama|Thriller Norman Reedus 12626905.0 20000000.0 6.3 106.0 English R NaN NaN
4919 Antibirth 2016 USA Horror Emmanuel Kabongo NaN 3500000.0 6.3 2.0 English NaN NaN NaN
4920 The Jungle Book 2016 UK Adventure|Drama|Family|Fantasy Bill Murray 362645141.0 175000000.0 7.8 398.0 English PG NaN NaN
4921 Dirty Grandpa 2016 USA Comedy Zoey Deutch 35537564.0 11500000.0 6.0 166.0 English R NaN NaN
4922 London Has Fallen 2016 UK Action|Crime|Drama|Thriller Radha Mitchell 62401264.0 60000000.0 5.9 323.0 English R NaN NaN
4923 Nerve 2016 USA Adventure|Crime|Mystery|Sci-Fi|Thriller Marc John Jefferies 28876924.0 20000000.0 7.1 35.0 English PG-13 African Descent Male
4924 Me Before You 2016 UK Drama|Romance Emilia Clarke 56154094.0 20000000.0 7.6 130.0 English PG-13 NaN NaN
4925 Suicide Squad 2016 USA Action|Adventure|Comedy|Sci-Fi Robin Atkin Downes 161087183.0 175000000.0 6.9 971.0 English PG-13 NaN NaN
4926 The BFG 2016 UK Adventure|Family|Fantasy Penelope Wilton 52792307.0 140000000.0 6.8 106.0 English PG NaN NaN
4927 Deadpool 2016 USA Action|Adventure|Comedy|Romance|Sci-Fi Ed Skrein 363024263.0 58000000.0 8.1 1058.0 English R NaN NaN
4928 X-Men: Apocalypse 2016 USA Action|Adventure|Sci-Fi Michael Fassbender 154985087.0 178000000.0 7.3 622.0 English PG-13 NaN NaN
4929 The Legend of Tarzan 2016 USA Action|Adventure|Drama|Romance Alexander Skarsg’Çrd 124051759.0 180000000.0 6.6 239.0 English PG-13 NaN NaN
4930 Compadres 2016 Mexico Action|Comedy H’ctor Jim’nez 3105269.0 3000000.0 5.0 8.0 English R NaN NaN
4931 Bad Moms 2016 USA Comedy Jay Hernandez 55461307.0 20000000.0 6.7 46.0 English R NaN NaN
4932 Misconduct 2016 USA Drama|Thriller Anthony Hopkins NaN 11000000.0 5.3 50.0 English R NaN NaN
4933 Risen 2016 USA Action|Adventure|Drama|Mystery Jan Cornet 36874745.0 20000000.0 6.3 117.0 English PG-13 NaN NaN
4934 Keanu 2016 USA Action|Comedy Will Forte 20566327.0 15000000.0 6.4 84.0 English R NaN NaN

4935 rows × 13 columns


In [960]:
combo2.shape


Out[960]:
(4935, 13)
  • Add number of awards per movie to combo data

In [961]:
# Group 2
combo2_1 = pd.merge(combo2, award_dat,   # left and right df's
                 how='left',        # add to left 
                 on=['Film','Year']      # link with this variable/column 
                ) 

combo2_1


Out[961]:
Film Year country genres actor_name gross budget imdb_score num_user_for_reviews language content_rating race sex Awards Nominations
0 Intolerance: Love's Struggle Throughout the Ages 1916 USA Drama|History|War Mae Marsh NaN 385907.0 8.0 88.0 NaN Not Rated NaN NaN NaN NaN
1 Over the Hill to the Poorhouse 1920 USA Crime|Drama Johnnie Walker 3000000.0 100000.0 4.8 1.0 NaN NaN NaN NaN NaN NaN
2 The Big Parade 1925 USA Drama|Romance|War Ren’e Ador’e NaN 245000.0 8.3 45.0 NaN Not Rated NaN NaN NaN NaN
3 Metropolis 1927 Germany Drama|Sci-Fi Gustav Fr’_hlich 26435.0 6000000.0 8.3 413.0 German Not Rated NaN NaN NaN NaN
4 The Broadway Melody 1929 USA Musical|Romance Bessie Love 2808000.0 379000.0 6.3 71.0 English Passed NaN NaN 1.0 3.0
5 Pandora's Box 1929 Germany Crime|Drama|Romance Francis Lederer 9950.0 NaN 8.0 84.0 German Not Rated NaN NaN NaN NaN
6 Hell's Angels 1930 USA Drama|War Marian Marsh NaN 3950000.0 7.8 53.0 English Passed NaN NaN NaN NaN
7 A Farewell to Arms 1932 USA Drama|Romance|War Helen Hayes NaN 800000.0 6.6 46.0 English Unrated NaN NaN 2.0 4.0
8 42nd Street 1933 USA Comedy|Musical|Romance Dick Powell 2300000.0 439000.0 7.7 97.0 English Unrated NaN NaN NaN NaN
9 She Done Him Wrong 1933 USA Comedy|Drama|History|Musical|Romance Gilbert Roland NaN 200000.0 6.5 59.0 English Approved NaN NaN NaN NaN
10 It Happened One Night 1934 USA Comedy|Romance Alan Hale NaN 325000.0 8.2 235.0 English Unrated NaN NaN 5.0 5.0
11 Top Hat 1935 USA Comedy|Musical|Romance Edward Everett Horton 3000000.0 609000.0 7.8 98.0 English Approved NaN NaN NaN NaN
12 Modern Times 1936 USA Comedy|Drama|Family Stanley Blystone 163245.0 1500000.0 8.6 211.0 English G NaN NaN NaN NaN
13 The Charge of the Light Brigade 1936 USA Action|Adventure|Romance|War David Niven NaN 1200000.0 7.1 52.0 English Approved NaN NaN 1.0 3.0
14 Snow White and the Seven Dwarfs 1937 USA Animation|Family|Fantasy|Musical Billy Gilbert 184925485.0 2000000.0 7.7 204.0 English Approved NaN NaN 0.0 1.0
15 The Prisoner of Zenda 1937 USA Adventure|Drama|Romance Mary Astor NaN NaN 7.8 44.0 English Approved NaN NaN NaN NaN
16 Alexander's Ragtime Band 1938 USA Drama|Musical|Romance Don Ameche NaN 2000000.0 7.0 29.0 English Approved NaN NaN 1.0 6.0
17 You Can't Take It with You 1938 USA Comedy|Drama|Romance Ann Miller NaN 1644736.0 8.0 133.0 English Approved NaN NaN 2.0 7.0
18 Mr. Smith Goes to Washington 1939 USA Comedy|Drama Jean Arthur NaN 1500000.0 8.2 245.0 English Not Rated NaN NaN 1.0 11.0
19 Gone with the Wind 1939 USA Drama|History|Romance|War George Reeves 198655278.0 3977000.0 8.2 706.0 English G NaN NaN 8.0 13.0
20 The Wizard of Oz 1939 USA Adventure|Family|Fantasy|Musical Terry 22202612.0 2800000.0 8.1 533.0 English Passed NaN NaN 2.0 6.0
21 The Blue Bird 1940 USA Drama|Family|Fantasy Nigel Bruce NaN NaN 6.5 25.0 English Approved NaN NaN NaN NaN
22 Boom Town 1940 USA Adventure|Drama|Romance|Western Spencer Tracy NaN 1614000.0 7.1 34.0 English Passed NaN NaN NaN NaN
23 Rebecca 1940 USA Drama|Film-Noir|Mystery|Thriller Joan Fontaine NaN 1288000.0 8.2 276.0 English Not Rated NaN NaN 2.0 11.0
24 Pinocchio 1940 USA Animation|Family|Fantasy|Musical Dickie Jones 84300000.0 2600000.0 7.5 147.0 English Approved NaN NaN 2.0 2.0
25 Fantasia 1940 USA Animation|Family|Fantasy|Music Deems Taylor 76400000.0 2280000.0 7.8 230.0 English G NaN NaN 0.0 0.0
26 How Green Was My Valley 1941 USA Drama|Family Walter Pidgeon NaN 1250000.0 7.8 124.0 English Approved NaN NaN 5.0 10.0
27 Casablanca 1942 USA Drama|Romance|War Claude Rains NaN 950000.0 8.6 1123.0 English PG NaN NaN NaN NaN
28 Bambi 1942 USA Animation|Drama|Family Donnie Dunagan 102797150.0 NaN 7.4 136.0 English Approved NaN NaN NaN NaN
29 A Guy Named Joe 1943 USA Drama|Fantasy|Romance|War Esther Williams NaN 2627000.0 7.0 27.0 English Passed NaN NaN NaN NaN
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
4905 Hail, Caesar! 2016 UK Comedy|Mystery Channing Tatum 29997095.0 22000000.0 6.4 302.0 English PG-13 NaN NaN NaN NaN
4906 The Infiltrator 2016 UK Biography|Crime|Drama|Thriller Amy Ryan 14946229.0 25000000.0 7.3 29.0 English R NaN NaN NaN NaN
4907 Ben-Hur 2016 USA Adventure|Drama|History Ayelet Zurer NaN 100000000.0 6.0 1.0 English PG-13 NaN NaN NaN NaN
4908 The Veil 2016 USA Horror Shannon Woodward NaN 4000000.0 4.7 29.0 English R NaN NaN NaN NaN
4909 Eddie the Eagle 2016 UK Biography|Comedy|Drama|Sport Taron Egerton 15785632.0 23000000.0 7.5 119.0 English PG-13 NaN NaN NaN NaN
4910 The Conjuring 2 2016 USA Horror|Mystery|Thriller Frances O'Connor 102310175.0 40000000.0 7.8 279.0 English R NaN NaN NaN NaN
4911 The Birth of a Nation 2016 USA Biography|Drama Nate Parker NaN 10000000.0 5.4 8.0 English R African Descent Male NaN NaN
4912 Airlift 2016 India Action|Drama|History|Thriller|War Sameer Ali Khan NaN 4400000.0 8.5 178.0 Hindi NaN NaN NaN NaN NaN
4913 Our Kind of Traitor 2016 UK Thriller Pawel Szajda 3108216.0 NaN 6.4 21.0 English R NaN NaN NaN NaN
4914 Jane Got a Gun 2016 USA Action|Drama|Western Noah Emmerich 1512815.0 25000000.0 5.8 56.0 English R NaN NaN NaN NaN
4915 The Purge: Election Year 2016 France Action|Horror|Sci-Fi|Thriller Joseph Julian Soria 78845130.0 10000000.0 6.1 94.0 English R NaN NaN NaN NaN
4916 Ben-Hur 2016 USA Adventure|Drama|History Ayelet Zurer NaN 100000000.0 6.1 1.0 English PG-13 NaN NaN NaN NaN
4917 The Finest Hours 2016 USA Action|Drama|History|Thriller Abraham Benrubi 27550735.0 NaN 6.8 113.0 English PG-13 NaN NaN NaN NaN
4918 Triple 9 2016 USA Action|Crime|Drama|Thriller Norman Reedus 12626905.0 20000000.0 6.3 106.0 English R NaN NaN NaN NaN
4919 Antibirth 2016 USA Horror Emmanuel Kabongo NaN 3500000.0 6.3 2.0 English NaN NaN NaN NaN NaN
4920 The Jungle Book 2016 UK Adventure|Drama|Family|Fantasy Bill Murray 362645141.0 175000000.0 7.8 398.0 English PG NaN NaN 1.0 1.0
4921 Dirty Grandpa 2016 USA Comedy Zoey Deutch 35537564.0 11500000.0 6.0 166.0 English R NaN NaN NaN NaN
4922 London Has Fallen 2016 UK Action|Crime|Drama|Thriller Radha Mitchell 62401264.0 60000000.0 5.9 323.0 English R NaN NaN NaN NaN
4923 Nerve 2016 USA Adventure|Crime|Mystery|Sci-Fi|Thriller Marc John Jefferies 28876924.0 20000000.0 7.1 35.0 English PG-13 African Descent Male NaN NaN
4924 Me Before You 2016 UK Drama|Romance Emilia Clarke 56154094.0 20000000.0 7.6 130.0 English PG-13 NaN NaN NaN NaN
4925 Suicide Squad 2016 USA Action|Adventure|Comedy|Sci-Fi Robin Atkin Downes 161087183.0 175000000.0 6.9 971.0 English PG-13 NaN NaN 1.0 1.0
4926 The BFG 2016 UK Adventure|Family|Fantasy Penelope Wilton 52792307.0 140000000.0 6.8 106.0 English PG NaN NaN NaN NaN
4927 Deadpool 2016 USA Action|Adventure|Comedy|Romance|Sci-Fi Ed Skrein 363024263.0 58000000.0 8.1 1058.0 English R NaN NaN NaN NaN
4928 X-Men: Apocalypse 2016 USA Action|Adventure|Sci-Fi Michael Fassbender 154985087.0 178000000.0 7.3 622.0 English PG-13 NaN NaN NaN NaN
4929 The Legend of Tarzan 2016 USA Action|Adventure|Drama|Romance Alexander Skarsg’Çrd 124051759.0 180000000.0 6.6 239.0 English PG-13 NaN NaN NaN NaN
4930 Compadres 2016 Mexico Action|Comedy H’ctor Jim’nez 3105269.0 3000000.0 5.0 8.0 English R NaN NaN NaN NaN
4931 Bad Moms 2016 USA Comedy Jay Hernandez 55461307.0 20000000.0 6.7 46.0 English R NaN NaN NaN NaN
4932 Misconduct 2016 USA Drama|Thriller Anthony Hopkins NaN 11000000.0 5.3 50.0 English R NaN NaN NaN NaN
4933 Risen 2016 USA Action|Adventure|Drama|Mystery Jan Cornet 36874745.0 20000000.0 6.3 117.0 English PG-13 NaN NaN NaN NaN
4934 Keanu 2016 USA Action|Comedy Will Forte 20566327.0 15000000.0 6.4 84.0 English R NaN NaN NaN NaN

4935 rows × 15 columns


In [962]:
combo2_1.shape


Out[962]:
(4935, 15)

In [963]:
# Rename Actor Name to Actor_2
combo2_1= combo2_1.rename(columns={'race': 'race_2', 'actor_name': 'actor_2_name', 'sex': 'sex_2'})
# Set Index
combo2_1= combo2_1.set_index("Year", "Film")

In [964]:
# Replace missing 
combo2_1['Awards']= (combo2_1['Awards'].replace(to_replace=[None], value=0))
combo2_1['race_2']= (combo2_1['race_2'].replace(to_replace=[None], value="Other"))
combo2_1['sex_2']= (combo2_1['sex_2'].replace(to_replace=[None], value="Other"))

In [965]:
# See
combo2_1


Out[965]:
Film country genres actor_2_name gross budget imdb_score num_user_for_reviews language content_rating race_2 sex_2 Awards Nominations
Year
1916 Intolerance: Love's Struggle Throughout the Ages USA Drama|History|War Mae Marsh NaN 385907.0 8.0 88.0 NaN Not Rated Other Other 0.0 NaN
1920 Over the Hill to the Poorhouse USA Crime|Drama Johnnie Walker 3000000.0 100000.0 4.8 1.0 NaN NaN Other Other 0.0 NaN
1925 The Big Parade USA Drama|Romance|War Ren’e Ador’e NaN 245000.0 8.3 45.0 NaN Not Rated Other Other 0.0 NaN
1927 Metropolis Germany Drama|Sci-Fi Gustav Fr’_hlich 26435.0 6000000.0 8.3 413.0 German Not Rated Other Other 0.0 NaN
1929 The Broadway Melody USA Musical|Romance Bessie Love 2808000.0 379000.0 6.3 71.0 English Passed Other Other 1.0 3.0
1929 Pandora's Box Germany Crime|Drama|Romance Francis Lederer 9950.0 NaN 8.0 84.0 German Not Rated Other Other 0.0 NaN
1930 Hell's Angels USA Drama|War Marian Marsh NaN 3950000.0 7.8 53.0 English Passed Other Other 0.0 NaN
1932 A Farewell to Arms USA Drama|Romance|War Helen Hayes NaN 800000.0 6.6 46.0 English Unrated Other Other 2.0 4.0
1933 42nd Street USA Comedy|Musical|Romance Dick Powell 2300000.0 439000.0 7.7 97.0 English Unrated Other Other 0.0 NaN
1933 She Done Him Wrong USA Comedy|Drama|History|Musical|Romance Gilbert Roland NaN 200000.0 6.5 59.0 English Approved Other Other 0.0 NaN
1934 It Happened One Night USA Comedy|Romance Alan Hale NaN 325000.0 8.2 235.0 English Unrated Other Other 5.0 5.0
1935 Top Hat USA Comedy|Musical|Romance Edward Everett Horton 3000000.0 609000.0 7.8 98.0 English Approved Other Other 0.0 NaN
1936 Modern Times USA Comedy|Drama|Family Stanley Blystone 163245.0 1500000.0 8.6 211.0 English G Other Other 0.0 NaN
1936 The Charge of the Light Brigade USA Action|Adventure|Romance|War David Niven NaN 1200000.0 7.1 52.0 English Approved Other Other 1.0 3.0
1937 Snow White and the Seven Dwarfs USA Animation|Family|Fantasy|Musical Billy Gilbert 184925485.0 2000000.0 7.7 204.0 English Approved Other Other 0.0 1.0
1937 The Prisoner of Zenda USA Adventure|Drama|Romance Mary Astor NaN NaN 7.8 44.0 English Approved Other Other 0.0 NaN
1938 Alexander's Ragtime Band USA Drama|Musical|Romance Don Ameche NaN 2000000.0 7.0 29.0 English Approved Other Other 1.0 6.0
1938 You Can't Take It with You USA Comedy|Drama|Romance Ann Miller NaN 1644736.0 8.0 133.0 English Approved Other Other 2.0 7.0
1939 Mr. Smith Goes to Washington USA Comedy|Drama Jean Arthur NaN 1500000.0 8.2 245.0 English Not Rated Other Other 1.0 11.0
1939 Gone with the Wind USA Drama|History|Romance|War George Reeves 198655278.0 3977000.0 8.2 706.0 English G Other Other 8.0 13.0
1939 The Wizard of Oz USA Adventure|Family|Fantasy|Musical Terry 22202612.0 2800000.0 8.1 533.0 English Passed Other Other 2.0 6.0
1940 The Blue Bird USA Drama|Family|Fantasy Nigel Bruce NaN NaN 6.5 25.0 English Approved Other Other 0.0 NaN
1940 Boom Town USA Adventure|Drama|Romance|Western Spencer Tracy NaN 1614000.0 7.1 34.0 English Passed Other Other 0.0 NaN
1940 Rebecca USA Drama|Film-Noir|Mystery|Thriller Joan Fontaine NaN 1288000.0 8.2 276.0 English Not Rated Other Other 2.0 11.0
1940 Pinocchio USA Animation|Family|Fantasy|Musical Dickie Jones 84300000.0 2600000.0 7.5 147.0 English Approved Other Other 2.0 2.0
1940 Fantasia USA Animation|Family|Fantasy|Music Deems Taylor 76400000.0 2280000.0 7.8 230.0 English G Other Other 0.0 0.0
1941 How Green Was My Valley USA Drama|Family Walter Pidgeon NaN 1250000.0 7.8 124.0 English Approved Other Other 5.0 10.0
1942 Casablanca USA Drama|Romance|War Claude Rains NaN 950000.0 8.6 1123.0 English PG Other Other 0.0 NaN
1942 Bambi USA Animation|Drama|Family Donnie Dunagan 102797150.0 NaN 7.4 136.0 English Approved Other Other 0.0 NaN
1943 A Guy Named Joe USA Drama|Fantasy|Romance|War Esther Williams NaN 2627000.0 7.0 27.0 English Passed Other Other 0.0 NaN
... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
2016 Hail, Caesar! UK Comedy|Mystery Channing Tatum 29997095.0 22000000.0 6.4 302.0 English PG-13 Other Other 0.0 NaN
2016 The Infiltrator UK Biography|Crime|Drama|Thriller Amy Ryan 14946229.0 25000000.0 7.3 29.0 English R Other Other 0.0 NaN
2016 Ben-Hur USA Adventure|Drama|History Ayelet Zurer NaN 100000000.0 6.0 1.0 English PG-13 Other Other 0.0 NaN
2016 The Veil USA Horror Shannon Woodward NaN 4000000.0 4.7 29.0 English R Other Other 0.0 NaN
2016 Eddie the Eagle UK Biography|Comedy|Drama|Sport Taron Egerton 15785632.0 23000000.0 7.5 119.0 English PG-13 Other Other 0.0 NaN
2016 The Conjuring 2 USA Horror|Mystery|Thriller Frances O'Connor 102310175.0 40000000.0 7.8 279.0 English R Other Other 0.0 NaN
2016 The Birth of a Nation USA Biography|Drama Nate Parker NaN 10000000.0 5.4 8.0 English R African Descent Male 0.0 NaN
2016 Airlift India Action|Drama|History|Thriller|War Sameer Ali Khan NaN 4400000.0 8.5 178.0 Hindi NaN Other Other 0.0 NaN
2016 Our Kind of Traitor UK Thriller Pawel Szajda 3108216.0 NaN 6.4 21.0 English R Other Other 0.0 NaN
2016 Jane Got a Gun USA Action|Drama|Western Noah Emmerich 1512815.0 25000000.0 5.8 56.0 English R Other Other 0.0 NaN
2016 The Purge: Election Year France Action|Horror|Sci-Fi|Thriller Joseph Julian Soria 78845130.0 10000000.0 6.1 94.0 English R Other Other 0.0 NaN
2016 Ben-Hur USA Adventure|Drama|History Ayelet Zurer NaN 100000000.0 6.1 1.0 English PG-13 Other Other 0.0 NaN
2016 The Finest Hours USA Action|Drama|History|Thriller Abraham Benrubi 27550735.0 NaN 6.8 113.0 English PG-13 Other Other 0.0 NaN
2016 Triple 9 USA Action|Crime|Drama|Thriller Norman Reedus 12626905.0 20000000.0 6.3 106.0 English R Other Other 0.0 NaN
2016 Antibirth USA Horror Emmanuel Kabongo NaN 3500000.0 6.3 2.0 English NaN Other Other 0.0 NaN
2016 The Jungle Book UK Adventure|Drama|Family|Fantasy Bill Murray 362645141.0 175000000.0 7.8 398.0 English PG Other Other 1.0 1.0
2016 Dirty Grandpa USA Comedy Zoey Deutch 35537564.0 11500000.0 6.0 166.0 English R Other Other 0.0 NaN
2016 London Has Fallen UK Action|Crime|Drama|Thriller Radha Mitchell 62401264.0 60000000.0 5.9 323.0 English R Other Other 0.0 NaN
2016 Nerve USA Adventure|Crime|Mystery|Sci-Fi|Thriller Marc John Jefferies 28876924.0 20000000.0 7.1 35.0 English PG-13 African Descent Male 0.0 NaN
2016 Me Before You UK Drama|Romance Emilia Clarke 56154094.0 20000000.0 7.6 130.0 English PG-13 Other Other 0.0 NaN
2016 Suicide Squad USA Action|Adventure|Comedy|Sci-Fi Robin Atkin Downes 161087183.0 175000000.0 6.9 971.0 English PG-13 Other Other 1.0 1.0
2016 The BFG UK Adventure|Family|Fantasy Penelope Wilton 52792307.0 140000000.0 6.8 106.0 English PG Other Other 0.0 NaN
2016 Deadpool USA Action|Adventure|Comedy|Romance|Sci-Fi Ed Skrein 363024263.0 58000000.0 8.1 1058.0 English R Other Other 0.0 NaN
2016 X-Men: Apocalypse USA Action|Adventure|Sci-Fi Michael Fassbender 154985087.0 178000000.0 7.3 622.0 English PG-13 Other Other 0.0 NaN
2016 The Legend of Tarzan USA Action|Adventure|Drama|Romance Alexander Skarsg’Çrd 124051759.0 180000000.0 6.6 239.0 English PG-13 Other Other 0.0 NaN
2016 Compadres Mexico Action|Comedy H’ctor Jim’nez 3105269.0 3000000.0 5.0 8.0 English R Other Other 0.0 NaN
2016 Bad Moms USA Comedy Jay Hernandez 55461307.0 20000000.0 6.7 46.0 English R Other Other 0.0 NaN
2016 Misconduct USA Drama|Thriller Anthony Hopkins NaN 11000000.0 5.3 50.0 English R Other Other 0.0 NaN
2016 Risen USA Action|Adventure|Drama|Mystery Jan Cornet 36874745.0 20000000.0 6.3 117.0 English PG-13 Other Other 0.0 NaN
2016 Keanu USA Action|Comedy Will Forte 20566327.0 15000000.0 6.4 84.0 English R Other Other 0.0 NaN

4935 rows × 14 columns

Group 3 - Supporting Actors 2

  • Add race and gender of main actors to movie data

In [966]:
# Group 3
combo3 = pd.merge(df_m3, df_rG,   # left and right df's
                 how='left',        # add to left 
                 on='actor_name'       # link with this variable/column 
                ) 

combo3


Out[966]:
Film Year country genres actor_name gross budget imdb_score num_user_for_reviews language content_rating race sex
0 Intolerance: Love's Struggle Throughout the Ages 1916 USA Drama|History|War Walter Long NaN 385907.0 8.0 88.0 NaN Not Rated NaN NaN
1 Over the Hill to the Poorhouse 1920 USA Crime|Drama Mary Carr 3000000.0 100000.0 4.8 1.0 NaN NaN NaN NaN
2 The Big Parade 1925 USA Drama|Romance|War Claire Adams NaN 245000.0 8.3 45.0 NaN Not Rated NaN NaN
3 Metropolis 1927 Germany Drama|Sci-Fi Rudolf Klein-Rogge 26435.0 6000000.0 8.3 413.0 German Not Rated NaN NaN
4 The Broadway Melody 1929 USA Musical|Romance Charles King 2808000.0 379000.0 6.3 71.0 English Passed NaN NaN
5 Pandora's Box 1929 Germany Crime|Drama|Romance Fritz Kortner 9950.0 NaN 8.0 84.0 German Not Rated NaN NaN
6 Hell's Angels 1930 USA Drama|War James Hall NaN 3950000.0 7.8 53.0 English Passed NaN NaN
7 A Farewell to Arms 1932 USA Drama|Romance|War Adolphe Menjou NaN 800000.0 6.6 46.0 English Unrated NaN NaN
8 42nd Street 1933 USA Comedy|Musical|Romance George Brent 2300000.0 439000.0 7.7 97.0 English Unrated NaN NaN
9 She Done Him Wrong 1933 USA Comedy|Drama|History|Musical|Romance Louise Beavers NaN 200000.0 6.5 59.0 English Approved African Descent Female
10 It Happened One Night 1934 USA Comedy|Romance Walter Connolly NaN 325000.0 8.2 235.0 English Unrated NaN NaN
11 Top Hat 1935 USA Comedy|Musical|Romance Eric Blore 3000000.0 609000.0 7.8 98.0 English Approved NaN NaN
12 Modern Times 1936 USA Comedy|Drama|Family Fred Malatesta 163245.0 1500000.0 8.6 211.0 English G NaN NaN
13 The Charge of the Light Brigade 1936 USA Action|Adventure|Romance|War Spring Byington NaN 1200000.0 7.1 52.0 English Approved NaN NaN
14 Snow White and the Seven Dwarfs 1937 USA Animation|Family|Fantasy|Musical Lucille La Verne 184925485.0 2000000.0 7.7 204.0 English Approved NaN NaN
15 The Prisoner of Zenda 1937 USA Adventure|Drama|Romance Ronald Colman NaN NaN 7.8 44.0 English Approved NaN NaN
16 Alexander's Ragtime Band 1938 USA Drama|Musical|Romance John Carradine NaN 2000000.0 7.0 29.0 English Approved NaN NaN
17 You Can't Take It with You 1938 USA Comedy|Drama|Romance Lionel Barrymore NaN 1644736.0 8.0 133.0 English Approved NaN NaN
18 Mr. Smith Goes to Washington 1939 USA Comedy|Drama Thomas Mitchell NaN 1500000.0 8.2 245.0 English Not Rated NaN NaN
19 Gone with the Wind 1939 USA Drama|History|Romance|War Thomas Mitchell 198655278.0 3977000.0 8.2 706.0 English G NaN NaN
20 The Wizard of Oz 1939 USA Adventure|Family|Fantasy|Musical Billie Burke 22202612.0 2800000.0 8.1 533.0 English Passed NaN NaN
21 The Blue Bird 1940 USA Drama|Family|Fantasy Gale Sondergaard NaN NaN 6.5 25.0 English Approved NaN NaN
22 Boom Town 1940 USA Adventure|Drama|Romance|Western Claudette Colbert NaN 1614000.0 7.1 34.0 English Passed NaN NaN
23 Rebecca 1940 USA Drama|Film-Noir|Mystery|Thriller George Sanders NaN 1288000.0 8.2 276.0 English Not Rated NaN NaN
24 Pinocchio 1940 USA Animation|Family|Fantasy|Musical Cliff Edwards 84300000.0 2600000.0 7.5 147.0 English Approved NaN NaN
25 Fantasia 1940 USA Animation|Family|Fantasy|Music NaN 76400000.0 2280000.0 7.8 230.0 English G NaN NaN
26 How Green Was My Valley 1941 USA Drama|Family Barry Fitzgerald NaN 1250000.0 7.8 124.0 English Approved NaN NaN
27 Casablanca 1942 USA Drama|Romance|War Conrad Veidt NaN 950000.0 8.6 1123.0 English PG NaN NaN
28 Bambi 1942 USA Animation|Drama|Family Ann Gillis 102797150.0 NaN 7.4 136.0 English Approved NaN NaN
29 A Guy Named Joe 1943 USA Drama|Fantasy|Romance|War Irene Dunne NaN 2627000.0 7.0 27.0 English Passed NaN NaN
... ... ... ... ... ... ... ... ... ... ... ... ... ...
4905 Hail, Caesar! 2016 UK Comedy|Mystery Alden Ehrenreich 29997095.0 22000000.0 6.4 302.0 English PG-13 NaN NaN
4906 The Infiltrator 2016 UK Biography|Crime|Drama|Thriller Olympia Dukakis 14946229.0 25000000.0 7.3 29.0 English R NaN NaN
4907 Ben-Hur 2016 USA Adventure|Drama|History Moises Arias NaN 100000000.0 6.0 1.0 English PG-13 NaN NaN
4908 The Veil 2016 USA Horror Kira McLean NaN 4000000.0 4.7 29.0 English R NaN NaN
4909 Eddie the Eagle 2016 UK Biography|Comedy|Drama|Sport Tim McInnerny 15785632.0 23000000.0 7.5 119.0 English PG-13 NaN NaN
4910 The Conjuring 2 2016 USA Horror|Mystery|Thriller Robin Atkin Downes 102310175.0 40000000.0 7.8 279.0 English R NaN NaN
4911 The Birth of a Nation 2016 USA Biography|Drama Aunjanue Ellis NaN 10000000.0 5.4 8.0 English R African Descent Female
4912 Airlift 2016 India Action|Drama|History|Thriller|War Purab Kohli NaN 4400000.0 8.5 178.0 Hindi NaN NaN NaN
4913 Our Kind of Traitor 2016 UK Thriller Grigoriy Dobrygin 3108216.0 NaN 6.4 21.0 English R NaN NaN
4914 Jane Got a Gun 2016 USA Action|Drama|Western Boyd Holbrook 1512815.0 25000000.0 5.8 56.0 English R NaN NaN
4915 The Purge: Election Year 2016 France Action|Horror|Sci-Fi|Thriller Mykelti Williamson 78845130.0 10000000.0 6.1 94.0 English R African Descent Male
4916 Ben-Hur 2016 USA Adventure|Drama|History Moises Arias NaN 100000000.0 6.1 1.0 English PG-13 NaN NaN
4917 The Finest Hours 2016 USA Action|Drama|History|Thriller Graham McTavish 27550735.0 NaN 6.8 113.0 English PG-13 NaN NaN
4918 Triple 9 2016 USA Action|Crime|Drama|Thriller Clifton Collins Jr. 12626905.0 20000000.0 6.3 106.0 English R NaN NaN
4919 Antibirth 2016 USA Horror Mark Webber NaN 3500000.0 6.3 2.0 English NaN NaN NaN
4920 The Jungle Book 2016 UK Adventure|Drama|Family|Fantasy Garry Shandling 362645141.0 175000000.0 7.8 398.0 English PG NaN NaN
4921 Dirty Grandpa 2016 USA Comedy Jason Mantzoukas 35537564.0 11500000.0 6.0 166.0 English R NaN NaN
4922 London Has Fallen 2016 UK Action|Crime|Drama|Thriller Julian Kostov 62401264.0 60000000.0 5.9 323.0 English R NaN NaN
4923 Nerve 2016 USA Adventure|Crime|Mystery|Sci-Fi|Thriller Emily Meade 28876924.0 20000000.0 7.1 35.0 English PG-13 NaN NaN
4924 Me Before You 2016 UK Drama|Romance Brendan Coyle 56154094.0 20000000.0 7.6 130.0 English PG-13 NaN NaN
4925 Suicide Squad 2016 USA Action|Adventure|Comedy|Sci-Fi Ike Barinholtz 161087183.0 175000000.0 6.9 971.0 English PG-13 NaN NaN
4926 The BFG 2016 UK Adventure|Family|Fantasy Rafe Spall 52792307.0 140000000.0 6.8 106.0 English PG NaN NaN
4927 Deadpool 2016 USA Action|Adventure|Comedy|Romance|Sci-Fi Stefan Kapicic 363024263.0 58000000.0 8.1 1058.0 English R NaN NaN
4928 X-Men: Apocalypse 2016 USA Action|Adventure|Sci-Fi Tye Sheridan 154985087.0 178000000.0 7.3 622.0 English PG-13 NaN NaN
4929 The Legend of Tarzan 2016 USA Action|Adventure|Drama|Romance Casper Crump 124051759.0 180000000.0 6.6 239.0 English PG-13 NaN NaN
4930 Compadres 2016 Mexico Action|Comedy Erick Elias 3105269.0 3000000.0 5.0 8.0 English R NaN NaN
4931 Bad Moms 2016 USA Comedy Jada Pinkett Smith 55461307.0 20000000.0 6.7 46.0 English R African Descent Female
4932 Misconduct 2016 USA Drama|Thriller Glen Powell NaN 11000000.0 5.3 50.0 English R NaN NaN
4933 Risen 2016 USA Action|Adventure|Drama|Mystery Mar’_a Botto 36874745.0 20000000.0 6.3 117.0 English PG-13 NaN NaN
4934 Keanu 2016 USA Action|Comedy Keegan-Michael Key 20566327.0 15000000.0 6.4 84.0 English R African Descent Male

4935 rows × 13 columns


In [967]:
combo3.shape


Out[967]:
(4935, 13)

In [968]:
# Group 2
combo3_1 = pd.merge(combo3, award_dat,   # left and right df's
                 how='left',        # add to left 
                 on=['Film','Year']      # link with this variable/column 
                ) 

combo3_1


Out[968]:
Film Year country genres actor_name gross budget imdb_score num_user_for_reviews language content_rating race sex Awards Nominations
0 Intolerance: Love's Struggle Throughout the Ages 1916 USA Drama|History|War Walter Long NaN 385907.0 8.0 88.0 NaN Not Rated NaN NaN NaN NaN
1 Over the Hill to the Poorhouse 1920 USA Crime|Drama Mary Carr 3000000.0 100000.0 4.8 1.0 NaN NaN NaN NaN NaN NaN
2 The Big Parade 1925 USA Drama|Romance|War Claire Adams NaN 245000.0 8.3 45.0 NaN Not Rated NaN NaN NaN NaN
3 Metropolis 1927 Germany Drama|Sci-Fi Rudolf Klein-Rogge 26435.0 6000000.0 8.3 413.0 German Not Rated NaN NaN NaN NaN
4 The Broadway Melody 1929 USA Musical|Romance Charles King 2808000.0 379000.0 6.3 71.0 English Passed NaN NaN 1.0 3.0
5 Pandora's Box 1929 Germany Crime|Drama|Romance Fritz Kortner 9950.0 NaN 8.0 84.0 German Not Rated NaN NaN NaN NaN
6 Hell's Angels 1930 USA Drama|War James Hall NaN 3950000.0 7.8 53.0 English Passed NaN NaN NaN NaN
7 A Farewell to Arms 1932 USA Drama|Romance|War Adolphe Menjou NaN 800000.0 6.6 46.0 English Unrated NaN NaN 2.0 4.0
8 42nd Street 1933 USA Comedy|Musical|Romance George Brent 2300000.0 439000.0 7.7 97.0 English Unrated NaN NaN NaN NaN
9 She Done Him Wrong 1933 USA Comedy|Drama|History|Musical|Romance Louise Beavers NaN 200000.0 6.5 59.0 English Approved African Descent Female NaN NaN
10 It Happened One Night 1934 USA Comedy|Romance Walter Connolly NaN 325000.0 8.2 235.0 English Unrated NaN NaN 5.0 5.0
11 Top Hat 1935 USA Comedy|Musical|Romance Eric Blore 3000000.0 609000.0 7.8 98.0 English Approved NaN NaN NaN NaN
12 Modern Times 1936 USA Comedy|Drama|Family Fred Malatesta 163245.0 1500000.0 8.6 211.0 English G NaN NaN NaN NaN
13 The Charge of the Light Brigade 1936 USA Action|Adventure|Romance|War Spring Byington NaN 1200000.0 7.1 52.0 English Approved NaN NaN 1.0 3.0
14 Snow White and the Seven Dwarfs 1937 USA Animation|Family|Fantasy|Musical Lucille La Verne 184925485.0 2000000.0 7.7 204.0 English Approved NaN NaN 0.0 1.0
15 The Prisoner of Zenda 1937 USA Adventure|Drama|Romance Ronald Colman NaN NaN 7.8 44.0 English Approved NaN NaN NaN NaN
16 Alexander's Ragtime Band 1938 USA Drama|Musical|Romance John Carradine NaN 2000000.0 7.0 29.0 English Approved NaN NaN 1.0 6.0
17 You Can't Take It with You 1938 USA Comedy|Drama|Romance Lionel Barrymore NaN 1644736.0 8.0 133.0 English Approved NaN NaN 2.0 7.0
18 Mr. Smith Goes to Washington 1939 USA Comedy|Drama Thomas Mitchell NaN 1500000.0 8.2 245.0 English Not Rated NaN NaN 1.0 11.0
19 Gone with the Wind 1939 USA Drama|History|Romance|War Thomas Mitchell 198655278.0 3977000.0 8.2 706.0 English G NaN NaN 8.0 13.0
20 The Wizard of Oz 1939 USA Adventure|Family|Fantasy|Musical Billie Burke 22202612.0 2800000.0 8.1 533.0 English Passed NaN NaN 2.0 6.0
21 The Blue Bird 1940 USA Drama|Family|Fantasy Gale Sondergaard NaN NaN 6.5 25.0 English Approved NaN NaN NaN NaN
22 Boom Town 1940 USA Adventure|Drama|Romance|Western Claudette Colbert NaN 1614000.0 7.1 34.0 English Passed NaN NaN NaN NaN
23 Rebecca 1940 USA Drama|Film-Noir|Mystery|Thriller George Sanders NaN 1288000.0 8.2 276.0 English Not Rated NaN NaN 2.0 11.0
24 Pinocchio 1940 USA Animation|Family|Fantasy|Musical Cliff Edwards 84300000.0 2600000.0 7.5 147.0 English Approved NaN NaN 2.0 2.0
25 Fantasia 1940 USA Animation|Family|Fantasy|Music NaN 76400000.0 2280000.0 7.8 230.0 English G NaN NaN 0.0 0.0
26 How Green Was My Valley 1941 USA Drama|Family Barry Fitzgerald NaN 1250000.0 7.8 124.0 English Approved NaN NaN 5.0 10.0
27 Casablanca 1942 USA Drama|Romance|War Conrad Veidt NaN 950000.0 8.6 1123.0 English PG NaN NaN NaN NaN
28 Bambi 1942 USA Animation|Drama|Family Ann Gillis 102797150.0 NaN 7.4 136.0 English Approved NaN NaN NaN NaN
29 A Guy Named Joe 1943 USA Drama|Fantasy|Romance|War Irene Dunne NaN 2627000.0 7.0 27.0 English Passed NaN NaN NaN NaN
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
4905 Hail, Caesar! 2016 UK Comedy|Mystery Alden Ehrenreich 29997095.0 22000000.0 6.4 302.0 English PG-13 NaN NaN NaN NaN
4906 The Infiltrator 2016 UK Biography|Crime|Drama|Thriller Olympia Dukakis 14946229.0 25000000.0 7.3 29.0 English R NaN NaN NaN NaN
4907 Ben-Hur 2016 USA Adventure|Drama|History Moises Arias NaN 100000000.0 6.0 1.0 English PG-13 NaN NaN NaN NaN
4908 The Veil 2016 USA Horror Kira McLean NaN 4000000.0 4.7 29.0 English R NaN NaN NaN NaN
4909 Eddie the Eagle 2016 UK Biography|Comedy|Drama|Sport Tim McInnerny 15785632.0 23000000.0 7.5 119.0 English PG-13 NaN NaN NaN NaN
4910 The Conjuring 2 2016 USA Horror|Mystery|Thriller Robin Atkin Downes 102310175.0 40000000.0 7.8 279.0 English R NaN NaN NaN NaN
4911 The Birth of a Nation 2016 USA Biography|Drama Aunjanue Ellis NaN 10000000.0 5.4 8.0 English R African Descent Female NaN NaN
4912 Airlift 2016 India Action|Drama|History|Thriller|War Purab Kohli NaN 4400000.0 8.5 178.0 Hindi NaN NaN NaN NaN NaN
4913 Our Kind of Traitor 2016 UK Thriller Grigoriy Dobrygin 3108216.0 NaN 6.4 21.0 English R NaN NaN NaN NaN
4914 Jane Got a Gun 2016 USA Action|Drama|Western Boyd Holbrook 1512815.0 25000000.0 5.8 56.0 English R NaN NaN NaN NaN
4915 The Purge: Election Year 2016 France Action|Horror|Sci-Fi|Thriller Mykelti Williamson 78845130.0 10000000.0 6.1 94.0 English R African Descent Male NaN NaN
4916 Ben-Hur 2016 USA Adventure|Drama|History Moises Arias NaN 100000000.0 6.1 1.0 English PG-13 NaN NaN NaN NaN
4917 The Finest Hours 2016 USA Action|Drama|History|Thriller Graham McTavish 27550735.0 NaN 6.8 113.0 English PG-13 NaN NaN NaN NaN
4918 Triple 9 2016 USA Action|Crime|Drama|Thriller Clifton Collins Jr. 12626905.0 20000000.0 6.3 106.0 English R NaN NaN NaN NaN
4919 Antibirth 2016 USA Horror Mark Webber NaN 3500000.0 6.3 2.0 English NaN NaN NaN NaN NaN
4920 The Jungle Book 2016 UK Adventure|Drama|Family|Fantasy Garry Shandling 362645141.0 175000000.0 7.8 398.0 English PG NaN NaN 1.0 1.0
4921 Dirty Grandpa 2016 USA Comedy Jason Mantzoukas 35537564.0 11500000.0 6.0 166.0 English R NaN NaN NaN NaN
4922 London Has Fallen 2016 UK Action|Crime|Drama|Thriller Julian Kostov 62401264.0 60000000.0 5.9 323.0 English R NaN NaN NaN NaN
4923 Nerve 2016 USA Adventure|Crime|Mystery|Sci-Fi|Thriller Emily Meade 28876924.0 20000000.0 7.1 35.0 English PG-13 NaN NaN NaN NaN
4924 Me Before You 2016 UK Drama|Romance Brendan Coyle 56154094.0 20000000.0 7.6 130.0 English PG-13 NaN NaN NaN NaN
4925 Suicide Squad 2016 USA Action|Adventure|Comedy|Sci-Fi Ike Barinholtz 161087183.0 175000000.0 6.9 971.0 English PG-13 NaN NaN 1.0 1.0
4926 The BFG 2016 UK Adventure|Family|Fantasy Rafe Spall 52792307.0 140000000.0 6.8 106.0 English PG NaN NaN NaN NaN
4927 Deadpool 2016 USA Action|Adventure|Comedy|Romance|Sci-Fi Stefan Kapicic 363024263.0 58000000.0 8.1 1058.0 English R NaN NaN NaN NaN
4928 X-Men: Apocalypse 2016 USA Action|Adventure|Sci-Fi Tye Sheridan 154985087.0 178000000.0 7.3 622.0 English PG-13 NaN NaN NaN NaN
4929 The Legend of Tarzan 2016 USA Action|Adventure|Drama|Romance Casper Crump 124051759.0 180000000.0 6.6 239.0 English PG-13 NaN NaN NaN NaN
4930 Compadres 2016 Mexico Action|Comedy Erick Elias 3105269.0 3000000.0 5.0 8.0 English R NaN NaN NaN NaN
4931 Bad Moms 2016 USA Comedy Jada Pinkett Smith 55461307.0 20000000.0 6.7 46.0 English R African Descent Female NaN NaN
4932 Misconduct 2016 USA Drama|Thriller Glen Powell NaN 11000000.0 5.3 50.0 English R NaN NaN NaN NaN
4933 Risen 2016 USA Action|Adventure|Drama|Mystery Mar’_a Botto 36874745.0 20000000.0 6.3 117.0 English PG-13 NaN NaN NaN NaN
4934 Keanu 2016 USA Action|Comedy Keegan-Michael Key 20566327.0 15000000.0 6.4 84.0 English R African Descent Male NaN NaN

4935 rows × 15 columns


In [969]:
combo3_1.shape


Out[969]:
(4935, 15)

In [970]:
# Rename Actor Name to Actor_2
combo3_1= combo3_1.rename(columns={'race': 'race_3', 'actor_name': 'actor_3_name', 'sex': 'sex_3'})
# Set Index
combo3_1= combo3_1.set_index("Year", "Film")

In [971]:
# Replace missing
combo3_1['Awards']= (combo3_1['Awards'].replace(to_replace=[None], value=0))
combo3_1['race_3']= (combo3_1['race_3'].replace(to_replace=[None], value="Other"))
combo3_1['sex_3']= (combo3_1['sex_3'].replace(to_replace=[None], value="Other"))
# See
combo3_1


Out[971]:
Film country genres actor_3_name gross budget imdb_score num_user_for_reviews language content_rating race_3 sex_3 Awards Nominations
Year
1916 Intolerance: Love's Struggle Throughout the Ages USA Drama|History|War Walter Long NaN 385907.0 8.0 88.0 NaN Not Rated Other Other 0.0 NaN
1920 Over the Hill to the Poorhouse USA Crime|Drama Mary Carr 3000000.0 100000.0 4.8 1.0 NaN NaN Other Other 0.0 NaN
1925 The Big Parade USA Drama|Romance|War Claire Adams NaN 245000.0 8.3 45.0 NaN Not Rated Other Other 0.0 NaN
1927 Metropolis Germany Drama|Sci-Fi Rudolf Klein-Rogge 26435.0 6000000.0 8.3 413.0 German Not Rated Other Other 0.0 NaN
1929 The Broadway Melody USA Musical|Romance Charles King 2808000.0 379000.0 6.3 71.0 English Passed Other Other 1.0 3.0
1929 Pandora's Box Germany Crime|Drama|Romance Fritz Kortner 9950.0 NaN 8.0 84.0 German Not Rated Other Other 0.0 NaN
1930 Hell's Angels USA Drama|War James Hall NaN 3950000.0 7.8 53.0 English Passed Other Other 0.0 NaN
1932 A Farewell to Arms USA Drama|Romance|War Adolphe Menjou NaN 800000.0 6.6 46.0 English Unrated Other Other 2.0 4.0
1933 42nd Street USA Comedy|Musical|Romance George Brent 2300000.0 439000.0 7.7 97.0 English Unrated Other Other 0.0 NaN
1933 She Done Him Wrong USA Comedy|Drama|History|Musical|Romance Louise Beavers NaN 200000.0 6.5 59.0 English Approved African Descent Female 0.0 NaN
1934 It Happened One Night USA Comedy|Romance Walter Connolly NaN 325000.0 8.2 235.0 English Unrated Other Other 5.0 5.0
1935 Top Hat USA Comedy|Musical|Romance Eric Blore 3000000.0 609000.0 7.8 98.0 English Approved Other Other 0.0 NaN
1936 Modern Times USA Comedy|Drama|Family Fred Malatesta 163245.0 1500000.0 8.6 211.0 English G Other Other 0.0 NaN
1936 The Charge of the Light Brigade USA Action|Adventure|Romance|War Spring Byington NaN 1200000.0 7.1 52.0 English Approved Other Other 1.0 3.0
1937 Snow White and the Seven Dwarfs USA Animation|Family|Fantasy|Musical Lucille La Verne 184925485.0 2000000.0 7.7 204.0 English Approved Other Other 0.0 1.0
1937 The Prisoner of Zenda USA Adventure|Drama|Romance Ronald Colman NaN NaN 7.8 44.0 English Approved Other Other 0.0 NaN
1938 Alexander's Ragtime Band USA Drama|Musical|Romance John Carradine NaN 2000000.0 7.0 29.0 English Approved Other Other 1.0 6.0
1938 You Can't Take It with You USA Comedy|Drama|Romance Lionel Barrymore NaN 1644736.0 8.0 133.0 English Approved Other Other 2.0 7.0
1939 Mr. Smith Goes to Washington USA Comedy|Drama Thomas Mitchell NaN 1500000.0 8.2 245.0 English Not Rated Other Other 1.0 11.0
1939 Gone with the Wind USA Drama|History|Romance|War Thomas Mitchell 198655278.0 3977000.0 8.2 706.0 English G Other Other 8.0 13.0
1939 The Wizard of Oz USA Adventure|Family|Fantasy|Musical Billie Burke 22202612.0 2800000.0 8.1 533.0 English Passed Other Other 2.0 6.0
1940 The Blue Bird USA Drama|Family|Fantasy Gale Sondergaard NaN NaN 6.5 25.0 English Approved Other Other 0.0 NaN
1940 Boom Town USA Adventure|Drama|Romance|Western Claudette Colbert NaN 1614000.0 7.1 34.0 English Passed Other Other 0.0 NaN
1940 Rebecca USA Drama|Film-Noir|Mystery|Thriller George Sanders NaN 1288000.0 8.2 276.0 English Not Rated Other Other 2.0 11.0
1940 Pinocchio USA Animation|Family|Fantasy|Musical Cliff Edwards 84300000.0 2600000.0 7.5 147.0 English Approved Other Other 2.0 2.0
1940 Fantasia USA Animation|Family|Fantasy|Music NaN 76400000.0 2280000.0 7.8 230.0 English G Other Other 0.0 0.0
1941 How Green Was My Valley USA Drama|Family Barry Fitzgerald NaN 1250000.0 7.8 124.0 English Approved Other Other 5.0 10.0
1942 Casablanca USA Drama|Romance|War Conrad Veidt NaN 950000.0 8.6 1123.0 English PG Other Other 0.0 NaN
1942 Bambi USA Animation|Drama|Family Ann Gillis 102797150.0 NaN 7.4 136.0 English Approved Other Other 0.0 NaN
1943 A Guy Named Joe USA Drama|Fantasy|Romance|War Irene Dunne NaN 2627000.0 7.0 27.0 English Passed Other Other 0.0 NaN
... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
2016 Hail, Caesar! UK Comedy|Mystery Alden Ehrenreich 29997095.0 22000000.0 6.4 302.0 English PG-13 Other Other 0.0 NaN
2016 The Infiltrator UK Biography|Crime|Drama|Thriller Olympia Dukakis 14946229.0 25000000.0 7.3 29.0 English R Other Other 0.0 NaN
2016 Ben-Hur USA Adventure|Drama|History Moises Arias NaN 100000000.0 6.0 1.0 English PG-13 Other Other 0.0 NaN
2016 The Veil USA Horror Kira McLean NaN 4000000.0 4.7 29.0 English R Other Other 0.0 NaN
2016 Eddie the Eagle UK Biography|Comedy|Drama|Sport Tim McInnerny 15785632.0 23000000.0 7.5 119.0 English PG-13 Other Other 0.0 NaN
2016 The Conjuring 2 USA Horror|Mystery|Thriller Robin Atkin Downes 102310175.0 40000000.0 7.8 279.0 English R Other Other 0.0 NaN
2016 The Birth of a Nation USA Biography|Drama Aunjanue Ellis NaN 10000000.0 5.4 8.0 English R African Descent Female 0.0 NaN
2016 Airlift India Action|Drama|History|Thriller|War Purab Kohli NaN 4400000.0 8.5 178.0 Hindi NaN Other Other 0.0 NaN
2016 Our Kind of Traitor UK Thriller Grigoriy Dobrygin 3108216.0 NaN 6.4 21.0 English R Other Other 0.0 NaN
2016 Jane Got a Gun USA Action|Drama|Western Boyd Holbrook 1512815.0 25000000.0 5.8 56.0 English R Other Other 0.0 NaN
2016 The Purge: Election Year France Action|Horror|Sci-Fi|Thriller Mykelti Williamson 78845130.0 10000000.0 6.1 94.0 English R African Descent Male 0.0 NaN
2016 Ben-Hur USA Adventure|Drama|History Moises Arias NaN 100000000.0 6.1 1.0 English PG-13 Other Other 0.0 NaN
2016 The Finest Hours USA Action|Drama|History|Thriller Graham McTavish 27550735.0 NaN 6.8 113.0 English PG-13 Other Other 0.0 NaN
2016 Triple 9 USA Action|Crime|Drama|Thriller Clifton Collins Jr. 12626905.0 20000000.0 6.3 106.0 English R Other Other 0.0 NaN
2016 Antibirth USA Horror Mark Webber NaN 3500000.0 6.3 2.0 English NaN Other Other 0.0 NaN
2016 The Jungle Book UK Adventure|Drama|Family|Fantasy Garry Shandling 362645141.0 175000000.0 7.8 398.0 English PG Other Other 1.0 1.0
2016 Dirty Grandpa USA Comedy Jason Mantzoukas 35537564.0 11500000.0 6.0 166.0 English R Other Other 0.0 NaN
2016 London Has Fallen UK Action|Crime|Drama|Thriller Julian Kostov 62401264.0 60000000.0 5.9 323.0 English R Other Other 0.0 NaN
2016 Nerve USA Adventure|Crime|Mystery|Sci-Fi|Thriller Emily Meade 28876924.0 20000000.0 7.1 35.0 English PG-13 Other Other 0.0 NaN
2016 Me Before You UK Drama|Romance Brendan Coyle 56154094.0 20000000.0 7.6 130.0 English PG-13 Other Other 0.0 NaN
2016 Suicide Squad USA Action|Adventure|Comedy|Sci-Fi Ike Barinholtz 161087183.0 175000000.0 6.9 971.0 English PG-13 Other Other 1.0 1.0
2016 The BFG UK Adventure|Family|Fantasy Rafe Spall 52792307.0 140000000.0 6.8 106.0 English PG Other Other 0.0 NaN
2016 Deadpool USA Action|Adventure|Comedy|Romance|Sci-Fi Stefan Kapicic 363024263.0 58000000.0 8.1 1058.0 English R Other Other 0.0 NaN
2016 X-Men: Apocalypse USA Action|Adventure|Sci-Fi Tye Sheridan 154985087.0 178000000.0 7.3 622.0 English PG-13 Other Other 0.0 NaN
2016 The Legend of Tarzan USA Action|Adventure|Drama|Romance Casper Crump 124051759.0 180000000.0 6.6 239.0 English PG-13 Other Other 0.0 NaN
2016 Compadres Mexico Action|Comedy Erick Elias 3105269.0 3000000.0 5.0 8.0 English R Other Other 0.0 NaN
2016 Bad Moms USA Comedy Jada Pinkett Smith 55461307.0 20000000.0 6.7 46.0 English R African Descent Female 0.0 NaN
2016 Misconduct USA Drama|Thriller Glen Powell NaN 11000000.0 5.3 50.0 English R Other Other 0.0 NaN
2016 Risen USA Action|Adventure|Drama|Mystery Mar’_a Botto 36874745.0 20000000.0 6.3 117.0 English PG-13 Other Other 0.0 NaN
2016 Keanu USA Action|Comedy Keegan-Michael Key 20566327.0 15000000.0 6.4 84.0 English R African Descent Male 0.0 NaN

4935 rows × 14 columns

Combine Data Frames

  • Add desired columns from combo2_1 and combo3_1 to combo1_1

In [972]:
# Combo 2
dat_desire2= combo2_1.reset_index(drop=False)[['actor_2_name', 'race_2', 'sex_2']]
dat_desire2.shape


Out[972]:
(4935, 3)

In [973]:
# Combo 3
dat_desire3= combo3_1.reset_index(drop=False)[['actor_3_name', 'race_3', 'sex_3']]
dat_desire3.shape


Out[973]:
(4935, 3)
  • Merge the data sets

In [974]:
df_all = pd.concat([combo1_1.reset_index(drop=False), dat_desire2, dat_desire3], axis=1)
# Sort by Year
df_all= df_all.sort_values("Year")
df_all.shape


Out[974]:
(4935, 21)

In [975]:
# Add numeric sum of black actors in each film

df_all['r1total']= df_all['race_1'].replace({'Other': 0, 'African Descent': 1})
df_all['r2total']= df_all['race_2'].replace({'Other': 0, 'African Descent': 1})
df_all['r3total']= df_all['race_3'].replace({'Other': 0, 'African Descent': 1})
df_all['sum_totals']= df_all['r1total']+ df_all['r2total']+df_all['r3total']
df_all


Out[975]:
Year Film country genres actor_1_name gross budget imdb_score num_user_for_reviews language ... actor_2_name race_2 sex_2 actor_3_name race_3 sex_3 r1total r2total r3total sum_totals
0 1916 Intolerance: Love's Struggle Throughout the Ages USA Drama|History|War Lillian Gish NaN 385907.0 8.0 88.0 NaN ... Mae Marsh Other Other Walter Long Other Other 0 0 0 0
1 1920 Over the Hill to the Poorhouse USA Crime|Drama Stephen Carr 3000000.0 100000.0 4.8 1.0 NaN ... Johnnie Walker Other Other Mary Carr Other Other 0 0 0 0
2 1925 The Big Parade USA Drama|Romance|War John Gilbert NaN 245000.0 8.3 45.0 NaN ... Ren’e Ador’e Other Other Claire Adams Other Other 0 0 0 0
3 1927 Metropolis Germany Drama|Sci-Fi Brigitte Helm 26435.0 6000000.0 8.3 413.0 German ... Gustav Fr’_hlich Other Other Rudolf Klein-Rogge Other Other 0 0 0 0
4 1929 The Broadway Melody USA Musical|Romance Anita Page 2808000.0 379000.0 6.3 71.0 English ... Bessie Love Other Other Charles King Other Other 0 0 0 0
5 1929 Pandora's Box Germany Crime|Drama|Romance Louise Brooks 9950.0 NaN 8.0 84.0 German ... Francis Lederer Other Other Fritz Kortner Other Other 0 0 0 0
6 1930 Hell's Angels USA Drama|War Jean Harlow NaN 3950000.0 7.8 53.0 English ... Marian Marsh Other Other James Hall Other Other 0 0 0 0
7 1932 A Farewell to Arms USA Drama|Romance|War Gary Cooper NaN 800000.0 6.6 46.0 English ... Helen Hayes Other Other Adolphe Menjou Other Other 0 0 0 0
8 1933 42nd Street USA Comedy|Musical|Romance Ginger Rogers 2300000.0 439000.0 7.7 97.0 English ... Dick Powell Other Other George Brent Other Other 0 0 0 0
9 1933 She Done Him Wrong USA Comedy|Drama|History|Musical|Romance Mae West NaN 200000.0 6.5 59.0 English ... Gilbert Roland Other Other Louise Beavers African Descent Female 0 0 1 1
10 1934 It Happened One Night USA Comedy|Romance Claudette Colbert NaN 325000.0 8.2 235.0 English ... Alan Hale Other Other Walter Connolly Other Other 0 0 0 0
11 1935 Top Hat USA Comedy|Musical|Romance Ginger Rogers 3000000.0 609000.0 7.8 98.0 English ... Edward Everett Horton Other Other Eric Blore Other Other 0 0 0 0
12 1936 Modern Times USA Comedy|Drama|Family Paulette Goddard 163245.0 1500000.0 8.6 211.0 English ... Stanley Blystone Other Other Fred Malatesta Other Other 0 0 0 0
13 1936 The Charge of the Light Brigade USA Action|Adventure|Romance|War Errol Flynn NaN 1200000.0 7.1 52.0 English ... David Niven Other Other Spring Byington Other Other 0 0 0 0
14 1937 Snow White and the Seven Dwarfs USA Animation|Family|Fantasy|Musical Adriana Caselotti 184925485.0 2000000.0 7.7 204.0 English ... Billy Gilbert Other Other Lucille La Verne Other Other 0 0 0 0
15 1937 The Prisoner of Zenda USA Adventure|Drama|Romance David Niven NaN NaN 7.8 44.0 English ... Mary Astor Other Other Ronald Colman Other Other 0 0 0 0
17 1938 You Can't Take It with You USA Comedy|Drama|Romance Jean Arthur NaN 1644736.0 8.0 133.0 English ... Ann Miller Other Other Lionel Barrymore Other Other 0 0 0 0
16 1938 Alexander's Ragtime Band USA Drama|Musical|Romance Tyrone Power NaN 2000000.0 7.0 29.0 English ... Don Ameche Other Other John Carradine Other Other 0 0 0 0
18 1939 Mr. Smith Goes to Washington USA Comedy|Drama Claude Rains NaN 1500000.0 8.2 245.0 English ... Jean Arthur Other Other Thomas Mitchell Other Other 0 0 0 0
19 1939 Gone with the Wind USA Drama|History|Romance|War Hattie McDaniel 198655278.0 3977000.0 8.2 706.0 English ... George Reeves Other Other Thomas Mitchell Other Other 1 0 0 1
20 1939 The Wizard of Oz USA Adventure|Family|Fantasy|Musical Margaret Hamilton 22202612.0 2800000.0 8.1 533.0 English ... Terry Other Other Billie Burke Other Other 0 0 0 0
21 1940 The Blue Bird USA Drama|Family|Fantasy Spring Byington NaN NaN 6.5 25.0 English ... Nigel Bruce Other Other Gale Sondergaard Other Other 0 0 0 0
22 1940 Boom Town USA Adventure|Drama|Romance|Western Hedy Lamarr NaN 1614000.0 7.1 34.0 English ... Spencer Tracy Other Other Claudette Colbert Other Other 0 0 0 0
23 1940 Rebecca USA Drama|Film-Noir|Mystery|Thriller Laurence Olivier NaN 1288000.0 8.2 276.0 English ... Joan Fontaine Other Other George Sanders Other Other 0 0 0 0
24 1940 Pinocchio USA Animation|Family|Fantasy|Musical Mel Blanc 84300000.0 2600000.0 7.5 147.0 English ... Dickie Jones Other Other Cliff Edwards Other Other 0 0 0 0
25 1940 Fantasia USA Animation|Family|Fantasy|Music Leopold Stokowski 76400000.0 2280000.0 7.8 230.0 English ... Deems Taylor Other Other NaN Other Other 0 0 0 0
26 1941 How Green Was My Valley USA Drama|Family Roddy McDowall NaN 1250000.0 7.8 124.0 English ... Walter Pidgeon Other Other Barry Fitzgerald Other Other 0 0 0 0
27 1942 Casablanca USA Drama|Romance|War Humphrey Bogart NaN 950000.0 8.6 1123.0 English ... Claude Rains Other Other Conrad Veidt Other Other 0 0 0 0
28 1942 Bambi USA Animation|Drama|Family Sam Edwards 102797150.0 NaN 7.4 136.0 English ... Donnie Dunagan Other Other Ann Gillis Other Other 0 0 0 0
29 1943 A Guy Named Joe USA Drama|Fantasy|Romance|War Spencer Tracy NaN 2627000.0 7.0 27.0 English ... Esther Williams Other Other Irene Dunne Other Other 0 0 0 0
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
4933 2016 Risen USA Action|Adventure|Drama|Mystery Peter Firth 36874745.0 20000000.0 6.3 117.0 English ... Jan Cornet Other Other Mar’_a Botto Other Other 0 0 0 0
4851 2016 Race Canada Biography|Drama|Sport William Hurt 19097994.0 NaN 7.1 49.0 English ... Tony Curran Other Other David Kross Other Other 0 0 0 0
4880 2016 Jason Bourne UK Action|Thriller Matt Damon 108521835.0 120000000.0 7.1 297.0 English ... Riz Ahmed Other Other Ato Essandoh African Descent Male 0 0 1 1
4852 2016 How to Be Single USA Comedy|Romance Alison Brie 46813366.0 38000000.0 6.1 83.0 English ... Damon Wayans Jr. African Descent Male Nicholas Braun Other Other 0 1 0 1
4855 2016 Pride and Prejudice and Zombies USA Action|Horror|Romance Matt Smith 10907291.0 28000000.0 5.8 134.0 English ... Bella Heathcote Other Other Sam Riley Other Other 0 0 0 0
4878 2016 Godzilla Resurgence Japan Action|Adventure|Drama|Horror|Sci-Fi Mark Chinnery NaN NaN 8.2 13.0 Japanese ... Shin'ya Tsukamoto Other Other Atsuko Maeda Other Other 0 0 0 0
4877 2016 Irreplaceable France Comedy|Drama Fran’_ois Cluzet NaN NaN 7.2 1.0 French ... F’lix Moati Other Other Marianne Denicourt Other Other 0 0 0 0
4876 2016 The Shallows USA Drama|Horror|Thriller ’–scar Jaenada 54257433.0 17000000.0 6.8 139.0 English ... Brett Cullen Other Other Sedona Legge Other Other 0 0 0 0
4875 2016 Midnight Special Greece Adventure|Drama|Sci-Fi|Thriller Kirsten Dunst 3707794.0 18000000.0 6.7 156.0 English ... Sam Shepard Other Other Paul Sparks Other Other 0 0 0 0
4874 2016 God's Not Dead 2 USA Drama Benjamin A. Onyango 20773070.0 5000000.0 3.4 102.0 English ... Robin Givens African Descent Female Maria Canals-Barrera Other Other 0 1 0 1
4873 2016 Ben-Hur USA Adventure|Drama|History Morgan Freeman NaN NaN 6.1 1.0 English ... Ayelet Zurer Other Other Moises Arias Other Other 1 0 0 1
4872 2016 Alice Through the Looking Glass USA Adventure|Family|Fantasy Johnny Depp 76846624.0 170000000.0 6.4 131.0 English ... Alan Rickman Other Other Anne Hathaway Other Other 0 0 0 0
4871 2016 Hands of Stone Panama Action|Biography|Drama|Sport Robert De Niro NaN 20000000.0 7.2 1.0 English ... Jurnee Smollett-Bell African Descent Female Edgar Ram’_rez Other Other 0 1 0 1
4870 2016 Warcraft USA Action|Adventure|Fantasy Dominic Cooper 46978995.0 160000000.0 7.3 781.0 English ... Callum Rennie Other Other Ruth Negga Other Other 0 0 0 0
4869 2016 The Secret Life of Pets Japan Animation|Comedy|Family Steve Coogan 323505540.0 75000000.0 6.8 155.0 English ... Eric Stonestreet Other Other Albert Brooks Other Other 0 0 0 0
4868 2016 Cabin Fever USA Horror Dustin Ingram NaN NaN 3.7 40.0 English ... Gage Golightly Other Other Samuel Davis Other Other 0 0 0 0
4867 2016 Fight Valley USA Action|Drama Kari J. Kramer NaN 27000000.0 5.0 6.0 English ... Erin O'Brien Other Other Cabrina Collesides Other Other 0 0 0 0
4866 2016 Bad Moms USA Comedy Mila Kunis 55461307.0 20000000.0 6.7 46.0 English ... Jay Hernandez Other Other Jada Pinkett Smith African Descent Female 0 0 1 1
4865 2016 Kickboxer: Vengeance USA Action Matthew Ziff NaN 17000000.0 9.1 1.0 NaN ... T.J. Storm Other Other Sam Medina Other Other 0 0 0 0
4864 2016 The Young Messiah USA Drama Clive Russell 6462576.0 18500000.0 5.4 30.0 English ... Vincent Walsh Other Other Finn Ireland Other Other 0 0 0 0
4863 2016 The Dog Lover USA Drama Lea Thompson NaN 2000000.0 4.8 8.0 English ... Christina Moore Other Other Cullen Douglas Other Other 0 0 0 0
4862 2016 Indignation USA Drama Logan Lerman 560512.0 NaN 7.8 3.0 Hebrew ... Sarah Gadon Other Other Tracy Letts Other Other 0 0 0 0
4861 2016 Operation Chromite South Korea Action|Drama|History|War Liam Neeson 31662.0 12620000.0 6.8 1.0 English ... Dean Dawson Other Other Jung-jae Lee Other Other 0 0 0 0
4860 2016 My Big Fat Greek Wedding 2 USA Comedy|Family|Romance Nia Vardalos 59573085.0 18000000.0 6.1 103.0 English ... Louis Mandylor Other Other Joey Fatone Other Other 0 0 0 0
4859 2016 Now You See Me 2 USA Action|Adventure|Comedy|Crime|Mystery|Thriller Daniel Radcliffe 64685359.0 90000000.0 6.9 139.0 English ... Morgan Freeman African Descent Male Sanaa Lathan African Descent Female 0 1 1 2
4858 2016 The Legend of Tarzan USA Action|Adventure|Drama|Romance Christoph Waltz 124051759.0 180000000.0 6.6 239.0 English ... Alexander Skarsg’Çrd Other Other Casper Crump Other Other 0 0 0 0
4857 2016 Ride Along 2 USA Action|Comedy Olivia Munn 90835030.0 40000000.0 5.9 58.0 English ... Nadine Velazquez Other Other Bruce McGill Other Other 0 0 0 0
4856 2016 Xi you ji zhi: Sun Wukong san da Baigu Jing China Action|Adventure|Fantasy Li Gong NaN 68005000.0 6.0 9.0 English ... Aaron Kwok Other Other Eddie Peng Other Other 0 0 0 0
4853 2016 Kicks USA Adventure Tina Gilton NaN NaN 7.8 6.0 English ... Natalie Stephany Aguilar Other Other Justin Hall Other Other 0 0 0 0
4934 2016 Keanu USA Action|Comedy Nia Long 20566327.0 15000000.0 6.4 84.0 English ... Will Forte Other Other Keegan-Michael Key African Descent Male 1 0 1 2

4935 rows × 25 columns


In [976]:
# Extract Hollywood

hollywood= df_all[df_all['country'] == "USA"]
hollywood


Out[976]:
Year Film country genres actor_1_name gross budget imdb_score num_user_for_reviews language ... actor_2_name race_2 sex_2 actor_3_name race_3 sex_3 r1total r2total r3total sum_totals
0 1916 Intolerance: Love's Struggle Throughout the Ages USA Drama|History|War Lillian Gish NaN 385907.0 8.0 88.0 NaN ... Mae Marsh Other Other Walter Long Other Other 0 0 0 0
1 1920 Over the Hill to the Poorhouse USA Crime|Drama Stephen Carr 3000000.0 100000.0 4.8 1.0 NaN ... Johnnie Walker Other Other Mary Carr Other Other 0 0 0 0
2 1925 The Big Parade USA Drama|Romance|War John Gilbert NaN 245000.0 8.3 45.0 NaN ... Ren’e Ador’e Other Other Claire Adams Other Other 0 0 0 0
4 1929 The Broadway Melody USA Musical|Romance Anita Page 2808000.0 379000.0 6.3 71.0 English ... Bessie Love Other Other Charles King Other Other 0 0 0 0
6 1930 Hell's Angels USA Drama|War Jean Harlow NaN 3950000.0 7.8 53.0 English ... Marian Marsh Other Other James Hall Other Other 0 0 0 0
7 1932 A Farewell to Arms USA Drama|Romance|War Gary Cooper NaN 800000.0 6.6 46.0 English ... Helen Hayes Other Other Adolphe Menjou Other Other 0 0 0 0
8 1933 42nd Street USA Comedy|Musical|Romance Ginger Rogers 2300000.0 439000.0 7.7 97.0 English ... Dick Powell Other Other George Brent Other Other 0 0 0 0
9 1933 She Done Him Wrong USA Comedy|Drama|History|Musical|Romance Mae West NaN 200000.0 6.5 59.0 English ... Gilbert Roland Other Other Louise Beavers African Descent Female 0 0 1 1
10 1934 It Happened One Night USA Comedy|Romance Claudette Colbert NaN 325000.0 8.2 235.0 English ... Alan Hale Other Other Walter Connolly Other Other 0 0 0 0
11 1935 Top Hat USA Comedy|Musical|Romance Ginger Rogers 3000000.0 609000.0 7.8 98.0 English ... Edward Everett Horton Other Other Eric Blore Other Other 0 0 0 0
12 1936 Modern Times USA Comedy|Drama|Family Paulette Goddard 163245.0 1500000.0 8.6 211.0 English ... Stanley Blystone Other Other Fred Malatesta Other Other 0 0 0 0
13 1936 The Charge of the Light Brigade USA Action|Adventure|Romance|War Errol Flynn NaN 1200000.0 7.1 52.0 English ... David Niven Other Other Spring Byington Other Other 0 0 0 0
14 1937 Snow White and the Seven Dwarfs USA Animation|Family|Fantasy|Musical Adriana Caselotti 184925485.0 2000000.0 7.7 204.0 English ... Billy Gilbert Other Other Lucille La Verne Other Other 0 0 0 0
15 1937 The Prisoner of Zenda USA Adventure|Drama|Romance David Niven NaN NaN 7.8 44.0 English ... Mary Astor Other Other Ronald Colman Other Other 0 0 0 0
17 1938 You Can't Take It with You USA Comedy|Drama|Romance Jean Arthur NaN 1644736.0 8.0 133.0 English ... Ann Miller Other Other Lionel Barrymore Other Other 0 0 0 0
16 1938 Alexander's Ragtime Band USA Drama|Musical|Romance Tyrone Power NaN 2000000.0 7.0 29.0 English ... Don Ameche Other Other John Carradine Other Other 0 0 0 0
18 1939 Mr. Smith Goes to Washington USA Comedy|Drama Claude Rains NaN 1500000.0 8.2 245.0 English ... Jean Arthur Other Other Thomas Mitchell Other Other 0 0 0 0
19 1939 Gone with the Wind USA Drama|History|Romance|War Hattie McDaniel 198655278.0 3977000.0 8.2 706.0 English ... George Reeves Other Other Thomas Mitchell Other Other 1 0 0 1
20 1939 The Wizard of Oz USA Adventure|Family|Fantasy|Musical Margaret Hamilton 22202612.0 2800000.0 8.1 533.0 English ... Terry Other Other Billie Burke Other Other 0 0 0 0
21 1940 The Blue Bird USA Drama|Family|Fantasy Spring Byington NaN NaN 6.5 25.0 English ... Nigel Bruce Other Other Gale Sondergaard Other Other 0 0 0 0
22 1940 Boom Town USA Adventure|Drama|Romance|Western Hedy Lamarr NaN 1614000.0 7.1 34.0 English ... Spencer Tracy Other Other Claudette Colbert Other Other 0 0 0 0
23 1940 Rebecca USA Drama|Film-Noir|Mystery|Thriller Laurence Olivier NaN 1288000.0 8.2 276.0 English ... Joan Fontaine Other Other George Sanders Other Other 0 0 0 0
24 1940 Pinocchio USA Animation|Family|Fantasy|Musical Mel Blanc 84300000.0 2600000.0 7.5 147.0 English ... Dickie Jones Other Other Cliff Edwards Other Other 0 0 0 0
25 1940 Fantasia USA Animation|Family|Fantasy|Music Leopold Stokowski 76400000.0 2280000.0 7.8 230.0 English ... Deems Taylor Other Other NaN Other Other 0 0 0 0
26 1941 How Green Was My Valley USA Drama|Family Roddy McDowall NaN 1250000.0 7.8 124.0 English ... Walter Pidgeon Other Other Barry Fitzgerald Other Other 0 0 0 0
27 1942 Casablanca USA Drama|Romance|War Humphrey Bogart NaN 950000.0 8.6 1123.0 English ... Claude Rains Other Other Conrad Veidt Other Other 0 0 0 0
28 1942 Bambi USA Animation|Drama|Family Sam Edwards 102797150.0 NaN 7.4 136.0 English ... Donnie Dunagan Other Other Ann Gillis Other Other 0 0 0 0
29 1943 A Guy Named Joe USA Drama|Fantasy|Romance|War Spencer Tracy NaN 2627000.0 7.0 27.0 English ... Esther Williams Other Other Irene Dunne Other Other 0 0 0 0
30 1944 Bathing Beauty USA Comedy|Musical Esther Williams NaN 2361000.0 6.5 28.0 English ... Basil Rathbone Other Other Red Skelton Other Other 0 0 0 0
33 1945 Spellbound USA Film-Noir|Mystery|Romance|Thriller Norman Lloyd NaN 1696377.0 7.6 161.0 English ... Rhonda Fleming Other Other Leo G. Carroll Other Other 0 0 0 0
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
4841 2016 Fifty Shades of Black USA Comedy Fred Willard 11675178.0 5000000.0 3.5 53.0 English ... Mike Epps African Descent Male Russell Peters Other Other 0 1 0 1
4837 2016 Mr. Church USA Drama Mckenna Grace NaN 8000000.0 8.0 5.0 English ... Lucy Fry Other Other Natalie Coughlin Other Other 0 0 0 0
4836 2016 Lights Out USA Horror Billy Burke 56536016.0 4900000.0 6.9 95.0 English ... Amiah Miller Other Other Gabriel Bateman Other Other 0 0 0 0
4835 2016 The Boss USA Comedy Peter Dinklage 63034755.0 29000000.0 5.3 96.0 English ... Tyler Labine Other Other Ben Falcone Other Other 0 0 0 0
4834 2016 The Forest USA Horror|Mystery|Thriller Eoin Macken 26583369.0 10000000.0 4.8 127.0 English ... Stephanie Vogt Other Other Gen Seto Other Other 0 0 0 0
4833 2016 Money Monster USA Crime|Drama|Thriller Julia Roberts 41008532.0 27000000.0 6.7 103.0 English ... Jack O'Connell Other Other Chris Bauer Other Other 0 0 0 0
4831 2016 Captain America: Civil War USA Action|Adventure|Sci-Fi Robert Downey Jr. 407197282.0 250000000.0 8.2 1022.0 English ... Scarlett Johansson Other Other Chris Evans Other Other 0 0 0 0
4830 2016 Neighbors 2: Sorority Rising USA Comedy Chlo’â Grace Moretz 55291815.0 35000000.0 6.0 111.0 English ... Ike Barinholtz Other Other Kiersey Clemons African Descent Female 0 0 1 1
4829 2016 Yoga Hosers USA Comedy|Fantasy|Horror|Thriller Johnny Depp NaN 5000000.0 4.8 4.0 English ... Haley Joel Osment Other Other Natasha Lyonne Other Other 0 0 0 0
4933 2016 Risen USA Action|Adventure|Drama|Mystery Peter Firth 36874745.0 20000000.0 6.3 117.0 English ... Jan Cornet Other Other Mar’_a Botto Other Other 0 0 0 0
4852 2016 How to Be Single USA Comedy|Romance Alison Brie 46813366.0 38000000.0 6.1 83.0 English ... Damon Wayans Jr. African Descent Male Nicholas Braun Other Other 0 1 0 1
4855 2016 Pride and Prejudice and Zombies USA Action|Horror|Romance Matt Smith 10907291.0 28000000.0 5.8 134.0 English ... Bella Heathcote Other Other Sam Riley Other Other 0 0 0 0
4876 2016 The Shallows USA Drama|Horror|Thriller ’–scar Jaenada 54257433.0 17000000.0 6.8 139.0 English ... Brett Cullen Other Other Sedona Legge Other Other 0 0 0 0
4874 2016 God's Not Dead 2 USA Drama Benjamin A. Onyango 20773070.0 5000000.0 3.4 102.0 English ... Robin Givens African Descent Female Maria Canals-Barrera Other Other 0 1 0 1
4873 2016 Ben-Hur USA Adventure|Drama|History Morgan Freeman NaN NaN 6.1 1.0 English ... Ayelet Zurer Other Other Moises Arias Other Other 1 0 0 1
4872 2016 Alice Through the Looking Glass USA Adventure|Family|Fantasy Johnny Depp 76846624.0 170000000.0 6.4 131.0 English ... Alan Rickman Other Other Anne Hathaway Other Other 0 0 0 0
4870 2016 Warcraft USA Action|Adventure|Fantasy Dominic Cooper 46978995.0 160000000.0 7.3 781.0 English ... Callum Rennie Other Other Ruth Negga Other Other 0 0 0 0
4868 2016 Cabin Fever USA Horror Dustin Ingram NaN NaN 3.7 40.0 English ... Gage Golightly Other Other Samuel Davis Other Other 0 0 0 0
4867 2016 Fight Valley USA Action|Drama Kari J. Kramer NaN 27000000.0 5.0 6.0 English ... Erin O'Brien Other Other Cabrina Collesides Other Other 0 0 0 0
4866 2016 Bad Moms USA Comedy Mila Kunis 55461307.0 20000000.0 6.7 46.0 English ... Jay Hernandez Other Other Jada Pinkett Smith African Descent Female 0 0 1 1
4865 2016 Kickboxer: Vengeance USA Action Matthew Ziff NaN 17000000.0 9.1 1.0 NaN ... T.J. Storm Other Other Sam Medina Other Other 0 0 0 0
4864 2016 The Young Messiah USA Drama Clive Russell 6462576.0 18500000.0 5.4 30.0 English ... Vincent Walsh Other Other Finn Ireland Other Other 0 0 0 0
4863 2016 The Dog Lover USA Drama Lea Thompson NaN 2000000.0 4.8 8.0 English ... Christina Moore Other Other Cullen Douglas Other Other 0 0 0 0
4862 2016 Indignation USA Drama Logan Lerman 560512.0 NaN 7.8 3.0 Hebrew ... Sarah Gadon Other Other Tracy Letts Other Other 0 0 0 0
4860 2016 My Big Fat Greek Wedding 2 USA Comedy|Family|Romance Nia Vardalos 59573085.0 18000000.0 6.1 103.0 English ... Louis Mandylor Other Other Joey Fatone Other Other 0 0 0 0
4859 2016 Now You See Me 2 USA Action|Adventure|Comedy|Crime|Mystery|Thriller Daniel Radcliffe 64685359.0 90000000.0 6.9 139.0 English ... Morgan Freeman African Descent Male Sanaa Lathan African Descent Female 0 1 1 2
4858 2016 The Legend of Tarzan USA Action|Adventure|Drama|Romance Christoph Waltz 124051759.0 180000000.0 6.6 239.0 English ... Alexander Skarsg’Çrd Other Other Casper Crump Other Other 0 0 0 0
4857 2016 Ride Along 2 USA Action|Comedy Olivia Munn 90835030.0 40000000.0 5.9 58.0 English ... Nadine Velazquez Other Other Bruce McGill Other Other 0 0 0 0
4853 2016 Kicks USA Adventure Tina Gilton NaN NaN 7.8 6.0 English ... Natalie Stephany Aguilar Other Other Justin Hall Other Other 0 0 0 0
4934 2016 Keanu USA Action|Comedy Nia Long 20566327.0 15000000.0 6.4 84.0 English ... Will Forte Other Other Keegan-Michael Key African Descent Male 1 0 1 2

3733 rows × 25 columns

Graphs

Nominations for films with African American lead and supporting actors

Versus non-black lead and supporting actors


In [977]:
fig, ax = plt.subplots()

# Movies with nominations
hollywood_AW= hollywood[hollywood['Nominations'] > 0]

# Create dataframe where 'sum_totals' != 0
interest1 = hollywood_AW[hollywood_AW['sum_totals'] > 0]
interest1_opp = hollywood_AW[hollywood_AW['sum_totals'] == 0]

# Africa American 
plot1= interest1.plot.kde(x='Year', y="Nominations", color="blue", ax = ax, label = 'Of African Descent', 
                          xlim= [-2, 13], subplots= True)
ax.vlines(interest1['Nominations'].mode(), 0, 0.4, color='red', lw=2)
ax.set_title("Nominations: Films with Black Lead and Supporting Actors" ) 

# Other races
plot2= interest1_opp.plot.kde(x='Year', y="Nominations", color="green", label = 'Not of African Descent', xlim= [-2, 13])
plot2.set_title("Nominations: Films with No Black Lead and Supporting Actors" ) 
plot2.vlines(interest1_opp['Nominations'].mode(), 0, 0.4, color='red', lw=2)


Out[977]:
<matplotlib.collections.LineCollection at 0x15532dbe0>

Gross Revenue of Films that contained African American Actors and Actresses

Graph of films that contained African American male actors and African American female actors, and the film's respective gross revenues. We see from the graph that film's with African American males significantly gross higher than films that have African American females.


In [980]:
# AA hollywood 
hollywoodR_F= hollywood.loc[(hollywood['sex_1'] == 'Female') | (hollywood['sex_2'] == 'Female') | 
                            (hollywood['sex_3'] == 'Female')]
hollywoodR_M= hollywood.loc[(hollywood['sex_1'] == 'Male') | (hollywood['sex_2'] == 'Male') | 
                            (hollywood['sex_3'] == 'Male')]
# Not AA hollywood
hollywoodN_F= hollywood.loc[(hollywood['sex_1'] == 'Female') | (hollywood['sex_2'] == 'Female') | 
                            (hollywood['sex_3'] == 'Female')]
hollywoodN_M= hollywood.loc[(hollywood['sex_1'] == 'Male') | (hollywood['sex_2'] == 'Male') | 
                            (hollywood['sex_3'] == 'Male')]

In [981]:
# Growth of black actors in high budget films, male vs. female
plt.style.use("seaborn-pastel")

fig, ax = plt.subplots(figsize = (8, 4)) # 2 subplots

# Male

hollywoodR_M.plot(x='Year', y="gross", ax=ax, kind="scatter",
                                 color="blue", label ='Male')
# Female
hollywoodR_F.plot(x='Year', y="gross", ax=ax, kind="scatter",
                                 color="#FF69B4", alpha= 0.7, label ='Female')

ax.set_xlim(1920, 2016)
ax.legend(loc='best')
ax.set_title("Revenue: Black Female and Male Lead and Supporting Actors" )


Out[981]:
<matplotlib.text.Text at 0x152d30c50>

Seperate graphs of gross revenue of films that contain African American lead and supporting actors, per year. We see that the trend of African American males in high revenue yielding films is increasing. The opposite can be seen for films with African American females in high grossing films; this is likely due to the outlier of Hattie McDaniel in the 1939 film "Gone with the Wind".


In [982]:
# MALE

# Data no NAN's
interest2= hollywoodR_M[hollywoodR_M['gross'].notnull()]

# Define variables
x= interest2['Year']
y= interest2['gross']

# Best fit line
fit= np.polyfit(x, y, deg=1)
fit_fn = np.poly1d(fit)
interest2['trendline']= fit_fn(interest2['Year'])

# Plot
ax1 = interest2.plot.hexbin(x='Year', y='gross', cmap = plt.cm.Greens,     # Hexbin plot
                           gridsize= 50, label ='male')                   
interest2.set_index(x, inplace=True)                                      # Index by year
interest2.trendline.sort_index(ascending=False).plot(ax=ax1, subplots= True)
plt.gca().invert_xaxis() 
ax1.set_title("Male" ) 




# FEMALE

# Define variables
u = interest3['Year']
v = interest3['gross']

# Best fit line
fit2= np.polyfit(u, v, deg=1)
fit2_fn = np.poly1d(fit2)
interest3['trendline']= fit2_fn(interest3['Year'])

# Plot
ax2 = interest3.plot.hexbin(x='Year', y='gross', cmap = plt.cm.Purples,     # Hexbin plot
                           gridsize= 50, label ='Female')                   
interest3.set_index(u, inplace=True)                                      # Index by year
interest3.trendline.sort_index(ascending=False).plot(ax=ax2)
plt.gca().invert_xaxis()  
ax2.set_title("Female" )


/Users/eunicefamodimu/anaconda/lib/python3.6/site-packages/ipykernel/__main__.py:13: SettingWithCopyWarning: 
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/indexing.html#indexing-view-versus-copy
Out[982]:
<matplotlib.text.Text at 0x153e9b6a0>

In [ ]: