Data about particular researchers is interesting to us. We are interested in their career path, disciplinarity, affilitations. So far we have two datasets we can work with. One comes from highlycited.com (provided by Thompson Reuters) and another comes from American Philosophical Society.

highlycited.com


In [1]:
import requests

In [2]:
rxls = requests.get('http://highlycited.com/highly_cited_2014.xlsx')
rcsv = requests.get('http://highlycited.com/highly_cited_2014.csv')

In [1]:
import pandas as pd
url = 'd:\Desktop\highly_cited_2014_june_2014.xlsx'
df = pd.read_excel(url)

In [2]:
df.columns


Out[2]:
Index([u'First_Name', u'Middle_Name', u'Family_Name', u'Category', u'Primary Affiliation', u'Secondary Affiliations'], dtype='object')

Let's see what kind of disciplines/fields to they have. They use very neutral name of 'category'.


In [46]:
all_categories = list(df['Category'].unique())
all_categories


Out[46]:
[u'Agricultural Sciences',
 u'Biology & Biochemistry',
 u'Chemistry',
 u'Clinical Medicine',
 u'Computer Science',
 u'Economics & Business',
 u'Engineering',
 u'Environment/Ecology',
 u'Geosciences',
 u'Immunology',
 u'Materials Science',
 u'Mathematics',
 u'Microbiology',
 u'Molecular Biology & Genetics',
 u'Neuroscience & Behavior',
 u'Pharmacology & Toxicology',
 u'Physics',
 u'Plant & Animal Science',
 u'Psychiatry/Psychology',
 u'Social Sciences, general',
 u'Space Science']

The list we got here is quite nice. We can say something about it. First, some disciplines are grouped together. For example: 'Biology & Biochemistry', 'Economics & Business', 'Environment/Ecology'. Second, 'Environment/Ecology' and 'Psychiatry/Psychology' are separated with "/" rathern than "&" like most other categories. Does this refer to some kind of different relations between the disciplines? Third, 'Social Sciences, general' - this cattegorie refers to social sciences in general, that means we should expect overal.


In [56]:
[x.replace('/', '&').split('&') for x in all_categories]


Out[56]:
[[u'Agricultural Sciences'],
 [u'Biology ', u' Biochemistry'],
 [u'Chemistry'],
 [u'Clinical Medicine'],
 [u'Computer Science'],
 [u'Economics ', u' Business'],
 [u'Engineering'],
 [u'Environment', u'Ecology'],
 [u'Geosciences'],
 [u'Immunology'],
 [u'Materials Science'],
 [u'Mathematics'],
 [u'Microbiology'],
 [u'Molecular Biology ', u' Genetics'],
 [u'Neuroscience ', u' Behavior'],
 [u'Pharmacology ', u' Toxicology'],
 [u'Physics'],
 [u'Plant ', u' Animal Science'],
 [u'Psychiatry', u'Psychology'],
 [u'Social Sciences, general'],
 [u'Space Science']]

From here we might choose few paths. One of paths might be connecting these fields with the classifications we already have. We might think of:

  • importing consensus map of science #that shows connections. This can be done with many inside topics. You can see how comparison of categorizations/classifications are done in another notebook
  • importing biglans #that shows types
  • importing organizational_structures

However, we will not be doing this here. Later we will have a link to another notebook that will do that. Comparing classifications <- not working yet

For now, lets look inti some simple stats.


In [62]:
df.groupby('Category').size().order(ascending=False)


Out[62]:
Category
Clinical Medicine               402
Molecular Biology & Genetics    200
Chemistry                       198
Biology & Biochemistry          196
Engineering                     187
Social Sciences, general        177
Plant & Animal Science          176
Geosciences                     159
Materials Science               147
Physics                         144
Environment/Ecology             137
Pharmacology & Toxicology       133
Neuroscience & Behavior         129
Computer Science                117
Microbiology                    114
Agricultural Sciences           112
Space Science                   106
Psychiatry/Psychology           100
Mathematics                      99
Economics & Business             95
Immunology                       87
dtype: int64

In [63]:
grpd_by_prim_afil = df.groupby('Primary Affiliation').size().order(ascending=False)
grpd_by_prim_afil.head()


Out[63]:
Primary Affiliation
Harvard University, USA                    105
Stanford University, USA                    56
Chinese Academy of Sciences, China          46
University of California, Berkeley, USA     41
NIH, USA                                    33
dtype: int64

In [64]:
%matplotlib inline
import matplotlib.pyplot as plt
plt.plot(grpd_by_prim_afil)


Out[64]:
[<matplotlib.lines.Line2D at 0x7fb0e90>]

In [65]:
df.groupby('Secondary Affiliations').size().order(ascending=False).head()


Out[65]:
Secondary Affiliations
King Abdulaziz University, Saudi Arabia          122
Harvard University, USA                           27
The University of Tokyo, Japan                    10
King Abdulaziz University (50%), Saudi Arabia      8
Massachusetts Gen Hosp, USA                        8
dtype: int64

Why so many researchers have King Abdulaziz University as their secondary affiliation??? This seems like very straighforward finding that can easily attract attention. To whom and how should I communicate it. We could think of posting this data right into some social media or my blog.

Let's look into institutions that lead different Categories.


In [45]:
for x in df['Category'].unique():
    grouped =  df[df['Category'] == x].groupby('Primary Affiliation').size()
    ordered = grouped.order(ascending=False).head(3)
    print (x, ordered), '\n'


(u'Agricultural Sciences', Primary Affiliation
USDA, USA                        8
University of Southampton, UK    6
Cornell University, USA          4
dtype: int64) 

(u'Biology & Biochemistry', Primary Affiliation
NIH, USA                                     26
University of California, Santa Cruz, USA    18
European Bioinformat Inst, UK                16
dtype: int64) 

(u'Chemistry', Primary Affiliation
Northwestern University, USA               11
University of California, Berkeley, USA    10
Chinese Academy of Sciences, China          9
dtype: int64) 

(u'Clinical Medicine', Primary Affiliation
Brigham & Womens Hosp, USA                                   21
Harvard University, USA                                      14
The University of Texas M. D. Anderson Cancer Center, USA    10
dtype: int64) 

(u'Computer Science', Primary Affiliation
Stanford University, USA                            8
Massachusetts Institute of Technology (MIT), USA    4
The University of Texas at Austin, USA              4
dtype: int64) 

(u'Economics & Business', Primary Affiliation
Harvard University, USA         12
University of Chicago, USA       8
Northwestern University, USA     4
dtype: int64) 

(u'Engineering', Primary Affiliation
Aalborg University, Denmark           4
Aerodyne Res Inc, USA                 4
Chinese Academy of Sciences, China    4
dtype: int64) 

(u'Environment/Ecology', Primary Affiliation
University of Minnesota, Twin Cities, USA    4
CNRS, France                                 3
University College London, UK                3
dtype: int64) 

(u'Geosciences', Primary Affiliation
Natl Ctr Atmospher Res, USA    11
NOAA, USA                       9
NASA, USA                       7
dtype: int64) 

(u'Immunology', Primary Affiliation
NIAID, USA                        6
The University of Tokyo, Japan    5
Osaka University, Japan           5
dtype: int64) 

(u'Materials Science', Primary Affiliation
Chinese Academy of Sciences, China             15
National University of Singapore, Singapore     4
Imperial College London, UK                     4
dtype: int64) 

(u'Mathematics', Primary Affiliation
Stanford University, USA                   7
King Abdulaziz University, Saudi Arabia    5
University of California, Berkeley, USA    3
dtype: int64) 

(u'Microbiology', Primary Affiliation
Wellcome Trust Sanger Inst, UK                   8
J Craig Venter Inst, USA                         7
University of Maryland, Baltimore County, USA    4
dtype: int64) 

(u'Molecular Biology & Genetics', Primary Affiliation
deCODE Genet, Iceland             10
Wellcome Trust Sanger Inst, UK     9
Harvard University, USA            8
dtype: int64) 

(u'Neuroscience & Behavior', Primary Affiliation
Harvard University, USA     11
Mayo Medical School, USA    10
Stanford University, USA     7
dtype: int64) 

(u'Pharmacology & Toxicology', Primary Affiliation
University of Washington, USA                       4
Imperial College London, UK                         3
University of North Carolina at Chapel Hill, USA    3
dtype: int64) 

(u'Physics', Primary Affiliation
University of California, Berkeley, USA    9
Chinese Academy of Sciences, China         6
Stanford University, USA                   5
dtype: int64) 

(u'Plant & Animal Science', Primary Affiliation
Ghent University, Belgium                     8
King Saud University, Saudi Arabia            7
Max Planck Inst Mol Plant Physiol, Germany    6
dtype: int64) 

(u'Psychiatry/Psychology', Primary Affiliation
Harvard University, USA     10
Columbia University, USA     5
Duke University, USA         5
dtype: int64) 

(u'Social Sciences, general', Primary Affiliation
Harvard University, USA                 13
VU University Amsterdam, Netherlands     4
Indonesian Ctr Archaeol, Indonesia       4
dtype: int64) 

(u'Space Science', Primary Affiliation
Princeton University, USA            8
The Johns Hopkins University, USA    5
Apache Point Observ, USA             5
dtype: int64) 

We are also able to retrieve what institutions are best in what field.


In [75]:
for x in df['Primary Affiliation'].unique():
    grouped =  df[df['Primary Affiliation'] == x].groupby('Category').size()
    ordered = grouped.order(ascending=False)
    if len(ordered) > 8:
        print (x, ordered), '\n'


(u'University of California, Berkeley, USA', Category
Chemistry                       10
Physics                          9
Plant & Animal Science           4
Mathematics                      3
Environment/Ecology              3
Biology & Biochemistry           3
Materials Science                2
Economics & Business             2
Computer Science                 2
Space Science                    1
Molecular Biology & Genetics     1
Agricultural Sciences            1
dtype: int64) 

(u'Cornell University, USA', Category
Agricultural Sciences           4
Physics                         3
Plant & Animal Science          2
Clinical Medicine               2
Social Sciences, general        1
Psychiatry/Psychology           1
Pharmacology & Toxicology       1
Molecular Biology & Genetics    1
Materials Science               1
Geosciences                     1
Environment/Ecology             1
Economics & Business            1
Computer Science                1
Biology & Biochemistry          1
dtype: int64) 

(u'Pennsylvania State University - University Park, USA', Category
Space Science                   2
Plant & Animal Science          2
Engineering                     2
Agricultural Sciences           2
Molecular Biology & Genetics    1
Microbiology                    1
Mathematics                     1
Geosciences                     1
Environment/Ecology             1
Economics & Business            1
Computer Science                1
dtype: int64) 

(u'University of California, Los Angeles, USA', Category
Chemistry                       4
Mathematics                     3
Space Science                   2
Psychiatry/Psychology           2
Physics                         2
Neuroscience & Behavior         2
Materials Science               2
Engineering                     2
Computer Science                2
Molecular Biology & Genetics    1
Environment/Ecology             1
Clinical Medicine               1
Biology & Biochemistry          1
Agricultural Sciences           1
dtype: int64) 

(u'Harvard University, USA', Category
Clinical Medicine               14
Social Sciences, general        13
Economics & Business            12
Neuroscience & Behavior         11
Psychiatry/Psychology           10
Molecular Biology & Genetics     8
Chemistry                        5
Biology & Biochemistry           4
Physics                          4
Computer Science                 3
Agricultural Sciences            3
Immunology                       3
Microbiology                     3
Geosciences                      2
Engineering                      2
Mathematics                      2
Pharmacology & Toxicology        2
Plant & Animal Science           2
Materials Science                2
dtype: int64) 

(u'Chinese Academy of Sciences, China', Category
Materials Science            15
Chemistry                     9
Physics                       6
Geosciences                   4
Engineering                   4
Pharmacology & Toxicology     3
Plant & Animal Science        2
Environment/Ecology           1
Computer Science              1
Agricultural Sciences         1
dtype: int64) 

(u'Swiss Federal Institute of Technology Zurich, Switzerland', Category
Mathematics                  3
Geosciences                  3
Plant & Animal Science       2
Physics                      2
Pharmacology & Toxicology    2
Social Sciences, general     1
Materials Science            1
Immunology                   1
Computer Science             1
Biology & Biochemistry       1
dtype: int64) 

(u'University of Cambridge, UK', Category
Molecular Biology & Genetics    5
Neuroscience & Behavior         4
Biology & Biochemistry          4
Physics                         2
Materials Science               2
Clinical Medicine               2
Plant & Animal Science          1
Geosciences                     1
Environment/Ecology             1
Computer Science                1
dtype: int64) 

(u'University of Toronto, Canada', Category
Neuroscience & Behavior         2
Environment/Ecology             2
Economics & Business            2
Computer Science                2
Space Science                   1
Social Sciences, general        1
Psychiatry/Psychology           1
Plant & Animal Science          1
Molecular Biology & Genetics    1
Clinical Medicine               1
Chemistry                       1
Biology & Biochemistry          1
dtype: int64) 

(u'University of Washington, USA', Category
Molecular Biology & Genetics    5
Pharmacology & Toxicology       4
Materials Science               4
Space Science                   3
Geosciences                     3
Clinical Medicine               3
Microbiology                    1
Mathematics                     1
Immunology                      1
Economics & Business            1
Biology & Biochemistry          1
dtype: int64) 

(u'Yale University, USA', Category
Psychiatry/Psychology           3
Physics                         3
Clinical Medicine               3
Immunology                      2
Environment/Ecology             2
Biology & Biochemistry          2
Plant & Animal Science          1
Neuroscience & Behavior         1
Molecular Biology & Genetics    1
Chemistry                       1
dtype: int64) 

(u'Imperial College London, UK', Category
Materials Science               4
Pharmacology & Toxicology       3
Molecular Biology & Genetics    3
Clinical Medicine               3
Physics                         2
Plant & Animal Science          1
Neuroscience & Behavior         1
Environment/Ecology             1
Engineering                     1
Biology & Biochemistry          1
dtype: int64) 

(u'Stanford University, USA', Category
Computer Science                8
Neuroscience & Behavior         7
Mathematics                     7
Clinical Medicine               7
Physics                         5
Chemistry                       4
Biology & Biochemistry          4
Social Sciences, general        2
Psychiatry/Psychology           2
Materials Science               2
Environment/Ecology             2
Engineering                     2
Pharmacology & Toxicology       1
Molecular Biology & Genetics    1
Immunology                      1
Economics & Business            1
dtype: int64) 

(u'University of Oxford, UK', Category
Clinical Medicine               8
Neuroscience & Behavior         6
Molecular Biology & Genetics    5
Social Sciences, general        4
Geosciences                     2
Psychiatry/Psychology           1
Plant & Animal Science          1
Microbiology                    1
Mathematics                     1
Immunology                      1
Engineering                     1
Chemistry                       1
Biology & Biochemistry          1
dtype: int64) 

(u'University of California, San Diego, USA', Category
Psychiatry/Psychology           3
Neuroscience & Behavior         3
Molecular Biology & Genetics    3
Computer Science                3
Clinical Medicine               3
Biology & Biochemistry          3
Social Sciences, general        2
Plant & Animal Science          2
Immunology                      2
Environment/Ecology             2
Geosciences                     1
Engineering                     1
Economics & Business            1
Chemistry                       1
dtype: int64) 

(u'University of Michigan - Ann Arbor, USA', Category
Molecular Biology & Genetics    7
Clinical Medicine               4
Space Science                   2
Social Sciences, general        2
Physics                         2
Mathematics                     2
Materials Science               2
Immunology                      2
Economics & Business            2
Chemistry                       2
Psychiatry/Psychology           1
Environment/Ecology             1
Computer Science                1
Biology & Biochemistry          1
dtype: int64) 

(u'Massachusetts Institute of Technology (MIT), USA', Category
Chemistry                       7
Molecular Biology & Genetics    6
Computer Science                4
Materials Science               3
Economics & Business            3
Physics                         2
Pharmacology & Toxicology       1
Neuroscience & Behavior         1
Mathematics                     1
Geosciences                     1
Clinical Medicine               1
Biology & Biochemistry          1
dtype: int64) 

(u'The Johns Hopkins University, USA', Category
Space Science                   5
Clinical Medicine               4
Social Sciences, general        3
Neuroscience & Behavior         3
Microbiology                    2
Pharmacology & Toxicology       1
Molecular Biology & Genetics    1
Materials Science               1
Computer Science                1
Biology & Biochemistry          1
dtype: int64) 

(u'Georgia Institute of Technology, USA', Category
Chemistry                    4
Materials Science            3
Psychiatry/Psychology        1
Physics                      1
Pharmacology & Toxicology    1
Geosciences                  1
Engineering                  1
Economics & Business         1
Computer Science             1
dtype: int64) 

(u'Northwestern University, USA', Category
Chemistry                   11
Clinical Medicine            5
Economics & Business         4
Social Sciences, general     2
Materials Science            2
Neuroscience & Behavior      1
Microbiology                 1
Engineering                  1
Computer Science             1
dtype: int64) 

(u'California Institute of Technology, USA', Category
Physics                      3
Geosciences                  3
Engineering                  3
Space Science                2
Chemistry                    2
Pharmacology & Toxicology    1
Neuroscience & Behavior      1
Economics & Business         1
Computer Science             1
dtype: int64) 

(u'University of North Carolina at Chapel Hill, USA', Category
Clinical Medicine               4
Social Sciences, general        3
Pharmacology & Toxicology       3
Molecular Biology & Genetics    2
Psychiatry/Psychology           1
Plant & Animal Science          1
Microbiology                    1
Economics & Business            1
Chemistry                       1
dtype: int64) 

(u'Princeton University, USA', Category
Space Science               8
Physics                     3
Mathematics                 3
Economics & Business        3
Computer Science            3
Engineering                 2
Social Sciences, general    1
Neuroscience & Behavior     1
Geosciences                 1
Chemistry                   1
dtype: int64) 

(u'University of Pennsylvania, USA', Category
Social Sciences, general        3
Neuroscience & Behavior         2
Microbiology                    2
Immunology                      2
Economics & Business            2
Chemistry                       2
Psychiatry/Psychology           1
Pharmacology & Toxicology       1
Molecular Biology & Genetics    1
Clinical Medicine               1
dtype: int64) 

(u'University of Minnesota, Twin Cities, USA', Category
Environment/Ecology         4
Mathematics                 3
Social Sciences, general    2
Engineering                 2
Computer Science            2
Plant & Animal Science      1
Neuroscience & Behavior     1
Materials Science           1
Geosciences                 1
Economics & Business        1
Clinical Medicine           1
Chemistry                   1
dtype: int64) 

(u'Duke University, USA', Category
Psychiatry/Psychology        5
Social Sciences, general     3
Plant & Animal Science       3
Microbiology                 3
Economics & Business         3
Clinical Medicine            3
Mathematics                  2
Environment/Ecology          2
Physics                      1
Pharmacology & Toxicology    1
Neuroscience & Behavior      1
Materials Science            1
Chemistry                    1
dtype: int64) 

(u'Columbia University, USA', Category
Psychiatry/Psychology       5
Geosciences                 3
Clinical Medicine           3
Social Sciences, general    2
Neuroscience & Behavior     2
Environment/Ecology         2
Space Science               1
Physics                     1
Mathematics                 1
Economics & Business        1
dtype: int64) 

(u'University of Melbourne, Australia', Category
Environment/Ecology         3
Neuroscience & Behavior     2
Social Sciences, general    1
Psychiatry/Psychology       1
Microbiology                1
Mathematics                 1
Materials Science           1
Engineering                 1
Economics & Business        1
dtype: int64) 


In [106]:
#Do the same with: affiliation.contains(pycountry names)

let's network analysis


In [84]:
import networkx as nx
g = nx.Graph()
for unique_affiliation in df['Primary Affiliation'].unique():
    grouped =  df[df['Primary Affiliation'] == unique_affiliation].groupby('Category').size()
    ordered = grouped.order(ascending=False)
    for CAT, EGORY in ordered.iteritems():
        g.add_edge(unique_affiliation, CAT, {'Weight':EGORY})

In [99]:
print(len(g.nodes()), len(g.edges()))


(1038, 1980)

In [113]:



---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
<ipython-input-113-3bc49a33bab8> in <module>()
      1 pos = nx.fruchterman_reingold_layout(g)
----> 2 nx.draw_nodes(g, pos)

AttributeError: 'module' object has no attribute 'draw_nodes'

In [214]:
pos = nx.fruchterman_reingold_layout(g)
plt.figure(figsize=(10,10))
nx.draw_networkx_nodes(g, pos, nodelist=[x for x,y in g.nodes_iter(data=True) if x in all_categories], alpha=0.1)
nx.draw_networkx_labels(g, pos, labels = {key: key for (key, value) in g.nodes_iter(data=True) if key in all_categories}, alpha=0.3)
nx.draw_networkx_nodes(g, pos, nodelist=[x for x,y in g.nodes_iter(data=True) if x not in all_categories], alpha=0.05, node_color='b')
nx.draw_networkx_edges(g, pos, alpha=0.05)


Out[214]:
<matplotlib.collections.LineCollection at 0x169c8a50>

In [215]:
g1 = nx.subgraph(g, [x for x in g.nodes() if (len(g.neighbors(x)) > 10)])
plt.figure(figsize=(10,10))
pos = nx.fruchterman_reingold_layout(g1)
nx.draw_networkx_nodes(g1, pos, nodelist=[x for x,y in g1.nodes_iter(data=True) if x in all_categories], alpha=0.1)
nx.draw_networkx_labels(g1, pos, labels = {key: key for (key, value) in g1.nodes_iter(data=True) if key in all_categories}, alpha=0.3)
nx.draw_networkx_nodes(g1, pos, nodelist=[x for x,y in g1.nodes_iter(data=True) if x not in all_categories], alpha=0.05, node_color='b')
nx.draw_networkx_edges(g1, pos, alpha=0.05)


Out[215]:
<matplotlib.collections.LineCollection at 0x16bd03b0>

In [105]:
nx.closeness_centrality(g) 
# Next, add this to all nodes 
# Remove 50 percent things.
# Also, we do not extract country names. this can be solved with pycountry module
# import pycountry
# len(pycountry.countries)


Out[105]:
{u', Germany': 0.26233240576777134,
 u', USA': 0.2697009102730819,
 u'3M Co, USA': 0.2648786717752235,
 u'ARS, USA': 0.2690012970168612,
 u'ASTAR, Singapore': 0.26233240576777134,
 u'AT&T Labs Res, USA': 0.2648786717752235,
 u'Aalborg University (80%), Denmark': 0.2690012970168612,
 u'Aalborg University, Denmark': 0.29893341020466996,
 u'Aarhus University, Denmark': 0.2834107679693905,
 u'Aaron Diamond AIDS Research Center, USA': 0.2606182457903996,
 u'Acad Sci Czech Republic, Czech': 0.2661021298434693,
 u'Academy of Sciences of the Czech Republic, Czech': 0.2648786717752235,
 u'Addenbrookes Hosp, UK': 0.28893842295904154,
 u'Adimab LLC, USA': 0.2606182457903996,
 u'Aerodyne Res Inc, USA': 0.2795901860339714,
 u'Agenzia Spaziale Italiana ASI Sci Data Ctr, Italy': 0.2591852036990752,
 u'Agr & Agri Food Canada, Canada': 0.26074930852401307,
 u'Agricultural Sciences': 0.35260115606936415,
 u'Agricultural University of Athens, Greece': 0.26074930852401307,
 u'Aix Marseille University, France': 0.28449931412894375,
 u'Akad Pedog Krakowie, Poland': 0.2591852036990752,
 u'Al Ain Wildlife Pk & Resort, United Arab Emirates': 0.2648786717752235,
 u'Alan Guttmacher Inst, USA': 0.26569305662311044,
 u'AlertMe, UK': 0.2633985267970536,
 u'Allegheny Gen Hosp, USA': 0.27943950417677177,
 u'Alliance for a Healthier Generation, USA': 0.26569305662311044,
 u'Amer Canc Soc, USA': 0.27943950417677177,
 u'Amirkabir University of Technology, Iran': 0.2758712423516893,
 u'Apache Point Observ, USA': 0.2591852036990752,
 u'Argonne Natl Lab, USA': 0.28403177211722813,
 u'Arizona State University - Tempe, USA': 0.28187007338950804,
 u'Assoc Nazl Med Cardiol Osped ANMCO Res Ctr, Italy': 0.27943950417677177,
 u'Associates of Cape Cod, USA': 0.26074930852401307,
 u'Astex Pharmaceut, UK': 0.2648786717752235,
 u'Atmospher Chem Serv, UK': 0.2632647880172633,
 u'Auburn University, USA': 0.2690012970168612,
 u'Auckland City Hosp, New Zealand': 0.27943950417677177,
 u'Austin Peay State University, USA': 0.2591852036990752,
 u'Autonomous University of Barcelona, Spain': 0.2648786717752235,
 u'Autonomous University of Madrid, Spain': 0.2625981261078754,
 u'Avid Radiopharmaceuticals, USA': 0.2594445834375782,
 u'BGI (60%), China': 0.2648786717752235,
 u'BGI Shenzhen, China': 0.2648786717752235,
 u'BNLMS, China': 0.2633985267970536,
 u'Babol Noshirvani University of Technology, Iran': 0.2690012970168612,
 u'Babraham Institute Cambridge, UK': 0.2648786717752235,
 u'Babson College, USA': 0.2581528503858601,
 u'Baker IDI Heart & Diabet Inst, Australia': 0.27943950417677177,
 u'Bangor University, UK': 0.26569305662311044,
 u'Bauhaus University, Weimar, Germany': 0.2777926600589338,
 u'Bavarian Ctr Appl Energy Res ZAE Bayern, Germany': 0.2633985267970536,
 u'Baylor College of Medicine, USA': 0.2975609756097561,
 u'Baylor University, USA': 0.25789604575976127,
 u'Beaujon Hosp, France': 0.27943950417677177,
 u'Beaumont Hosp, Ireland': 0.25957446808510637,
 u'Beihang University, China': 0.2625981261078754,
 u'Beijing Normal University, China': 0.2625981261078754,
 u'Beijing University of Posts and Telecommunications, China': 0.2625981261078754,
 u'Bern Univ Hosp, Switzerland': 0.27943950417677177,
 u'Beth Israel Deaconess Med Ctr, USA': 0.2849683979115141,
 u'Betty & Guy Beatty Ctr Integrat Res, USA': 0.27943950417677177,
 u'Bilkent University, Turkey': 0.2754316069057105,
 u'Biobyte Solut GmbH, Germany': 0.2633985267970536,
 u'Biogen Idec Inc, USA': 0.2648786717752235,
 u'Biogen Idec, USA': 0.25789604575976127,
 u'Biology & Biochemistry': 0.35746294381247845,
 u'Biotechnology industry, USA': 0.2606182457903996,
 u'BlackRock, USA': 0.2581528503858601,
 u'Boise State University, USA': 0.2591852036990752,
 u"Boston Children's Hospital & Harvard Medical School, USA": 0.27943950417677177,
 u'Boston College, USA': 0.2690012970168612,
 u'Boston Med Ctr, USA': 0.27943950417677177,
 u'Boston University, USA': 0.3128205128205128,
 u'Boston Vet Affairs Healthcare Syst, USA': 0.27943950417677177,
 u'Brigham & Womens Hosp, USA': 0.31471927162367225,
 u'Brigham &Womens Hosp, USA': 0.27943950417677177,
 u'Brigham Young University, USA': 0.26233240576777134,
 u'British Columbia Canc Agcy, Canada': 0.27943950417677177,
 u'British Columbia Canc Res Ctr, Canada': 0.2648786717752235,
 u'Broad Inst Harvard & MIT, USA': 0.2975609756097561,
 u'Broad Inst Harvard & Massachusetts Inst Technol M, USA': 0.2648786717752235,
 u'Broad Inst Harvard & Massachusetts Inst Technol, USA': 0.2648786717752235,
 u'Broad Inst MIT & Harvard, USA': 0.28893842295904154,
 u'Broad Inst Massachusetts Inst Technol & Harvard, USA': 0.2648786717752235,
 u'Broad Inst Massachusetts Inst Technol MIT & Harva, USA': 0.2594445834375782,
 u'Broad Inst, USA': 0.3067139899438036,
 u'Brookhaven Natl Lab, USA': 0.2690012970168612,
 u'Brown University, USA': 0.29893341020466996,
 u'Brunel University, UK': 0.2690012970168612,
 u'CBS KNAW Fungal Biodivers Ctr, Netherlands': 0.2654210391604812,
 u'CEA Saclay, France': 0.2698412698412698,
 u'CEA, France': 0.28187007338950804,
 u'CEFAS Lowestoft Lab, UK': 0.2648786717752235,
 u'CHU Pitie Salpetriere, France': 0.27943950417677177,
 u'CIMMYT, Mexico': 0.2654210391604812,
 u'CMS, USA': 0.26569305662311044,
 u'CMU, Switzerland': 0.2633985267970536,
 u'CNIC, Spain': 0.27943950417677177,
 u'CNR, Italy': 0.2769025367156208,
 u'CNRS, France': 0.28528198074277855,
 u'CSIC, Spain': 0.2936845086377797,
 u'CSIR Inst Microbial Technol, India': 0.26233240576777134,
 u'CSIRO Sustainable Ecosyst, Australia': 0.2648786717752235,
 u'CVPath Inst, USA': 0.27943950417677177,
 u'California Institute of Technology, USA': 0.3387781770663182,
 u'Canc Res & Biostat, USA': 0.27943950417677177,
 u'Canc Res UK, UK': 0.25789604575976127,
 u'Cardiff University, UK': 0.28512510310695627,
 u'Carleton University, Canada': 0.2648786717752235,
 u'Carnegie Inst Sci, USA': 0.2648786717752235,
 u'Carnegie Mellon University, USA': 0.3067139899438036,
 u'Case Western Reserve University, USA': 0.26719917547023964,
 u'Catholic University of Louvain, Belgium': 0.27311035027653413,
 u'Cedars Sinai Heart Inst, USA': 0.27943950417677177,
 u'Cedars Sinai Med Ctr, USA': 0.2648786717752235,
 u'Cent Food Technol Res Inst, India': 0.26074930852401307,
 u'Center National de la Recherche scientifique(CNRS), France': 0.27601809954751133,
 u'Center for International Forestry Research (CIFOR), Brazil': 0.2581528503858601,
 u'Central Queensland University, Australia': 0.2690012970168612,
 u'Central South University, China': 0.2690012970168612,
 u'Centre National de la Recherche Scientifique (CNRS), France': 0.2795901860339714,
 u'Chemistry': 0.3624606780845858,
 u'Chiba University (60%), Japan': 0.2654210391604812,
 u'Chiba University, Japan': 0.27012242771555095,
 u'Child Mind Inst, USA': 0.2594445834375782,
 u'Childrens Hosp & Reg Med Ctr, USA': 0.26569305662311044,
 u'Childrens Hosp Boston, USA': 0.2633985267970536,
 u'Childrens Hosp Med Ctr, USA': 0.26074930852401307,
 u'Childrens Hosp, USA': 0.2897457390332495,
 u'Children\u2019s Hospital Boston, USA': 0.2594445834375782,
 u'China Univ Geosci, China': 0.2632647880172633,
 u'Chinese Academy of Sciences, China': 0.363987363987364,
 u'Cincinnati Childrens Hosp Res Fdn, USA': 0.25789604575976127,
 u'Citigroup Inc USA, USA': 0.26233240576777134,
 u'City University of Hong Kong, Hong Kong, China': 0.3031277404267758,
 u'Clean Energy Res Ctr, Canada': 0.2690012970168612,
 u'Cleveland Clin Fdn, USA': 0.27943950417677177,
 u'Cleveland Clin, USA': 0.2915378127635648,
 u'Climate Anal & Consulting, Germany': 0.2632647880172633,
 u'Clinical Medicine': 0.38766355140186914,
 u'CoStim, USA': 0.27943950417677177,
 u'Cold Spring Harbor Lab, USA': 0.2648786717752235,
 u'Colorado State University, USA': 0.2632647880172633,
 u'Columbia University, USA': 0.35944540727902946,
 u'Commiss European Communities, Italy': 0.2632647880172633,
 u'Commissariat \xe0 l\u2019Energie Atomique et aux Energies Alternatives, CEA, France': 0.2632647880172633,
 u'Commonwealth Fund, USA': 0.26569305662311044,
 u'Commonwealth Sci & Ind Res Org, Australia': 0.2654210391604812,
 u'Computer Science': 0.3555022283167638,
 u'Consejo Superior de Investigaciones Cient\xedficas (CSIC), Spain': 0.2632647880172633,
 u'Consorzio Mario Negri Sud, Italy': 0.27943950417677177,
 u'Constantine the Philosopher University, Slovakia': 0.2648786717752235,
 u'Cooperat Inst Res Environm Studies, USA': 0.2632647880172633,
 u'Copenhagen Business School, Denmark': 0.26569305662311044,
 u'Cornell University, USA': 0.419328750505459,
 u'Ctr Biosyst Genom, Netherlands': 0.2654210391604812,
 u'Ctr Chirurg Marie Lannelongue, France': 0.27943950417677177,
 u'Ctr Dis Control & Prevent, USA': 0.3013658820110433,
 u'Ctr Genom Regulat CRG, Spain': 0.2736869886513592,
 u'Ctr Healthcare Res, USA': 0.26569305662311044,
 u'Ctr Int Climate & Environm Res Oslo CICERO, Norway': 0.2632647880172633,
 u'Ctr Medicaid Serv, USA': 0.26569305662311044,
 u'Ctr Medicare & Medicaid Serv CMS, USA': 0.26569305662311044,
 u'Ctr Nacl Biotecnol Consejo Super Invest Cient, Spain': 0.2654210391604812,
 u'Curtin University of Technology, Australia': 0.2632647880172633,
 u'Dalhousie University, Canada': 0.2648786717752235,
 u'Dalian University of Technology, China': 0.2661021298434693,
 u'Dana Farber Canc Inst, USA': 0.2946859903381642,
 u'Danube University Krems, Austria': 0.28893842295904154,
 u'Dartmouth College, USA': 0.26678672498070494,
 u'Delft University of Technology, Netherlands': 0.2625981261078754,
 u'Denver VA Med Ctr, USA': 0.25957446808510637,
 u'Detroit Medical Ctr, USA': 0.27943950417677177,
 u'Deutsch Herzzentrum Munich, Germany': 0.27943950417677177,
 u'Deutsch Zentrum Luft & Raumfahrt, Germany': 0.2632647880172633,
 u"Dongduk Women's University, South Korea": 0.2648786717752235,
 u'Donghua University, China': 0.2690012970168612,
 u'Dover Sci, USA': 0.26074930852401307,
 u'Dr Margarete Fischer Bosch Inst Clin Pharmacol, Germany': 0.2648786717752235,
 u'Dresden University of Technology, Germany': 0.27282294133122864,
 u'Drexel University, USA': 0.2815639424382297,
 u'DuPont Haskell Global Ctr Hlth & Environm Sci, USA': 0.2648786717752235,
 u'DuPont Haskell Lab Hlth & Environm Sci, USA': 0.2648786717752235,
 u'DuPont Pioneer, USA': 0.2654210391604812,
 u'Duke Clin Res Inst, USA': 0.27943950417677177,
 u'Duke Natl Univ Singapore, Singapore': 0.25957446808510637,
 u'Duke Translat Med Inst, USA': 0.27943950417677177,
 u'Duke University, USA': 0.3980806142034549,
 u'Dutch Polymer Inst, Netherlands': 0.2633985267970536,
 u'EAWAG, Switzerland': 0.2648786717752235,
 u'EBI, UK': 0.2633985267970536,
 u'EDHEC Business School, France': 0.2581528503858601,
 u'EMBL, UK': 0.2633985267970536,
 u'EPHYSE, France': 0.26074930852401307,
 u'EURECOM, France': 0.26233240576777134,
 u'Eagle Genomics, UK': 0.2633985267970536,
 u'East China University of Science and Technology, China': 0.27311035027653413,
 u'Eawag, Swiss Federal Institute of Aquatic Science and Technology, Switzerland': 0.2648786717752235,
 u'Ecole Centrale de Nantes, France': 0.2690012970168612,
 u'Ecole Normale Superieure - Paris, France': 0.26233240576777134,
 u'Ecole Polytechnique Federale de Lausanne (50%), Switzerland': 0.27311035027653413,
 u'Ecole Polytechnique Federale de Lausanne (80%), Switzerland': 0.27311035027653413,
 u'Ecole Polytechnique Federale de Lausanne, Switzerland': 0.2991058552062302,
 u'Ecole Polytechnique de Montreal, Canada': 0.2690012970168612,
 u'Economics & Business': 0.34786984233478696,
 u'Einaudi Institute for Economics and Finance, Italy': 0.2581528503858601,
 u'Eindhoven University of Technology, Netherlands': 0.27311035027653413,
 u'Emory University, USA': 0.3063515509601182,
 u'Empa Swiss Fed Labs Mat Sci & Technol, Switzerland': 0.2648786717752235,
 u'Endocannabinoid Res Grp, Italy': 0.2648786717752235,
 u'Energy Sciences Network, USA': 0.2654210391604812,
 u'Energy, Mining & Environment Portfolio, National Research Council of Canada, Canada': 0.2690012970168612,
 u'Engineering': 0.3678609435970202,
 u'Environm Canada, Canada': 0.2648786717752235,
 u'Environment/Ecology': 0.3601945119833275,
 u'Eotvos Lorand University, Hungary': 0.2591852036990752,
 u'Erasmus MC, Netherlands': 0.28893842295904154,
 u'Erasmus Med Ctr, Netherlands': 0.2606182457903996,
 u'Erasmus University, Netherlands': 0.3008413112851755,
 u'Erciyes University, Turkey': 0.2690012970168612,
 u'Estn Biol Donana EBD CSIC, Spain': 0.2648786717752235,
 u'European Bioinformat Inst EMBL EBI, UK': 0.2633985267970536,
 u'European Bioinformat Inst, UK': 0.2736869886513592,
 u'European Mol Biol Lab, Germany': 0.2736869886513592,
 u'European Molecular Biology Laboratory/European Bioinformatics Institute/EMBL-EBI, UK': 0.2736869886513592,
 u'European University Institute, Italy': 0.2581528503858601,
 u'Ewha Womans University, South Korea': 0.2661021298434693,
 u'Fed Inst Hydrol BfG, Germany': 0.2648786717752235,
 u'Federal University of Vicosa, Brazil': 0.2654210391604812,
 u'Federico Santa Mar\xeda Technical University, Chile': 0.2690012970168612,
 u'Feng Chia University, Taiwan, China': 0.2690012970168612,
 u'Fermilab Ctr Particle Astrophys, USA': 0.2591852036990752,
 u'Fermilab Natl Accelerator Lab, USA': 0.2591852036990752,
 u'Finnish Meteorological Institute, Finland': 0.2632647880172633,
 u'Florida Atlantic University, USA': 0.25957446808510637,
 u'Florida Institute of Technology, USA': 0.2600953097567093,
 u'Florida International University, USA': 0.2594445834375782,
 u'Florida State University, USA': 0.25957446808510637,
 u'Fontys University of Applied Sciences, Netherlands': 0.26569305662311044,
 u'Foundation Medicine, USA': 0.27943950417677177,
 u'Framingham Heart Dis Epidemiol Study, USA': 0.27943950417677177,
 u'Fraunhofer Inst Solar Energy Syst, Germany': 0.2690012970168612,
 u'Fred Hutchinson Canc Res Ctr, USA': 0.28829580205726996,
 u'Free University of Berlin, Germany': 0.270969427750196,
 u'Fudan University, China': 0.27311035027653413,
 u'Fuel Cell & Battery Consulting, Germany': 0.2690012970168612,
 u'GVM Care&Research \u2013 E.S. Health Science Foundation, Italy': 0.27943950417677177,
 u'Gaziosmanpasa University, Turkey': 0.2690012970168612,
 u'Gdansk University, Poland': 0.2648786717752235,
 u'Gemini Observ, USA': 0.2591852036990752,
 u'Genentech Inc, USA': 0.29535744802050695,
 u'Genome Sci Ctr, Canada': 0.26233240576777134,
 u'Georgetown University, USA': 0.2633985267970536,
 u'Georgia Institute of Technology (50%), USA': 0.2625981261078754,
 u'Georgia Institute of Technology, USA': 0.3387781770663182,
 u'Geosciences': 0.35721667240785393,
 u'German Center for Neurodegenerative Diseases (DZNE), Germany': 0.27943950417677177,
 u'German Res Ctr Environm Hlth, Germany': 0.2769025367156208,
 u'Ghent University, Belgium': 0.32558869701726845,
 u'Gilead Sci, USA': 0.27943950417677177,
 u'Gladstone Inst Neurol Dis, USA': 0.2594445834375782,
 u'GlaxoSmithKline Inc, UK': 0.2594445834375782,
 u'GlaxoSmithKline R&D, USA': 0.2648786717752235,
 u'GlaxoSmithKline, UK': 0.2769025367156208,
 u'Glostrup Univ Hosp, Denmark': 0.2648786717752235,
 u'Good Samaritan Hosp, USA': 0.27943950417677177,
 u'Gordon Life Science Institute, USA': 0.2719643325465513,
 u'Graz University of Technology, Austria': 0.2690012970168612,
 u'Great Lakes Bioenergy Res Ctr, USA': 0.2654210391604812,
 u'Grochowski Hosp, Poland': 0.27943950417677177,
 u'Grp Hlth Cooperat Puget Sound, USA': 0.26569305662311044,
 u'Gyeongsang National University, South Korea': 0.2600953097567093,
 u'HHMI Janelia Farm Res Campus, USA': 0.2719643325465513,
 u'HKU Pasteur Res Ctr, China': 0.27943950417677177,
 u'Hagedorn Res Inst, Denmark': 0.2648786717752235,
 u'Hamilton Hlth Sci, Canada': 0.27943950417677177,
 u'Hangzhou Normal University, China': 0.2600953097567093,
 u'Hannover Medical School, Germany': 0.27943950417677177,
 u'Harbin Institute of Technology, China': 0.2690012970168612,
 u'Harbor UCLA Med Ctr, USA': 0.27943950417677177,
 u'Harvard Smithsonian Ctr Astrophys, USA': 0.2764596107704612,
 u'Harvard University (75%), USA': 0.2633985267970536,
 u'Harvard University, USA': 0.4711494775102226,
 u'Health Protection Agcy, UK': 0.2606182457903996,
 u'Heart Res Inst, Australia': 0.27943950417677177,
 u'Hebrew SeniorLife, USA': 0.26569305662311044,
 u'Heidelberg Inst Theoret Studies, Germany': 0.2591852036990752,
 u'Heinrich Hertz Inst Nachrichtentech Berlin GmbH, Germany': 0.2690012970168612,
 u'Hellenic Health Foundation, Greece': 0.26569305662311044,
 u'Heriot Watt University, UK': 0.2648786717752235,
 u'Hewlett Packard Corp, USA': 0.2661021298434693,
 u'Hietzing Hosp, Austria': 0.27943950417677177,
 u'Hlth Sci Ctr, USA': 0.27943950417677177,
 u'Hong Kong Baptist University, Hong Kong, China': 0.2633985267970536,
 u'Hop Bicetre, France': 0.2606182457903996,
 u'Hop Bichat Claude Bernard, France': 0.27943950417677177,
 u'Hop Claude Huriez, France': 0.27943950417677177,
 u'Hop Hotel Dieu, France': 0.27943950417677177,
 u'Hop La Pitie Salpetriere, France': 0.27943950417677177,
 u'Hosei University, Japan': 0.2591852036990752,
 u'Hosp Badalona Germans Trias & Pujol, Spain': 0.2606182457903996,
 u'Hosp Clin Barcelona, Spain': 0.27943950417677177,
 u'Hosp Coracao, Brazil': 0.27943950417677177,
 u'Howard Hughes Med Inst, USA': 0.28465550370573706,
 u'HudsonAlpha Inst Biotechnol, USA': 0.2786885245901639,
 u'Hunan University, China': 0.2661021298434693,
 u'Hungarian Acad Sci, Hungary': 0.26233240576777134,
 u'IAVI, USA': 0.2606182457903996,
 u'IBM Corp, USA': 0.2625981261078754,
 u'IBM Thomas J Watson Res Ctr, USA': 0.2661021298434693,
 u'ICFO Inst Ciencies Foton, Spain': 0.2625981261078754,
 u'ICREA and ICE(CSIC-IEEC) (50%), Spain': 0.2625981261078754,
 u'INAF Ist Astrofis Spaziale & Fis Cosm, Italy': 0.2591852036990752,
 u'INRA, France': 0.28528198074277855,
 u'INRIA Grenoble, France': 0.2690012970168612,
 u'INSERM, France': 0.27943950417677177,
 u'IRCCS Mario Negri Institute for Pharmacological Research, Italy': 0.27943950417677177,
 u'IRCCS Salvatore Maugeri Fdn, Italy': 0.27943950417677177,
 u'ISIS, France': 0.2625981261078754,
 u'IST Austria, Austria': 0.2654210391604812,
 u'Icahn School of Medicine at Mount Sinai, USA': 0.3231536304144593,
 u'Ilmenau University of Technology, Germany': 0.2633985267970536,
 u'Imam Khomeini International University, Iran': 0.2600953097567093,
 u'Immunology': 0.3474036850921273,
 u'Imperial Coll Healthcare NHS Trust, UK': 0.27943950417677177,
 u'Imperial College London (90%), UK': 0.2633985267970536,
 u'Imperial College London, UK': 0.3853586027499071,
 u'Independent Biotechnology Consultant, USA': 0.2606182457903996,
 u'Indian Inst Technol Roorkee, India': 0.2690012970168612,
 u'Indiana University Bloomington, USA': 0.2648786717752235,
 u'Indiana University-Purdue University at Indianapolis, USA': 0.2761651131824234,
 u'Indonesian Ctr Archaeol, Indonesia': 0.26569305662311044,
 u'Inst Adv Study, USA': 0.2625981261078754,
 u'Inst Atmospher Sci & Climate ISAC, Italy': 0.2632647880172633,
 u'Inst Canc Res, UK': 0.27943950417677177,
 u'Inst Catala Oncol IDIBELL, Spain': 0.27943950417677177,
 u'Inst Genet & Mol Biol Cell IGBMC, France': 0.2648786717752235,
 u'Inst Genom Fonct, France': 0.2648786717752235,
 u'Inst Healthcare Improvement, USA': 0.26569305662311044,
 u'Inst Res Biomed, Switzerland': 0.25789604575976127,
 u'Inst Syst Biol, USA': 0.2648786717752235,
 u'Institute of Chemical Research of Catalonia (ICIQ), Spain': 0.2661021298434693,
 u'Int AIDS Vaccine Initiat, USA': 0.2606182457903996,
 u'Int Agcy Res Canc IARC, France': 0.26074930852401307,
 u'Int Assoc Hydrogen Energy, USA': 0.2690012970168612,
 u'Interdisciplinary Ctr, Israel': 0.26233240576777134,
 u'Intermt Med Ctr, USA': 0.27943950417677177,
 u'Iowa State University, USA': 0.2625981261078754,
 u'Isik University, Turkey': 0.2690012970168612,
 u'Islamic Azad University, Iran': 0.2690012970168612,
 u'Ist Sci San Raffaele, Italy': 0.27943950417677177,
 u'Italian Inst Technol, Italy': 0.2594445834375782,
 u'Italian Institute of Technology, Italy': 0.2633985267970536,
 u'J Craig Venter Inst, USA': 0.2841874486160592,
 u'JNCASR, India': 0.2633985267970536,
 u'James Cook University, Australia': 0.2648786717752235,
 u'Japan Int Ctr Agr Sci, Japan': 0.2654210391604812,
 u'Japan Int Res Ctr Agr Sci, Japan': 0.2654210391604812,
 u'Japan Sci & Technol Agcy JST, Japan': 0.2633985267970536,
 u'Jaume I University (50%), Spain': 0.2661021298434693,
 u'Jawaharlal Nehru Ctr Adv Sci Res, India': 0.27311035027653413,
 u'Jawaharlal Nehru University, India': 0.2690012970168612,
 u'Jiangnan University, China': 0.2777926600589338,
 u'Johannes Kepler University Linz, Austria': 0.2633985267970536,
 u'John Innes Ctr Plant Sci Res, UK': 0.2654210391604812,
 u'John Innes Ctr, UK': 0.2654210391604812,
 u'John Radcliffe Hosp, UK': 0.27943950417677177,
 u'Johns Hopkins Kimmel Canc Ctr, USA': 0.27943950417677177,
 u'Johns Hopkins Med Inst, USA': 0.26569305662311044,
 u'Joint Bioenergy Inst, USA': 0.2633985267970536,
 u'Joint Genome Inst, USA': 0.2726794635813831,
 u'Joint Univ Ottawa, Canada': 0.2625981261078754,
 u'Josai International University (35%), Japan': 0.2633985267970536,
 u'Joseph Fourier University (Grenoble 1), France': 0.29104687061465057,
 u'KBioscience Unit 7, UK': 0.2648786717752235,
 u'KU Leuven, Belgium': 0.33078149920255184,
 u'Kagawa University, Japan': 0.25789604575976127,
 u'Kaiser Permanente No Calif, USA': 0.27943950417677177,
 u'Kaiser Permanente Oncol Clin Trials, USA': 0.27943950417677177,
 u'Kaohsiung Medical University, Taiwan, China': 0.2600953097567093,
 u'Karlsruhe Institute of Technology, Germany': 0.2625981261078754,
 u'Karolinska Institute, Sweden': 0.2648786717752235,
 u'Kazusa DNA Res Inst, Japan': 0.2654210391604812,
 u'Kerckhoff Heart Ctr, Germany': 0.27943950417677177,
 u'King Abdulaziz University, Saudi Arabia': 0.3361426256077796,
 u'King Abdullah University of Science and Technology, Saudi Arabia': 0.2748476013782136,
 u'King Fahd University of Petroleum & Minerals, Saudi Arabia': 0.2600953097567093,
 u'King Saud University, Saudi Arabia': 0.3203583565029348,
 u"King's College London, UK": 0.27898843153080444,
 u'Knowledge & Evaluat Res Unit, USA': 0.27943950417677177,
 u'Knowledge Network, USA': 0.26569305662311044,
 u'Koch Inst Integrat Canc Res, USA': 0.2648786717752235,
 u'Kochi University, Japan': 0.2632647880172633,
 u'Korea Advanced Institute of Science and Technology, South Korea': 0.2648786717752235,
 u'Korea Inst Sci & Technol, South Korea': 0.2648786717752235,
 u'Korea Institue of Science & Technology, South Korea': 0.2648786717752235,
 u'Korea University, South Korea': 0.28095367109184505,
 u'Kyoto University, Japan': 0.3033050599590524,
 u'Kyung Hee University, South Korea': 0.26074930852401307,
 u'Kyushu Institute of Technology, Japan': 0.2600953097567093,
 u'Kyushu University, Japan': 0.2632647880172633,
 u'LI COR Biosci, USA': 0.26074930852401307,
 u'La Jolla Inst Allergy & Immunol, USA': 0.2648786717752235,
 u'Lab Sci Climat & Environm, France': 0.2632647880172633,
 u"Laboratoire des Sciences du Climat et de l'Environnement (CEA-CNRS-University of Versailles) and Institut Pierre Simon Laplace, France": 0.2632647880172633,
 u'Lahey Hosp & Med Ctr, USA': 0.2606182457903996,
 u'Lancaster University, UK': 0.26569305662311044,
 u'Lankenau Inst Med Res, USA': 0.27943950417677177,
 u'Lanzhou University, China': 0.2600953097567093,
 u'Laval University, Canada': 0.2661021298434693,
 u'Lawrence Berkeley Natl Lab, USA': 0.2732542819499341,
 u'Lawrence Livermore Natl Lab, USA': 0.2690012970168612,
 u'Leeds Teaching Hosp Trust, UK': 0.27943950417677177,
 u'Leibniz Inst Tropospher Res, Germany': 0.2632647880172633,
 u'Leiden University, Netherlands': 0.30242053076698744,
 u'Les Labs Servier, France': 0.2648786717752235,
 u'Liaoning University of Technology, China': 0.2690012970168612,
 u'LifeMap Sci LTD, Israel': 0.2633985267970536,
 u'Lincoln University, New Zealand': 0.2648786717752235,
 u'Linkoping University, Sweden': 0.2633985267970536,
 u'Liverpool John Moores University, UK': 0.2591852036990752,
 u'London Business School, UK': 0.2581528503858601,
 u'London School of Economics and Political Science, UK': 0.26569305662311044,
 u'London School of Hygiene and Tropical Medicine, UK': 0.27943950417677177,
 u'Los Alamos Natl Lab, USA': 0.27989203778677463,
 u'Los Angeles Biomed Res Inst, USA': 0.27943950417677177,
 u'Louisiana State University - Baton Rouge, USA': 0.26569305662311044,
 u'Lund University, Sweden': 0.29807415924116126,
 u'MRC Social & Publ Hlth Sci Unit, UK': 0.26569305662311044,
 u'MRC, UK': 0.27943950417677177,
 u'MTT Agrifood Res Finland, Finland': 0.26074930852401307,
 u'MacColl Ctr Hlth Care Innovat, USA': 0.26569305662311044,
 u'Macau University of Science and Technology, Macau, China': 0.2758712423516893,
 u'Macquarie University, Australia': 0.2837209302325581,
 u'MacroGenics, USA': 0.2606182457903996,
 u'Manchester Metropolitan University (MMU), UK': 0.2690012970168612,
 u'Massachusetts Gen Hosp, USA': 0.29435140505251206,
 u'Massachusetts General Hospital, USA': 0.25957446808510637,
 u'Massachusetts Institute of Technology (MIT), USA': 0.38223368964246224,
 u'Materials Science': 0.35746294381247845,
 u'Mathematics': 0.35140630294815317,
 u'Max Planck Gesell, Germany': 0.2625981261078754,
 u'Max Planck Inst Astron, Germany': 0.2591852036990752,
 u'Max Planck Inst Astrophys, Germany': 0.2591852036990752,
 u'Max Planck Inst Biochem, Germany': 0.2736869886513592,
 u'Max Planck Inst Biogeochem, Germany': 0.2648786717752235,
 u'Max Planck Inst Chem Ecol, Germany': 0.2654210391604812,
 u'Max Planck Inst Chem, Germany': 0.27210705851482553,
 u'Max Planck Inst Colloids & Interfaces, Germany': 0.27311035027653413,
 u'Max Planck Inst Dev Biol, Germany': 0.2654210391604812,
 u'Max Planck Inst Dynam Complex Tech Syst, Germany': 0.26233240576777134,
 u'Max Planck Inst Evolutionary Anthropol, Germany': 0.26569305662311044,
 u'Max Planck Inst Extraterr Phys, Germany': 0.2591852036990752,
 u'Max Planck Inst Informat, Germany': 0.26233240576777134,
 u'Max Planck Inst Kohlenforsch, Germany': 0.27311035027653413,
 u'Max Planck Inst Marine Microbiol, Germany': 0.2606182457903996,
 u'Max Planck Inst Meteorol, Germany': 0.2632647880172633,
 u'Max Planck Inst Mol Cell Biol & Genet, Germany': 0.2633985267970536,
 u'Max Planck Inst Mol Genet, Germany': 0.26233240576777134,
 u'Max Planck Inst Mol Plant Physiol, Germany': 0.2654210391604812,
 u'Max Planck Inst Plant Breeding Res, Germany': 0.2654210391604812,
 u'Max Planck Inst Polymer Res(Max Planck Institute for Poly Research), Germany': 0.2633985267970536,
 u'Max Planck Inst Polymer Res, Germany': 0.27311035027653413,
 u'Max Planck Inst Quantum Opt, Germany': 0.2625981261078754,
 u'Max Planck Inst Solid State Res, Germany': 0.2633985267970536,
 u'Max Planck Institute for Plant Breeding Research, Germany': 0.2654210391604812,
 u'Max Planck Institute for Solid State Research, Germany': 0.2625981261078754,
 u'Max Planck Soc, Germany': 0.26569305662311044,
 u'Max-Planck-Institut fur Plasmaphysik, Germany': 0.2690012970168612,
 u'Mayo Clinic, USA': 0.2594445834375782,
 u'Mayo Medical School, USA': 0.2849683979115141,
 u'McGill University, Canada': 0.27239296033622273,
 u'McMaster University, Canada': 0.2908835904628331,
 u'Med Res Council, UK': 0.28893842295904154,
 u'Medical University of Silesia, Poland': 0.27943950417677177,
 u'Medical University of Vienna, Austria': 0.2594445834375782,
 u'Medstar Res Inst, USA': 0.27943950417677177,
 u'Melbourne Hlth, Australia': 0.25957446808510637,
 u'Mem Sloan Kettering Canc Ctr, USA': 0.31338773043215473,
 u'Memorial Sloan Kettering Inst, USA': 0.25789604575976127,
 u'Memorial University of Newfoundland, Canada': 0.26928070631004936,
 u'Merck & Co Inc, USA': 0.26569305662311044,
 u'Merck Res Labs, USA': 0.30455212922173275,
 u'Merck Sharp & Dohme Corp, USA': 0.2594445834375782,
 u'Met Off Hadley Ctr, UK': 0.2632647880172633,
 u'Meteorol Serv Canada, Canada': 0.26074930852401307,
 u'Michigan State University, USA': 0.2834107679693905,
 u'Michigan Technological University, USA': 0.2690012970168612,
 u'Microbiology': 0.35236153584777435,
 u'Microsoft Res, USA': 0.2600953097567093,
 u'Missouri University of Science and Technology, USA': 0.2625981261078754,
 u'Mitre Corp, USA': 0.26233240576777134,
 u'Mol Bioeffects Branch, USA': 0.2648786717752235,
 u'Molecular Biology & Genetics': 0.3601945119833275,
 u'Molina Ctr Energy & Environm, USA': 0.2632647880172633,
 u'Monash University, Australia': 0.31244350708044594,
 u'Monogram Biosci Inc, USA': 0.2606182457903996,
 u'Monsanto Co, USA': 0.2648786717752235,
 u'Monsanto, USA': 0.2648786717752235,
 u'Morgan, USA': 0.2606182457903996,
 u'Morgridge Inst Res, USA': 0.27943950417677177,
 u'Mt Sinai Med Ctr, USA': 0.27943950417677177,
 u'NASA, USA': 0.28686030428769016,
 u'NCAR, USA': 0.2632647880172633,
 u'NCI, USA': 0.33397745571658616,
 u'NERC, UK': 0.2648786717752235,
 u'NHGRI, USA': 0.2648786717752235,
 u'NHS Blood Transplant, UK': 0.2648786717752235,
 u'NIA, USA': 0.3052693553135119,
 u'NIAAA, USA': 0.27210705851482553,
 u'NIAID, USA': 0.292524682651622,
 u'NIAMSD, USA': 0.25789604575976127,
 u'NIDDKD, USA': 0.2633985267970536,
 u'NIH, USA': 0.30617065249483316,
 u'NILU - Norweigan Institute for Air Research, Norway': 0.2632647880172633,
 u'NIMH, USA': 0.2777926600589338,
 u'NIMS, Japan': 0.2633985267970536,
 u'NIOZ Royal Netherlands Inst Sea Res, Netherlands': 0.2632647880172633,
 u'NIST, USA': 0.2625981261078754,
 u'NOAA ARL, USA': 0.26074930852401307,
 u'NOAA Earth Syst Res Lab, USA': 0.2632647880172633,
 u'NOAA, USA': 0.2632647880172633,
 u'Nagoya University, Japan': 0.2754316069057105,
 u'Nanjing University of Science and Technology, China': 0.2690012970168612,
 u'Nanjing University of Technology, China': 0.28765603328710126,
 u'Nankai University, China': 0.2633985267970536,
 u'Nanyang Technological University, Singapore': 0.27898843153080444,
 u'Nara Institute of Science and Technology, Japan': 0.2713949227950798,
 u'Nat Hist Museum London, UK': 0.26569305662311044,
 u'Nathan S Kline Inst Psychiat Res, USA': 0.2594445834375782,
 u'National Cheng Kung University, Taiwan, China': 0.2690012970168612,
 u'National Chiao Tung University, Taiwan, China': 0.2633985267970536,
 u'National Institute of Allergy and Infectious Diseases, USA': 0.2606182457903996,
 u'National Institute of Biomedical Innovation (NIBIO), Japan': 0.25789604575976127,
 u'National Institute of Diabetes & Digestive & Kidney Diseases, National Institutes of Health, USA': 0.2648786717752235,
 u'National Sun Yat-Sen University, Taiwan, China': 0.2758712423516893,
 u'National Taiwan University, Taiwan, China': 0.26233240576777134,
 u'National University of Cordoba, Argentina': 0.2648786717752235,
 u'National University of Ireland, Galway, Ireland': 0.28591122139509234,
 u'National University of Singapore, Singapore': 0.30014471780028945,
 u'National and Kapodistrian University of Athens, Greece': 0.27943950417677177,
 u'Natl Ctr Atmospher Res, USA': 0.2632647880172633,
 u'Natl Ctr Ecol Anal & Synth, USA': 0.2648786717752235,
 u'Natl Human Genome Res Inst, USA': 0.2648786717752235,
 u'Natl Inst Adv Ind Sci & Technol, Japan': 0.3041947785274274,
 u'Natl Inst Agrobiol Sci, Japan': 0.2654210391604812,
 u'Natl Inst Hlth & Welf, Finland': 0.2648786717752235,
 u'Natl Inst Hlth, USA': 0.2633985267970536,
 u'Natl Inst Mat Sci, Japan': 0.2633985267970536,
 u'Natl Inst Med Res, UK': 0.25789604575976127,
 u'Natl Lib Med, USA': 0.2633985267970536,
 u'Natl Opt Astron Observ, USA': 0.2591852036990752,
 u'Natl Renewable Energy Lab, USA': 0.2690012970168612,
 u'Natl Res Council CNR, Italy': 0.2632647880172633,
 u'Natl Res Council Canada, Canada': 0.2795901860339714,
 u'Natl Sci Fdn, USA': 0.2591852036990752,
 u'Naval Res Lab, USA': 0.2625981261078754,
 u'Netherlands Canc Inst, Netherlands': 0.2915378127635648,
 u'Netherlands Inst Mental Hlth & Addict, Netherlands': 0.25957446808510637,
 u'Neuroscience & Behavior': 0.3502195204322864,
 u'New Economic School, Russia': 0.2581528503858601,
 u'New York State Department of Health (70%), USA': 0.2648786717752235,
 u'New York University, USA': 0.3213511000929656,
 u'Newcastle University, UK': 0.26569305662311044,
 u'Nord Sch Publ Hlth, Sweden': 0.27943950417677177,
 u'North Carolina State University - Raleigh, USA': 0.30617065249483316,
 u'Northeast Normal University, China': 0.2736869886513592,
 u'Northeastern University, USA': 0.29351825643928675,
 u'Northwestern University, USA': 0.35599038791623755,
 u'Norwegian University of Science and Technology, Norway': 0.2600953097567093,
 u'Novartis Inst BioMed Res, USA': 0.27943950417677177,
 u'ORYGEN Youth Hlth Res Ctr, Australia': 0.25957446808510637,
 u'Oak Ridge National Laboratory, USA': 0.27282294133122864,
 u'Oak Ridge Natl Lab, USA': 0.2625981261078754,
 u'Oklahoma State University, USA': 0.2654210391604812,
 u'Ontario Canc Inst, Canada': 0.26233240576777134,
 u'Oregon Health and Science University, USA': 0.30101596516690854,
 u'Oregon State University, USA': 0.28528198074277855,
 u'Osaka University, Japan': 0.3320525136087096,
 u'Osped Niguarda Ca Granda, Italy': 0.27943950417677177,
 u'Osped Riuniti Bergamo, Italy': 0.27943950417677177,
 u'Osserv Astron Padova, Italy': 0.2591852036990752,
 u'Ottawa Hosp, Canada': 0.27943950417677177,
 u'PLA University of Science and Technology, China': 0.2777926600589338,
 u'Pacific NW Natl Lab, USA': 0.29807415924116126,
 u'Pacific Northwest National Laboratory, USA': 0.2632647880172633,
 u'Panjab University, India': 0.2648786717752235,
 u'Paul Sabatier University (Toulouse 3), France': 0.28829580205726996,
 u'Paul Scherrer Inst, Switzerland': 0.2632647880172633,
 u'Peking University, China': 0.2736869886513592,
 u'Pennsylvania State University - University Park, USA': 0.3611981887843957,
 u'Perimeter Inst Theoret Phys, Canada': 0.2625981261078754,
 u'Peter MacCallum Canc Ctr, Australia': 0.2648786717752235,
 u'Pfizer Inc, USA': 0.2648786717752235,
 u'Pfizer, USA': 0.25789604575976127,
 u'PharmaHungary Grp, Hungary': 0.2648786717752235,
 u'Pharmacology & Toxicology': 0.3601945119833275,
 u'Physics': 0.35599038791623755,
 u'Pierre and Marie Curie University - Paris 6, France': 0.29235974062588105,
 u'Pillsbury, USA': 0.2606182457903996,
 u'Plant & Animal Science': 0.3611981887843957,
 u'Pohang University of Science and Technology, South Korea': 0.27311035027653413,
 u'Polytechnic Institute of Milan, Italy': 0.2690012970168612,
 u'Polytechnic University of Catalonia, Spain': 0.2690012970168612,
 u'Polytechnic University of Valencia , Spain': 0.26569305662311044,
 u'Polytechnic University of Valencia, Spain': 0.2661021298434693,
 u'Potsdam Inst Climate Change Res, Germany': 0.2648786717752235,
 u'Princeton University (50%), USA': 0.2625981261078754,
 u'Princeton University, USA': 0.34601267934601265,
 u'Psychiatry/Psychology': 0.3504562352145995,
 u'Pukyong National University, South Korea': 0.26074930852401307,
 u'Purdue University - West Lafayette, USA': 0.31529340224992397,
 u'Queen Mary, U. of London, UK': 0.30049261083743845,
 u"Queen's University, Canada": 0.26569305662311044,
 u'RAND Corp, USA': 0.26569305662311044,
 u'RAND Hlth, USA': 0.27943950417677177,
 u'RIKEN Center for Emergent Matter Science(CEMS), Japan': 0.2625981261078754,
 u'RIKEN Center for Emergent Matter Science, RIKEN, Japan': 0.2625981261078754,
 u'RIKEN Center for Sustainable Resource Science, Japan': 0.2654210391604812,
 u'RIKEN Ctr Emergent Matter Sci, Japan': 0.2625981261078754,
 u'RIKEN Plant Sci Ctr, Japan': 0.2654210391604812,
 u'RIKEN, Japan': 0.28294679399727146,
 u'RKW Kompetenzzentrum (Projektleitung), Germany': 0.26074930852401307,
 u'RTI Int, USA': 0.2648786717752235,
 u'RWTH Aachen University, Germany': 0.27898843153080444,
 u'Radboud University Nijmegen, Netherlands': 0.28095367109184505,
 u'Renmin University of China, China': 0.2625981261078754,
 u'Rensselaer Polytechnic Institute, USA': 0.2591852036990752,
 u'Res Grp Nonlinear Anal & Applicat RGNAA, Iran': 0.2600953097567093,
 u'Res Inst Mol Pathol IMP, Austria': 0.2648786717752235,
 u'Research Center on Agricultural CRA-NUT, Italy': 0.26074930852401307,
 u'Rice University, USA': 0.34190570392350805,
 u'Rigshosp, Denmark': 0.27943950417677177,
 u'Robert Koch Inst, Germany': 0.26569305662311044,
 u'Rockefeller University, USA': 0.3011908219575951,
 u'Royal Acad Sci, Sweden': 0.2648786717752235,
 u'Royal Botanic Gardens, UK': 0.2654210391604812,
 u'Royal Marsden Hosp, UK': 0.27943950417677177,
 u'Royal Netherlands Academy of Arts and Sciences, Netherlands': 0.27943950417677177,
 u'Royal Observ, UK': 0.2591852036990752,
 u'Royal Soc Protect Birds, UK': 0.2648786717752235,
 u'Royal Swedish Acad Sci, Sweden': 0.2648786717752235,
 u'Rush University, USA': 0.2594445834375782,
 u'Russian Acad Sci, Russia': 0.2690012970168612,
 u'Russian Res Ctr Inst Phys & Power Engn, Russia': 0.2625981261078754,
 u'Rutgers, The State University of New Jersey - New Brunswick, USA': 0.3020681619574716,
 u'S African Natl Biodivers Inst, South Africa': 0.2648786717752235,
 u'SAIC Frederick Inc, USA': 0.2648786717752235,
 u'SB RAS, Russia': 0.2690012970168612,
 u'SLAC Natl Accelerator Lab, USA': 0.27311035027653413,
 u'SRI Int, USA': 0.2633985267970536,
 u'Sainsbury Lab, UK': 0.2654210391604812,
 u'Saint Xavier University, USA': 0.2690012970168612,
 u'Salk Inst Biol Studies, USA': 0.2725361366622865,
 u'Samuel Roberts Noble Fdn Inc, USA': 0.2654210391604812,
 u'San Diego State University, USA': 0.2697009102730819,
 u'San Raffaele Scientific Institute (50%), Italy': 0.2633985267970536,
 u'Sanford Burnham Med Res Inst, USA': 0.2633985267970536,
 u'Sangamo BioSci Inc, USA': 0.2633985267970536,
 u'School for Advanced Studies in the Social Sciences (EHESS), France': 0.2600953097567093,
 u'Scripps Clin, USA': 0.27943950417677177,
 u'Scripps Res Inst, USA': 0.2970495560011458,
 u'Scuola Scienza Materiali, Italy': 0.2690012970168612,
 u'Seattle BioMed, USA': 0.25789604575976127,
 u'Seattle Childrens Res Inst, USA': 0.25789604575976127,
 u'Selcuk University, Turkey': 0.26569305662311044,
 u'Seoul National University (50%), South Korea': 0.27311035027653413,
 u'Seoul National University, South Korea': 0.2739762219286658,
 u'Serbian Academy of Science, Serbia': 0.2600953097567093,
 u'Shanghai Jiao Tong University, China': 0.2879755623437934,
 u'Sila Sci & Energy Unltd Co, Turkey': 0.2690012970168612,
 u'Sila Sci, Turkey': 0.26569305662311044,
 u'Simon Fraser University, Canada': 0.26233240576777134,
 u'Siteman Canc Ctr, USA': 0.26569305662311044,
 u'Smith Sch Enterprise & Environm, UK': 0.2632647880172633,
 u'Social Sciences, general': 0.3617021276595745,
 u'Sojo University, Japan': 0.2648786717752235,
 u'South China University of Technology, China': 0.2633985267970536,
 u'Southeast University, China': 0.2758712423516893,
 u'Space Science': 0.3497470489038786,
 u'Space Telescope Sci Inst, USA': 0.2591852036990752,
 u'Spanish Natl Canc Res Ctr, Spain': 0.26233240576777134,
 u'St Jude Childrens Hosp, USA': 0.28893842295904154,
 u'St Jude Childrens Res Hosp, USA': 0.2648786717752235,
 u'St Lukes Hosp, USA': 0.27943950417677177,
 u'St Pauls Hosp, Canada': 0.27943950417677177,
 u'St Vincent Coll, USA': 0.2591852036990752,
 u"St. Jude Children's Research Hospital, USA": 0.2606182457903996,
 u'Stanford University, USA': 0.4337097448766207,
 u'State University of New York Upstate Medical University, USA': 0.25957446808510637,
 u'State University of New York at Albany, USA': 0.2632647880172633,
 u'State University of New York at Buffalo, USA': 0.2661021298434693,
 u'State University of New York at Stony Brook, USA': 0.2648786717752235,
 u'Stellenbosch University, South Africa': 0.2648786717752235,
 u'Stockholm University, Sweden': 0.2838762660826718,
 u'Sumitomo Hosp, Japan': 0.27943950417677177,
 u'Sun Yat-sen University, China': 0.2736869886513592,
 u'Sunnybrook Odette Canc Ctr, Canada': 0.27943950417677177,
 u'Swedish University of Agricultural Sciences, Sweden': 0.2773468841936347,
 u'Swinburne University of Technology, Australia': 0.2591852036990752,
 u'Swiss Fed Inst Aquat Sci & Technol, Switzerland': 0.2648786717752235,
 u'Swiss Fed Res Inst WSL, Switzerland': 0.2648786717752235,
 u'Swiss Federal Institute of Technology Zurich, Switzerland': 0.3555022283167638,
 u'Swiss Inst Bioinformat SIB, Switzerland': 0.2633985267970536,
 u'Swiss Inst Bioinformat, Switzerland': 0.2633985267970536,
 u'Syngenta Biotechnology, Inc., USA': 0.2654210391604812,
 u'Technical University Munich, Germany': 0.2862268837979575,
 u'Technical University of Denmark, Denmark': 0.29979762937265103,
 u'Technion-Israel Institute of Technology, Israel': 0.2777926600589338,
 u'Tel Aviv University, Israel': 0.2777926600589338,
 u'Temple University, USA': 0.2648786717752235,
 u'Texas A&M University - College Station, USA': 0.31529340224992397,
 u'Texas A&M University-Kingsville, USA': 0.2600953097567093,
 u'Texas Christian University, USA': 0.26569305662311044,
 u'Texas Southern University, USA': 0.2648786717752235,
 u'The Australian National University, Australia': 0.2648786717752235,
 u'The Chinese University of Hong Kong, Hong Kong, China': 0.27898843153080444,
 u'The College of William and Mary, USA': 0.2648786717752235,
 u'The George Institute for Global Health, Australia': 0.27943950417677177,
 u'The Hamner Institutes for Health Sciences, USA': 0.2648786717752235,
 u'The Hebrew University of Jerusalem, Israel': 0.27823987121008853,
 u'The Hong Kong Polytechnic University, Hong Kong, China': 0.2690012970168612,
 u'The Hong Kong University of Science and Technology, Hong Kong, China': 0.28765603328710126,
 u'The Hospital for Sick Children (50%), Canada': 0.2648786717752235,
 u'The Jackson Laboratory for Genomic Medicine, USA': 0.2648786717752235,
 u'The Johns Hopkins University, USA': 0.36449912126537787,
 u'The Ohio State University - Columbus, USA': 0.35116830342025057,
 u'The Sainsbury Laboratory, UK': 0.2654210391604812,
 u'The Scripps Research Institute, USA': 0.2606182457903996,
 u'The University of Adelaide, Australia': 0.31020041878552196,
 u'The University of Akron, USA': 0.2633985267970536,
 u'The University of Alabama - Tuscaloosa, USA': 0.2661021298434693,
 u'The University of Alabama at Birmingham, USA': 0.27210705851482553,
 u'The University of Auckland, New Zealand': 0.2633985267970536,
 u'The University of Calgary, Canada': 0.27764390896921015,
 u'The University of Dundee, UK': 0.2736869886513592,
 u'The University of Edinburgh, UK': 0.32075471698113206,
 u'The University of Georgia, USA': 0.2654210391604812,
 u'The University of Glasgow, UK': 0.31020041878552196,
 u'The University of Hong Kong (70%), Hong Kong, China': 0.2606182457903996,
 u'The University of Hong Kong, Hong Kong, China': 0.28893842295904154,
 u'The University of Jordan, Jordan': 0.2758712423516893,
 u'The University of Manchester, UK': 0.3193717277486911,
 u'The University of Montana - Missoula, USA': 0.27513929424250466,
 u'The University of New Mexico - Albuquerque, USA': 0.2732542819499341,
 u'The University of Queensland, Australia': 0.31414722811269313,
 u'The University of Reading, UK': 0.27210705851482553,
 u'The University of Sheffield, UK': 0.29893341020466996,
 u'The University of Texas Health Science Center at Houston, USA': 0.28893842295904154,
 u'The University of Texas Health Science Center at San Antonio, USA': 0.29687947323217867,
 u'The University of Texas M. D. Anderson Cancer Center, USA': 0.3050897322741983,
 u'The University of Texas Medical Branch at Galveston, USA': 0.2648786717752235,
 u'The University of Texas Southwestern Medical Center at Dallas, USA': 0.309459862727544,
 u'The University of Texas at Austin, USA': 0.3324783584482206,
 u'The University of Texas at Dallas, USA': 0.2581528503858601,
 u'The University of Texas at San Antonio, USA': 0.2661021298434693,
 u'The University of Tokyo, Japan': 0.307623850489469,
 u'The University of Western Australia, Australia': 0.2773468841936347,
 u'The Zucker Hillside Hospital, North Shore Long Island Jewish Health System, USA': 0.25957446808510637,
 u'Tianjin Polytech University, China': 0.2600953097567093,
 u'Tohoku University, Japan': 0.28591122139509234,
 u'Tokyo Institute of Technology, Japan': 0.2625981261078754,
 u'Tokyo Medical and Dental University, Japan': 0.2736869886513592,
 u'Tokyo Metropolitan Inst Med Sci, Japan': 0.2633985267970536,
 u'Tokyo University of Science, Japan': 0.2687224669603524,
 u'Tokyo University of Technology, Japan': 0.2690012970168612,
 u'Toulouse School of Economics, France': 0.2581528503858601,
 u'Toulouse Univ Hosp, France': 0.26074930852401307,
 u'Trinity College Dublin, Ireland': 0.2812584757255221,
 u'Tsinghua University, China': 0.2994513427663875,
 u'Tufts Med Ctr, USA': 0.27943950417677177,
 u'Tufts University, USA': 0.30101596516690854,
 u'Tulane University, USA': 0.27943950417677177,
 u'Tuscia University, Italy': 0.2648786717752235,
 u'U.S. Environmental Protection Agency, USA': 0.2648786717752235,
 u'UFZ Helmholtz Ctr Environm Res, Germany': 0.2648786717752235,
 u'US DOE Joint Genome Inst, USA': 0.2654210391604812,
 u'US DOE Joint Genome Institute , USA': 0.27601809954751133,
 u'US DOE Joint Genome Institute, USA': 0.28528198074277855,
 u'US FDA, USA': 0.27082789240010446,
 u'US Forest Serv, USA': 0.26074930852401307,
 u'US Geol Survey, USA': 0.2648786717752235,
 u'USAF, USA': 0.2648786717752235,
 u'USDA ARS, USA': 0.2654210391604812,
 u'USDA, USA': 0.26074930852401307,
 u'USN Observ, USA': 0.2591852036990752,
 u'UT Southwestern Medical Center, USA': 0.27943950417677177,
 u'Ulsan National Institute of Science and Technology, South Korea': 0.27311035027653413,
 u'Umea University, Sweden': 0.2654210391604812,
 u'Univ Klinikum Jena, Germany': 0.27943950417677177,
 u'Univ Med Ctr Utrecht, Netherlands': 0.2908835904628331,
 u'Univ Nat Resources & Appl Life Sci, Austria': 0.2606182457903996,
 u'Universidad Colegio Mayor de Cundinamarca, Colombia': 0.25957446808510637,
 u'University College Cork, Ireland': 0.2648786717752235,
 u'University College Dublin, Ireland': 0.27111111111111114,
 u'University College London, UK': 0.3407821229050279,
 u'University Kebangsaan Malaysia, Malaysia': 0.2600953097567093,
 u'University Libre Bruxelles, Belgium': 0.2690012970168612,
 u'University Mah, Turkey': 0.2690012970168612,
 u'University Rovira i Virgili, Spain': 0.26074930852401307,
 u'University Sains Malaysia, Malaysia': 0.2690012970168612,
 u'University of Aberdeen, UK': 0.274556526343659,
 u'University of Alaska - Fairbanks, USA': 0.2648786717752235,
 u'University of Alberta, Canada': 0.3107581660173809,
 u'University of Alicante, Spain': 0.2661021298434693,
 u'University of Amsterdam, Netherlands': 0.3006668599594085,
 u'University of Aquila, Italy': 0.2632647880172633,
 u'University of Arizona, USA': 0.30617065249483316,
 u'University of Arkansas at Fayetteville, USA': 0.27082789240010446,
 u'University of Arkansas at Little Rock, USA': 0.28893842295904154,
 u'University of Auvergne, France': 0.2654210391604812,
 u'University of Barcelona (75%), Spain': 0.27943950417677177,
 u'University of Barcelona, Spain': 0.2863849765258216,
 u'University of Basel, Switzerland': 0.2654210391604812,
 u'University of Bayreuth, Germany': 0.26074930852401307,
 u'University of Bergen, Norway': 0.2908835904628331,
 u'University of Bern, Switzerland': 0.29203041396789636,
 u'University of Bielefeld, Germany': 0.2654210391604812,
 u'University of Birmingham, UK': 0.31471927162367225,
 u'University of Bochum, Germany': 0.2736869886513592,
 u'University of Bologna, Italy': 0.28733721252424493,
 u'University of Bonn, Germany': 0.2764596107704612,
 u'University of Bordeaux 1, France': 0.26569305662311044,
 u'University of Bordeaux, France': 0.2908835904628331,
 u'University of Bristol, UK': 0.33397745571658616,
 u'University of British Columbia, Canada': 0.3185867895545315,
 u'University of Cagliari, Italy': 0.25957446808510637,
 u'University of Calabria, Italy': 0.2600953097567093,
 u'University of California, Berkeley, USA': 0.36475553992261694,
 u'University of California, Davis, USA': 0.33570734865652313,
 u'University of California, Irvine, USA': 0.3278533038254821,
 u'University of California, Los Angeles, USA': 0.41397205588822356,
 u'University of California, Riverside, USA': 0.3115049564433764,
 u'University of California, San Diego, USA': 0.4156312625250501,
 u'University of California, San Francisco, USA': 0.3348401679044236,
 u'University of California, Santa Barbara, USA': 0.3100149476831091,
 u'University of California, Santa Cruz, USA': 0.29962438601560243,
 u'University of Cambridge, UK': 0.3736936936936937,
 u'University of Cape Town, South Africa': 0.2755779962795642,
 u'University of Central Florida, USA': 0.2625981261078754,
 u'University of Chicago, USA': 0.32174992243251627,
 u'University of Cincinnati, USA': 0.3063515509601182,
 u'University of Colorado at Boulder, USA': 0.3370165745856354,
 u'University of Colorado at Denver, USA': 0.26788943425471456,
 u'University of Copenhagen, Denmark': 0.28512510310695627,
 u'University of Delaware, USA': 0.28465550370573706,
 u'University of Denver, USA': 0.2690012970168612,
 u'University of Duisburg-Essen, Germany': 0.27943950417677177,
 u'University of Durham, UK': 0.2591852036990752,
 u'University of East Anglia, UK': 0.2977318403674993,
 u'University of Eastern Finland, Finland': 0.2761651131824234,
 u'University of Erlangen-Nuremberg, Germany': 0.27311035027653413,
 u'University of Essex, UK': 0.26569305662311044,
 u'University of Evry, France': 0.2654210391604812,
 u'University of Exeter, UK': 0.28733721252424493,
 u'University of Ferrara, Italy': 0.26569305662311044,
 u'University of Florence, Italy': 0.2752853729758428,
 u'University of Florida, USA': 0.3284764016471334,
 u'University of Frankfurt, Germany': 0.27943950417677177,
 u'University of Freiburg, Germany': 0.2594445834375782,
 u'University of Fribourg, Switzerland': 0.2606182457903996,
 u'University of Geneva, Switzerland': 0.2835657642876675,
 u'University of Giessen, Germany': 0.3038382654556109,
 u'University of Goettingen, Germany': 0.2661021298434693,
 u'University of Gothenburg, Sweden': 0.2908835904628331,
 u'University of Granada, Spain': 0.2777926600589338,
 u'University of Groningen, Netherlands': 0.3324783584482206,
 u'University of Guelph, Canada': 0.2736869886513592,
 u'University of Hamburg, Germany': 0.2661021298434693,
 u'University of Hannover, Germany': 0.2600953097567093,
 u'University of Hasselt, Belgium': 0.26569305662311044,
 u'University of Hawaii at Manoa, USA': 0.26474342609139645,
 u'University of Heidelberg, Germany': 0.3365790327815644,
 u'University of Helsinki, Finland': 0.32016054337758565,
 u'University of Hohenheim, Germany': 0.26074930852401307,
 u'University of Houston, USA': 0.2633985267970536,
 u'University of Iceland, Iceland': 0.2648786717752235,
 u'University of Illes Balears, Spain': 0.2654210391604812,
 u'University of Illinois at Chicago, USA': 0.25957446808510637,
 u'University of Illinois at Urbana Champaign, USA': 0.2654210391604812,
 u'University of Illinois at Urbana-Champaign, USA': 0.31206740896780016,
 u'University of Innsbruck, Austria': 0.2698412698412698,
 u'University of Iowa, USA': 0.2632647880172633,
 u'University of Jaen, Spain': 0.26233240576777134,
 u'University of Kansas - Lawrence, USA': 0.2648786717752235,
 u'University of Karlsruhe, Germany': 0.2625981261078754,
 u'University of Kiel, Germany': 0.29807415924116126,
 u'University of Koeln, Germany': 0.2633985267970536,
 u'University of Konstanz, Germany': 0.2581528503858601,
 u'University of Lausanne, Switzerland': 0.28325594099972684,
 u'University of Laval, Canada': 0.2606182457903996,
 u'University of Leeds, UK': 0.30853912526033916,
 u'University of Leicester, UK': 0.2648786717752235,
 u'University of Leipzig, Germany': 0.26233240576777134,
 u'University of Liege, Belgium': 0.26074930852401307,
 u'University of Limerick, Ireland': 0.2661021298434693,
 u'University of Lisbon, Portugal': 0.2690012970168612,
 u'University of Liverpool, UK': 0.29335219236209337,
 u'University of Maastricht (95%), Netherlands': 0.25957446808510637,
 u'University of Maastricht, Netherlands': 0.27943950417677177,
 u'University of Malaya, Malaysia': 0.26569305662311044,
 u'University of Marburg, Germany': 0.2758712423516893,
 u'University of Maribor (50%), Slovenia': 0.2625981261078754,
 u'University of Maryland, Baltimore County, USA': 0.26788943425471456,
 u'University of Maryland, Baltimore, USA': 0.2716793293162169,
 u'University of Maryland, College Park, USA': 0.2890995260663507,
 u'University of Massachusetts Amherst, USA': 0.33922145894667977,
 u'University of Massachusetts Medical School - Worcester, USA': 0.2651495781130146,
 u'University of Melbourne, Australia': 0.33636068764190724,
 u'University of Miami, USA': 0.28233052001089026,
 u'University of Michigan - Ann Arbor, USA': 0.4027184466019417,
 u'University of Milan (93%), Italy': 0.25789604575976127,
 u'University of Milan - Bicocca, Italy': 0.27943950417677177,
 u'University of Minho, Portugal': 0.2625981261078754,
 u'University of Minnesota, Twin Cities, USA': 0.3977752205600307,
 u'University of Missouri - Kansas City, USA': 0.27943950417677177,
 u'University of Montreal, Canada': 0.28845618915159943,
 u'University of Muenster, Germany': 0.28095367109184505,
 u'University of Munich (50%), Germany': 0.2625981261078754,
 u'University of Munich, Germany': 0.29023229778897286,
 u'University of Naples Federico II, Italy': 0.26074930852401307,
 u'University of Nebraska - Lincoln, USA': 0.27111111111111114,
 u'University of Neuchatel, Switzerland': 0.2654210391604812,
 u'University of New England, Australia': 0.26569305662311044,
 u'University of New Hampshire - Durham, USA': 0.26569305662311044,
 u'University of New South Wales, Australia': 0.28279247341150804,
 u'University of Newcastle, Australia': 0.27943950417677177,
 u'University of Nis, Serbia': 0.2600953097567093,
 u'University of North Carolina at Chapel Hill, USA': 0.35944540727902946,
 u'University of North Texas - Denton, USA': 0.2654210391604812,
 u'University of Notre Dame, USA': 0.2748476013782136,
 u'University of Nottingham, UK': 0.2661021298434693,
 u'University of Oldenburg, Germany': 0.2648786717752235,
 u'University of Ontario Institute of Technology, Canada': 0.2690012970168612,
 u'University of Oregon, USA': 0.26569305662311044,
 u'University of Oslo, Norway': 0.28893842295904154,
 u'University of Otago, New Zealand': 0.25957446808510637,
 u'University of Ottawa, Canada': 0.2661021298434693,
 u'University of Oulu, Finland': 0.2648786717752235,
 u'University of Oxford, UK': 0.4046039797112759,
 u'University of Padua, Italy': 0.2648786717752235,
 u'University of Paris Descartes (Paris 5), France': 0.3008413112851755,
 u'University of Paris Diderot (Paris 7), France': 0.2758712423516893,
 u'University of Paris Sud (Paris 11), France': 0.3052693553135119,
 u'University of Parma, Italy': 0.26928070631004936,
 u'University of Patras, Greece': 0.2632647880172633,
 u'University of Pavia, Italy': 0.26233240576777134,
 u'University of Pennsylvania, USA': 0.35260115606936415,
 u'University of Pittsburgh, USA': 0.3348401679044236,
 u'University of Poitiers, France': 0.27943950417677177,
 u'University of Pompeu Fabra, Spain': 0.2581528503858601,
 u'University of Portsmouth, UK': 0.2591852036990752,
 u'University of Puerto Rico, USA': 0.2594445834375782,
 u'University of Rhode Island, USA': 0.26074930852401307,
 u'University of Rochester, USA': 0.2905575791538246,
 u'University of Rostock, Germany': 0.2661021298434693,
 u'University of Santiago Compostela, Spain': 0.2600953097567093,
 u'University of Sao Paulo, Brazil': 0.2795901860339714,
 u'University of Science Ho Chi Minh City, Vietnam': 0.26233240576777134,
 u'University of Science and Technology of China, China': 0.2835657642876675,
 u'University of South Carolina - Columbia, USA': 0.31263189629182997,
 u'University of South Florida (80%), USA': 0.2633985267970536,
 u'University of South Florida, USA': 0.28279247341150804,
 u'University of Southampton, UK': 0.3107581660173809,
 u'University of Southern California, USA': 0.34055829228243023,
 u'University of Strasbourg, France': 0.2648786717752235,
 u'University of Stuttgart, Germany': 0.2625981261078754,
 u'University of Surrey, UK': 0.26074930852401307,
 u'University of Sussex, UK': 0.2591852036990752,
 u'University of Tasmania, Australia': 0.2648786717752235,
 u'University of Technology, Sydney, Australia': 0.28325594099972684,
 u'University of Tehran, Iran': 0.2690012970168612,
 u'University of Tennessee - Knoxville, USA': 0.2690012970168612,
 u'University of Tennessee Health Science Center, USA': 0.3041947785274274,
 u'University of Toronto, Canada': 0.3847866419294991,
 u'University of Trieste, Italy': 0.2661021298434693,
 u'University of Tuebingen, Germany': 0.2654210391604812,
 u'University of Turin, Italy': 0.27943950417677177,
 u'University of Ulm, Germany': 0.29603197259491865,
 u'University of Ulster, Ireland': 0.26569305662311044,
 u'University of Utah, USA': 0.3052693553135119,
 u'University of Valencia, Spain': 0.2594445834375782,
 u'University of Vermont, USA': 0.2648786717752235,
 u'University of Versailles, France': 0.2661021298434693,
 u'University of Victoria, Canada': 0.28591122139509234,
 u'University of Vienna, Austria': 0.27943950417677177,
 u'University of Vigo, Spain': 0.26233240576777134,
 u'University of Virginia, USA': 0.28942227183924085,
 u'University of Wageningen, Netherlands': 0.28528198074277855,
 u'University of Warwick, UK': 0.2938509492774157,
 u'University of Washington, USA': 0.3681221157259496,
 u'University of Waterloo, Canada': 0.2633985267970536,
 u'University of Wisconsin - Madison, USA': 0.3401115119711381,
 u'University of Wollongong, Australia': 0.26569305662311044,
 u'University of Wuerzburg, Germany': 0.2940175786787638,
 u'University of Wuppertal, Germany': 0.2633985267970536,
 u'University of Yasuj, Iran': 0.2690012970168612,
 u'University of York, UK': 0.2648786717752235,
 u'University of Zaragoza, Spain': 0.2625981261078754,
 u'University of Zurich, Switzerland': 0.3370165745856354,
 ...}

Based on this we can see what institutions target very specific group of sciences and what institutions are more spread-out. To do that we will use something. :)

We might also thing of providing relative values according to total representatives of Category. Category should be value of $$S / S_t$$

American Philosophical Society

American Philosophial Society that has rather big membership. Their database of members is rather big.


In [13]:
def get_all():
    #gets many websites that have scientists in them.
    count = 1
    mylist = []
    for x in range(1, 2000, 20):
        payload = {'browse-all':'yes', 'sort':'creator', 'startDoc':'{}'.format(x)}
        r = requests.get('http://www.amphilsoc.org/memhist/search?', params=payload)
        mylist.append(r)
    return mylist

In [11]:
from bs4 import BeautifulSoup
soup = BeautifulSoup(r.content)


---------------------------------------------------------------------------
NameError                                 Traceback (most recent call last)
<ipython-input-11-cbbca10cb554> in <module>()
      1 from bs4 import BeautifulSoup
----> 2 soup = BeautifulSoup(r.content)

NameError: name 'r' is not defined

In [9]:
# will use them with regex
tags = ['Name:',
    'Institution:',
    'Year Elected:',
    'Class:' ,
    'Subdivision:',  
    'Residency:',
    'Living?:',
    'Birth Date:']

In [27]:
import re
res = soup.findAll('td', {'class':'docHit'})[0]
len(res.contents[0].contents[0])
len(res.contents[1].contents[0])
len(res.contents[2].contents[0])
len(res.contents[4].contents[0])
len(res.contents[5].contents[0])
for x in res.contents[5].contents[0].contents[0].children:
    print x


---------------------------------------------------------------------------
NameError                                 Traceback (most recent call last)
<ipython-input-27-405ee752da02> in <module>()
      1 import re
----> 2 res = soup.findAll('td', {'class':'docHit'})[0]
      3 len(res.contents[0].contents[0])
      4 len(res.contents[1].contents[0])
      5 len(res.contents[2].contents[0])

NameError: name 'soup' is not defined

In [154]:
#alternative
res = soup.findAll('td', {'class':'docHit'})
df = pd.read_html(str(res))
#not working properly

In [156]:
#alternative, reads one table
res = soup.findAll('div', {'id':'main_5'})
df = pd.read_html(str(res))
#not working properly

Future tasks

  • Scrap American Philosophical Society
  • Save findings in database
  • Do network analysis of affiliation to discipline connections.
  • Fuse this set with university set, look for matches.